1
|
Negro A, Cesarani A, Cortellari M, Bionda A, Fresi P, Macciotta NPP, Grande S, Biffani S, Crepaldi P. A comparison of genetic and genomic breeding values in Saanen and Alpine goats. Animal 2024; 18:101118. [PMID: 38508133 DOI: 10.1016/j.animal.2024.101118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 02/17/2024] [Accepted: 02/20/2024] [Indexed: 03/22/2024] Open
Abstract
Nowadays, several countries are developing or adopting genomic selection in the dairy goat sector. The most used method to estimate breeding values is Single-Step Genomic Best Linear Unbiased Prediction (ssGBLUP) which offers several advantages in terms of computational process and accuracy of the estimated breeding values (EBVs). Saanen and Alpine are the predominant dairy goat breeds in Italy, and both have similar breeding programs where EBVs for productive traits are currently calculated using BLUP. This work describes the implementation of genomic selection for these two breeds in Italy, aligning with the selection practices already carried out in the international landscape. The available dataset included 3 611 genotyped animals, 11 470 lactation records, five traits (milk, protein and fat yields, and fat and protein percentages), and three-generation pedigrees. EBVs were estimated using BLUP, GBLUP, and ssGBLUP both with single and multiple trait approaches. The methods were compared in terms of correlation between EBVs and genetic trends. Results were also validated with the linear regression method excluding part of the phenotypic data. In both breeds, EBVs and GEBVs were strongly correlated and the trend of each trait was similar comparing the three methods. The average increase in accuracy across traits and methods amounted to +13 and +10% from BLUP to ssGBLUP for Alpine and Saanen breeds, respectively. Results indicated higher prediction accuracy and correlation for GBLUP and ssGBLUP compared to BLUP, implying that the use of genotypes increases the accuracy of EBVs, particularly in the absence of phenotypic data. Therefore, ssGBLUP is likely to be the most effective method to enhance genetic gain in Italian Saanen and Alpine goats.
Collapse
Affiliation(s)
- A Negro
- Ufficio Studi, Associazione Nazionale della Pastorizia, 00187 Rome, Italy; Dipartimento di Scienze Agrarie e alimentari, Università degli studi di Milano, 20133 Milan, Italy
| | - A Cesarani
- Dipartimento di Scienze Agrarie, Università degli Studi di Sassari, 07100 Sassari, Italy; Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - M Cortellari
- Dipartimento di Scienze Agrarie e alimentari, Università degli studi di Milano, 20133 Milan, Italy
| | - A Bionda
- Dipartimento di Scienze Agrarie e alimentari, Università degli studi di Milano, 20133 Milan, Italy.
| | - P Fresi
- Ufficio Studi, Associazione Nazionale della Pastorizia, 00187 Rome, Italy
| | - N P P Macciotta
- Dipartimento di Scienze Agrarie, Università degli Studi di Sassari, 07100 Sassari, Italy
| | - S Grande
- Ufficio Studi, Associazione Nazionale della Pastorizia, 00187 Rome, Italy
| | - S Biffani
- Istituto di Biologia e Biotecnologia, Consiglio Nazionale delle Ricerche, 20133 Milan, Italy
| | - P Crepaldi
- Dipartimento di Scienze Agrarie e alimentari, Università degli studi di Milano, 20133 Milan, Italy
| |
Collapse
|
2
|
López-Correa RD, Legarra A, Aguilar I. Modelling missing pedigree with metafounders and validating single-step genomic predictions in a small dairy cattle population with a great influence of foreign genetics. J Dairy Sci 2024:S0022-0302(24)00054-7. [PMID: 38310956 DOI: 10.3168/jds.2023-23732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 12/22/2023] [Indexed: 02/06/2024]
Abstract
Genetic improvement in small countries rely heavily on foreign genetics. In an importing country such as Uruguay, consideration of unknown parent groups (UPG) for foreign sires is essential. However, the use of UPG in genomic model evaluations may lead to bias in genomic estimated breeding values. The objective of this study was to study different models including UPG or Metafounders (MF) in the Uruguayan Holstein evaluation and to analyze bias, dispersion, and accuracy of (G)EBV predictions in BLUP and ssGBLUP. Gamma matrix (Γ) was estimated either by using base allele population frequencies obtained by bounded linear regression (MFbounded), or by using 2 values to design Γ, i.e., a single value for the diagonal and a different value for the off-diagonal (MFrobust). Both Γ estimators performed well in terms of GEBV predictions, but MFbounded was the best option. There is, however, some bias whose origin was not completely understood. UPG or MF seem to model correctly genetic progress for unknown parents except for the very first groups (earlier time period). As for validation bulls, bias was observed across all models, whereas for validation cows it was only observed with UPG in BLUP. Overdispersion was found in all models, but it was mostly detected in validation bulls. Ratio of accuracies indicated that ssGBLUP gave better predictions than BLUP.
Collapse
Affiliation(s)
- R D López-Correa
- Universidad de la República, Facultad de Agronomía, Montevideo, Uruguay; Universidad de la República, Facultad de Veterinaria, Montevideo, Uruguay.
| | - A Legarra
- Council on Dairy Cattle Breeding: Bowie, Maryland, US
| | - I Aguilar
- Instituto Nacional de Investigación Agropecuaria, Montevideo, Uruguay
| |
Collapse
|
3
|
Cesarani A, Corte Pause F, Hidalgo J, Garcia A, Degano L, Vicario D, Macciota NPP, Stradaioli G. Genetic background of semen parameters in Italian Simmental bulls. ITALIAN JOURNAL OF ANIMAL SCIENCE 2023. [DOI: 10.1080/1828051x.2022.2160665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Alberto Cesarani
- Dipartimento di Agraria, University of Sassari, Sassari, Italy
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Francesca Corte Pause
- Department of Agricultural, Food, Environmental and Animal Sciences, University of Udine, Udine, Italy
| | - Jorge Hidalgo
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Andre Garcia
- Angus Genetics Inc. - American Angus Association, Saint Joseph, MO, USA
| | - Lorenzo Degano
- Associazione Nazionale Allevatori Pezzata Rossa Italiana (ANAPRI), Udine, Italy
| | - Daniele Vicario
- Associazione Nazionale Allevatori Pezzata Rossa Italiana (ANAPRI), Udine, Italy
| | | | - Giuseppe Stradaioli
- Department of Agricultural, Food, Environmental and Animal Sciences, University of Udine, Udine, Italy
| |
Collapse
|
4
|
Maugan LH, Rostellato R, Tribout T, Mattalia S, Ducrocq V. Combined single-step evaluation of functional longevity of dairy cows including correlated traits. Genet Sel Evol 2023; 55:75. [PMID: 37880580 PMCID: PMC10601146 DOI: 10.1186/s12711-023-00839-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 09/09/2023] [Indexed: 10/27/2023] Open
Abstract
BACKGROUND For years, multiple trait genetic evaluations have been used to increase the accuracy of estimated breeding values (EBV) using information from correlated traits. In France, accurate approximations of multiple trait evaluations were implemented for traits that are described by different models by combining the results of univariate best linear unbiased prediction (BLUP) evaluations. Functional longevity (FL) is the trait that has most benefited from this approach. Currently, with many single-step (SS) evaluations, only univariate FL evaluations can be run. The aim of this study was to implement a "combined" SS (CSS) evaluation that extends the "combined" BLUP evaluation to obtain more accurate genomic (G) EBV for FL when information from five correlated traits (somatic cell score, clinical mastitis, conception rate for heifers and cows, and udder depth) is added. RESULTS GEBV obtained from univariate SS (USS) evaluations and from a CSS evaluation were compared. The correlations between these GEBV showed the benefits of including information from correlated traits. Indeed, a CSS evaluation run without any performances on FL showed that the indirect information from correlated traits to evaluate FL was substantial. USS and CSS evaluations that mimic SS evaluations with data available in 2016 were compared. For each evaluation separately, the GEBV were sorted and then split into 10 consecutive groups (deciles). Survival curves were calculated for each group, based on the observed productive life of these cows as known in 2021. Regardless of their genotyping status, the worst group of heifers based on their GEBV in 2016 was well identified in the CSS evaluation and they had a substantially shorter herd life, while those in the best heifer group had a longer herd life. The gaps between groups were more important for the genotyped than the ungenotyped heifers, which indicates better prediction of future survival. CONCLUSIONS A CSS evaluation is an efficient tool to improve FL. It allows a proper combination of information on functional traits that influence culling. In contrast, because of the strong selection intensity on young bulls for functional traits, the benefit of such a "combined" evaluation of functional traits is more modest for these males.
Collapse
Affiliation(s)
- Laure-Hélène Maugan
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France.
| | | | - Thierry Tribout
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Sophie Mattalia
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
- Idele, 78350, Jouy-en-Josas, France
| | - Vincent Ducrocq
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| |
Collapse
|
5
|
Carta S, Cesarani A, Correddu F, Macciotta NPP. Understanding the phenotypic and genetic background of the lactose content in Sarda dairy sheep. J Dairy Sci 2023; 106:3312-3320. [PMID: 37028961 DOI: 10.3168/jds.2022-22579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/17/2022] [Indexed: 04/09/2023]
Abstract
Lactose, the principal carbohydrate found in milk, plays an important role in the physiological processes of milk production because it is related to milk volume, and it is responsible for the osmotic equilibrium between blood and milk in the mammary gland. In this study, factors affecting lactose content (LC) in sheep milk are investigated. For this purpose, 2,358 test-day records were sampled from 509 ewes (3-7 records per animal). The LC and other main milk traits were analyzed using a mixed linear model that included days in milk (DIM) class, parity, lambing month, and type of lambing as fixed effects and animal, permanent environment, and flock test day as random effects. The pedigree-based approach was used to estimate the heritability and repeatability of LC. Moreover, the genomic background of LC was investigated through a GWAS. The LC was affected by all tested factors (i.e., DIM class, parity, lambing month, and type of lambing). Low heritability (0.10 ± 0.05) and moderate repeatability (0.42 ± 0.02) were estimated for LC. High negative genetic correlations were estimated between LC and NaCl (-0.99 ± 0.01) and between LC and somatic cell count (-0.94 ± 0.05). Only 2 markers passed the chromosome-wide Bonferroni threshold. Results of the present study, although obtained on a relatively small sample, suggest the possibility to include LC in the breeding programs, particularly because of its strong relationship with NaCl and somatic cell count.
Collapse
Affiliation(s)
- S Carta
- Dipartimento di Agraria, University of Sassari, 07100, Sassari, Italy
| | - A Cesarani
- Dipartimento di Agraria, University of Sassari, 07100, Sassari, Italy; Department of Animal and Dairy Science, University of Georgia, Athens 30602
| | - F Correddu
- Dipartimento di Agraria, University of Sassari, 07100, Sassari, Italy.
| | - N P P Macciotta
- Dipartimento di Agraria, University of Sassari, 07100, Sassari, Italy
| |
Collapse
|
6
|
Cesarani A, Bermann M, Dimauro C, Degano L, Vicario D, Lourenco D, Macciotta NPP. Strategies for choosing core animals in the algorithm for proven and young and their impact on the accuracy of single-step genomic predictions in cattle. Animal 2023; 17:100766. [PMID: 37001441 DOI: 10.1016/j.animal.2023.100766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 02/24/2023] [Accepted: 02/28/2023] [Indexed: 03/16/2023] Open
Abstract
Nowadays, in some populations, the number of genotyped animals is too large to obtain the inverse of the genomic relationship matrix. The algorithm for proven and young animals (APY) can be used to overcome this problem. In the present work, different strategies for defining core animals in APY were tested using either simulated or real data. In particular, core definitions based on random choice or on the contribution to the genomic relationship matrix (GCONTR) calculated using Principal Component Analysis were tested. Core sizes able to explain 90, 95, 98, and 99% of the total variance of the genomic relationship matrix (G) were used. Analyzed phenotypes were three simulated traits for 3 000 individuals, and milkability records for 136 406 Italian Simmental cows. The number of genotypes was 4 100 for the simulated dataset, and 11 636 for the Simmental data, respectively. The GCONTR values in Simmental dataset were moderately correlated with the analyzed phenotype, and they showed a decreasing trend according to the year of birth of genotyped animals. The accuracy increased as the size of the core increased in both datasets. The inclusion in the core of animals with largest GCONTR values led to the lowest accuracies (0.50 and 0.71 for the simulated and Simmental datasets, respectively; average across traits and core sizes). On the contrary, the selection of animals with the lowest rank according to their contribution to the G provided slightly higher accuracies, especially in the simulated dataset (0.68 for the simulated dataset, and 0.76 for the Simmental data; average across traits and core sizes). In real data, particularly for larger sizes of core animals, the criteria of choice appear less important, confirming the results of earlier studies. Anyway, the inclusion in the core of animals with the lowest values of GCONTR led to increases in accuracy. These are preliminary results based on a small sample size that need to be confirmed on a larger number of genotypes.
Collapse
Affiliation(s)
- A Cesarani
- Dipartimento di Agraria, Università di Sassari, 07100 Sassari, Italy; Department of Animal and Dairy Science, University of Georgia, 30602 Athens, GA, USA.
| | - M Bermann
- Department of Animal and Dairy Science, University of Georgia, 30602 Athens, GA, USA
| | - C Dimauro
- Dipartimento di Agraria, Università di Sassari, 07100 Sassari, Italy
| | - L Degano
- Associazione Nazionale Allevatori Pezzata Rossa Italiana (ANAPRI), 33100 Udine, Italy
| | - D Vicario
- Associazione Nazionale Allevatori Pezzata Rossa Italiana (ANAPRI), 33100 Udine, Italy
| | - D Lourenco
- Department of Animal and Dairy Science, University of Georgia, 30602 Athens, GA, USA
| | - N P P Macciotta
- Dipartimento di Agraria, Università di Sassari, 07100 Sassari, Italy
| |
Collapse
|
7
|
Bussiman F, Chen CY, Holl J, Bermann M, Legarra A, Misztal I, Lourenco D. Boundaries for genotype, phenotype, and pedigree truncation in genomic evaluations in pigs. J Anim Sci 2023; 101:skad273. [PMID: 37584978 PMCID: PMC10464514 DOI: 10.1093/jas/skad273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 08/10/2023] [Indexed: 08/17/2023] Open
Abstract
Historical data collection for genetic evaluation purposes is a common practice in animal populations; however, the larger the dataset, the higher the computing power needed to perform the analyses. Also, fitting the same model to historical and recent data may be inappropriate. Data truncation can reduce the number of equations to solve, consequently decreasing computing costs; however, the large volume of genotypes is responsible for most of the increase in computations. This study aimed to assess the impact of removing genotypes along with phenotypes and pedigree on the computing performance, reliability, and inflation of genomic predicted breeding value (GEBV) from single-step genomic best linear unbiased predictor for selection candidates. Data from two pig lines, a terminal sire (L1) and a maternal line (L2), were analyzed in this study. Four analyses were implemented: growth and "weaning to finish" mortality on L1, pre-weaning and reproductive traits on L2. Four genotype removal scenarios were proposed: removing genotyped animals without phenotypes and progeny (noInfo), removing genotyped animals based on birth year (Age), the combination of noInfo and Age scenarios (noInfo + Age), and no genotype removal (AllGen). In all scenarios, phenotypes were removed, based on birth year, and three pedigree depths were tested: two and three generations traced back and using the entire pedigree. The full dataset contained 1,452,257 phenotypes for growth traits, 324,397 for weaning to finish mortality, 517,446 for pre-weaning traits, and 7,853,629 for reproductive traits in pure and crossbred pigs. Pedigree files for lines L1 and L2 comprised 3,601,369 and 11,240,865 animals, of which 168,734 and 170,121 were genotyped, respectively. In each truncation scenario, the linear regression method was used to assess the reliability and dispersion of GEBV for genotyped parents (born after 2019). The number of years of data that could be removed without harming reliability depended on the number of records, type of analyses (multitrait vs. single trait), the heritability of the trait, and data structure. All scenarios had similar reliabilities, except for noInfo, which performed better in the growth analysis. Based on the data used in this study, considering the last ten years of phenotypes, tracing three generations back in the pedigree, and removing genotyped animals not contributing own or progeny phenotypes, increases computing efficiency with no change in the ability to predict breeding values.
Collapse
Affiliation(s)
- Fernando Bussiman
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | | | | | - Matias Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
8
|
Jones HE, Wilson PB. Progress and opportunities through use of genomics in animal production. Trends Genet 2022; 38:1228-1252. [PMID: 35945076 DOI: 10.1016/j.tig.2022.06.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 06/08/2022] [Accepted: 06/17/2022] [Indexed: 01/24/2023]
Abstract
The rearing of farmed animals is a vital component of global food production systems, but its impact on the environment, human health, animal welfare, and biodiversity is being increasingly challenged. Developments in genetic and genomic technologies have had a key role in improving the productivity of farmed animals for decades. Advances in genome sequencing, annotation, and editing offer a means not only to continue that trend, but also, when combined with advanced data collection, analytics, cloud computing, appropriate infrastructure, and regulation, to take precision livestock farming (PLF) and conservation to an advanced level. Such an approach could generate substantial additional benefits in terms of reducing use of resources, health treatments, and environmental impact, while also improving animal health and welfare.
Collapse
Affiliation(s)
- Huw E Jones
- UK Genetics for Livestock and Equines (UKGLE) Committee, Department for Environment, Food and Rural Affairs, Nobel House, 17 Smith Square, London, SW1P 3JR, UK; Nottingham Trent University, Brackenhurst Campus, Brackenhurst Lane, Southwell, NG25 0QF, UK.
| | - Philippe B Wilson
- UK Genetics for Livestock and Equines (UKGLE) Committee, Department for Environment, Food and Rural Affairs, Nobel House, 17 Smith Square, London, SW1P 3JR, UK; Nottingham Trent University, Brackenhurst Campus, Brackenhurst Lane, Southwell, NG25 0QF, UK
| |
Collapse
|
9
|
Callister AN, Bermann M, Elms S, Bradshaw BP, Lourenco D, Brawner JT. Accounting for population structure in genomic predictions of Eucalyptus globulus. G3 GENES|GENOMES|GENETICS 2022; 12:6654591. [PMID: 35920792 PMCID: PMC9434241 DOI: 10.1093/g3journal/jkac180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 06/29/2022] [Indexed: 12/02/2022]
Abstract
Genetic groups have been widely adopted in tree breeding to account for provenance effects within pedigree-derived relationship matrices. However, provenances or genetic groups have not yet been incorporated into single-step genomic BLUP (“HBLUP”) analyses of tree populations. To quantify the impact of accounting for population structure in Eucalyptus globulus, we used HBLUP to compare breeding value predictions from models excluding base population effects and models including either fixed genetic groups or the marker-derived proxies, also known as metafounders. Full-sib families from 2 separate breeding populations were evaluated across 13 sites in the “Green Triangle” region of Australia. Gamma matrices (Γ) describing similarities among metafounders reflected the geographic distribution of populations and the origins of 2 land races were identified. Diagonal elements of Γ provided population diversity or allelic covariation estimates between 0.24 and 0.56. Genetic group solutions were strongly correlated with metafounder solutions across models and metafounder effects influenced the genetic solutions of base population parents. The accuracy, stability, dispersion, and bias of model solutions were compared using the linear regression method. Addition of genomic information increased accuracy from 0.41 to 0.47 and stability from 0.68 to 0.71, while increasing bias slightly. Dispersion was within 0.10 of the ideal value (1.0) for all models. Although inclusion of metafounders did not strongly affect accuracy or stability and had mixed effects on bias, we nevertheless recommend the incorporation of metafounders in prediction models to represent the hierarchical genetic population structure of recently domesticated populations.
Collapse
Affiliation(s)
| | - Matias Bermann
- Department of Animal and Dairy Science, University of Georgia , Athens, GA 30602, USA
| | - Stephen Elms
- HVP Plantations , Churchill, VIC 3842, Australia
| | - Ben P Bradshaw
- Australian Bluegum Plantations , Albany, WA 6330, Australia
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia , Athens, GA 30602, USA
| | - Jeremy T Brawner
- Department of Plant Pathology, University of Florida , Gainesville, FL 32611, USA
| |
Collapse
|
10
|
Belay TK, Eikje LS, Gjuvsland AB, Nordbø Ø, Tribout T, Meuwissen T. Correcting for base-population differences and unknown parent groups in single-step genomic predictions of Norwegian Red Cattle. J Anim Sci 2022; 100:6618053. [PMID: 35752161 PMCID: PMC9467032 DOI: 10.1093/jas/skac227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 06/23/2022] [Indexed: 11/15/2022] Open
Abstract
Bias and inflation in genomic evaluation with the single-step methods have been reported in several studies. Incompatibility between the base-populations of the pedigree-based and the genomic relationship matrix (G) could be a reason for these biases. Inappropriate ways of accounting for missing parents could be another reason for biases in genetic evaluations with or without genomic information. To handle these problems, we fitted and evaluated a fixed covariate (J) that contains ones for genotyped animals and zeros for unrelated non-genotyped animals, or pedigree-based regression coefficients for related non-genotyped animals. We also evaluated alternative ways of fitting the J covariate together with genetic groups on biases and stability of breeding value estimates, and of including it into G as a random effect. In a whole vs. partial data set comparison, four scenarios were investigated for the partial data: genotypes missing, phenotypes missing, both genotypes and phenotypes missing, and pedigree missing. Fitting J either as fixed or random reduced level-bias and inflation and increased stability of genomic predictions as compared to the basic model where neither J nor genetic groups were fitted. In most models, genomic predictions were largely biased for scenarios with missing genotype and phenotype information. The biases were reduced for models which combined group and J effects. Models with these corrected group covariates performed better than the recently published model where genetic groups were encapsulated and fitted as random via the Quaas and Pollak transformation. In our Norwegian Red cattle data, a model which combined group and J regression coefficients was preferred because it showed least bias and highest stability of genomic predictions across the scenarios.
Collapse
Affiliation(s)
- Tesfaye K Belay
- Department of animal and aquacultural Sciences, Norwegian University of Life Sciences, NMBU, Norway
| | | | | | | | - Thierry Tribout
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, JouyenJosas, France
| | - Theo Meuwissen
- Department of animal and aquacultural Sciences, Norwegian University of Life Sciences, NMBU, Norway
| |
Collapse
|
11
|
Bermann M, Cesarani A, Misztal I, Lourenco D. Past, present, and future developments in single-step genomic models. ITALIAN JOURNAL OF ANIMAL SCIENCE 2022. [DOI: 10.1080/1828051x.2022.2053366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Matias Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Alberto Cesarani
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
- Dipartimento di Agraria, Università degli Studi di Sassari, Sassari, Italy
| | - Ignacy Misztal
- Dipartimento di Agraria, Università degli Studi di Sassari, Sassari, Italy
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| |
Collapse
|
12
|
Macedo FL, Astruc JM, Meuwissen THE, Legarra A. Removing data and using metafounders alleviates biases for all traits in Lacaune dairy sheep predictions. J Dairy Sci 2022; 105:2439-2452. [PMID: 35033343 DOI: 10.3168/jds.2021-20860] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 11/23/2021] [Indexed: 11/19/2022]
Abstract
Bias in dairy genetic evaluations, when it exists, has to be understood and properly addressed. The origin of biases is not always clear. We analyzed 40 yr of records from the Lacaune dairy sheep breeding program to evaluate the extent of bias, assess possible corrections, and emit hypotheses on its origin. The data set included 7 traits (milk yield, fat and protein contents, somatic cell score, teat angle, udder cleft, and udder depth) with records from 600,000 to 5 million depending on the trait, ∼1,900,000 animals, and ∼5,900 genotyped elite artificial insemination rams. For the ∼8% animals with missing sire, we fit 25 unknown parent groups. We used the linear regression method to compare "partial" and "whole" predictions of young rams before and after progeny testing, with 7 cut-off points, and we obtained estimates of their bias, (over)dispersion, and accuracy in early proofs. We tried (1) several scenarios as follows: multiple or single trait, the "official" (routine) evaluation, which is a mixture of both single and multiple trait, and "deletion" of data before 1990; and (2) several models as follows: BLUP and single-step genomic (SSG)BLUP with fixed unknown parent groups or metafounders, where, for metafounders, their relationship matrix gamma was estimated using either a model for inbreeding trend, or base allele frequencies estimated by peeling. The estimate of gamma obtained by modeling the inbreeding trend resulted in an estimated increase of inbreeding, based on markers, faster than the pedigree-based one. The estimated genetic trends were similar for most models and scenarios across all traits, but were shrunken when gamma was estimated by peeling. This was due to shrinking of the estimates of metafounders in the latter case. Across scenarios, all traits showed bias, generally as an overestimate of genetic trend for milk yield and an underestimate for the other traits. As for the slope, it showed overdispersion of estimated breeding values for all traits. Using multiple-trait models slightly reduced the overestimate of genetic trend and the overdispersion, as did including genomic information (i.e., SSGBLUP) when the gamma matrix was estimated by the model for inbreeding trend. However, only deletion of historical data before 1990 resulted in elimination of both kind of biases. The SSGBLUP resulted in more accurate early proofs than BLUP for all traits. We considered that a snowball effect of small errors in each genetic evaluation, combined with selection, may have resulted in biased evaluations. Improving statistical methods reduced some bias but not all, and a simple solution for this data set was to remove historical records.
Collapse
Affiliation(s)
- F L Macedo
- INRA, GenPhySE, Castanet-Tolosan 31320, France; Facultad de Veterinaria, Universidad de la República, 11600 Montevideo, Uruguay; Deptartment of Animal Breeding and Genetics SLU, S-75007 Uppsala, Sweden
| | - J M Astruc
- Institut de l'Elevage, Castanet-Tolosan 31321, France
| | - T H E Meuwissen
- Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, 1432 Ås, Norway
| | - A Legarra
- INRA, GenPhySE, Castanet-Tolosan 31320, France.
| |
Collapse
|
13
|
Cesarani A, Lourenco D, Tsuruta S, Legarra A, Nicolazzi E, VanRaden P, Misztal I. Multibreed genomic evaluation for production traits of dairy cattle in the United States using single-step genomic best linear unbiased predictor. J Dairy Sci 2022; 105:5141-5152. [DOI: 10.3168/jds.2021-21505] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 01/27/2022] [Indexed: 01/01/2023]
|
14
|
Masuda Y, VanRaden PM, Tsuruta S, Lourenco DAL, Misztal I. Invited review: Unknown-parent groups and metafounders in single-step genomic BLUP. J Dairy Sci 2021; 105:923-939. [PMID: 34799109 DOI: 10.3168/jds.2021-20293] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 09/26/2021] [Indexed: 11/19/2022]
Abstract
Single-step genomic BLUP (ssGBLUP) is a method for genomic prediction that integrates matrices of pedigree (A) and genomic (G) relationships into a single unified additive relationship matrix whose inverse is incorporated into a set of mixed model equations (MME) to compute genomic predictions. Pedigree information in dairy cattle is often incomplete. Missing pedigree potentially causes biases and inflation in genomic estimated breeding values (GEBV) obtained with ssGBLUP. Three major issues are associated with missing pedigree in ssGBLUP, namely biased predictions by selection, missing inbreeding in pedigree relationships, and incompatibility between G and A in level and scale. These issues can be solved using a proper model for unknown-parent groups (UPG). The theory behind the use of UPG is well established for pedigree BLUP, but not for ssGBLUP. This study reviews the development of the UPG model in pedigree BLUP, the properties of UPG models in ssGBLUP, and the effect of UPG on genetic trends and genomic predictions. Similarities and differences between UPG and metafounder (MF) models, a generalized UPG model, are also reviewed. A UPG model (QP) derived using a transformation of the MME has a good convergence behavior. However, with insufficient data, the QP model may yield biased genetic trends and may underestimate UPG. The QP model can be altered by removing the genomic relationships linking GEBV and UPG effects from MME. This altered QP model exhibits less bias in genetic trends and less inflation in genomic predictions than the QP model, especially with large data sets. Recently, a new model, which encapsulates the UPG equations into the pedigree relationships for genotyped animals, was proposed in simulated purebred populations. The MF model is a comprehensive solution to the missing pedigree issue. This model can be a choice for multibreed or crossbred evaluations if the data set allows the estimation of a reasonable relationship matrix for MF. Missing pedigree influences genetic trends, but its effect on the predictability of genetic merit for genotyped animals should be negligible when many proven bulls are genotyped. The SNP effects can be back-solved using GEBV from older genotyped animals, and these predicted SNP effects can be used to calculate GEBV for young-genotyped animals with missing parents.
Collapse
Affiliation(s)
- Yutaka Masuda
- Department of Animal and Dairy Science, University of Georgia, Athens 30602.
| | - Paul M VanRaden
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, US Department of Agriculture, Beltsville, MD 20705
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| |
Collapse
|