1
|
Reliabilities of estimated breeding values in models with metafounders. Genet Sel Evol 2023; 55:6. [PMID: 36690938 PMCID: PMC9869531 DOI: 10.1186/s12711-023-00778-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 01/04/2023] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Reliabilities of best linear unbiased predictions (BLUP) of breeding values are defined as the squared correlation between true and estimated breeding values and are helpful in assessing risk and genetic gain. Reliabilities can be computed from the prediction error variances for models with a single base population but are undefined for models that include several base populations and when unknown parent groups are modeled as fixed effects. In such a case, the use of metafounders in principle enables reliabilities to be derived. METHODS We propose to compute the reliability of the contrast of an individual's estimated breeding value with that of a metafounder based on the prediction error variances of the individual and the metafounder, their prediction error covariance, and their genetic relationship. Computation of the required terms demands only little extra work once the sparse inverse of the mixed model equations is obtained, or they can be approximated. This also allows the reliabilities of the metafounders to be obtained. We studied the reliabilities for both BLUP and single-step genomic BLUP (ssGBLUP), using several definitions of reliability in a large dataset with 1,961,687 dairy sheep and rams, most of which had phenotypes and among which 27,000 rams were genotyped with a 50K single nucleotide polymorphism (SNP) chip. There were 23 metafounders with progeny sizes between 100,000 and 2000 individuals. RESULTS In models with metafounders, directly using the prediction error variance instead of the contrast with a metafounder leads to artificially low reliabilities because they refer to a population with maximum heterozygosity. When only one metafounder is fitted in the model, the reliability of the contrast is shown to be equivalent to the reliability of the individual in a model without metafounders. When there are several metafounders in the model, using a contrast with the oldest metafounder yields reliabilities that are on a meaningful scale and very close to reliabilities obtained from models without metafounders. The reliabilities using contrasts with ssGBLUP also resulted in meaningful values. CONCLUSIONS This work provides a general method to obtain reliabilities for both BLUP and ssGBLUP when several base populations are included through metafounders.
Collapse
|
2
|
Impact of genomic preselection on subsequent genetic evaluations with ssGBLUP using real data from pigs. Genet Sel Evol 2022; 54:48. [PMID: 35764921 PMCID: PMC9238012 DOI: 10.1186/s12711-022-00727-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 05/02/2022] [Indexed: 11/29/2022] Open
Abstract
Background Empirically assessing the impact of preselection on genetic evaluation of preselected animals requires comparing scenarios that take different approaches into account, including scenarios without preselection. However, preselection is almost always performed in animal breeding programs, so it is difficult to have a dataset without preselection. Hence, most studies on preselection have used simulated datasets, and have concluded that genomic estimated breeding values (GEBV) from subsequent single-step genomic best linear unbiased prediction (ssGBLUP) evaluations are unbiased. The aim of this study was to investigate the impact of genomic preselection (GPS) on accuracy and bias in subsequent ssGBLUP evaluations, using data from a commercial pig breeding program. Methods We used data on average daily gain during performance testing, average daily gain throughout life, backfat thickness, and loin depth from one sire line and one dam line of pigs. As these traits have different weights in the breeding goals of the two lines, we analyzed the lines separately. For each line, we implemented a reference GPS scenario that kept all available data, against which the next two scenarios were compared. We then implemented two other scenarios with additional layers of GPS by removing all animals without progeny either (i) only in the validation generation, or (ii) in all generations. We conducted subsequent ssGBLUP evaluations for each GPS scenario, using all the data remaining after implementing the GPS scenario. Accuracy and bias were computed by comparing GEBV against progeny yield deviations of validation animals. Results Results for all traits and in both lines showed a marginal loss in accuracy due to the additional layers of GPS. Average accuracies across all GPS scenarios in the two lines were 0.39, 0.47, 0.56, and 0.60, for average daily gain during performance testing and throughout life, backfat thickness, and loin depth, respectively. Biases were largely absent, and when present, did not differ greatly between the GPS scenarios. Conclusions We conclude that the impact of preselection on accuracy and bias in subsequent ssGBLUP evaluations of selection candidates in pigs is generally minimal. We expect this conclusion to apply for other animal breeding programs as well, since preselection of any type or intensity generally has the same effect in animal breeding programs. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00727-5.
Collapse
|
3
|
Inclusion of Sire by Herd interaction effect in the genomic evaluation for weaning weight of American Angus. J Anim Sci 2022; 100:6537149. [PMID: 35213718 PMCID: PMC9030219 DOI: 10.1093/jas/skac057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 02/23/2022] [Indexed: 11/12/2022] Open
Abstract
A spurious negative genetic correlation between direct and maternal effects of weaning weight (WW) in beef cattle has historically been problematic for researchers and industry. Previous research has suggested the covariance between sires and herds may be contributing to this relationship. The objective of this study was to estimate the variance components (VC) for WW in American Angus with and without sire by herd (S×H) interaction effect when genomic information is used or not. Five subsets of ~100k animals for each subset were used. When genomic information was included, genotypes were added for 15,637 animals. Five replicates were performed. Four different models were tested, namely, M1: without S×H interaction effect and with covariance between direct and maternal effect (σam) ≠ 0; M2: with S×H interaction effect and σam ≠ 0; M3: without S×H interaction effect and with σam = 0; M4: with S×H interaction effect and σam = 0. VC were estimated using the restricted maximum likelihood (REML) and single-step genomic REML (ssGREML) with the average information algorithm. Breeding values were computed using single-step genomic BLUP for the models above and one additional model, which had the covariance zeroed after the estimation of VC (M5). The ability of each model to predict future breeding values was investigated with the linear regression method. Under REML, when the S×H interaction effect was added to the model, both direct and maternal genetic variances were greatly reduced, and the negative covariance became positive (i.e., when moving from M1 to M2). Similar patterns were observed under ssGREML, but with less reduction in the direct and maternal genetic variances and still a negative covariance. Models with the S×H interaction effect (M2 and M4) had a better fit according to the Akaike information criteria. Breeding values from those models were more accurate and had less bias than the other three models. The rankings and breeding values of artificial insemination sires (N = 1,977) greatly changed when the S×H interaction effect was fit in the model. Although the S×H interaction effect accounted for 3% to 5% of the total phenotypic variance and improved the model fit, this change in the evaluation model will cause severe reranking among animals. A spurious negative genetic correlation between direct and maternal effects of weaning weight (WW) in beef cattle has been problematic for researchers and industry. Previous research suggested the covariance between sires and herds may contribute to this relationship. The objective of this study was to estimate the variance components (VC) for WW in American Angus with and without sire by herd (S×H) interaction effect when genomic information is used or not. Four models were designed to investigate the S×H effect. The restricted maximum likelihood (REML) and single-step genomic REML (ssGREML) were used to estimate VC. Breeding values were computed using single-step genomic BLUP and the validation was done through the linear regression method. Under REML, when the S×H was added to the model, both direct and maternal genetic variances were greatly reduced, and the negative covariance became positive. Similar patterns were observed under ssGREML, but with less reduction in the direct and maternal genetic variances and still a negative covariance. Breeding values from models with S×H were more accurate and had less bias than the other models. Although the S×H improved the model, this change in the evaluation model will cause severe reranking among key animals.
Collapse
|
4
|
Genome-wide associations for heat stress response suggest potential candidate genes underlying milk fatty acid composition in dairy cattle. J Dairy Sci 2022; 105:3323-3340. [PMID: 35094857 DOI: 10.3168/jds.2021-21152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 12/01/2021] [Indexed: 11/19/2022]
Abstract
Contents of milk fatty acids (FA) display remarkable alterations along climatic gradients. Detecting candidate genes underlying such alterations might be beneficial for the exploration of climate sensitivity in dairy cattle. Consequently, we aimed on the definition of FA heat stress indicators, considering FA breeding values in response to temperature-humidity index (THI) alterations. Indicators were used in GWAS, in ongoing gene annotations and for the estimation of chromosome-wide variance components. The phenotypic data set consisted of 39,600 test-day milk FA records from 5,757 first-lactation Holstein dairy cows kept in 16 large-scale German cooperator herds. The FA traits were C18:0, polyunsaturated fatty acids (PUFA), saturated fatty acids (SFA), and unsaturated fatty acids (UFA). After genotype quality control, 40,523 SNP markers from 3,266 cows and 930 sires were considered. Meteorological data from the weather station in closest herd distance were used for the calculation of maximum hourly daily THI, which were allocated to 10 different THI classes. The same FA from 3 stages of lactation were considered as different, but genetically correlated traits. Consequently, a 3-trait reaction norm model was used to estimate genetic parameters and breeding values for FA along THI classes, considering either pedigree (A) or genomic (G) relationship matrices. De-regressed proofs and genomic estimated breeding values at the intermediate THI class 5 and at the extreme THI class 10 were used as pseudophenotypes in ongoing genomic analyses for thermoneutral (TNC) and heat stress conditions (HSC), respectively. The differences in de-regressed proofs and in genomic estimated breeding values from both THI classes were pseudophenotypes for heat stress response (HSR). Genetic correlations between the same FA under TNC and HSC were smallest in the first lactation stage and ranged from 0.20 for PUFA to 0.87 for SFA when modeling with the A matrix, and from 0.35 for UFA to 0.86 for SFA when modeling with the G matrix. In the first lactation stage, larger additive genetic variances under HSC compared with TNC indicate climate sensitivity for C18:0, PUFA, and UFA. Climate sensitivity was also reflected by pronounced chromosome-wide genetic variances for HSR of PUFA and UFA in the first stage of lactation. For all FA under TNC, HSC, and HSR, quite large genetic variance proportions were explained by BTA14. In GWAS, 30 SNP (within or close to 38 potential candidate genes) overlapped for HSR of the different FA. One unique potential candidate gene (AMFR) was detected for HSR of PUFA, 15 for HSR of SFA (ADGRB1, DENND3, DUSP16, EFR3A, EMP1, ENSBTAG00000003838, EPS8, MGP, PIK3C2G, STYK1, TMEM71, GSG1, SMARCE1, CCDC57, and FASN) and 3 for HSR of UFA (ENSBTAG00000048091, PAEP, and EPPK1). The identified unique genes play key roles in milk FA synthesis and are associated with disease resistance in dairy cattle. The results suggest consideration of FA in combination with climatic responses when inferring genetic mechanisms of heat stress in dairy cows.
Collapse
|
5
|
Genome-wide association study for residual concentrate intake using different approaches in Italian Brown Swiss. ITALIAN JOURNAL OF ANIMAL SCIENCE 2021. [DOI: 10.1080/1828051x.2021.1963864] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
6
|
Bias, dispersion, and accuracy of genomic predictions for feedlot and carcase traits in Australian Angus steers. Genet Sel Evol 2021; 53:77. [PMID: 34565347 PMCID: PMC8474816 DOI: 10.1186/s12711-021-00673-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 09/15/2021] [Indexed: 12/03/2022] Open
Abstract
Background Improving feedlot performance, carcase weight and quality is a primary goal of the beef industry worldwide. Here, we used data from 3408 Australian Angus steers from seven years of birth (YOB) cohorts (2011–2017) with a minimal level of sire linkage and that were genotyped for 45,152 SNPs. Phenotypic records included two feedlot and five carcase traits, namely average daily gain (ADG), average daily dry matter intake (DMI), carcase weight (CWT), carcase eye muscle area (EMA), carcase Meat Standard Australia marbling score (MBL), carcase ossification score (OSS) and carcase subcutaneous rib fat depth (RIB). Using a 7-way cross-validation based on YOB cohorts, we tested the quality of genomic predictions using the linear regression (LR) method compared to the traditional method (Pearson’s correlation between the genomic estimated breeding value (GEBV) and its associated adjusted phenotype divided by the square root of heritability); explored the factors, such as heritability, validation cohort, and phenotype that affect estimates of accuracy, bias, and dispersion calculated with the LR method; and suggested a novel interpretation for translating differences in accuracy into phenotypic differences, based on GEBV quartiles (Q1Q4). Results Heritability (h2) estimates were generally moderate to high (from 0.29 for ADG to 0.53 for CWT). We found a strong correlation (0.73, P-value < 0.001) between accuracies using the traditional method and those using the LR method, although the LR method was less affected by random variation within and across years and showed a better ability to discriminate between extreme GEBV quartiles. We confirmed that bias of GEBV was not significantly affected by h2, validation cohort or trait. Similarly, validation cohort was not a significant source of variation for any of the GEBV quality metrics. Finally, we observed that the phenotypic differences were larger for higher accuracies. Conclusions Our estimates of h2 and GEBV quality metrics suggest a potential for accurate genomic selection of Australian Angus for feedlot performance and carcase traits. In addition, the Q1Q4 measure presented here easily translates into possible gains of genomic selection in terms of phenotypic differences and thus provides a more tangible output for commercial beef cattle producers. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00673-8.
Collapse
|
7
|
Investigating pig survival in different production phases using genomic models. J Anim Sci 2021; 99:skab217. [PMID: 34282454 PMCID: PMC8404463 DOI: 10.1093/jas/skab217] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 07/18/2021] [Indexed: 11/12/2022] Open
Abstract
Pig survival is an economically important trait with relevant social welfare implications, thus standing out as an important selection criterion for the current pig farming system. We aimed to estimate (co)variance components for survival in different production phases in a crossbred pig population as well as to investigate the benefit of including genomic information through single-step genomic best linear unbiased prediction (ssGBLUP) on the prediction accuracy of survival traits compared with results from traditional BLUP. Individual survival records on, at most, 64,894 crossbred piglets were evaluated under two multi-trait threshold models. The first model included farrowing, lactation, and combined postweaning survival, whereas the second model included nursery and finishing survival. Direct and maternal breeding values were estimated using BLUP and ssGBLUP methods. Furthermore, prediction accuracy, bias, and dispersion were accessed using the linear regression validation method. Direct heritability estimates for survival in all studied phases were low (from 0.02 to 0.08). Survival in preweaning phases (farrowing and lactation) was controlled by the dam and piglet additive genetic effects, although the maternal side was more important. Postweaning phases (nursery, finishing, and the combination of both) showed the same or higher direct heritabilities compared with preweaning phases. The genetic correlations between survival traits within preweaning and postweaning phases were favorable and strong, but correlations between preweaning and postweaning phases were moderate. The prediction accuracy of survival traits was low, although it increased by including genomic information through ssGBLUP compared with the prediction accuracy from BLUP. Direct and maternal breeding values were similarly accurate with BLUP, but direct breeding values benefited more from genomic information. Overall, a slight increase in bias was observed when genomic information was included, whereas dispersion of breeding values was greatly reduced. Combined postweaning survival presented higher direct heritability than in the preweaning phases and the highest prediction accuracy among all evaluated production phases, therefore standing out as a candidate trait for improving survival. Survival is a complex trait with low heritability; however, important genetic gains can still be obtained, especially under a genomic prediction framework.
Collapse
|
8
|
Genomic investigation of milk production in Italian buffalo. ITALIAN JOURNAL OF ANIMAL SCIENCE 2021. [DOI: 10.1080/1828051x.2021.1902404] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
9
|
Genomic Analysis, Progress and Future Perspectives in Dairy Cattle Selection: A Review. Animals (Basel) 2021; 11:599. [PMID: 33668747 PMCID: PMC7996307 DOI: 10.3390/ani11030599] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 11/23/2020] [Accepted: 11/24/2020] [Indexed: 12/16/2022] Open
Abstract
Genomics comprises a set of current and valuable technologies implemented as selection tools in dairy cattle commercial breeding programs. The intensive progeny testing for production and reproductive traits based on genomic breeding values (GEBVs) has been crucial to increasing dairy cattle productivity. The knowledge of key genes and haplotypes, including their regulation mechanisms, as markers for productivity traits, may improve the strategies on the present and future for dairy cattle selection. Genome-wide association studies (GWAS) such as quantitative trait loci (QTL), single nucleotide polymorphisms (SNPs), or single-step genomic best linear unbiased prediction (ssGBLUP) methods have already been included in global dairy programs for the estimation of marker-assisted selection-derived effects. The increase in genetic progress based on genomic predicting accuracy has also contributed to the understanding of genetic effects in dairy cattle offspring. However, the crossing within inbred-lines critically increased homozygosis with accumulated negative effects of inbreeding like a decline in reproductive performance. Thus, inaccurate-biased estimations based on empirical-conventional models of dairy production systems face an increased risk of providing suboptimal results derived from errors in the selection of candidates of high genetic merit-based just on low-heritability phenotypic traits. This extends the generation intervals and increases costs due to the significant reduction of genetic gains. The remarkable progress of genomic prediction increases the accurate selection of superior candidates. The scope of the present review is to summarize and discuss the advances and challenges of genomic tools for dairy cattle selection for optimizing breeding programs and controlling negative inbreeding depression effects on productivity and consequently, achieving economic-effective advances in food production efficiency. Particular attention is given to the potential genomic selection-derived results to facilitate precision management on modern dairy farms, including an overview of novel genome editing methodologies as perspectives toward the future.
Collapse
|
10
|
Genetic association among feeding behavior, feed efficiency, and growth traits in growing indicine cattle. J Anim Sci 2021; 98:5944080. [PMID: 33125460 DOI: 10.1093/jas/skaa350] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 10/23/2020] [Indexed: 12/31/2022] Open
Abstract
This study aimed to estimate genetic parameters, including genomic data, for feeding behavior, feed efficiency, and growth traits in Nellore cattle. The following feeding behavior traits were studied (861 animals with records): time spent at the feed bunk (TF), duration of one feeding event (FD), frequency of visits to the bunk (FF), feeding rate (FR), and dry matter intake (DMI) per visit (DMIv). The feed efficiency traits (1,543 animals with records) included residual feed intake (RFI), residual weight gain (RWG), and feed conversion (FC). The growth traits studied were average daily gain (ADG, n = 1,543 animals) and selection (postweaning) weight (WSel, n = 9,549 animals). The (co)variance components were estimated by the maximum restricted likelihood method, fitting animal models that did (single-step genomic best linear unbiased prediction) or did not include (best linear unbiased prediction) genomic information in two-trait analyses. The direct responses to selection were calculated for the feed efficiency traits, ADG, and WSel, as well as the correlated responses in feed efficiency and growth by direct selection for shorter TF. The estimated heritabilities were 0.51 ± 0.06, 0.35 ± 0.06, 0.27 ± 0.07, 0.34 ± 0.06, and 0.33 ± 0.06 for TF, FD, FF, FR, and DMIv, respectively. In general, TF and FD showed positive genetic correlations with all feed efficiency traits (RFI, RWG, and FC), ADG, DMI, and WSel. Additionally, TF showed high and positive genetic and phenotypic correlations with RFI (0.71 ± 0.10 and 0.46 ± 0.02, respectively) and DMI (0.56 ± 0.09 and 0.48 ± 0.03), and medium to weak genetic correlations with growth (0.32 ± 0.11 with ADG and 0.14 ± 0.09 with WSel). The results suggest that TF is a strong indicator trait of feed efficiency, which exhibits high heritability and a weak positive genetic correlation with growth. In a context of a selection index, the inclusion of TF to select animals for shorter TF may accelerate the genetic gain in feed efficiency by reducing RFI but with zero or slightly negative genetic gain in growth traits.
Collapse
|
11
|
Identification of QTL regions and candidate genes for growth and feed efficiency in broilers. Genet Sel Evol 2021; 53:13. [PMID: 33549052 PMCID: PMC7866652 DOI: 10.1186/s12711-021-00608-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 01/26/2021] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Feed accounts for about 70% of the total cost of poultry meat production. Residual feed intake (RFI) has become the preferred measure of feed efficiency because it is phenotypically independent of growth rate and body weight. In this study, our aim was to estimate genetic parameters and identify quantitative trait loci (QTL) for feed efficiency in 3314 purebred broilers using a genome-wide association study. Broilers were genotyped using a custom 55 K single nucleotide polymorphism (SNP) array. RESULTS Estimates of genomic heritability for seven growth and feed efficiency traits, including body weight at 28 days of age (BW28), BW42, average daily feed intake (ADFI), RFI, and RFI adjusted for weight of abdominal fat (RFIa), ranged from 0.12 to 0.26. Eleven genome-wide significant SNPs and 15 suggestively significant SNPs were detected, of which 19 clustered around two genomic regions. A region on chromosome 16 (2.34-2.66 Mb) was associated with both BW28 and BW42, and the most significant SNP in this region, AX_101003762, accounted for 7.6% of the genetic variance of BW28. The other region, on chromosome 1 (91.27-92.43 Mb) was associated with RFI and ADFI, and contains the NSUN3 and EPHA6 as candidate genes. The most significant SNP in this region, AX_172588157, accounted for 4.4% of the genetic variance of RFI. In addition, a genomic region containing the gene AGK on chromosome 1 was found to be associated with RFIa. The NSUN3 and AGK genes were found to be differentially expressed in breast muscle, thigh muscle, and abdominal fat between male broilers with high and low RFI. CONCLUSIONS We identified QTL regions for BW28 and BW42 (spanning 0.32 Mb) and RFI (spanning 1.16 Mb). The NSUN3, EPHA6, and AGK were identified as the most likely candidate genes for these QTL. These genes are involved in mitochondrial function and behavioral regulation. These results contribute to the identification of candidate genes and variants for growth and feed efficiency in poultry.
Collapse
|
12
|
Applying the Metafounders Approach for Genomic Evaluation in a Multibreed Beef Cattle Population. Front Genet 2021; 11:556399. [PMID: 33424914 PMCID: PMC7793833 DOI: 10.3389/fgene.2020.556399] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 10/29/2020] [Indexed: 11/23/2022] Open
Abstract
Pedigree information is incomplete by nature and commonly not well-established because many of the genetic ties are not known a priori or can be wrong. The genomic era brought new opportunities to assess relationships between individuals. However, when pedigree and genomic information are used simultaneously, which is the case of single-step genomic BLUP (ssGBLUP), defining the genetic base is still a challenge. One alternative to overcome this challenge is to use metafounders, which are pseudo-individuals that describe the genetic relationship between the base population individuals. The purpose of this study was to evaluate the impact of metafounders on the estimation of breeding values for tick resistance under ssGBLUP for a multibreed population composed by Hereford, Braford, and Zebu animals. Three different scenarios were studied: pedigree-based model (BLUP), ssGBLUP, and ssGBLUP with metafounders (ssGBLUPm). In ssGBLUPm, a total of four different metafounders based on breed of origin (i.e., Hereford, Braford, Zebu, and unknown) were included for the animals with missing parents. The relationship coefficient between metafounders was in average 0.54 (ranging from 0.34 to 0.96) suggesting an overlap between ancestor populations. The estimates of metafounder relationships indicate that Hereford and Zebu breeds have a possible common ancestral relationship. Inbreeding coefficients calculated following the metafounder approach had less negative values, suggesting that ancestral populations were large enough and that gametes inherited from the historical population were not identical. Variance components were estimated based on ssGBLUPm, ssGBLUP, and BLUP, but the values from ssGBLUPm were scaled to provide a fair comparison with estimates from the other two models. In general, additive, residual, and phenotypic variance components in the Hereford population were smaller than in Braford across different models. The addition of genomic information increased heritability for Hereford, possibly because of improved genetic relationships. As expected, genomic models had greater predictive ability, with an additional gain for ssGBLUPm over ssGBLUP. The increase in predictive ability was greater for Herefords. Our results show the potential of using metafounders to increase accuracy of GEBV, and therefore, the rate of genetic gain in beef cattle populations with partial levels of missing pedigree information.
Collapse
|
13
|
Impact of missing pedigrees in single-step genomic evaluation. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Context
A common problem in mixed model-based genetic evaluation schemes for livestock is that cohorts of animals differ systematically in mean genetic merit, for example, due to missing pedigree. This can be modelled by fitting genetic groups. Single-step genomic evaluation (ssGBLUP) combining information from genotyped and non-genotyped individuals has become routine, but little is known of the effects of unknown parents in this context.
Aims
To investigate the effects of missing pedigrees on accuracy and bias of predicted breeding values for ssGBLUP analyses.
Methods
A simulation study was used to examine alternative ways to account for genetic groups in ssGBLUP, for multi-generation data with strong selection and rapidly increasing numbers of genotyped animals in the most recent generations.
Key results
Results demonstrated that missing pedigrees can markedly impair predicted breeding values. With selection, alignment of genomic and pedigree relationship matrices is essential when fitting unknown parent groups (UPG). Genomic relationships are complete; that is, they ‘automatically’ reference the genomic base, which typically differs from the genetic base for pedigreed animals. This can lead to biased comparisons between genotyped and non-genotyped animals with unknown parents when the two categories of animals are assigned to the same UPG. Allocating genotyped individuals to a separate UPG across all generations for each strain or breed was shown to be a simple and effective way to reduce misalignment bias. In contrast, fitting metafounders modified pedigree-based relationships to account for ancestral genomic relationships and inbreeding rather than the genomic relationship matrix. Thus, no bias due to different types of animals assigned to the same metafounders was apparent. Overall, fitting metafounders yielded slightly higher correlations between true and predicted breeding values than did UPG models, which assume genetic groups to be unrelated.
Conclusions
Missing pedigrees are more problematic with ssGBLUP than for analyses considering pedigree-based relationships only. UPG models with separation of genotyped and non-genotyped individuals and analyses fitting metafounders yielded comparable predictions of breeding values in terms of accuracy and bias.
Implications
A previously unidentified incompatibility between alignment of founder populations and assignment of genotyped and non-genotyped animals to the same UPG has been reported. Implementation of the proposed strategy to reduce ‘double counting’ is straightforward and can improve results of ssGBLUP analyses.
Collapse
|