Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lourenco DAL, Tsuruta S, Fragomeni BO, Chen CY, Herring WO, Misztal I. Crossbreed evaluations in single-step genomic best linear unbiased predictor using adjusted realized relationship matrices. J Anim Sci 2016;94:909-19. [PMID: 27065253 DOI: 10.2527/jas.2015-9748] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Number

Cited by Other Article(s)

Ma H, Li H, Ge F, Zhao H, Zhu B, Zhang L, Gao H, Xu L, Li J, Wang Z. Improving Genomic Predictions in Multi-Breed Cattle Populations: A Comparative Analysis of BayesR and GBLUP Models. Genes (Basel) 2024;15:253. [PMID: 38397242 PMCID: PMC10887749 DOI: 10.3390/genes15020253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 02/09/2024] [Accepted: 02/16/2024] [Indexed: 02/25/2024] Open

Abstract

Numerous studies have shown that combining populations from similar or closely related genetic breeds improves the accuracy of genomic predictions (GP). Extensive experimentation with diverse Bayesian and genomic best linear unbiased prediction (GBLUP) models have been developed to explore multi-breed genomic selection (GS) in livestock, ultimately establishing them as successful approaches for predicting genomic estimated breeding value (GEBV). This study aimed to assess the effectiveness of using BayesR and GBLUP models with linkage disequilibrium (LD)-weighted genomic relationship matrices (GRMs) for genomic prediction in three different beef cattle breeds to identify the best approach for enhancing the accuracy of multi-breed genomic selection in beef cattle. Additionally, a comparison was conducted to evaluate the predictive precision of different marker densities and genetic correlations among the three breeds of beef cattle. The GRM between Yunling cattle (YL) and other breeds demonstrated modest affinity and highlighted a notable genetic concordance of 0.87 between Chinese Wagyu (WG) and Huaxi (HX) cattle. In the within-breed GS, BayesR demonstrated an advantage over GBLUP. The prediction accuracies for HX cattle using the BayesR model were 0.52 with BovineHD BeadChip data (HD) and 0.46 with whole-genome sequencing data (WGS). In comparison to the GBLUP model, the accuracy increased by 26.8% for HD data and 9.5% for WGS data. For WG and YL, BayesR doubled the within-breed prediction accuracy to 14.3% from 7.1%, outperforming GBLUP across both HD and WGS datasets. Moreover, analyzing multiple breeds using genomic selection showed that BayesR consistently outperformed GBLUP in terms of predictive accuracy, especially when using WGS. For instance, in a mixed reference population of HX and WG, BayesR achieved a significant accuracy of 0.53 using WGS for HX, which was a substantial enhancement over the accuracies obtained with GBLUP models. The research further highlights the benefit of including various breeds in the reference group, leading to enhanced accuracy in predictions and emphasizing the importance of comprehensive genomic selection methods. Our research findings indicate that BayesR exhibits superior performance compared to GBLUP in multi-breed genomic prediction accuracy, achieving a maximum improvement of 33.3%, especially in genetically diverse breeds. The improvement can be attributed to the effective utilization of higher single nucleotide polymorphism (SNP) marker density by BayesR, resulting in enhanced prediction accuracy. This evidence conclusively demonstrates the significant impact of BayesR on enhancing genomic predictions in diverse cattle populations, underscoring the crucial role of genetic relatedness in selection methodologies. In parallel, subsequent studies should focus on refining GRM and exploring alternative models for GP.

Collapse

Aldridge M, Vandenplas J, Duenk P, Henshall J, Hawken R, Calus M. Validation with single-step SNPBLUP shows that evaluations can continue using a single mean of genotyped individuals, even with multiple breeds. Genet Sel Evol 2023;55:19. [PMID: 36949392 PMCID: PMC10031914 DOI: 10.1186/s12711-023-00787-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 02/13/2023] [Indexed: 03/24/2023] Open

Abstract

BACKGROUND

In genomic prediction, it is common to centre the genotypes of single nucleotide polymorphisms based on the allele frequencies in the current population, rather than those in the base generation. The mean breeding value of non-genotyped animals is conditional on the mean performance of genotyped relatives, but can be corrected by fitting the mean performance of genotyped individuals as a fixed regression. The associated covariate vector has been referred to as a 'J-factor', which if fitted as a fixed effect can improve the accuracy and dispersion bias of sire genomic estimated breeding values (GEBV). To date, this has only been performed on populations with a single breed. Here, we investigated whether there was any benefit in fitting a separate J-factor for each breed in a three-way crossbred population, and in using pedigree-based expected or genome-based estimated breed fractions to define the J-factors.

RESULTS

For body weight at 7 days, dispersion bias decreased when fitting multiple J-factors, but only with a low proportion of genotyped individuals with selective genotyping. On average, the mean regression coefficients of validation records on those of GEBV increased with one J-factor compared to none, and further increased with multiple J-factors. However, for body weight at 35 days this was not observed. The accuracy of GEBV remained unchanged regardless of the J-factor method used. Differences between the J-factor methods were limited with correlations approaching 1 for the estimated covariate vector, the estimated coefficients of the regression on the J-factors, and the GEBV.

CONCLUSIONS

Based on our results and in the particular design analysed here, i.e. all the animals with phenotype are of the same type of crossbreds, fitting a single J-factor should be sufficient, to reduce dispersion bias. Fitting multiple J-factors may reduce dispersion bias further but this depends on the trait and genotyping rate. For the crossbred population analysed, fitting multiple J-factors has no adverse consequences and if this is done, it does not matter if the breed fractions used are based on the pedigree-expectation or the genomic estimates. Finally, when GEBV are estimated from crossbred data, any observed bias can potentially be reduced by including a straightforward regression on actual breed proportions.

Collapse

Mei Q, Liu H, Zhao S, Xiang T, Christensen OF. Genomic evaluation for two-way crossbred performance in cattle. Genet Sel Evol 2023;55:17. [PMID: 36932324 PMCID: PMC10022181 DOI: 10.1186/s12711-023-00792-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 03/08/2023] [Indexed: 03/19/2023] Open

Abstract

BACKGROUND

Dairy cattle production systems are mostly based on purebreds, but recently the use of crossbreeding has received increased interest. For genetic evaluations including crossbreds, several methods based on single-step genomic best linear unbiased prediction (ssGBLUP) have been proposed, including metafounder ssGBLUP (MF-ssGBLUP) and breed-specific ssGBLUP (BS-ssGBLUP). Ideally, models that account for breed effects should perform better than simple models, but knowledge on the performance of these methods is lacking for two-way crossbred cattle. In addition, the differences in the estimates of genetic parameters (such as the genetic variance component and heritability) between these methods have rarely been investigated. Therefore, the aims of this study were to (1) compare the estimates of genetic parameters for average daily gain (ADG) and feed conversion ratio (FCR) between these methods; and (2) evaluate the impact of these methods on the predictive ability for crossbred performance.

METHODS

Bivariate models using standard ssGBLUP, MF-ssGBLUP and BS-ssGBLUP for the genetic evaluation of ADG and FCR were investigated. To measure the predictive ability of these three methods, we estimated four estimators, bias, dispersion, population accuracy and ratio of population accuracies, using the linear regression (LR) method.

RESULTS

The results show that, for both ADG and FCR, the heritabilities were low with the three methods. For FCR, the differences in the estimated genetic parameters were small between the three methods, while for ADG, those estimated with BS-ssGBLUP deviated largely from those estimated with the other two methods. Bias and dispersion were similar across the three methods. Population accuracies for both ADG and FCR were always higher with MF-ssGBLUP than with ssGBLUP, while with BS-ssGBLUP the population accuracy was highest for FCR and lowest for ADG.

CONCLUSIONS

Our results indicate that in the genetic evaluation for crossbred performance in a two-way crossbred cattle production system, the predictive ability of MF-ssGBLUP and BS-ssGBLUP is greater than that of ssGBLUP, when the estimated variance components are consistent across the three methods. Compared with BS-ssGBLUP, MF-ssGBLUP is more robust in its superiority over ssGBLUP.

Collapse

Leite NG, Chen CY, Herring WO, Holl J, Tsuruta S, Lourenco D. Leveraging low-density crossbred genotypes to offset crossbred phenotypes and their impact on purebred predictions. J Anim Sci 2022;100:6780296. [PMID: 36309902 PMCID: PMC9733505 DOI: 10.1093/jas/skac359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 10/27/2022] [Indexed: 12/15/2022] Open

Abstract

The objectives of this study were to 1) investigate the predictability and bias of genomic breeding values (GEBV) of purebred (PB) sires for CB performance when CB genotypes imputed from a low-density panel are available, 2) assess if the availability of those CB genotypes can be used to partially offset CB phenotypic recording, and 3) investigate the impact of including imputed CB genotypes in genomic analyses when using the algorithm for proven and young (APY). Two pig populations with up to 207,375 PB and 32,893 CB phenotypic records per trait and 138,026 PB and 32,893 CB genotypes were evaluated. PB sires were genotyped for a 50K panel, whereas CB animals were genotyped for a low-density panel of 600 SNP and imputed to 50K. The predictability and bias of GEBV of PB sires for backfat thickness (BFX) and average daily gain recorded (ADGX) recorded on CB animals were assessed when CB genotypes were available or not in the analyses. In the first set of analyses, direct inverses of the genomic relationship matrix (G) were used with phenotypic datasets truncated at different time points. In the next step, we evaluated the APY algorithm with core compositions differing in the CB genotype contributions. After that, the performance of core compositions was compared with an analysis using a random PB core from a purely PB genomic set. The number of rounds to convergence was recorded for all APY analyses. With the direct inverse of G in the first set of analyses, adding CB genotypes imputed from a low-density panel (600 SNP) did not improve predictability or reduce the bias of PB sires' GEBV for CB performance, even for sires with fewer CB progeny phenotypes in the analysis. That indicates that the inclusion of CB genotypes primarily used for inferring pedigree in commercial farms is of no benefit to offset CB phenotyping. When CB genotypes were incorporated into APY, a random core composition or a core with no CB genotypes reduced bias and the number of rounds to convergence but did not affect predictability. Still, a PB random core composition from a genomic set with only PB genotypes resulted in the highest predictability and the smallest number of rounds to convergence, although bias increased. Genotyping CB individuals for low-density panels is a valuable identification tool for linking CB phenotypes to pedigree; however, the inclusion of those CB genotypes imputed from a low-density panel (600 SNP) might not benefit genomic predictions for PB individuals or offset CB phenotyping for the evaluated CB performance traits. Further studies will help understand the usefulness of those imputed CB genotypes for traits with lower PB-CB genetic correlations and traits not recorded in the PB environment, such as mortality and disease traits.

Collapse

Lozada-Soto EA, Lourenco D, Maltecca C, Fix J, Schwab C, Shull C, Tiezzi F. Genotyping and phenotyping strategies for genetic improvement of meat quality and carcass composition in swine. Genet Sel Evol 2022;54:42. [PMID: 35672700 PMCID: PMC9171933 DOI: 10.1186/s12711-022-00736-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 05/25/2022] [Indexed: 12/04/2022] Open

Cesarani A, Lourenco D, Tsuruta S, Legarra A, Nicolazzi E, VanRaden P, Misztal I. Multibreed genomic evaluation for production traits of dairy cattle in the United States using single-step genomic best linear unbiased predictor. J Dairy Sci 2022;105:5141-5152. [DOI: 10.3168/jds.2021-21505] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 01/27/2022] [Indexed: 01/01/2023]

Misztal I, Steyn Y, Lourenco D. Genomic evaluation with multibreed and crossbred data. JDS Communications 2022;3:156-159. [PMID: 36339739 PMCID: PMC9623721 DOI: 10.3168/jdsc.2021-0177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 11/21/2021] [Indexed: 11/19/2022]

Guillenea A, Su G, Lund MS, Karaman E. Genomic prediction in Nordic Red dairy cattle considering breed origin of alleles. J Dairy Sci 2022;105:2426-2438. [PMID: 35033341 DOI: 10.3168/jds.2021-21173] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 11/23/2021] [Indexed: 01/02/2023]

Abstract

This study investigated the reliability of genomic prediction (GP) using breed origin of alleles (BOA) approach in the Nordic Red (RDC) population, which has an admixed population structure. The RDC population consists of animals with varying degrees of genetic materials from the Danish Red (RDM), Swedish Red (SRB), Finnish Ayrshire (FAY), and Holstein (HOL) because bulls have been used across the breeds. The BOA approach was tested using 39,550 RDC animals in the reference population and 11,786 in the validation population. Deregressed proofs (DRP) of milk, fat and protein were used as response variable for GP. Direct genomic breeding values (DGV) for animals in the validation population were calculated with (BOA model) or without (joint model) considering breed origin of alleles. The joint model assumed homogeneous marker effects and a single set of marker effects were estimated, whereas BOA model assumed heterogeneous marker effects, and different sets of marker effects were estimated across the breeds. For the BOA approach, we tested scenarios assuming both correlated (BOA_cor) and uncorrelated (BOA_uncor) marker effects between the breeds. Additionally, we investigated GP using a standard Illumina 50K chip and including SNP selected from imputed whole-genome sequencing (50K+WGS). We also studied the effect of estimating (co)variances for genome regions of different sizes to exploit the information of the genome regions contributing to the (co)variance between the breeds. Region sizes were set as 1 SNP, a group of 30 or 100 adjacent SNP, or the whole genome. Reliability of DGV was measured as squared correlations between DGV and DRP divided by the reliability of DRP. Across the 3 traits, in general, RS30 and RS100 SNP yielded the highest reliabilities. Including WGS SNP improved reliabilities in almost all scenarios (0.297 on average for 50K and 0.307 on average for 50K+WGS). The BOA_uncor (0.233 on average) was inferior to the joint model (0.339 on average), but the reliabilities obtained using BOA_cor (0.334 on average) in most cases were not significantly different from those obtained using the joint model. The results indicate that both including additional whole-genome sequencing SNP and dividing the genome into fixed regions improve GP in the RDC. The BOA models have the potential to increase the reliability of GP, but the benefit is limited in populations with a high exchange of genetic material for a long time, as is the case for RDC.

Collapse

Buaban S, Prempree S, Sumreddee P, Duangjinda M, Masuda Y. Genomic prediction of milk-production traits and somatic cell score using single-step genomic best linear unbiased predictor with random regression test-day model in Thai dairy cattle. J Dairy Sci 2021;104:12713-12723. [PMID: 34538484 DOI: 10.3168/jds.2021-20263] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 08/04/2021] [Indexed: 12/15/2022]

Abstract

Cow genotypes are expected to improve the accuracy of genomic estimated breeding values (GEBV) for young bulls in relatively small populations such as Thai Holstein-Friesian crossbred dairy cattle in Thailand. The objective of this study was to investigate the effect of cow genotypes on the predictive ability and individual accuracies of GEBV for young dairy bulls in Thailand. Test-day data included milk yield (n = 170,666), milk component traits (fat yield, protein yield, total solids yield, fat percentage, protein percentage, and total solids percentage; n = 160,526), and somatic cell score (n = 82,378) from 23,201, 82,378, and 13,737 (for milk yield, milk component traits, and SCS, respectively) cows calving between 1993 and 2017, respectively. Pedigree information included 51,128; 48,834; and 32,743 animals for milk yield, milk component traits, and somatic cell score, respectively. Additionally, 876, 868, and 632 pedigreed animals (for milk yield, milk component traits, and SCS, respectively) were genotyped (152 bulls and 724 cows), respectively, using Illumina Bovine SNP50 BeadChip. We cut off the data in the last 6 yr, and the validation animals were defined as genotyped bulls with no daughters in the truncated set. We calculated GEBV using a single-step random regression test-day model (SS-RR-TDM), in comparison with estimated breed value (EBV) based on the pedigree-based model used as the official method in Thailand (RR-TDM). Individual accuracies of GEBV were obtained by inverting the coefficient matrix of the mixed model equations, whereas validation accuracies were measured by the Pearson correlation between deregressed EBV from the full data set and (G)EBV predicted with the reduced data set. When only bull genotypes were used, on average, SS-RR-TDM increased individual accuracies by 0.22 and validation accuracies by 0.07, compared with RR-TDM. With cow genotypes, the additional increase was 0.02 for individual accuracies and 0.06 for validation accuracies. The inflation of GEBV tended to be reduced using cow genotypes. Genomic evaluation by SS-RR-TDM is feasible to select young bulls for the longitudinal traits in Thai dairy cattle, and the accuracy of selection is expected to be increased with more genotypes. Genomic selection using the SS-RR-TDM should be implemented in the routine genetic evaluation of the Thai dairy cattle population. The genetic evaluation should consider including genotypes of both sires and cows.

Collapse

Kluska S, Masuda Y, Ferraz JBS, Tsuruta S, Eler JP, Baldi F, Lourenco D. Metafounders May Reduce Bias in Composite Cattle Genomic Predictions. Front Genet 2021;12:678587. [PMID: 34490031 PMCID: PMC8417888 DOI: 10.3389/fgene.2021.678587] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 07/15/2021] [Indexed: 11/13/2022] Open

Abstract

Metafounders are pseudo-individuals that act as proxies for animals in base populations. When metafounders are used, individuals from different breeds can be related through pedigree, improving the compatibility between genomic and pedigree relationships. The aim of this study was to investigate the use of metafounders and unknown parent groups (UPGs) for the genomic evaluation of a composite beef cattle population. Phenotypes were available for scrotal circumference at 14 months of age (SC14), post weaning gain (PWG), weaning weight (WW), and birth weight (BW). The pedigree included 680,551 animals, of which 1,899 were genotyped for or imputed to around 30,000 single-nucleotide polymorphisms (SNPs). Evaluations were performed based on pedigree (BLUP), pedigree with UPGs (BLUP_UPG), pedigree with metafounders (BLUP_MF), single-step genomic BLUP (ssGBLUP), ssGBLUP with UPGs for genomic and pedigree relationship matrices (ssGBLUP_UPG) or only for the pedigree relationship matrix (ssGBLUP_UPGA), and ssGBLUP with metafounders (ssGBLUP_MF). Each evaluation considered either four or 10 groups that were assigned based on breed of founders and intermediate crosses. To evaluate model performance, we used a validation method based on linear regression statistics to obtain accuracy, stability, dispersion, and bias of (genomic) estimated breeding value [(G)EBV]. Overall, relationships within and among metafounders were stronger in the scenario with 10 metafounders. Accuracy was greater for models with genomic information than for BLUP. Also, the stability of (G)EBVs was greater when genomic information was taken into account. Overall, pedigree-based methods showed lower inflation/deflation (regression coefficients close to 1.0) for SC14, WWM, and BWD traits. The level of inflation/deflation for genomic models was small and trait-dependent. Compared with regular ssGBLUP, ssGBLUP_MF4 displayed regression coefficient closer to one SC14, PWG, WWM, and BWD. Genomic models with metafounders seemed to be slightly more stable than models with UPGs based on higher similarity of results with different numbers of groups. Further, metafounders can help to reduce bias in genomic evaluations of composite beef cattle populations without reducing the stability of GEBVs.

Collapse

Abdollahi-Arpanahi R, Lourenco D, Misztal I. Detecting effective starting point of genomic selection by divergent trends from best linear unbiased prediction and single-step genomic best linear unbiased prediction in pigs, beef cattle, and broilers. J Anim Sci 2021;99:6352407. [PMID: 34390341 PMCID: PMC8420679 DOI: 10.1093/jas/skab243] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 08/12/2021] [Indexed: 12/12/2022] Open

Abstract

Genomic selection has been adopted nationally and internationally in different livestock and plant species. However, understanding whether genomic selection has been effective or not is an essential question for both industry and academia. Once genomic evaluation started being used, estimation of breeding values with pedigree best linear unbiased prediction (BLUP) became biased because this method does not consider selection using genomic information. Hence, the effective starting point of genomic selection can be detected in two possible ways including the divergence of genetic trends and Realized Mendelian sampling (RMS) trends obtained with BLUP and single-step genomic BLUP (ssGBLUP). This study aimed to find the start date of genomic selection for a set of economically important traits in three livestock species by comparing trends obtained using BLUP and ssGBLUP. Three datasets were used for this purpose: 1) a pig dataset with 117k genotypes and 1.3M animals in pedigree, 2) an Angus cattle dataset consisted of ~842k genotypes and 11.5M animals in pedigree, and 3) a purebred broiler chicken dataset included ~154k genotypes and 1.3M birds in pedigree were used. The genetic trends for pigs diverged for the genotyped animals born in 2014 for average daily gain (ADG) and backfat (BF). In beef cattle, the trends started diverging in 2009 for weaning weight (WW) and in 2016 for postweaning gain (PWG), with little divergence for birth weight (BTW). In broiler chickens, the genetic trends estimated by ssGBLUP and BLUP diverged at breeding cycle 6 for two out of the three production traits. The RMS trends for the genotyped pigs diverged for animals born in 2014, more for ADG than for BF. In beef cattle, the RMS trends started diverging in 2009 for WW and in 2016 for PWG, with a trivial trend for BTW. In broiler chickens, the RMS trends from ssGBLUP and BLUP diverged strongly for two production traits at breeding cycle 6, with a slight divergence for another trait. Divergence of the genetic trends from ssGBLUP and BLUP indicates the onset of the genomic selection. The presence of trends for RMS indicates selective genotyping, with or without the genomic selection. The onset of genomic selection and genotyping strategies agrees with industry practices across the three species. In summary, the effective start of genomic selection can be detected by the divergence between genetic and RMS trends from BLUP and ssGBLUP.

Collapse

Steyn Y, Gonzalez-Pena D, Bernal Rubio YL, Vukasinovic N, DeNise SK, Lourenco DAL, Misztal I. Indirect genomic predictions for milk yield in crossbred Holstein-Jersey dairy cattle. J Dairy Sci 2021;104:5728-5737. [PMID: 33685678 DOI: 10.3168/jds.2020-19451] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 01/05/2021] [Indexed: 11/19/2022]

See GM, Mote BE, Spangler ML. Selective genotyping and phenotypic data inclusion strategies of crossbred progeny for combined crossbred and purebred selection in swine breeding. J Anim Sci 2021;99:6131744. [PMID: 33560334 DOI: 10.1093/jas/skab041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 02/04/2021] [Indexed: 11/14/2022] Open

Abstract

Inclusion of crossbred (CB) data into traditionally purebred (PB) genetic evaluations has been shown to increase the response in CB performance. Currently, it is unrealistic to collect data on all CB animals in swine production systems, thus, a subset of CB animals must be selected to contribute genomic/phenotypic information. The aim of this study was to evaluate selective genotyping strategies in a simulated 3-way swine crossbreeding scheme. The swine crossbreeding scheme was simulated and produced 3-way CB animals for 6 generations with 3 distinct PB breeds each with 25 and 175 mating males and females, respectively. F1 crosses (400 mating females) produced 4,000 terminal CB progeny which were subjected to selective genotyping. The genome consisted of 18 chromosomes with 1,800 QTL and 72k SNP markers. Selection was performed using estimated breeding values (EBV) for CB performance. It was assumed that both PB and CB performance was moderately heritable (h2=0.4). Several scenarios altering the genetic correlation between PB and CB performance (rpc=0.1, 0.3, 0.5, 0.7,or 0.9) were considered. CB animals were chosen based on phenotypes to select 200, 400, or 800 CB animals to genotype per generation. Selection strategies included: (1) Random: random selection, (2) Top: highest phenotype, (3) Bottom: lowest phenotype, (4) Extreme: half highest and half lowest phenotypes, and (5) Middle: average phenotype. Each selective genotyping strategy, except for Random, was considered by selecting animals in half-sib (HS) or full-sib (FS) families. The number of PB animals with genotypes and phenotypes each generation was fixed at 1,680. Each unique genotyping strategy and rpc scenario was replicated 10 times. Selection of CB animals based on the Extreme strategy resulted in the highest (P < 0.05) rates of genetic gain in CB performance (ΔG) when rpc<0.9. For highly correlated traits (rpc=0.9) selective genotyping did not impact (P > 0.05) ΔG. No differences (P > 0.05) were observed in ΔG between top, bottom, or middle when rpc>0.1. Higher correlations between true breeding values (TBV) and EBV were observed using Extreme when rpc<0.9. In general, family sampling method did not impact ΔG or the correlation between TBV and EBV. Overall, the Extreme genotyping strategy produced the greatest genetic gain and the highest correlations between TBV and EBV, suggesting that 2-tailed sampling of CB animals is the most informative when CB performance is the selection goal.

Collapse

Warburton CL, Costilla R, Engle BN, Corbet NJ, Allen JM, Fordyce G, McGowan MR, Burns BM, Hayes BJ. Breed-adjusted genomic relationship matrices as a method to account for population stratification in multibreed populations of tropically adapted beef heifers. Anim Prod Sci 2021. [DOI: 10.1071/an21057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abstract Context Beef cattle breeds in Australia can broadly be broken up into two subspecies, namely, Bos indicus and Bos taurus. Due to the time since divergence between the subspecies, it is likely that mutations affecting quantitative traits have developed independently in each. Aims We hypothesise that this will affect the prediction accuracy of genomic selection of admixed and composite populations that include both ancestral subspecies. Our study investigates methods to quantify population stratification in a multibreed population of tropically adapted heifers, with the aim of improving prediction accuracy of genomic selection for reproductive maturity score. Methods We used genotypes and reproductive maturity phenotypes from 3695 tropically adapted heifers from three purebred populations, namely, Brahman, Santa Gertrudis and Droughtmaster. Two of these breeds, Santa Gertrudis and Droughtmaster, are stabilised composites of varying B. indicus × B. taurus ancestry, and the third breed, Brahman, has predominately B. indicus ancestry. Genotypes were imputed to three marker-panel densities and population stratification was accounted for in genomic relationship matrices by using breed-specific allele frequencies when calculating the genomic relationships among animals. Prediction accuracy and bias were determined using a five-fold cross validation of randomly selected multibreed cohorts. Key Results Our results showed that the use of breed-adjusted genomic relationship matrices did not improve either prediction accuracy or bias for a lowly heritable trait such as reproductive maturity score. However, using breed-adjusted genomic relationship matrices allowed the capture of a higher proportion of additive genetic effects when estimating variance components. Conclusions These findings suggest that, despite seeing no improvement in prediction accuracy, it may still be beneficial to use breed-adjusted genomic relationship matrices in multibreed populations to improve the estimation of variance components. Implications As such, genomic evaluations using breed-adjusted genomic relationship matrices may be beneficial in multibreed populations. Collapse

See GM, Mote BE, Spangler ML. Impact of inclusion rates of crossbred phenotypes and genotypes in nucleus selection programs. J Anim Sci 2020;98:5979488. [PMID: 33180915 DOI: 10.1093/jas/skaa360] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Accepted: 11/05/2020] [Indexed: 01/19/2023] Open

Abstract

Numerous methods have been suggested to incorporate crossbred (CB) phenotypes and genotypes into swine selection programs, yet little research has focused on the implicit trade-off decisions between generating data at the nucleus or commercial level. The aim of this study was to investigate the impact of altering the proportion of purebred (PB) and CB phenotypes and genotypes in genetic evaluations on the response to selection of CB performance. Assuming CB and PB performance with moderate heritabilities (h2=0.4), a three-breed swine crossbreeding scheme was simulated and selection was practiced for six generations, where the goal was to increase CB performance. Phenotypes, genotypes, and pedigrees for three PB breeds (25 and 175 mating males and females for each breed, respectively), F1 crosses (400 mating females), and terminal cross progeny (2,500) were simulated. The genome consisted of 18 chromosomes with 1,800 quantitative trait loci and 72k single nucleotide polymorphism (SNP) markers. Selection was performed in PB breeds using estimated breeding value for each phenotyping/genotyping strategy. Strategies investigated were: 1) increasing the proportion of CB with genotypes, phenotypes, and sire pedigree relationships, 2) decreasing the proportion of PB phenotypes and genotypes, and 3) altering the genetic correlation between PB and CB performance (rpc). Each unique rpc scenario and data collection strategy was replicated 10 times. Results showed that including CB data improved the CB performance regardless of rpc or data collection strategy compared with when no CB data were included. Compared with using only PB information, including 10% of CB progeny per generation with sire pedigrees and phenotypes increased the response in CB phenotype by 134%, 55%, 33%, 23%, and 21% when rpc was 0.1, 0.3, 0.5, 0.7, and 0.9, respectively. When the same 10% of CB progeny were also genotyped, CB performance increased by 243%, 54%, 38%, 23%, and 20% when the rpc was 0.1, 0.3, 0.5, 0.7, and 0.9, respectively, compared with when no CB data were utilized. Minimal change was observed in the average CB phenotype when PB phenotypes were included or proportionally removed when CB were genotyped. Removal of both PB phenotypes and genotypes when CB were genotyped greatly reduced the response in CB performance. In practice, the optimal inclusion rate of CB and PB data depends upon the genetic correlation between CB and PB animals and the expense of additional CB data collection compared with the economic benefit associated with increased CB performance.

Collapse

Wientjes YCJ, Bijma P, Calus MPL. Optimizing genomic reference populations to improve crossbred performance. Genet Sel Evol 2020;52:65. [PMID: 33158416 PMCID: PMC7648379 DOI: 10.1186/s12711-020-00573-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 09/18/2020] [Indexed: 11/10/2022] Open

Abstract

Background

In pig and poultry breeding, the objective is to improve the performance of crossbred production animals, while selection takes place in the purebred parent lines. One way to achieve this is to use genomic prediction with a crossbred reference population. A crossbred reference population benefits from expressing the breeding goal trait but suffers from a lower genetic relatedness with the purebred selection candidates than a purebred reference population. Our aim was to investigate the benefit of using a crossbred reference population for genomic prediction of crossbred performance for: (1) different levels of relatedness between the crossbred reference population and purebred selection candidates, (2) different levels of the purebred-crossbred correlation, and (3) different reference population sizes. We simulated a crossbred breeding program with 0, 1 or 2 multiplication steps to generate the crossbreds, and compared the accuracy of genomic prediction of crossbred performance in one generation using either a purebred or a crossbred reference population. For each scenario, we investigated the empirical accuracy based on simulation and the predicted accuracy based on the estimated effective number of independent chromosome segments between the reference animals and selection candidates.

Results

When the purebred-crossbred correlation was 0.75, the accuracy was highest for a two-way crossbred reference population but similar for purebred and four-way crossbred reference populations, for all reference population sizes. When the purebred-crossbred correlation was 0.5, a purebred reference population always resulted in the lowest accuracy. Among the different crossbred reference populations, the accuracy was slightly lower when more multiplication steps were used to create the crossbreds. In general, the benefit of crossbred reference populations increased when the size of the reference population increased. All predicted accuracies overestimated their corresponding empirical accuracies, but the different scenarios were ranked accurately when the reference population was large.

Conclusions

The benefit of a crossbred reference population becomes larger when the crossbred population is more related to the purebred selection candidates, when the purebred-crossbred correlation is lower, and when the reference population is larger. The purebred-crossbred correlation and reference population size interact with each other with respect to their impact on the accuracy of genomic estimated breeding values.

Collapse

Lourenco D, Legarra A, Tsuruta S, Masuda Y, Aguilar I, Misztal I. Single-Step Genomic Evaluations from Theory to Practice: Using SNP Chips and Sequence Data in BLUPF90. Genes (Basel) 2020;11:E790. [PMID: 32674271 PMCID: PMC7397237 DOI: 10.3390/genes11070790] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/03/2020] [Accepted: 07/06/2020] [Indexed: 11/16/2022] Open

Calus MPL, Vandenplas J, Hulsegge I, Borg R, Henshall JM, Hawken R. Assessment of sire contribution and breed-of-origin of alleles in a three-way crossbred broiler dataset. Poult Sci 2020;98:6270-6280. [PMID: 31393589 PMCID: PMC6870559 DOI: 10.3382/ps/pez458] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 07/28/2019] [Indexed: 11/30/2022] Open

Abstract

Broiler breeding programs rely on crossbreeding. With genomic selection, widespread use of crossbred performance in breeding programs comes within reach. Commercial crossbreds, however, may have unknown pedigrees and their genomes may include DNA from 2 to 4 different breeds. Our aim was, for a broiler dataset with a limited number of sires having both purebred and crossbred offspring generated using natural mating, to rapidly derive parentage, assess the distribution of the sire contribution to the offspring generation, and to assess breed-of-origin of alleles in crossbreds. The dataset contained genotypes for 56,075 SNPs for 5,882 purebred and 10,943 3-way crossbred offspring generated by natural mating of 164 purebred sires to 1,016 purebred and 1,386 F1 crossbred hens. Using our algorithm FindParents, joint parentage derivation for the offspring and parent generations required only 1 m 29 s to retrieve parentage for 20,253 animals considering 4,504 possible parents. FindParents was similarly accurate as a maximum likelihood based method, apart from situations where settings of FindParents did not match the genotyping error rate in the data. Numbers of offspring per sire had a very skewed distribution, ranging from 1 to 270 crossbreds and 1 to 154 purebreds. Derivation of breed-of-origin of alleles relied on phasing all genotypes, including 8,205, 372, and 720 animals from the 3 pure lines involved, and allocating haplotypes in the crossbreds to purebred lines based on observed frequencies in the purebred lines. Breed-of-origin could be derived for 96.94% of the alleles of the 1,386 F1 crossbred hens and for 91.88% of the alleles of the 10,943 3-way crossbred offspring, of which 49.49% to the sire line. The achieved percentage of assignment to the sire line was sufficient to proceed with subsequent analyses requiring only the breed-of-origin of the paternal alleles to be known. Although required number of animals may be population dependent, to increase the total percentage of assigned alleles, it seems advisable to use at least approx. 1,000 genotyped purebred animals for each of the lines involved.

Collapse

Alvarenga AB, Veroneze R, Oliveira HR, Marques DBD, Lopes PS, Silva FF, Brito LF. Comparing Alternative Single-Step GBLUP Approaches and Training Population Designs for Genomic Evaluation of Crossbred Animals. Front Genet 2020;11:263. [PMID: 32328083 PMCID: PMC7162606 DOI: 10.3389/fgene.2020.00263] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 03/05/2020] [Indexed: 02/06/2023] Open

Abstract

As crossbreeding is extensively used in some livestock species, we aimed to evaluate the performance of single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) methods to predict Genomic Estimated Breeding Values (GEBVs) of crossbred animals. Different training population scenarios were evaluated: (SC1) ssGBLUP based on a single-trait model considering purebred and crossbred animals in a joint training population; (SC2) ssGBLUP based on a multiple-trait model to enable considering phenotypes recorded in purebred and crossbred training animals as different traits; (SC3) WssGBLUP based on a single-trait model considering purebred and crossbred animals jointly in the training population (both populations were used for SNP weights' estimation); (SC4) WssGBLUP based on a single-trait model considering only purebred animals in the training population (crossbred population only used for SNP weights' estimation); (SC5) WssGBLUP based on a single-trait model and the training population characterized by purebred animals (purebred population used for SNP weights' estimation). A complex trait was simulated assuming alternative genetic architectures. Different scaling factors to blend the inverse of the genomic (G -1) and pedigree (A 22 - 1 ) relationship matrices were also tested. The predictive performance of each scenario was evaluated based on the validation accuracy and regression coefficient. The genetic correlations across simulated populations in the different scenarios ranged from moderate to high (0.71-0.99). The scenario mimicking a completely polygenic trait (h Q T L 2 = 0) yielded the lowest validation accuracy (0.12; for SC3 and SC4). The simulated scenarios assuming 4,500 QTLs affecting the trait andh Q T L 2 = h 2 resulted in the greatest GEBV accuracies (0.47; for SC1 and SC2). The regression coefficients ranged from 0.28 (for SC3 assuming polygenic effect) to 1.27 (for SC2 considering 4,500 QTLs). In general, SC3 and SC5 resulted in inflated GEBVs, whereas other scenarios yielded deflated GEBVs. The scaling factors used to combine G -1 andA 22 - 1 had a small influence on the validation accuracies, but a greater effect on the regression coefficients. Due to the complexity of multiple-trait models and WssGBLUP analyses, and a similar predictive performance across the methods evaluated, SC1 is recommended for genomic evaluation in crossbred populations with similar genetic structures [moderate-to-high (0.71-0.99) genetic correlations between purebred and crossbred populations].

Collapse

Misztal I, Lourenco D, Legarra A. Current status of genomic evaluation. J Anim Sci 2020;98:skaa101. [PMID: 32267923 PMCID: PMC7183352 DOI: 10.1093/jas/skaa101] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/07/2020] [Indexed: 12/14/2022] Open

Abstract

Early application of genomic selection relied on SNP estimation with phenotypes or de-regressed proofs (DRP). Chips of 50k SNP seemed sufficient for an accurate estimation of SNP effects. Genomic estimated breeding values (GEBV) were composed of an index with parent average, direct genomic value, and deduction of a parental index to eliminate double counting. Use of SNP selection or weighting increased accuracy with small data sets but had minimal to no impact with large data sets. Efforts to include potentially causative SNP derived from sequence data or high-density chips showed limited or no gain in accuracy. After the implementation of genomic selection, EBV by BLUP became biased because of genomic preselection and DRP computed based on EBV required adjustments, and the creation of DRP for females is hard and subject to double counting. Genomic selection was greatly simplified by single-step genomic BLUP (ssGBLUP). This method based on combining genomic and pedigree relationships automatically creates an index with all sources of information, can use any combination of male and female genotypes, and accounts for preselection. To avoid biases, especially under strong selection, ssGBLUP requires that pedigree and genomic relationships are compatible. Because the inversion of the genomic relationship matrix (G) becomes costly with more than 100k genotyped animals, large data computations in ssGBLUP were solved by exploiting limited dimensionality of genomic data due to limited effective population size. With such dimensionality ranging from 4k in chickens to about 15k in cattle, the inverse of G can be created directly (e.g., by the algorithm for proven and young) at a linear cost. Due to its simplicity and accuracy, ssGBLUP is routinely used for genomic selection by the major chicken, pig, and beef industries. Single step can be used to derive SNP effects for indirect prediction and for genome-wide association studies, including computations of the P-values. Alternative single-step formulations exist that use SNP effects for genotyped or for all animals. Although genomics is the new standard in breeding and genetics, there are still some problems that need to be solved. This involves new validation procedures that are unaffected by selection, parameter estimation that accounts for all the genomic data used in selection, and strategies to address reduction in genetic variances after genomic selection was implemented.

Collapse

Steyn Y, Lourenco DAL, Misztal I. Genomic predictions in purebreds with a multibreed genomic relationship matrix1. J Anim Sci 2020;97:4418-4427. [PMID: 31539424 DOI: 10.1093/jas/skz296] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 09/10/2019] [Indexed: 11/14/2022] Open

Abstract

Combining breeds in a multibreed evaluation can have a negative impact on prediction accuracy, especially if single nucleotide polymorphism (SNP) effects differ among breeds. The aim of this study was to evaluate the use of a multibreed genomic relationship matrix (G), where SNP effects are considered to be unique to each breed, that is, nonshared. This multibreed G was created by treating SNP of different breeds as if they were on nonoverlapping positions on the chromosome, although, in reality, they were not. This simple setup may avoid spurious Identity by state (IBS) relationships between breeds and automatically considers breed-specific allele frequencies. This scenario was contrasted to a regular multibreed evaluation where all SNPs were shared, that is, the same position, and to single-breed evaluations. Different SNP densities (9k and 45k) and different effective population sizes (Ne) were tested. Five breeds mimicking recent beef cattle populations that diverged from the same historical population were simulated using different selection criteria. It was assumed that quantitative trait locus (QTL) effects were the same over all breeds. For the recent population, generations 1-9 had approximately half of the animals genotyped, whereas all animals in generation 10 were genotyped. Generation 10 animals were set for validation; therefore, each breed had a validation group. Analyses were performed using single-step genomic best linear unbiased prediction. Prediction accuracy was calculated as the correlation between true (T) and genomic estimated breeding values (GEBV). Accuracies of GEBV were lower for the larger Ne and low SNP density. All three evaluation scenarios using 45k resulted in similar accuracies, suggesting that the marker density is high enough to account for relationships and linkage disequilibrium with QTL. A shared multibreed evaluation using 9k resulted in a decrease of accuracy of 0.08 for a smaller Ne and 0.12 for a larger Ne. This loss was mostly avoided when markers were treated as nonshared within the same G matrix. A G matrix with nonshared SNP enables multibreed evaluations without considerably changing accuracy, especially with limited information per breed.

Collapse

VanRaden PM, Tooker ME, Chud TCS, Norman HD, Megonigal JH, Haagen IW, Wiggans GR. Genomic predictions for crossbred dairy cattle. J Dairy Sci 2019;103:1620-1631. [PMID: 31837783 DOI: 10.3168/jds.2019-16634] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Accepted: 10/14/2019] [Indexed: 01/14/2023]

Abstract

Genomic evaluations are useful for crossbred as well as purebred populations when selection is applied to commercial herds. Dairy farmers had already spent more than $1 million to genotype over 32,000 crossbred animals before US genomic evaluations became available for those animals. Thus, new tools were needed to provide accurate genomic predictions for crossbreds. Genotypes for crossbreds are imputed more accurately when the imputation reference population includes purebreds. Therefore, genotypes of 6,296 crossbred animals were imputed from lower-density chips by including either 3,119 ancestors or 834,367 genotyped animals in the reference population. Crossbreds in the imputation study included 733 Jersey × Holstein F₁ animals, 55 Brown Swiss × Holstein F₁ animals, 2,300 Holstein backcrosses, 2,026 Jersey backcrosses, 27 Brown Swiss backcrosses, and 502 other crossbreds of various breed combinations. Another 653 animals appeared to be purebreds that owners had miscoded as a different breed. Genomic breed composition was estimated from 60,671 markers using the known breed identities for purebred, progeny-tested Holstein, Jersey, Brown Swiss, Ayrshire, and Guernsey bulls as the 5 traits (breed fractions) to be predicted. Estimates of breed composition were adjusted so that no percentages were negative or exceeded 100%, and breed percentages summed to 100%. Another adjustment set percentages above 93.5% equal to 100%, and the resulting value was termed breed base representation (BBR). Larger percentages of missing alleles were imputed by using a crossbred reference population rather than only the closest purebred reference population. Crossbred predictions were averages of genomic predictions computed using marker effects for each pure breed, which were weighted by the animal's BBR. Marker and polygenic effects were estimated separately for each breed on the all-breed scale instead of within-breed scales. For crossbreds, genomic predictions weighted by BBR were more accurate than the average of parents' breeding values and slightly more accurate than predictions using only the predominant breed. For purebreds, single-trait predictions using only within-breed data were as accurate as multi-trait predictions with allele effects in different breeds treated as correlated effects. Crossbred genomic predicted transmitting abilities were implemented by the Council on Dairy Cattle Breeding in April 2019 and will aid producers in managing their breeding programs and selecting replacement heifers.

Collapse

Duenk P, Calus MPL, Wientjes YCJ, Breen VP, Henshall JM, Hawken R, Bijma P. Validation of genomic predictions for body weight in broilers using crossbred information and considering breed-of-origin of alleles. Genet Sel Evol 2019;51:38. [PMID: 31286857 PMCID: PMC6613268 DOI: 10.1186/s12711-019-0481-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 06/25/2019] [Indexed: 02/01/2023] Open

Abstract

Background

Pig and poultry breeding programs aim at improving crossbred (CB) performance. Selection response may be suboptimal if only purebred (PB) performance is used to compute genomic estimated breeding values (GEBV) because the genetic correlation between PB and CB performance (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc) is often lower than 1. Thus, it may be beneficial to use information on both PB and CB performance. In addition, the accuracy of GEBV of PB animals for CB performance may improve when the breed-of-origin of alleles (BOA) is considered in the genomic relationship matrix (GRM). Thus, our aim was to compare scenarios where GEBV are computed and validated by using (1) either CB offspring averages or individual CB records for validation, (2) either a PB or CB reference population, and (3) a GRM that either accounts for or ignores BOA in the CB individuals. For this purpose, we used data on body weight measured at around 7 (BW7) or 35 (BW35) days in PB and CB broiler chickens and evaluated the accuracy of GEBV based on the correlation GEBV with phenotypes in the validation population (validation correlation).

Results

With validation on CB offspring averages, the validation correlation of GEBV of PB animals for CB performance was lower with a CB reference population than with a PB reference population for BW35 (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc = 0.96), and about equal for BW7 (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc = 0.80) when BOA was ignored. However, with validation on individual CB records, the validation correlation was higher with a CB reference population for both traits. The use of a GRM that took BOA into account increased the validation correlation for BW7 but reduced it for BW35.

Conclusions

We argue that the benefit of using a CB reference population for genomic prediction of PB animals for CB performance should be assessed either by validation on CB offspring averages, or by validation on individual CB records while using a GRM that accounts for BOA in the CB individuals. With this recommendation in mind, our results show that the accuracy of GEBV of PB animals for CB performance was equal to or higher with a CB reference population than with a PB reference population for a trait with an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc of 0.8, but lower for a trait with an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc of 0.96. In addition, taking BOA into account was beneficial for a trait with an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc of 0.8 but not for a trait with an \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$r_{pc}$$\end{document}rpc of 0.96.

Electronic supplementary material

The online version of this article (10.1186/s12711-019-0481-7) contains supplementary material, which is available to authorized users.

Collapse

Pocrnic I, Lourenco DAL, Chen CY, Herring WO, Misztal I. Crossbred evaluations using single-step genomic BLUP and algorithm for proven and young with different sources of data1. J Anim Sci 2019;97:1513-1522. [PMID: 30726939 PMCID: PMC6447278 DOI: 10.1093/jas/skz042] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 01/28/2019] [Indexed: 11/14/2022] Open

Abstract

Genomic selection (GS) is routinely applied to many purebreds and lines of farm species. However, this method can be extended to predictions across purebreds as well as for crossbreds. This is useful for swine and poultry, for which selection in nucleus herds is typically performed on purebred animals, whereas the commercial product is the crossbred animal. Single-step genomic BLUP (ssGBLUP) is a widely applied method that can explore the recently developed algorithm for proven and young (APY). The APY allows for greater efficiency in computing BLUP solutions by exploiting the theory of limited dimensionality of genomic information and chromosome segments (Me). This study investigates the predictivity as a proxy for accuracy across and within 2 purebred pig lines and their crosses, under the application of APY in ssGBLUP setup, and different levels of Me overlapping across populations. The data consisted of approximately 210k phenotypic records for 2 traits (T1 and T2) with moderate heritability. Genotypes for 43k SNP markers were available for approximately 46k animals, from which 26k and 16k belong to 2 pure lines (L1 and L2), and approximately 4k are crossbreds. The complete pedigree had more than 720k animals. Different multivariate ssGBLUP models were applied, either with the regular or APY inverse of the genomic relationship matrix (G). The models included a standard bivariate animal model with 3 lines evaluated as 1 joint line, and for each trait individually, a 3-trait animal model with each line treated as a separate trait. Both models provided the same predictivity across and within the lines. Using either of the pure lines data as a training set resulted in a similar predictivity for the crossbreed animals (0.18 to 0.21). Across-line predictive ability was limited to less than half of the maximum predictivity for each line (L1T1 0.33, L1T2 0.25, L2T1 0.35, L2T2 0.36). For crossbred predictions, APY performed equivalently to regular G inverse when the number of core animals was equal to the number of eigenvalues explaining between 98% and 99% of the variance of G (4k to 8k) including all lines. Predictivity across the lines is achievable because of the shared Me between them. The number of those shared segments can be obtained via eigenvalue decomposition of genomic information available for each line.

Collapse

van Grevenhof EM, Vandenplas J, Calus MPL. Genomic prediction for crossbred performance using metafounders. J Anim Sci 2019;97:548-558. [PMID: 30423111 DOI: 10.1093/jas/sky433] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 11/09/2018] [Indexed: 01/01/2023] Open

Boré R, Brito LF, Jafarikia M, Bouquet A, Maignel L, Sullivan B, Schenkel FS. Genomic data reveals large similarities among Canadian and French maternal pig lines. Can J Anim Sci 2018. [DOI: 10.1139/cjas-2017-0103] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Vandenplas J, Calus MPL, Ten Napel J. Sparse single-step genomic BLUP in crossbreeding schemes. J Anim Sci 2018;96:2060-2073. [PMID: 29873759 DOI: 10.1093/jas/sky136] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 04/16/2018] [Indexed: 12/20/2022] Open

Iversen MW, Nordbø Ø, Gjerlaug-Enger E, Grindflek E, Lopes M, Meuwissen THE. Including crossbred pigs in the genomic relationship matrix through utilization of both linkage disequilibrium and linkage analysis. J Anim Sci 2017;95:5197-5207. [PMID: 29293760 PMCID: PMC6292332 DOI: 10.2527/jas2017.1705] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 09/22/2017] [Indexed: 12/20/2022] Open

Abstract

In pig breeding, the final product is a crossbred (CB) animal, while selection is performed at the purebred (PB) level using mainly PB data. However, incorporating CB data in genetic evaluations is expected to result in greater genetic progress at the CB level. Currently, there is no optimal way to include CB genotypes into the genomic relationship matrix. This is because, in single-step genomic BLUP, which is the most commonly used method, genomic and pedigree relationships must refer to the same base. This may not be the case when several breeds and CB are included. An alternative to overcome this issue may be to use a genomic relationship matrix (G matrix) that accounts for both linkage disequilibrium (LD) and linkage analysis (LA), called G. The objectives of this study were to further develop the G matrix approach to utilize both PB and CB genotypes simultaneously, to investigate its performance, and the general added value of including CB genotypes in genomic evaluations. Data were available on Dutch Landrace, Large White, and the F1 cross of those breeds. In total, 7 different G matrix compositions (PB alone, PB together, each PB with the CB, all genotypes across breeds, and G) were tested on 3 maternal traits: total number born (TNB), live born (LB), and gestation length (GL). Results show that G gave the greatest prediction accuracy of all the relationship matrices tested for PB prediction, but not for CB prediction. Including CB genotypes in general increased prediction accuracy for all breeds. However, in some cases, these increases in prediction accuracy were not significant (at < 0.05). To conclude, CB genotypes increased prediction accuracy for some of the traits and breeds, but not for all. The G matrix had significantly greater prediction accuracy in PB than the other G matrix with both PB and CB genotypes, except in one case. While for CB, the G matrix with genotypes across all breeds gave the greatest accuracy, though this was not significantly different from G. Computation time was high for G, and research will be needed to reduce its computational costs to make it feasible for use in routine evaluations. The main conclusion is that inclusion of CB genotypes is beneficial for both PB and CB animals.

Collapse

Sevillano CA, Vandenplas J, Bastiaansen JWM, Bergsma R, Calus MPL. Genomic evaluation for a three-way crossbreeding system considering breed-of-origin of alleles. Genet Sel Evol 2017;49:75. [PMID: 29061123 PMCID: PMC5653471 DOI: 10.1186/s12711-017-0350-1] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2017] [Accepted: 10/10/2017] [Indexed: 12/31/2022] Open

Abstract

BACKGROUND

Genomic prediction of purebred animals for crossbred performance can be based on a model that estimates effects of single nucleotide polymorphisms (SNPs) in purebreds on crossbred performance. For crossbred performance, SNP effects might be breed-specific due to differences between breeds in allele frequencies and linkage disequilibrium patterns between SNPs and quantitative trait loci. Accurately tracing the breed-of-origin of alleles (BOA) in three-way crosses is possible with a recently developed procedure called BOA. A model that accounts for breed-specific SNP effects (BOA model), has never been tested empirically on a three-way crossbreeding scheme. Therefore, the objectives of this study were to evaluate the estimates of variance components and the predictive accuracy of the BOA model compared to models in which SNP effects for crossbred performance were assumed to be the same across breeds, using either breed-specific allele frequencies ([Formula: see text] model) or allele frequencies averaged across breeds ([Formula: see text] model). In this study, we used data from purebred and three-way crossbred pigs on average daily gain (ADG), back fat thickness (BF), and loin depth (LD).

RESULTS

Estimates of variance components for crossbred performance from the BOA model were mostly similar to estimates from models [Formula: see text] and [Formula: see text]. Heritabilities for crossbred performance ranged from 0.24 to 0.46 between traits. Genetic correlations between purebred and crossbred performance ([Formula: see text]) across breeds ranged from 0.30 to 0.62 for ADG and from 0.53 to 0.74 for BF and LD. For ADG, prediction accuracies of the BOA model were higher than those of the [Formula: see text] and [Formula: see text] models, with significantly higher accuracies only for one maternal breed. For BF and LD, prediction accuracies of models [Formula: see text] and [Formula: see text] were higher than those of the BOA model, with no significant differences. Across all traits, models [Formula: see text] and [Formula: see text] yielded similar predictions.

CONCLUSIONS

The BOA model yielded a higher prediction accuracy for ADG in one maternal breed, which had the lowest [Formula: see text] (0.30). Using the BOA model was especially relevant for traits with a low [Formula: see text]. In all other cases, the use of crossbred information in models [Formula: see text] and [Formula: see text], does not jeopardize predictions and these models are more easily implemented than the BOA model.

Collapse

Wientjes YCJ, Bijma P, Vandenplas J, Calus MPL. Multi-population Genomic Relationships for Estimating Current Genetic Variances Within and Genetic Correlations Between Populations. Genetics 2017;207:503-515. [PMID: 28821589 PMCID: PMC5629319 DOI: 10.1534/genetics.117.300152] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Accepted: 08/15/2017] [Indexed: 01/19/2023] Open

Lopes MS, Bovenhuis H, Hidalgo AM, van Arendonk JAM, Knol EF, Bastiaansen JWM. Genomic selection for crossbred performance accounting for breed-specific effects. Genet Sel Evol 2017. [PMID: 28651536 PMCID: PMC5485705 DOI: 10.1186/s12711-017-0328-z] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Abstract

Background

Breed-specific effects are observed when the same allele of a given genetic marker has a different effect depending on its breed origin, which results in different allele substitution effects across breeds. In such a case, single-breed breeding values may not be the most accurate predictors of crossbred performance. Our aim was to estimate the contribution of alleles from each parental breed to the genetic variance of traits that are measured in crossbred offspring, and to compare the prediction accuracies of estimated direct genomic values (DGV) from a traditional genomic selection model (GS) that are trained on purebred or crossbred data, with accuracies of DGV from a model that accounts for breed-specific effects (BS), trained on purebred or crossbred data. The final dataset was composed of 924 Large White, 924 Landrace and 924 two-way cross (F1) genotyped and phenotyped animals. The traits evaluated were litter size (LS) and gestation length (GL) in pigs.

Results

The genetic correlation between purebred and crossbred performance was higher than 0.88 for both LS and GL. For both traits, the additive genetic variance was larger for alleles inherited from the Large White breed compared to alleles inherited from the Landrace breed (0.74 and 0.56 for LS, and 0.42 and 0.40 for GL, respectively). The highest prediction accuracies of crossbred performance were obtained when training was done on crossbred data. For LS, prediction accuracies were the same for GS and BS DGV (0.23), while for GL, prediction accuracy for BS DGV was similar to the accuracy of GS DGV (0.53 and 0.52, respectively).

Conclusions

In this study, training on crossbred data resulted in higher prediction accuracy than training on purebred data and evidence of breed-specific effects for LS and GL was demonstrated. However, when training was done on crossbred data, both GS and BS models resulted in similar prediction accuracies. In future studies, traits with a lower genetic correlation between purebred and crossbred performance should be included to further assess the value of the BS model in genomic predictions.

Collapse

Vandenplas J, Windig JJ, Calus MPL. Prediction of the reliability of genomic breeding values for crossbred performance. Genet Sel Evol 2017;49:43. [PMID: 28499351 PMCID: PMC5439167 DOI: 10.1186/s12711-017-0318-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 04/27/2017] [Indexed: 01/12/2023] Open

Abstract

Background

In crossbreeding programs, various genomic prediction models have been proposed for using phenotypic records of crossbred animals to increase the selection response for crossbred performance in purebred animals. A possible model is a model that assumes identical single nucleotide polymorphism (SNP) effects for the crossbred performance trait across breeds (ASGM). Another model is a genomic model that assumes breed-specific effects of SNP alleles (BSAM) for crossbred performance. The aim of this study was to derive and validate equations for predicting the reliability of estimated genomic breeding values for crossbred performance in both these models. Prediction equations were derived for situations when all (phenotyping and) genotyping data have already been collected, i.e. based on the genetic evaluation model, and for situations when all genotyping data are not yet available, i.e. when designing breeding programs.

Results

When all genotyping data are available, prediction equations are based on selection index theory. Without availability of all genotyping data, prediction equations are based on population parameters (e.g., heritability of the traits involved, genetic correlation between purebred and crossbred performance, effective number of chromosome segments). Validation of the equations for predicting the reliability of genomic breeding values without all genotyping data was performed based on simulated data of a two-way crossbreeding program, using either two closely-related breeds, or two unrelated breeds, to produce crossbred animals. The proposed equations can be used for an easy comparison of the reliability of genomic estimated breeding values across many scenarios, especially if all genotyping data are available. We show that BSAM outperforms ASGM for a specific breed, if the effective number of chromosome segments that originate from this breed and are shared by selection candidates of this breed and crossbred reference animals is less than half the effective number of all chromosome segments that are independently segregating in the same animals.

Conclusions

The derived equations can be used to predict the reliability of genomic estimated breeding values for crossbred performance using ASGM or BSAM in many scenarios, and are thus useful to optimize the design of breeding programs. Scenarios can vary in terms of the genetic correlation between purebred and crossbred performances, heritabilities, number of reference animals, or distance between breeds.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-017-0318-1) contains supplementary material, which is available to authorized users.

Collapse

Garcia-Baccino CA, Legarra A, Christensen OF, Misztal I, Pocrnic I, Vitezica ZG, Cantet RJC. Metafounders are related to F _st fixation indices and reduce bias in single-step genomic evaluations. Genet Sel Evol 2017;49:34. [PMID: 28283016 PMCID: PMC5439149 DOI: 10.1186/s12711-017-0309-2] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 03/03/2017] [Indexed: 01/03/2023] Open

Abstract

Background

Metafounders are pseudo-individuals that encapsulate genetic heterozygosity and relationships within and across base pedigree populations, i.e. ancestral populations. This work addresses the estimation and usefulness of metafounder relationships in single-step genomic best linear unbiased prediction (ssGBLUP).

Results

We show that ancestral relationship parameters are proportional to standardized covariances of base allelic frequencies across populations, such as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{\text{st}}$$\end{document}Fst fixation indexes. These covariances of base allelic frequencies can be estimated from marker genotypes of related recent individuals and pedigree. Simple methods for their estimation include naïve computation of allele frequencies from marker genotypes or a method of moments that equates average pedigree-based and marker-based relationships. Complex methods include generalized least squares (best linear unbiased estimator (BLUE)) or maximum likelihood based on pedigree relationships. To our knowledge, methods to infer \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{\text{st}}$$\end{document}Fst coefficients from marker data have not been developed for related individuals. We derived a genomic relationship matrix, compatible with pedigree relationships, that is constructed as a cross-product of {−1,0,1} codes and that is equivalent (apart from scale factors) to an identity-by-state relationship matrix at genome-wide markers. Using a simulation with a single population under selection in which only males and youngest animals are genotyped, we observed that generalized least squares or maximum likelihood gave accurate and unbiased estimates of the ancestral relationship parameter (true value: 0.40) whereas the naïve method and the method of moments were biased (average estimates of 0.43 and 0.35). We also observed that genomic evaluation by ssGBLUP using metafounders was less biased in terms of estimates of genetic trend (bias of 0.01 instead of 0.12), resulted in less overdispersed (0.94 instead of 0.99) and as accurate (0.74) estimates of breeding values than ssGBLUP without metafounders and provided consistent estimates of heritability.

Conclusions

Estimation of metafounder relationships can be achieved using BLUP-like methods with pedigree and markers. Inclusion of metafounder relationships reduces bias of genomic predictions with no loss in accuracy.

Collapse

Fragomeni BO, Lourenco DAL, Tsuruta S, Bradford HL, Gray KA, Huang Y, Misztal I. Using single-step genomic best linear unbiased predictor to enhance the mitigation of seasonal losses due to heat stress in pigs. J Anim Sci 2016;94:5004-5013. [DOI: 10.2527/jas.2016-0820] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Pocrnic I, Lourenco DAL, Masuda Y, Misztal I. Dimensionality of genomic information and performance of the Algorithm for Proven and Young for different livestock species. Genet Sel Evol 2016;48:82. [PMID: 27799053 PMCID: PMC5088690 DOI: 10.1186/s12711-016-0261-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 10/25/2016] [Indexed: 12/19/2022] Open

Abstract

Background

A genomic relationship matrix (GRM) can be inverted efficiently with the Algorithm for Proven and Young (APY) through recursion on a small number of core animals. The number of core animals is theoretically linked to effective population size (N_e). In a simulation study, the optimal number of core animals was equal to the number of largest eigenvalues of GRM that explained 98% of its variation. The purpose of this study was to find the optimal number of core animals and estimate N_e for different species.

Methods

Datasets included phenotypes, pedigrees, and genotypes for populations of Holstein, Jersey, and Angus cattle, pigs, and broiler chickens. The number of genotyped animals varied from 15,000 for broiler chickens to 77,000 for Holsteins, and the number of single-nucleotide polymorphisms used for genomic prediction varied from 37,000 to 61,000. Eigenvalue decomposition of the GRM for each population determined numbers of largest eigenvalues corresponding to 90, 95, 98, and 99% of variation.

Results

The number of eigenvalues corresponding to 90% (98%) of variation was 4527 (14,026) for Holstein, 3325 (11,500) for Jersey, 3654 (10,605) for Angus, 1239 (4103) for pig, and 1655 (4171) for broiler chicken. Each trait in each species was analyzed using the APY inverse of the GRM with randomly selected core animals, and their number was equal to the number of largest eigenvalues. Realized accuracies peaked with the number of core animals corresponding to 98% of variation for Holstein and Jersey and closer to 99% for other breed/species. N_e was estimated based on comparisons of eigenvalue decomposition in a simulation study. Assuming a genome length of 30 Morgan, N_e was equal to 149 for Holsteins, 101 for Jerseys, 113 for Angus, 32 for pigs, and 44 for broilers.

Conclusions

Eigenvalue profiles of GRM for common species are similar to those in simulation studies although they are affected by number of genotyped animals and genotyping quality. For all investigated species, the APY required less than 15,000 core animals. Realized accuracies were equal or greater with the APY inverse than with regular inversion. Eigenvalue analysis of GRM can provide a realistic estimate of N_e.

Collapse