Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pérez-Cabal MA, Vazquez AI, Gianola D, Rosa GJM, Weigel KA. Accuracy of Genome-Enabled Prediction in a Dairy Cattle Population using Different Cross-Validation Layouts. Front Genet 2012;3:27. [PMID: 22403583 PMCID: PMC3288819 DOI: 10.3389/fgene.2012.00027] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 02/13/2012] [Indexed: 11/26/2022] Open

For:	Pérez-Cabal MA, Vazquez AI, Gianola D, Rosa GJM, Weigel KA. Accuracy of Genome-Enabled Prediction in a Dairy Cattle Population using Different Cross-Validation Layouts. Front Genet 2012;3:27. [PMID: 22403583 PMCID: PMC3288819 DOI: 10.3389/fgene.2012.00027] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 02/13/2012] [Indexed: 11/26/2022] Open

Number

Cited by Other Article(s)

Wolf MJ, Neumann GB, Kokuć P, Yin T, Brockmann GA, König S, May K. Genetic evaluations for endangered dual-purpose German Black Pied cattle using 50K SNPs, a breed-specific 200K chip, and whole-genome sequencing. J Dairy Sci 2023;106:3345-3358. [PMID: 37028956 DOI: 10.3168/jds.2022-22665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 12/16/2022] [Indexed: 04/09/2023]

Abstract

Genetic evaluations of local cattle breeds are hampered due to small reference groups or biased due to the utilization of SNP effects estimated in other large populations. Against this background, there is a lack of studies addressing the possible advantage of whole-genome sequences (WGS) or consideration of specific variants from WGS data in genomic predictions for local breeds with small population size. Consequently, the aim of this study was to compare genetic parameters and accuracies of genomic estimated breeding values (GEBV) for 305-d production traits, fat-to protein ratio (FPR), and somatic cell score (SCS) at the first test date after calving and confirmation traits of the endangered German Black Pied cattle (DSN) breed using 4 different marker panels: (1) the commercial 50K Illumina BovineSNP50 BeadChip, (2) a customized 200K chip designed for DSN (DSN200K) which considers the most important variants for DSN from WGS, (3) randomly generated 200K chips based on WGS data, and (4) a WGS panel. The same number of animals was considered for all marker panel analyses (i.e., 1,811 genotyped or sequenced cows for conformation traits, 2,383 cows for lactation production traits, and 2,420 cows for FPR and SCS). Mixed models for the estimation of genetic parameters directly included the respective genomic relationship matrix from the different marker panels plus the trait-specific fixed effects. For the calculation of GEBV accuracies, we applied repeated random subsampling validation. In the process of separate cross-validations per trait, we created a validation set including 20% of cows with masked phenotypes, and a training set comprising 80% of the cows. The cows were selected randomly in a procedure with 10 replicates considering replacements in the different scenarios. The accuracy was defined as the correlation between the direct GEBV and the phenotypes with subtracted corresponding fixed effects for the cows in the validation set. For FPR and SCS, as well as for lactation production traits, heritabilities were largest based on WGS data, but the increase compared with the 50K or DSN200K applications was quite small in the range from 0.01 to 0.03. Also, for most of the conformation traits, heritabilities were largest based on WGS and DSN200K data, but the increase was in the range of the corresponding standard error. Accordingly, GEBV accuracies for most of the studied traits were highest based on WGS data or when utilizing the DSN200K chip, but the accuracy differences across the marker panels were quite small and nonsignificant. In conclusion, WGS data and the DSN200K chip only contributed to minor improvements in genomic predictions, still justifying the use of the commercial 50K chip. Nevertheless, WGS and the 200KDSN chip harbor breed-specific variants, which are valuable for studying causal genetic mechanisms in the endangered DSN population.

Collapse

Faggion S, Carnier P, Franch R, Babbucci M, Pascoli F, Dalla Rovere G, Caggiano M, Chavanne H, Toffan A, Bargelloni L. Viral nervous necrosis resistance in gilthead sea bream (Sparus aurata) at the larval stage: heritability and accuracy of genomic prediction with different training and testing settings. Genet Sel Evol 2023;55:22. [PMID: 37013478 PMCID: PMC10069116 DOI: 10.1186/s12711-023-00796-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 03/21/2023] [Indexed: 04/05/2023] Open

Abstract

BACKGROUND

The gilthead sea bream (Sparus aurata) has long been considered resistant to viral nervous necrosis (VNN), until recently, when significant mortalities caused by a reassortant nervous necrosis virus (NNV) strain were reported. Selective breeding to enhance resistance against NNV might be a preventive action. In this study, 972 sea bream larvae were subjected to a NNV challenge test and the symptomatology was recorded. All the experimental fish and their parents were genotyped using a genome-wide single nucleotide polymorphism (SNP) array consisting of over 26,000 markers.

RESULTS

Estimates of pedigree-based and genomic heritabilities of VNN symptomatology were consistent with each other (0.21, highest posterior density interval at 95% (HPD95%): 0.1-0.4; 0.19, HPD95%: 0.1-0.3, respectively). The genome-wide association study suggested one genomic region, i.e., in linkage group (LG) 23 that might be involved in sea bream VNN resistance, although it was far from the genome-wide significance threshold. The accuracies (r) of the predicted estimated breeding values (EBV) provided by three Bayesian genomic regression models (Bayes B, Bayes C, and Ridge Regression) were consistent and on average were equal to 0.90 when assessed in a set of cross-validation (CV) procedures. When genomic relationships between training and testing sets were minimized, accuracy decreased greatly (r = 0.53 for a validation based on genomic clustering, r = 0.12 for a validation based on a leave-one-family-out approach focused on the parents of the challenged fish). Classification of the phenotype using the genomic predictions of the phenotype or using the genomic predictions of the pedigree-based, all data included, EBV as classifiers was moderately accurate (area under the ROC curve 0.60 and 0.66, respectively).

CONCLUSIONS

The estimate of the heritability for VNN symptomatology indicates that it is feasible to implement selective breeding programs for increased resistance to VNN of sea bream larvae/juveniles. Exploiting genomic information offers the opportunity of developing prediction tools for VNN resistance, and genomic models can be trained on EBV using all data or phenotypes, with minimal differences in classification performance of the trait phenotype. In a long-term view, the weakening of the genomic ties between animals in the training and test sets leads to decreased genomic prediction accuracies, thus periodical update of the reference population with new data is mandatory.

Collapse

Anilkumar C, Sunitha NC, Devate NB, Ramesh S. Advances in integrated genomic selection for rapid genetic gain in crop improvement: a review. PLANTA 2022;256:87. [PMID: 36149531 DOI: 10.1007/s00425-022-03996-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 09/11/2022] [Indexed: 06/16/2023]

Abstract

Genomic selection and its importance in crop breeding. Integration of GS with new breeding tools and developing SOP for GS to achieve maximum genetic gain with low cost and time. The success of conventional breeding approaches is not sufficient to meet the demand of a growing population for nutritious food and other plant-based products. Whereas, marker assisted selection (MAS) is not efficient in capturing all the favorable alleles responsible for economic traits in the process of crop improvement. Genomic selection (GS) developed in livestock breeding and then adapted to plant breeding promised to overcome the drawbacks of MAS and significantly improve complicated traits controlled by gene/QTL with small effects. Large-scale deployment of GS in important crops, as well as simulation studies in a variety of contexts, addressed G × E interaction effects and non-additive effects, as well as lowering breeding costs and time. The current study provides a complete overview of genomic selection, its process, and importance in modern plant breeding, along with insights into its application. GS has been implemented in the improvement of complex traits including tolerance to biotic and abiotic stresses. Furthermore, this review hypothesises that using GS in conjunction with other crop improvement platforms accelerates the breeding process to increase genetic gain. The objective of this review is to highlight the development of an appropriate GS model, the global open source network for GS, and trans-disciplinary approaches for effective accelerated crop improvement. The current study focused on the application of data science, including machine learning and deep learning tools, to enhance the accuracy of prediction models. Present study emphasizes on developing plant breeding strategies centered on GS combined with routine conventional breeding principles by developing GS-SOP to achieve enhanced genetic gain.

Collapse

Liu D, Xu Z, Zhao W, Wang S, Li T, Zhu K, Liu G, Zhao X, Wang Q, Pan Y, Ma P. Genetic parameters and genome-wide association for milk production traits and somatic cell score in different lactation stages of Shanghai Holstein population. Front Genet 2022;13:940650. [PMID: 36134029 PMCID: PMC9483179 DOI: 10.3389/fgene.2022.940650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 08/04/2022] [Indexed: 11/13/2022] Open

Abstract

The aim of this study was to investigate the genetic parameters and genetic architectures of six milk production traits in the Shanghai Holstein population. The data used to estimate the genetic parameters consisted of 1,968,589 test-day records for 305,031 primiparous cows. Among the cows with phenotypes, 3,016 cows were genotyped with Illumina Bovine SNP50K BeadChip, GeneSeek Bovine 50K BeadChip, GeneSeek Bovine LD BeadChip v4, GeneSeek Bovine 150K BeadChip, or low-depth whole-genome sequencing. A genome-wide association study was performed to identify quantitative trait loci and genes associated with milk production traits in the Shanghai Holstein population using genotypes imputed to whole-genome sequences and both fixed and random model circulating probability unification and a mixed linear model with rMVP software. Estimated heritabilities (h2) varied from 0.04 to 0.14 for somatic cell score (SCS), 0.07 to 0.22 for fat percentage (FP), 0.09 to 0.27 for milk yield (MY), 0.06 to 0.23 for fat yield (FY), 0.09 to 0.26 for protein yield (PY), and 0.07 to 0.35 for protein percentage (PP), respectively. Within lactation, genetic correlations for SCS, FP, MY, FY, PY, and PP at different stages of lactation estimated in random regression model were ranged from -0.02 to 0.99, 0.18 to 0.99, 0.04 to 0.99, 0.04 to 0.99, 0.01 to 0.99, and 0.33 to 0.99, respectively. The genetic correlations were highest between adjacent DIM but decreased as DIM got further apart. Candidate genes included those related to production traits (DGAT1, MGST1, PTK2, and SCRIB), disease-related (LY6K, COL22A1, TECPR2, and PLCB1), heat stress–related (ITGA9, NDST4, TECPR2, and HSF1), and reproduction-related (7SK and DOCK2) genes. This study has shown that there are differences in the genetic mechanisms of milk production traits at different stages of lactation. Therefore, it is necessary to conduct research on milk production traits at different stages of lactation as different traits. Our results can also provide a theoretical basis for subsequent molecular breeding, especially for the novel genetic loci.

Collapse

Meher PK, Rustgi S, Kumar A. Performance of Bayesian and BLUP alphabets for genomic prediction: analysis, comparison and results. Heredity (Edinb) 2022;128:519-530. [PMID: 35508540 DOI: 10.1038/s41437-022-00539-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 04/19/2022] [Accepted: 04/19/2022] [Indexed: 11/09/2022] Open

Rios EF, Andrade MHML, Resende MFR, Kirst M, de Resende MDV, de Almeida Filho JE, Gezan SA, Munoz P. Genomic prediction in family bulks using different traits and cross-validations in pine. G3-GENES GENOMES GENETICS 2021;11:6321952. [PMID: 34544139 PMCID: PMC8496210 DOI: 10.1093/g3journal/jkab249] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 07/02/2021] [Indexed: 11/13/2022]

Krishnappa G, Savadi S, Tyagi BS, Singh SK, Mamrutha HM, Kumar S, Mishra CN, Khan H, Gangadhara K, Uday G, Singh G, Singh GP. Integrated genomic selection for rapid improvement of crops. Genomics 2021;113:1070-1086. [PMID: 33610797 DOI: 10.1016/j.ygeno.2021.02.007] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Revised: 11/08/2020] [Accepted: 02/15/2021] [Indexed: 11/15/2022]

Brunes LC, Baldi F, Lopes FB, Narciso MG, Lobo RB, Espigolan R, Costa MFO, Magnabosco CU. Genomic prediction ability for feed efficiency traits using different models and pseudo-phenotypes under several validation strategies in Nelore cattle. Animal 2020;15:100085. [PMID: 33573965 DOI: 10.1016/j.animal.2020.100085] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 09/09/2020] [Accepted: 09/15/2020] [Indexed: 10/22/2022] Open

Abstract

There is a growing interest to improve feed efficiency (FE) traits in cattle. The genomic selection was proposed to improve these traits since they are difficult and expensive to measure. Up to date, there are scarce studies about the implementation of genomic selection for FE traits in indicine cattle under different scenarios of pseudo-phenotypes, models, and validation strategies on a commercial large scale. Thus, the aim was to evaluate the feasibility of genomic selection implementation for FE traits in Nelore cattle applying different models and pseudo-phenotypes under validation strategies. Phenotypic and genotypic information from 4 329 and 3 467 animals were used, respectively, which were tested for residual feed intake, DM intake, feed efficiency, feed conversion ratio, residual BW gain, and residual intake and BW gain. Six prediction methods were used: single-step genomic best linear unbiased prediction, Bayes A, Bayes B, Bayes Cπ, Bayesian least absolute shrinkage and selection operator (BLASSO), and Bayes R. Phenotypes adjusted for fixed effects (Y*), estimated breeding value (EBV), and EBV deregressed (DEBV) were used as pseudo-phenotypes. The validation approaches used were: (1) random: the data was randomly divided into ten subsets and the validation was done in each subset at a time; (2) age: the partition into training and testing sets was based on year of birth and testing animals were born after 2016; and (3) EBV accuracy: the data was split into two groups, being animals with accuracy above 0.45 the training set; and below 0.45 the validation set. In the analyses that used the Y* as pseudo-phenotype, prediction ability (PA) was obtained by dividing the correlation between pseudo-phenotype and genomic EBV (GEBV) by the square root of the heritability of the trait. When EBV and DEBV were used as the pseudo-phenotype, the simple correlation of this quantity with the GEBV was considered as PA. The prediction methods show similar results for PA and bias. The random cross-validation presented higher PA (0.17) than EBV accuracy (0.14) and age (0.13). The PA was higher for Y* than for EBV and DEBV (30.0 and 34.3%, respectively). Random validation presented the highest PA, being indicated for use in populations composed mainly of young animals and traits with few generations of data recording. For high heritability traits, the validation can be done by age, enabling the prediction of the next-generation genetic merit. These results would support breeders to identify genomic approaches that are more viable for genomic prediction for FE-related traits.

Collapse

High-frequency marker haplotypes in the genomic selection of dairy cattle. J Appl Genet 2019;60:179-186. [PMID: 30877657 PMCID: PMC6483952 DOI: 10.1007/s13353-019-00489-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 01/18/2019] [Accepted: 02/28/2019] [Indexed: 11/05/2022]

Wang X, Miao J, Chang T, Xia J, An B, Li Y, Xu L, Zhang L, Gao X, Li J, Gao H. Evaluation of GBLUP, BayesB and elastic net for genomic prediction in Chinese Simmental beef cattle. PLoS One 2019;14:e0210442. [PMID: 30817758 PMCID: PMC6394919 DOI: 10.1371/journal.pone.0210442] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Accepted: 12/21/2018] [Indexed: 11/24/2022] Open

Karimi Z, Sargolzaei M, Robinson J, Schenkel F. Assessing haplotype-based models for genomic evaluation in Holstein cattle. CANADIAN JOURNAL OF ANIMAL SCIENCE 2018. [DOI: 10.1139/cjas-2018-0009] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Momen M, Mehrgardi AA, Sheikhi A, Kranis A, Tusell L, Morota G, Rosa GJM, Gianola D. Predictive ability of genome-assisted statistical models under various forms of gene action. Sci Rep 2018;8:12309. [PMID: 30120288 PMCID: PMC6098164 DOI: 10.1038/s41598-018-30089-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Accepted: 07/24/2018] [Indexed: 11/09/2022] Open

Muleta KT, Bulli P, Zhang Z, Chen X, Pumphrey M. Unlocking Diversity in Germplasm Collections via Genomic Selection: A Case Study Based on Quantitative Adult Plant Resistance to Stripe Rust in Spring Wheat. THE PLANT GENOME 2017;10. [PMID: 29293811 DOI: 10.3835/plantgenome2016.12.0124] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Silva RMO, Fragomeni BO, Lourenco DAL, Magalhães AFB, Irano N, Carvalheiro R, Canesin RC, Mercadante MEZ, Boligon AA, Baldi FS, Misztal I, Albuquerque LG. Accuracies of genomic prediction of feed efficiency traits using different prediction and validation methods in an experimental Nelore cattle population. J Anim Sci 2017;94:3613-3623. [PMID: 27898889 DOI: 10.2527/jas.2016-0401] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Abstract

Animal feeding is the most important economic component of beef production systems. Selection for feed efficiency has not been effective mainly due to difficult and high costs to obtain the phenotypes. The application of genomic selection using SNP can decrease the cost of animal evaluation as well as the generation interval. The objective of this study was to compare methods for genomic evaluation of feed efficiency traits using different cross-validation layouts in an experimental beef cattle population genotyped for a high-density SNP panel (BovineHD BeadChip assay 700k, Illumina Inc., San Diego, CA). After quality control, a total of 437,197 SNP genotypes were available for 761 Nelore animals from the Institute of Animal Science, Sertãozinho, São Paulo, Brazil. The studied traits were residual feed intake, feed conversion ratio, ADG, and DMI. Methods of analysis were traditional BLUP, single-step genomic BLUP (ssGBLUP), genomic BLUP (GBLUP), and a Bayesian regression method (BayesCπ). Direct genomic values (DGV) from the last 2 methods were compared directly or in an index that combines DGV with parent average. Three cross-validation approaches were used to validate the models: 1) YOUNG, in which the partition into training and testing sets was based on year of birth and testing animals were born after 2010; 2) UNREL, in which the data set was split into 3 less related subsets and the validation was done in each subset a time; and 3) RANDOM, in which the data set was randomly divided into 4 subsets (considering the contemporary groups) and the validation was done in each subset at a time. On average, the RANDOM design provided the most accurate predictions. Average accuracies ranged from 0.10 to 0.58 using BLUP, from 0.09 to 0.48 using GBLUP, from 0.06 to 0.49 using BayesCπ, and from 0.22 to 0.49 using ssGBLUP. The most accurate and consistent predictions were obtained using ssGBLUP for all analyzed traits. The ssGBLUP seems to be more suitable to obtain genomic predictions for feed efficiency traits on an experimental population of genotyped animals.

Collapse

Jenko J, Wiggans G, Cooper T, Eaglen S, Luff W, Bichard M, Pong-Wong R, Woolliams J. Cow genotyping strategies for genomic selection in a small dairy cattle population. J Dairy Sci 2017;100:439-452. [DOI: 10.3168/jds.2016-11479] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 09/21/2016] [Indexed: 01/22/2023]

Using Genetic Distance to Infer the Accuracy of Genomic Prediction. PLoS Genet 2016;12:e1006288. [PMID: 27589268 PMCID: PMC5010218 DOI: 10.1371/journal.pgen.1006288] [Citation(s) in RCA: 79] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Accepted: 08/10/2016] [Indexed: 12/12/2022] Open

Abstract

The prediction of phenotypic traits using high-density genomic data has many applications such as the selection of plants and animals of commercial interest; and it is expected to play an increasing role in medical diagnostics. Statistical models used for this task are usually tested using cross-validation, which implicitly assumes that new individuals (whose phenotypes we would like to predict) originate from the same population the genomic prediction model is trained on. In this paper we propose an approach based on clustering and resampling to investigate the effect of increasing genetic distance between training and target populations when predicting quantitative traits. This is important for plant and animal genetics, where genomic selection programs rely on the precision of predictions in future rounds of breeding. Therefore, estimating how quickly predictive accuracy decays is important in deciding which training population to use and how often the model has to be recalibrated. We find that the correlation between true and predicted values decays approximately linearly with respect to either F_ST or mean kinship between the training and the target populations. We illustrate this relationship using simulations and a collection of data sets from mice, wheat and human genetics.

The availability of increasing amounts of genomic data is making the use of statistical models to predict traits of interest a mainstay of many applications in life sciences. Applications range from medical diagnostics for common and rare diseases to breeding characteristics such as disease resistance in plants and animals of commercial interest. We explored an implicit assumption of how such prediction models are often assessed: that the individuals whose traits we would like to predict originate from the same population as those that are used to train the models. This is commonly not the case, especially in the case of plants and animals that are parts of selection programs. To study this problem we proposed a model-agnostic approach to infer the accuracy of prediction models as a function of two common measures of genetic distance. Using data from plant, animal and human genetics, we find that accuracy decays approximately linearly in either of those measures. Quantifying this decay has fundamental applications in all branches of genetics, as it measures how studies generalise to different populations.

Collapse

Karaman E, Cheng H, Firat MZ, Garrick DJ, Fernando RL. An Upper Bound for Accuracy of Prediction Using GBLUP. PLoS One 2016;11:e0161054. [PMID: 27529480 PMCID: PMC4986954 DOI: 10.1371/journal.pone.0161054] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 07/29/2016] [Indexed: 11/26/2022] Open

Fernandes Júnior GA, Rosa GJM, Valente BD, Carvalheiro R, Baldi F, Garcia DA, Gordo DGM, Espigolan R, Takada L, Tonussi RL, de Andrade WBF, Magalhães AFB, Chardulo LAL, Tonhati H, de Albuquerque LG. Genomic prediction of breeding values for carcass traits in Nellore cattle. Genet Sel Evol 2016;48:7. [PMID: 26830208 PMCID: PMC4734869 DOI: 10.1186/s12711-016-0188-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Accepted: 01/18/2016] [Indexed: 01/20/2023] Open

Yin T, König S. Genomics for phenotype prediction and management purposes. Anim Front 2016. [DOI: 10.2527/af.2016-0010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

Morota G, Gianola D. Kernel-based whole-genome prediction of complex traits: a review. Front Genet 2014;5:363. [PMID: 25360145 PMCID: PMC4199321 DOI: 10.3389/fgene.2014.00363] [Citation(s) in RCA: 96] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2014] [Accepted: 09/29/2014] [Indexed: 01/18/2023] Open

Calus MP, Schrooten C, Veerkamp RF. Genomic prediction of breeding values using previously estimated SNP variances. Genet Sel Evol 2014;46:52. [PMID: 25928875 PMCID: PMC4176585 DOI: 10.1186/s12711-014-0052-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2014] [Accepted: 07/17/2014] [Indexed: 11/10/2022] Open

Abstract

Background

Genomic prediction requires estimation of variances of effects of single nucleotide polymorphisms (SNPs), which is computationally demanding, and uses these variances for prediction. We have developed models with separate estimation of SNP variances, which can be applied infrequently, and genomic prediction, which can be applied routinely.

Methods

SNP variances were estimated with Bayes Stochastic Search Variable Selection (BSSVS) and BayesC. Genome-enhanced breeding values (GEBV) were estimated with RR-BLUP (ridge regression best linear unbiased prediction), using either variances obtained from BSSVS (BLUP-SSVS) or BayesC (BLUP-C), or assuming equal variances for each SNP. Datasets used to estimate SNP variances comprised (1) all animals, (2) 50% random animals (RAN50), (3) 50% best animals (TOP50), or (4) 50% worst animals (BOT50). Traits analysed were protein yield, udder depth, somatic cell score, interval between first and last insemination, direct longevity, and longevity including information from predictors.

Results

BLUP-SSVS and BLUP-C yielded similar GEBV as the equivalent Bayesian models that simultaneously estimated SNP variances. Reliabilities of these GEBV were consistently higher than from RR-BLUP, although only significantly for direct longevity. Across scenarios that used data subsets to estimate GEBV, observed reliabilities were generally higher for TOP50 than for RAN50, and much higher than for BOT50. Reliabilities of TOP50 were higher because the training data contained more ancestors of selection candidates. Using estimated SNP variances based on random or non-random subsets of the data, while using all data to estimate GEBV, did not affect reliabilities of the BLUP models. A convergence criterion of 10⁻⁸ instead of 10⁻¹⁰ for BLUP models yielded similar GEBV, while the required number of iterations decreased by 71 to 90%. Including a separate polygenic effect consistently improved reliabilities of the GEBV, but also substantially increased the required number of iterations to reach convergence with RR-BLUP. SNP variances converged faster for BayesC than for BSSVS.

Conclusions

Combining Bayesian variable selection models to re-estimate SNP variances and BLUP models that use those SNP variances, yields GEBV that are similar to those from full Bayesian models. Moreover, these combined models yield predictions with higher reliability and less bias than the commonly used RR-BLUP model.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-014-0052-x) contains supplementary material, which is available to authorized users.

Collapse

Desta ZA, Ortiz R. Genomic selection: genome-wide prediction in plant improvement. TRENDS IN PLANT SCIENCE 2014;19:592-601. [PMID: 24970707 DOI: 10.1016/j.tplants.2014.05.006] [Citation(s) in RCA: 278] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Revised: 05/08/2014] [Accepted: 05/23/2014] [Indexed: 05/18/2023]

Yao C, Leng N, Weigel KA, Lee KE, Engelman CD, Meyers KJ. Prediction of genetic contributions to complex traits using whole genome sequencing data. BMC Proc 2014;8:S68. [PMID: 25519339 PMCID: PMC4143683 DOI: 10.1186/1753-6561-8-s1-s68] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Accuracy of estimation of genomic breeding values in pigs using low-density genotypes and imputation. G3-GENES GENOMES GENETICS 2014;4:623-31. [PMID: 24531728 PMCID: PMC4059235 DOI: 10.1534/g3.114.010504] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

Genomic selection has the potential to increase genetic progress. Genotype imputation of high-density single-nucleotide polymorphism (SNP) genotypes can improve the cost efficiency of genomic breeding value (GEBV) prediction for pig breeding. Consequently, the objectives of this work were to: (1) estimate accuracy of genomic evaluation and GEBV for three traits in a Yorkshire population and (2) quantify the loss of accuracy of genomic evaluation and GEBV when genotypes were imputed under two scenarios: a high-cost, high-accuracy scenario in which only selection candidates were imputed from a low-density platform and a low-cost, low-accuracy scenario in which all animals were imputed using a small reference panel of haplotypes. Phenotypes and genotypes obtained with the PorcineSNP60 BeadChip were available for 983 Yorkshire boars. Genotypes of selection candidates were masked and imputed using tagSNP in the GeneSeek Genomic Profiler (10K). Imputation was performed with BEAGLE using 128 or 1800 haplotypes as reference panels. GEBV were obtained through an animal-centric ridge regression model using de-regressed breeding values as response variables. Accuracy of genomic evaluation was estimated as the correlation between estimated breeding values and GEBV in a 10-fold cross validation design. Accuracy of genomic evaluation using observed genotypes was high for all traits (0.65−0.68). Using genotypes imputed from a large reference panel (accuracy: R² = 0.95) for genomic evaluation did not significantly decrease accuracy, whereas a scenario with genotypes imputed from a small reference panel (R² = 0.88) did show a significant decrease in accuracy. Genomic evaluation based on imputed genotypes in selection candidates can be implemented at a fraction of the cost of a genomic evaluation using observed genotypes and still yield virtually the same accuracy. On the other side, using a very small reference panel of haplotypes to impute training animals and candidates for selection results in lower accuracy of genomic evaluation.

Collapse

Silva FF, Mulder HA, Knol EF, Lopes MS, Guimarães SEF, Lopes PS, Mathur PK, Viana JMS, Bastiaansen JWM. Sire evaluation for total number born in pigs using a genomic reaction norms approach. J Anim Sci 2014;92:3825-34. [PMID: 24492557 DOI: 10.2527/jas.2013-6486] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Abstract

In the era of genome-wide selection (GWS), genotype-by-environment (G×E) interactions can be studied using genomic information, thus enabling the estimation of SNP marker effects and the prediction of genomic estimated breeding values (GEBV) for young candidates for selection in different environments. Although G×E studies in pigs are scarce, the use of artificial insemination has enabled the distribution of genetic material from sires across multiple environments. Given the relevance of reproductive traits, such as the total number born (TNB) and the variation in environmental conditions encountered by commercial dams, understanding G×E interactions can be essential for choosing the best sires for different environments. The present work proposes a two-step reaction norm approach for G×E analysis using genomic information. The first step provided estimates of environmental effects (herd-year-season, HYS), and the second step provided estimates of the intercept and slope for the TNB across different HYS levels, obtained from the first step, using a random regression model. In both steps, pedigree ( A: ) and genomic ( G: ) relationship matrices were considered. The genetic parameters (variance components, h(2) and genetic correlations) were very similar when estimated using the A: and G: relationship matrices. The reaction norm graphs showed considerable differences in environmental sensitivity between sires, indicating a reranking of sires in terms of genetic merit across the HYS levels. Based on the G: matrix analysis, SNP by environment interactions were observed. For some SNP, the effects increased at increasing HYS levels, while for others, the effects decreased at increasing HYS levels or showed no changes between HYS levels. Cross-validation analysis demonstrated better performance of the genomic approach with respect to traditional pedigrees for both the G×E and standard models. The genomic reaction norm model resulted in an accuracy of GEBV for "juvenile" boars varying from 0.14 to 0.44 across different HYS levels, while the accuracy of the standard genomic prediction model, without reaction norms, varied from 0.09 to 0.28. These results show that it is important and feasible to consider G×E interactions in evaluations of sires using genomic prediction models and that genomic information can increase the accuracy of selection across environments.

Collapse

Vazquez AI, de los Campos G, Klimentidis YC, Rosa GJM, Gianola D, Yi N, Allison DB. A comprehensive genetic approach for improving prediction of skin cancer risk in humans. Genetics 2012;192:1493-502. [PMID: 23051645 PMCID: PMC3512154 DOI: 10.1534/genetics.112.141705] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2012] [Accepted: 09/07/2012] [Indexed: 01/09/2023] Open

Predicted accuracy of and response to genomic selection for new traits in dairy cattle. Animal 2012;7:183-91. [PMID: 23031684 DOI: 10.1017/s1751731112001450] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Abstract

Genomic selection relaxes the requirement of traditional selection tools to have phenotypic measurements on close relatives of all selection candidates. This opens up possibilities to select for traits that are difficult or expensive to measure. The objectives of this paper were to predict accuracy of and response to genomic selection for a new trait, considering that only a cow reference population of moderate size was available for the new trait, and that selection simultaneously targeted an index and this new trait. Accuracy for and response to selection were deterministically evaluated for three different breeding goals. Single trait selection for the new trait based only on a limited cow reference population of up to 10 000 cows, showed that maximum genetic responses of 0.20 and 0.28 genetic standard deviation (s.d.) per year can be achieved for traits with a heritability of 0.05 and 0.30, respectively. Adding information from the index based on a reference population of 5000 bulls, and assuming a genetic correlation of 0.5, increased genetic response for both heritability levels by up to 0.14 genetic s.d. per year. The scenario with simultaneous selection for the new trait and the index, yielded a substantially lower response for the new trait, especially when the genetic correlation with the index was negative. Despite the lower response for the index, whenever the new trait had considerable economic value, including the cow reference population considerably improved the genetic response for the new trait. For scenarios with a zero or negative genetic correlation with the index and equal economic value for the index and the new trait, a reference population of 2000 cows increased genetic response for the new trait with at least 0.10 and 0.20 genetic s.d. per year, for heritability levels of 0.05 and 0.30, respectively. We conclude that for new traits with a very small or positive genetic correlation with the index, and a high positive economic value, considerable genetic response can already be achieved based on a cow reference population with only 2000 records, even when the reliability of individual genomic breeding values is much lower than currently accepted in dairy cattle breeding programs. New traits may generally have a negative genetic correlation with the index and a small positive economic value. For such new traits, cow reference populations of at least 10 000 cows may be required to achieve acceptable levels of genetic response for the new trait and for the whole breeding goal.

Collapse