1
|
Cai W, Cole JB, Goddard ME, Li J, Zhang S, Song J. Mammary gland multi-omics data reveals new genetic insights into milk production traits in dairy cattle. PLoS Genet 2025; 21:e1011675. [PMID: 40245050 PMCID: PMC12054919 DOI: 10.1371/journal.pgen.1011675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 05/06/2025] [Accepted: 04/03/2025] [Indexed: 04/19/2025] Open
Abstract
Although many sequence variants have been discovered in cattle, deciphering the relationship between genome and phenome remains a significant challenge. In this study, we identified functional classes, including mammary-specific genes, lactation-associated genes, novel long non-coding RNAs, miRNAs, RNA editing sites, DNA methylation, histone modifications, and expression quantitative trait loci. We estimated their contributions to genetic variance for milk production traits using 3 million variants in 23,566 Holstein bulls. Sequence variants in the 5'-UTR, synonymous, and splicing regions disproportionately contributed to genetic variance of milk production traits compared to other genomic regions. Genes specifically expressed in the mammary gland, particularly those active in lactating tissue (e.g., GLYCAM1, DGAT1), account for significantly more genetic variance of milk production traits than specific genes from non-mammary tissues. We identified 8,560 differentially expressed genes (DEGs) between lactating and non-lactating tissues. Among these, both up-regulated and small-fold changes of down-regulated DEGs exhibited greater genetic variance enrichment of milk production traits than other genes. Mammary enhancers (e.g., H3K27ac, H3K4Me1) explained more variance than repressive elements, while small changes in DNA methylation level (≤0.2) contributed more variance than that with larger changes (> 0.2). Notably, lactation-associated RNA editing sites in mammary explained more variance for milk production traits than expected by chance. We proposed a novel miRNA prioritization strategy for selecting candidate miRNAs related to milk production traits, based on the overlaps between significant enrichment tests of miRNA target correlations and the relatively large variance explained by these targets. Additionally, we integrated these nine functional classes into the variance component analysis simultaneously, revealing that sQTLs, histone modification and DEGs showed the highest per-SNP variance enrichment. Finally, we constructed a new 624K SNP panel, which improved the reliabilities of genomic predictions by 0.22%. Dividing routine SNPs into two groups based on functional classes improved the reliabilities by 0.21%, particularly for milk protein percentage (0.68% improvement). Overall, incorporating prior biological knowledge of the mammary gland directly enhances our understanding of milk production's genetic architecture and improves the reliability of genomic predictions for milk production traits. This integrative approach establishes a paradigm for translating biological knowledge into agricultural genomics applications.
Collapse
Affiliation(s)
- Wentao Cai
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
- Department of Animal and Avian Science, University of Maryland, College Park, Maryland, United States of America
- College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - John B. Cole
- Animal Genomics and Improvement Laboratory, USDA, Beltsville, Maryland, United States of America
- Department of Animal Sciences, University of Florida, Gainesville, Florida, United States of America
- Department of Animal Science, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Michael E. Goddard
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, Victoria, Australia
| | - Junya Li
- Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Jiuzhou Song
- Department of Animal and Avian Science, University of Maryland, College Park, Maryland, United States of America
| |
Collapse
|
2
|
Yuan C, Gillon A, Gualdrón Duarte JL, Takeda H, Coppieters W, Georges M, Druet T. Evaluation of genomic selection models using whole genome sequence data and functional annotation in Belgian Blue cattle. Genet Sel Evol 2025; 57:10. [PMID: 40038647 PMCID: PMC11881496 DOI: 10.1186/s12711-025-00955-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 02/10/2025] [Indexed: 03/06/2025] Open
Abstract
BACKGROUND The availability of large cohorts of whole-genome sequenced individuals, combined with functional annotation, is expected to provide opportunities to improve the accuracy of genomic selection (GS). However, such benefits have not often been observed in initial applications. The reference population for GS in Belgian Blue Cattle (BBC) continues to grow. Combined with the availability of reference panels of sequenced individuals, it provides an opportunity to evaluate GS models using whole genome sequence (WGS) data and functional annotation. RESULTS Here, we used data from 16,508 cows, with phenotypes for five muscular development traits and imputed at the WGS level, in combination with in silico functional annotation and catalogs of putative regulatory variants obtained from experimental data. We evaluated first GS models using the entire WGS data, with or without functional annotation. At this marker density, we were able to run two approaches, assuming either a highly polygenic architecture (GBLUP) or allowing some variants to have larger effects (BayesRR-RC, a Bayesian mixture model), and observed an increased reliability compared to the official GBLUP model at medium marker density (on average 0.016 and 0.018 for GBLUP and BayesRR-RC, respectively). When functional annotation was used, we observed slightly higher reliabilities with an extension of GBLUP that included multiple polygenic terms (one per functional group), while reliabilities decreased with BayesRR-RC. We then used large subsets of variants selected based on functional information or with a linkage disequilibrium (LD) pruning approach, which allowed us to evaluate two additional approaches, BayesCπ and Bayesian Sparse Linear Mixed Model (BSLMM). Reliabilities were higher for these panels than for the WGS data, with the highest accuracies obtained when markers were selected based on functional information. In our setting, BSLMM systematically achieved higher reliabilities than other methods. CONCLUSIONS GS with large panels of functional variants selected from WGS data allowed a significant increase in reliability compared to the official genomic evaluation approach. However, the benefits of using WGS and functional data remained modest, indicating that there is still room for improvement, for example by further refining the functional annotation in the BBC breed.
Collapse
Affiliation(s)
- Can Yuan
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de l'Hôpital, 1, 4000, Liège, Belgium.
| | - Alain Gillon
- Walloon Breeders Association, Rue Des Champs Elysées, 4, 5590, Ciney, Belgium
| | | | - Haruko Takeda
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de l'Hôpital, 1, 4000, Liège, Belgium
| | - Wouter Coppieters
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de l'Hôpital, 1, 4000, Liège, Belgium
| | - Michel Georges
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de l'Hôpital, 1, 4000, Liège, Belgium
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de l'Hôpital, 1, 4000, Liège, Belgium
| |
Collapse
|
3
|
Cai X, Zhang W, Gao N, Wei C, Wu X, Si J, Gao Y, Li J, Yin T, Zhang Z. Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations. J Anim Breed Genet 2025; 142:223-236. [PMID: 39215551 DOI: 10.1111/jbg.12896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 07/18/2024] [Accepted: 08/18/2024] [Indexed: 09/04/2024]
Abstract
The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.
Collapse
Affiliation(s)
- Xiaodian Cai
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Wenjing Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Ning Gao
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Chen Wei
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xibo Wu
- Guangxi State Farmd Yongxin Animal Husbandry Group Co., Ltd, Nanning, China
| | - Jinglei Si
- Guangxi State Farmd Yongxin Animal Husbandry Group Co., Ltd, Nanning, China
| | - Yahui Gao
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Tong Yin
- Institute of Animal Breeding and Genetics, Justus Liebig University, Giessen, Germany
| | - Zhe Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
4
|
Ali B, Mary‐Huard T, Charcosset A, Moreau L, Rincent R. Improvement in genomic prediction of maize with prior gene ontology information depends on traits and environmental conditions. THE PLANT GENOME 2025; 18:e20553. [PMID: 39779652 PMCID: PMC11711123 DOI: 10.1002/tpg2.20553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 11/12/2024] [Accepted: 11/14/2024] [Indexed: 01/11/2025]
Abstract
Classical genomic prediction approaches rely on statistical associations between traits and markers rather than their biological significance. Biologically informed selection of genomic regions can help prioritize polymorphisms by considering underlying biological processes, making prediction models robust and accurate. Gene ontology (GO) terms can be used for this purpose, and the information can be integrated into genomic prediction models through marker categorization. It allows likely causal markers to account for a certain portion of genetic variance independently from the remaining markers. We systematically tested a list of 5110 GO terms for their predictive performance for physiological (platform traits) and productivity traits (field grain yield) in a maize (Zea mays L.) panel using genomic features best linear unbiased prediction (GFBLUP) model. Predictive abilities were compared to the classical genomic best linear unbiased prediction (GBLUP). Predictive gains with categorizing markers based on a given GO term strongly depend on the trait and on the growth conditions, as a term can be useful for a given trait in a given condition or somewhat similar conditions but not useful for the same trait in a different condition. Overall, results of all GFBLUP models compared to GBLUP show that the former might be less efficient than the latter. Even though we could not identify a prior criterion to determine which GO terms can offer benefit to a given trait, we could a posteriori find biological interpretations of the results, meaning that GFBLUP could be helpful if more about the gene functions and their relationships with the growth conditions was known.
Collapse
Affiliation(s)
- Baber Ali
- INRAE, CNRS, AgroParisTech, GQE–Le MoulonUniversité Paris‐SaclayGif‐sur‐YvetteFrance
| | - Tristan Mary‐Huard
- INRAE, CNRS, AgroParisTech, GQE–Le MoulonUniversité Paris‐SaclayGif‐sur‐YvetteFrance
- MIA Paris‐Saclay, INRAE, AgroParisTechUniversité Paris‐SaclayPalaiseauFrance
| | - Alain Charcosset
- INRAE, CNRS, AgroParisTech, GQE–Le MoulonUniversité Paris‐SaclayGif‐sur‐YvetteFrance
| | - Laurence Moreau
- INRAE, CNRS, AgroParisTech, GQE–Le MoulonUniversité Paris‐SaclayGif‐sur‐YvetteFrance
| | - Renaud Rincent
- INRAE, CNRS, AgroParisTech, GQE–Le MoulonUniversité Paris‐SaclayGif‐sur‐YvetteFrance
| |
Collapse
|
5
|
Klimkowski Arango N, Morgante F. Comparing statistical learning methods for complex trait prediction from gene expression. PLoS One 2025; 20:e0317516. [PMID: 39932918 PMCID: PMC11813155 DOI: 10.1371/journal.pone.0317516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Accepted: 12/30/2024] [Indexed: 02/13/2025] Open
Abstract
Accurate prediction of complex traits is an important task in quantitative genetics. Genotypes have been used for trait prediction using a variety of methods such as mixed models, Bayesian methods, penalized regression methods, dimension reduction methods, and machine learning methods. Recent studies have shown that gene expression levels can produce higher prediction accuracy than genotypes. However, only a few prediction methods were tested in these studies. Thus, a comprehensive assessment of methods is needed to fully evaluate the potential of gene expression as a predictor of complex trait phenotypes. Here, we used data from the Drosophila Genetic Reference Panel (DGRP) to compare the ability of several existing statistical learning methods to predict starvation resistance and startle response from gene expression in the two sexes separately. The methods considered differ in assumptions about the distribution of gene effects-ranging from models that assume that every gene affects the trait to more sparse models-and their ability to capture gene-gene interactions. We also used functional annotation (i.e., Gene Ontology (GO)) as a source of biological information to inform prediction models. The results show that differences in prediction accuracy exist. For example, methods performing variable selection achieved higher prediction accuracy for starvation resistance in females, while they generally had lower accuracy for startle response in both sexes. Incorporating GO annotations further improved prediction accuracy for a few GO terms of biological significance. Biological significance extended to the genes underlying highly predictive GO terms. Notably, the Insulin-like Receptor (InR) was prevalent across methods and sexes for starvation resistance. For startle response, crumbs (crb) and imaginal disc growth factor 2 (Idgf2) were found for females and males, respectively. Our results confirmed the potential of transcriptomic prediction and highlighted the importance of selecting appropriate methods and strategies in order to achieve accurate predictions.
Collapse
Affiliation(s)
- Noah Klimkowski Arango
- Center for Human Genetics, Clemson University, Greenwood, SC, United States of America
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States of America
| | - Fabio Morgante
- Center for Human Genetics, Clemson University, Greenwood, SC, United States of America
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States of America
| |
Collapse
|
6
|
Liu K, Yin Y, Wang B, Liu C, Zhou W, Niu P, Huang R, Li P, Zhao Q. Integrating Significant SNPs Identified by GWAS for Genomic Prediction of the Number of Ribs and Carcass Length in Suhuai Pigs. Animals (Basel) 2025; 15:412. [PMID: 39943181 PMCID: PMC11816145 DOI: 10.3390/ani15030412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2024] [Revised: 01/17/2025] [Accepted: 01/27/2025] [Indexed: 02/16/2025] Open
Abstract
The number of ribs (NRs) and the carcass length (CL) are important economic traits. The traits are usually measured after slaughter. To improve the prediction performance of genomic selection (GS) for NRs and CL, one strategy is to integrate the significant loci identified from whole-genome sequencing (WGS) data by genome-wide association study (GWAS) into the genomic prediction (GP) model. This study investigated the GP of different genomic best linear unbiased prediction (GBLUP) and Bayesian models using chip genotype data, imputed WGS (iWGS) data and modeling significant single-nucleotide polymorphisms (SNPs) in different ways for the GP of NRs and CL in the Suhuai pig population. The prediction accuracy, bias and running time of 15 different GP models were evaluated by 10-fold cross-validation. The prediction accuracy of GBLUP using chip data for NRs and CL was 0.314 ± 0.022 and 0.194 ± 0.040, respectively. For NRs, based on the iWGS data, treating the most significant SNP as fixed effects in the GBLUP model had the highest predictive performance, with a prediction accuracy of 0.528 ± 0.023. For CL, based on the chip data, the model that added all the significant SNPs identified by imputed data by GWAS into the multi-trait GBLUP as the second random additive effect was the highest predictive performance, with a prediction accuracy of 0.305 ± 0.027. This study provides insights into optimizing GP models for small populations with phenotypes that are difficult to measure.
Collapse
Affiliation(s)
- Kaiyue Liu
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
| | - Yanzhen Yin
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
| | - Binbin Wang
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
| | - Chenxi Liu
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
| | - Wuduo Zhou
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
| | - Peipei Niu
- Huaian Academy, Nanjing Agricultural University, Huai’an 223005, China;
| | - Ruihua Huang
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
- Huaian Academy, Nanjing Agricultural University, Huai’an 223005, China;
| | - Pinghua Li
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
- Huaian Academy, Nanjing Agricultural University, Huai’an 223005, China;
| | - Qingbo Zhao
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs Resources, Ministry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural University, Nanjing 210095, China; (K.L.); (Y.Y.); (B.W.); (C.L.); (W.Z.); (R.H.)
- Huaian Academy, Nanjing Agricultural University, Huai’an 223005, China;
| |
Collapse
|
7
|
Shi H, Geng H, Yang B, Yin Z, Liu Y. Integrating QTL and expression QTL of PigGTEx to improve the accuracy of genomic prediction for small population in Yorkshire pigs. Anim Genet 2025; 56:e70001. [PMID: 39916449 DOI: 10.1111/age.70001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Revised: 01/16/2025] [Accepted: 01/17/2025] [Indexed: 05/08/2025]
Abstract
The size of the reference population and sufficient phenotypic records are crucial for the accuracy of genomic selection. However, for small-to-medium-sized pig farms or breeds with limited population sizes, conducting genomic breeding programs presents significant challenges. In this study, 2295 Yorkshire pigs were selected from three distinct regions, including 1500 from an American line, 500 from a Canadian line, and 295 from a Danish line. All populations were genotyped using the GeneSeek 50K GGP Porcine HD chip. To enhance genomic selection accuracy, we proposed strategies that combined multiple populations and leveraged multi-omics prior information. Cis-QTL from the PigGTEx database and QTL identified through genome-wide association studies were incorporated into the genomic feature best linear unbiased prediction (GFBLUP) model to predict the ADG100 and the BF100 traits. Results demonstrated that combining multiple populations effectively improved prediction accuracy for small population, accuracy for ADG100 increased by an average of 0.29 and accuracy for BF100 by 0.05. The GFBLUP model, which integrates biological priors, showed some improvements in prediction accuracy for the BF100 trait. Specifically, for the small population, accuracy increased by 0.09 in Scheme 1, where each population size was predicted independently. In Scheme 3, where the large population was used as a reference group to predict the small population, accuracy increased by 0.03. However, the GFBLUP model did not provide additional benefits in predicting the ADG100 trait. These findings offer effective strategies for genetic improvement in developing regions and highlight the potential of multi-omics integration to enhance prediction models.
Collapse
Affiliation(s)
- Haoran Shi
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| | - He Geng
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| | - Bin Yang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| | - Zongjun Yin
- College of Animal Science and Technology, Anhui Agricultural University, Hefei, China
| | - Yang Liu
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
8
|
Chen Y, Yang F, Yang Y, Hu Y, Meng Y, Zhang Y, Ran M, He J, Yin Y, Gao N. Genomic Prediction of Semen Traits in Boars Incorporating Biological Interactions. Int J Mol Sci 2024; 25:13155. [PMID: 39684865 DOI: 10.3390/ijms252313155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 12/03/2024] [Accepted: 12/05/2024] [Indexed: 12/18/2024] Open
Abstract
In the context of modern pig farming, the central role of boars is underscored by large-scale centralized breeding and the widespread application of artificial insemination techniques. However, previous studies and breeding programs have focused mainly on product efficiency traits, such as growth rate, lean meat yield, and litter size, often neglecting boar semen traits. In this study, we estimated the genetic parameters and assessed the genomic prediction accuracy of boar semen traits with phenotypes evaluated from 274,332 ejections in a large population consisting of 2467 Duroc boars. Heritability of sperm morphological abnormality rate (ABN), fresh semen volume (VOL), sperm concentration (DEN), and motility (MOT) were estimated to be 0.43, 0.22, 0.23, and 0.16, respectively. GBLUP achieved a moderate predictive ability of semen traits, with a range of 0.32-0.50. Incorporating gene interactions indicated by the KEGG pathways (biBLUP) significantly improved predictive accuracy over the classical additive model (GBLUP) and epistatic model (RKHS). Moreover, biBLUP showed an improvement from 9.50% to 20.10% among the studied traits compared with GBLUP, with the greatest improvement (0.40 vs. 0.48) observed in sperm morphological abnormality rate. In conclusion, moderate to low heritability was estimated for the Duroc boar semen traits. Genomic prediction was able to achieve moderate accuracy, with a range from 0.32 to 0.56, for the studied traits. Considering gene interactions within KEGG pathways enhanced the predictive ability of boar semen traits.
Collapse
Affiliation(s)
- Yantong Chen
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Fang Yang
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Yanda Yang
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Yulong Hu
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Yang Meng
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Yuebo Zhang
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Maoliang Ran
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Jun He
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
| | - Yulong Yin
- Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha 410125, China
| | - Ning Gao
- Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510006, China
| |
Collapse
|
9
|
Harrison BR, Lee MB, Zhang S, Young B, Han K, Sukomol J, Paus V, Tran S, Kim D, Fitchett H, Pan Y, Tesfaye P, Johnson AW, Zhao X, Djukovic D, Raftery D, Promislow DEL. Wide-ranging genetic variation in sensitivity to rapamycin in Drosophila melanogaster. Aging Cell 2024; 23:e14292. [PMID: 39135281 PMCID: PMC11561674 DOI: 10.1111/acel.14292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 06/18/2024] [Accepted: 07/16/2024] [Indexed: 09/05/2024] Open
Abstract
The progress made in aging research using laboratory organisms is undeniable. Yet, with few exceptions, these studies are conducted in a limited number of isogenic strains. The path from laboratory discoveries to treatment in human populations is complicated by the reality of genetic variation in nature. To model the effect of genetic variation on the action of the drug rapamycin, here we use the growth of Drosophila melanogaster larvae. We screened 140 lines from the Drosophila Genetic References Panel for the extent of developmental delay and found wide-ranging variation in their response, from lines whose development time is nearly doubled by rapamycin, to those that appear to be completely resistant. Sensitivity did not associate with any single genetic marker, nor with any gene. However, variation at the level of genetic pathways was associated with rapamycin sensitivity and might provide insight into sensitivity. In contrast to the genetic analysis, metabolomic analysis showed a strong response of the metabolome to rapamycin, but only among the sensitive larvae. In particular, we found that rapamycin altered levels of amino acids in sensitive larvae, and in a direction strikingly similar to the metabolome response to nutrient deprivation. This work demonstrates the need to evaluate interventions across genetic backgrounds and highlights the potential of omic approaches to reveal biomarkers of drug efficacy and to shed light on mechanisms underlying sensitivity to interventions aimed at increasing lifespan.
Collapse
Affiliation(s)
- Benjamin R. Harrison
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | | | - Shufan Zhang
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Bill Young
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Kenneth Han
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Jiranut Sukomol
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Vanessa Paus
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Sarina Tran
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - David Kim
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Hannah Fitchett
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Yu‐Chen Pan
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Philmon Tesfaye
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Alia W. Johnson
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Xiaqing Zhao
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Danijel Djukovic
- Northwest Metabolomics Research Center, Department of Anesthesiology and Pain MedicineUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Daniel Raftery
- Northwest Metabolomics Research Center, Department of Anesthesiology and Pain MedicineUniversity of Washington School of MedicineSeattleWashingtonUSA
| | - Daniel E. L. Promislow
- Department of Laboratory Medicine and PathologyUniversity of Washington School of MedicineSeattleWashingtonUSA
- Department of BiologyUniversity of WashingtonSeattleWashingtonUSA
- Jean Mayer USDA Human Nutrition Research Center on AgingTufts UniversityBostonMassachusettsUSA
| |
Collapse
|
10
|
Zhang Z, Zhao W, Wang Z, Pan Y, Wang Q, Zhang Z. Integration of ssGWAS and ROH analyses for uncovering genetic variants associated with reproduction traits in Large White pigs. Anim Genet 2024; 55:714-724. [PMID: 39129705 DOI: 10.1111/age.13465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 05/26/2024] [Accepted: 07/05/2024] [Indexed: 08/13/2024]
Abstract
The low heritability of reproduction traits such as total number born (TNB), number born alive (NBA) and adjusted litter weight until 21 days at weaning (ALW) poses a challenge for genetic improvement. In this study, we aimed to identify genetic variants that influence these traits and evaluate the accuracy of genomic selection (GS) using these variants as genomic features. We performed single-step genome-wide association studies (ssGWAS) on 17 823 Large White (LW) pigs, of which 2770 were genotyped by 50K single nucleotide polymorphism (SNP) chips. Additionally, we analyzed runs of homozygosity (ROH) in the population and tested their effects on the traits. The genomic feature best linear unbiased prediction (GFBLUP) was then carried out in an independent population of 350 LW pigs using identified trait-related SNP subsets as genomic features. As a result, our findings identified five, one and four SNP windows that explaining more than 1% of genetic variance for ALW, TNB, and NBA, respectively and discovered 358 hotspots and nine ROH islands. The ROH SSC1:21814570-27186456 and SSC11:7220366-14276394 were found to be significantly associated with ALW and NBA, respectively. We assessed the genomic estimated breeding value accuracy through 20 replicates of five-fold cross-validation. Our findings demonstrate that GFBLUP, incorporating SNPs located in effective ROH (p-value < 0.05) as genomic features, might enhance GS accuracy for ALW compared with GBLUP. Additionally, using SNPs explaining more than 0.1% of the genetic variance in ssGWAS for NBA as genomic features might improve the GS accuracy, too. However, it is important to note that the incorporation of inappropriate genomic features can significantly reduce GS accuracy. In conclusion, our findings provide valuable insights into the genetic mechanisms of reproductive traits in pigs and suggest that the ssGWAS and ROH have the potential to enhance the accuracy of GS for reproductive traits in LW pigs.
Collapse
Affiliation(s)
- Zhenyang Zhang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Wei Zhao
- SciGene Biotechnology Co. Ltd, Hefei, China
| | - Zhen Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Yuchun Pan
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
- Hainan Institute, Zhejiang University, Sanya, China
| | - Qishan Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
- Hainan Institute, Zhejiang University, Sanya, China
| | - Zhe Zhang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
11
|
Song H, Dong T, Wang W, Yan X, Geng C, Bai S, Hu H. GWAS Enhances Genomic Prediction Accuracy of Caviar Yield, Caviar Color and Body Weight Traits in Sturgeons Using Whole-Genome Sequencing Data. Int J Mol Sci 2024; 25:9756. [PMID: 39273703 PMCID: PMC11395957 DOI: 10.3390/ijms25179756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2024] [Revised: 09/05/2024] [Accepted: 09/07/2024] [Indexed: 09/15/2024] Open
Abstract
Caviar yield, caviar color, and body weight are crucial economic traits in sturgeon breeding. Understanding the molecular mechanisms behind these traits is essential for their genetic improvement. In this study, we performed whole-genome sequencing on 673 Russian sturgeons, renowned for their high-quality caviar. With an average sequencing depth of 13.69×, we obtained approximately 10.41 million high-quality single nucleotide polymorphisms (SNPs). Using a genome-wide association study (GWAS) with a single-marker regression model, we identified SNPs and genes associated with these traits. Our findings revealed several candidate genes for each trait: caviar yield: TFAP2A, RPS6KA3, CRB3, TUBB, H2AFX, morc3, BAG1, RANBP2, PLA2G1B, and NYAP1; caviar color: NFX1, OTULIN, SRFBP1, PLEK, INHBA, and NARS; body weight: ACVR1, HTR4, fmnl2, INSIG2, GPD2, ACVR1C, TANC1, KCNH7, SLC16A13, XKR4, GALR2, RPL39, ACVR2A, ADCY10, and ZEB2. Additionally, using the genomic feature BLUP (GFBLUP) method, which combines linkage disequilibrium (LD) pruning markers with GWAS prior information, we improved genomic prediction accuracy by 2%, 1.9%, and 3.1% for caviar yield, caviar color, and body weight traits, respectively, compared to the GBLUP method. In conclusion, this study enhances our understanding of the genetic mechanisms underlying caviar yield, caviar color, and body weight traits in sturgeons, providing opportunities for genetic improvement of these traits through genomic selection.
Collapse
Affiliation(s)
- Hailiang Song
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
- National Innovation Center for Digital Seed Industry, Beijing 100097, China
| | - Tian Dong
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Wei Wang
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Xiaoyu Yan
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Chenfan Geng
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Song Bai
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Hongxia Hu
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
- National Innovation Center for Digital Seed Industry, Beijing 100097, China
| |
Collapse
|
12
|
Wang H, Li C, Li J, Zhang R, An X, Yuan C, Guo T, Yue Y. Genomic Selection for Weaning Weight in Alpine Merino Sheep Based on GWAS Prior Marker Information. Animals (Basel) 2024; 14:1904. [PMID: 38998016 PMCID: PMC11240623 DOI: 10.3390/ani14131904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 06/19/2024] [Accepted: 06/24/2024] [Indexed: 07/14/2024] Open
Abstract
This study aims to compare the accuracy of genomic estimated breeding values (GEBV) estimated using a genomic best linear unbiased prediction (GBLUP) method and GEBV estimates incorporating prior marker information from a genome-wide association study (GWAS) for the weaning weight trait in highland Merino sheep. The objective is to provide theoretical and technical support for improving the accuracy of genomic selection. The study used a population of 1007 highland Merino ewes, with the weaning weight at 3 months as the target trait. The population was randomly divided into two groups. The first group was used for GWAS analysis to identify significant markers, and the top 5%, top 10%, top 15%, and top 20% markers were selected as prior marker information. The second group was used to estimate genetic parameters and compare the accuracy of GEBV predictions using different prior marker information. The accuracy was obtained using a five-fold cross-validation. Finally, both groups were subjected to cross-validation. The study's findings revealed that the heritability of the weaning weight trait, as calculated using the GBLUP model, ranged from 0.122 to 0.394, with corresponding prediction accuracies falling between 0.075 and 0.228. By incorporating prior marker information from GWAS, the heritability was enhanced to a range of 0.125 to 0.407. The inclusion of the top 5% to top 20% significant SNPs from GWAS results as prior information into GS showed potential for improving the accuracy of predicting genomic breeding value.
Collapse
Affiliation(s)
- Haifeng Wang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Chenglan Li
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Jianye Li
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Rui Zhang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Xuejiao An
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Chao Yuan
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Tingting Guo
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Yaojing Yue
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| |
Collapse
|
13
|
Arango NK, Morgante F. Comparing statistical learning methods for complex trait prediction from gene expression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.01.596951. [PMID: 38895364 PMCID: PMC11185554 DOI: 10.1101/2024.06.01.596951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Accurate prediction of complex traits is an important task in quantitative genetics that has become increasingly relevant for personalized medicine. Genotypes have traditionally been used for trait prediction using a variety of methods such as mixed models, Bayesian methods, penalized regressions, dimension reductions, and machine learning methods. Recent studies have shown that gene expression levels can produce higher prediction accuracy than genotypes. However, only a few prediction methods were used in these studies. Thus, a comprehensive assessment of methods is needed to fully evaluate the potential of gene expression as a predictor of complex trait phenotypes. Here, we used data from the Drosophila Genetic Reference Panel (DGRP) to compare the ability of several existing statistical learning methods to predict starvation resistance from gene expression in the two sexes separately. The methods considered differ in assumptions about the distribution of gene effect sizes - ranging from models that assume that every gene affects the trait to more sparse models - and their ability to capture gene-gene interactions. We also used functional annotation (i.e., Gene Ontology (GO)) as an external source of biological information to inform prediction models. The results show that differences in prediction accuracy between methods exist, although they are generally not large. Methods performing variable selection gave higher accuracy in females while methods assuming a more polygenic architecture performed better in males. Incorporating GO annotations further improved prediction accuracy for a few GO terms of biological significance. Biological significance extended to the genes underlying highly predictive GO terms with different genes emerging between sexes. Notably, the Insulin-like Receptor (InR) was prevalent across methods and sexes. Our results confirmed the potential of transcriptomic prediction and highlighted the importance of selecting appropriate methods and strategies in order to achieve accurate predictions.
Collapse
Affiliation(s)
- Noah Klimkowski Arango
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Fabio Morgante
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| |
Collapse
|
14
|
Xie Z, Xu X, Li L, Wu C, Ma Y, He J, Wei S, Wang J, Feng X. Residual networks without pooling layers improve the accuracy of genomic predictions. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:138. [PMID: 38771334 DOI: 10.1007/s00122-024-04649-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 05/10/2024] [Indexed: 05/22/2024]
Abstract
KEY MESSAGE Residual neural network genomic selection is the first GS algorithm to reach 35 layers, and its prediction accuracy surpasses previous algorithms. With the decrease in DNA sequencing costs and the development of deep learning, phenotype prediction accuracy by genomic selection (GS) continues to improve. Residual networks, a widely validated deep learning technique, are introduced to deep learning for GS. Since each locus has a different weighted impact on the phenotype, strided convolutions are more suitable for GS problems than pooling layers. Through the above technological innovations, we propose a GS deep learning algorithm, residual neural network for genomic selection (ResGS). ResGS is the first neural network to reach 35 layers in GS. In 15 cases from four public data, the prediction accuracy of ResGS is higher than that of ridge-regression best linear unbiased prediction, support vector regression, random forest, gradient boosting regressor, and deep neural network genomic prediction in most cases. ResGS performs well in dealing with gene-environment interaction. Phenotypes from other environments are imported into ResGS along with genetic data. The prediction results are much better than just providing genetic data as input, which demonstrates the effectiveness of GS multi-modal learning. Standard deviation is recommended as an auxiliary GS evaluation metric, which could improve the distribution of predicted results. Deep learning for GS, such as ResGS, is becoming more accurate in phenotype prediction.
Collapse
Affiliation(s)
| | - Xiaogang Xu
- School of Computer Science and Technology, Zhejiang Gongshang University, Hangzhou, 310012, China.
| | - Ling Li
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Cuiling Wu
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Yinxing Ma
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Jingjing He
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Sidi Wei
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Jun Wang
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Xianzhong Feng
- Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
| |
Collapse
|
15
|
Zhang Y, Zhuang Z, Liu Y, Huang J, Luan M, Zhao X, Dong L, Ye J, Yang M, Zheng E, Cai G, Wu Z, Yang J. Genomic prediction based on preselected single-nucleotide polymorphisms from genome-wide association study and imputed whole-genome sequence data annotation for growth traits in Duroc pigs. Evol Appl 2024; 17:e13651. [PMID: 38362509 PMCID: PMC10868536 DOI: 10.1111/eva.13651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 10/31/2023] [Accepted: 01/13/2024] [Indexed: 02/17/2024] Open
Abstract
The use of whole-genome sequence (WGS) data is expected to improve genomic prediction (GP) power of complex traits because it may contain mutations that in strong linkage disequilibrium pattern with causal mutations. However, a few previous studies have shown no or small improvement in prediction accuracy using WGS data. Incorporating prior biological information into GP seems to be an attractive strategy that might improve prediction accuracy. In this study, a total of 6334 pigs were genotyped using 50K chips and subsequently imputed to the WGS level. This cohort includes two prior discovery populations that comprise 294 Landrace pigs and 186 Duroc pigs, as well as two validation populations that consist of 3770 American Duroc pigs and 2084 Canadian Duroc pigs. Then we used annotation information and genome-wide association study (GWAS) from the WGS data to make GP for six growth traits in two Duroc pig populations. Based on variant annotation, we partitioned different genomic classes, such as intron, intergenic, and untranslated regions, for imputed WGS data. Based on GWAS results of WGS data, we obtained trait-associated single-nucleotide polymorphisms (SNPs). We then applied the genomic feature best linear unbiased prediction (GFBLUP) and genomic best linear unbiased prediction (GBLUP) models to estimate the genomic estimated breeding values for growth traits with these different variant panels, including six genomic classes and trait-associated SNPs. Compared with 50K chip data, GBLUP with imputed WGS data had no increase in prediction accuracy. Using only annotations resulted in no increase in prediction accuracy compared to GBLUP with 50K, but adding annotation information into the GFBLUP model with imputed WGS data could improve the prediction accuracy with increases of 0.00%-2.82%. In conclusion, a GFBLUP model that incorporated prior biological information might increase the advantage of using imputed WGS data for GP.
Collapse
Affiliation(s)
- Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Jinyan Huang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Menghao Luan
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Xiang Zhao
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Linsong Dong
- Guangdong Zhongxin Breeding Technology Co., LtdGuangzhouChina
| | - Jian Ye
- Guangdong Zhongxin Breeding Technology Co., LtdGuangzhouChina
| | - Ming Yang
- College of Animal Science and TechnologyZhongkai University of Agriculture and EngineeringGuangzhouChina
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
- Guangdong Zhongxin Breeding Technology Co., LtdGuangzhouChina
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| |
Collapse
|
16
|
Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023; 13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open
Abstract
Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.
Collapse
Affiliation(s)
- Yiyi Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuling Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fuchen Zhou
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zekai Yao
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuexin Zhan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfei Fan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Xianglun Meng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zebin Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Langqing Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Jie Yang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Enqin Zheng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
17
|
Li C, Li J, Wang H, Zhang R, An X, Yuan C, Guo T, Yue Y. Genomic Selection for Live Weight in the 14th Month in Alpine Merino Sheep Combining GWAS Information. Animals (Basel) 2023; 13:3516. [PMID: 38003134 PMCID: PMC10668700 DOI: 10.3390/ani13223516] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 10/25/2023] [Accepted: 11/07/2023] [Indexed: 11/26/2023] Open
Abstract
Alpine Merino Sheep is a novel breed reared from Australian Merino Sheep as the father and Gansu Alpine Fine-Wool Sheep as the mother, living all year in cold and arid alpine areas with exceptional wool quality and meat performance. Body weight is an important economic trait of the Alpine Merino Sheep, but there is limited research on identifying the genes associated with live weight in the 14th month for improving the accuracy of the genomic prediction of this trait. Therefore, this study's sample comprised 1310 Alpine Merino Sheep ewes, and the Fine Wool Sheep 50K Panel was used for genome-wide association study (GWAS) analysis to identify candidate genes. Moreover, the trial population (1310 ewes) in this study was randomly divided into two groups. One group was used as the population for GWAS analysis and screened for the most significant top 5%, top 10%, top 15%, and top 20% SNPs to obtain prior marker information. The other group was used to estimate the genetic parameters based on the weight assigned by heritability combined with different prior marker information. The aim of this study was to compare the accuracy of genomic breeding value estimation when combined with prior marker information from GWAS analysis with the optimal linear unbiased prediction method for genome selection (GBLUP) for the breeding value of target traits. Finally, the accuracy was evaluated using the five-fold cross-validation method. This research provides theoretical and technical support to improve the accuracy of sheep genome selection and better guide breeding. The results demonstrated that eight candidate genes were associated with GWAS analysis, and the gene function query and literature search results suggested that FAM184B, NCAPG, MACF1, ANKRD44, DCAF16, FUK, LCORL, and SYN3 were candidate genes affecting live weight in the 14th month (WT), which regulated the growth of muscle and bone in sheep. In genome selection analysis, the heritability of GBLUP to calculate the WT was 0.335-0.374, the accuracy after five-fold cross-verification was 0.154-0.190, and after assigning different weights to the top 5%, top 10%, top 15%, and top 20% of the GWAS results in accordance with previous information to construct the G matrix, the accuracy of the WT in the GBLUP model was improved by 2.59-7.79%.
Collapse
Affiliation(s)
- Chenglan Li
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Jianye Li
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Haifeng Wang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Rui Zhang
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Xuejiao An
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Chao Yuan
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Tingting Guo
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| | - Yaojing Yue
- Key Laboratory of Animal Genetics and Breeding on Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou 730050, China; (C.L.)
- Sheep Breeding Engineering Technology Research Center of Chinese Academy of Agricultural Sciences, Lanzhou 730050, China
| |
Collapse
|
18
|
Zhu D, Zhao Y, Zhang R, Wu H, Cai G, Wu Z, Wang Y, Hu X. Genomic prediction based on selective linkage disequilibrium pruning of low-coverage whole-genome sequence variants in a pure Duroc population. Genet Sel Evol 2023; 55:72. [PMID: 37853325 PMCID: PMC10583454 DOI: 10.1186/s12711-023-00843-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 09/14/2023] [Indexed: 10/20/2023] Open
Abstract
BACKGROUND Although the accumulation of whole-genome sequencing (WGS) data has accelerated the identification of mutations underlying complex traits, its impact on the accuracy of genomic predictions is limited. Reliable genotyping data and pre-selected beneficial loci can be used to improve prediction accuracy. Previously, we reported a low-coverage sequencing genotyping method that yielded 11.3 million highly accurate single-nucleotide polymorphisms (SNPs) in pigs. Here, we introduce a method termed selective linkage disequilibrium pruning (SLDP), which refines the set of SNPs that show a large gain during prediction of complex traits using whole-genome SNP data. RESULTS We used the SLDP method to identify and select markers among millions of SNPs based on genome-wide association study (GWAS) prior information. We evaluated the performance of SLDP with respect to three real traits and six simulated traits with varying genetic architectures using two representative models (genomic best linear unbiased prediction and BayesR) on samples from 3579 Duroc boars. SLDP was determined by testing 180 combinations of two core parameters (GWAS P-value thresholds and linkage disequilibrium r2). The parameters for each trait were optimized in the training population by five fold cross-validation and then tested in the validation population. Similar to previous GWAS prior-based methods, the performance of SLDP was mainly affected by the genetic architecture of the traits analyzed. Specifically, SLDP performed better for traits controlled by major quantitative trait loci (QTL) or a small number of quantitative trait nucleotides (QTN). Compared with two commercial SNP chips, genotyping-by-sequencing data, and an unselected whole-genome SNP panel, the SLDP strategy led to significant improvements in prediction accuracy, which ranged from 0.84 to 3.22% for real traits controlled by major or moderate QTL and from 1.23 to 11.47% for simulated traits controlled by a small number of QTN. CONCLUSIONS The SLDP marker selection method can be incorporated into mainstream prediction models to yield accuracy improvements for traits with a relatively simple genetic architecture, however, it has no significant advantage for traits not controlled by major QTL. The main factors that affect its performance are the genetic architecture of traits and the reliability of GWAS prior information. Our findings can facilitate the application of WGS-based genomic selection.
Collapse
Affiliation(s)
- Di Zhu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yiqiang Zhao
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Ran Zhang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Hanyu Wu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, China.
| | - Yuzhe Wang
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China.
| | - Xiaoxiang Hu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China.
| |
Collapse
|
19
|
Wang M, Bissonnette N, Laterrière M, Dudemaine PL, Gagné D, Roy JP, Sirard MA, Ibeagha-Awemu EM. Gene co-expression in response to Staphylococcus aureus infection reveals networks of genes with specific functions during bovine subclinical mastitis. J Dairy Sci 2023; 106:5517-5536. [PMID: 37291036 DOI: 10.3168/jds.2022-22757] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 02/08/2023] [Indexed: 06/10/2023]
Abstract
Staphylococcus aureus is one of the most prevalent contagious bacterial pathogen of bovine mastitis. The subclinical mastitis it causes has long-term economic implications and it is difficult to control. To further understanding of the genetic basis of mammary gland defense against S. aureus infection, the transcriptomes of milk somatic cells from 15 cows with persistent natural S. aureus infection (S. aureus-positive, SAP) and 10 healthy control cows (HC) were studied by deep RNA-sequencing technology. Comparing the transcriptomes of SAP to HC group revealed 4,077 differentially expressed genes (DEG; 1,616 up- and 2,461 downregulated). Functional annotation indicated enrichment of DEG in 94 Gene Ontology (GO) and 47 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. Terms related to the immune response and disease processes were mostly enriched for by upregulated DEG, whereas biological process terms related to cell adhesion, cell movement and localization, and tissue development were mostly enriched for by downregulated DEG. Weighted gene co-expression network analysis grouped DEG into 7 modules, the most important module (colored turquoise by software and here referred to as Turquoise module) was positively significantly correlated with S. aureus subclinical mastitis. The 1,546 genes in the Turquoise module were significantly enriched in 48 GO terms and 72 KEGG pathways, with 80% of them being disease- and immune-related terms [e.g., immune system process (GO:0002376), cytokine-cytokine receptor interaction (bta04060) and S. aureus infection (bta05150)]. Some DEG such as IFNG, IL18, IL1B, NFKB1, CXCL8, and IL12B were enriched in immune and disease pathways suggesting their possible involvement in the regulation of the host response to S. aureus infection. Four modules (Yellow, Brown, Blue, and Red) were negatively correlated (significantly) with S. aureus subclinical mastitis, and were enriched in functional annotations involved in the regulation of cell migration, cell communication, metabolic process, and blood circulatory system development, respectively. Application of sparse partial least squares discriminant analysis to genes of the Turquoise module identified 5 genes (NR2F6, PDLIM5, RAB11FIP5, ACOT4, and TMEM53) capable of explaining the majority of the differences in the expression patterns between SAP and HC cows. In conclusion, this study has furthered understanding of the genetic changes in the mammary gland and the molecular mechanisms underlying S. aureus mastitis, as well as revealed a list of candidate discriminant genes with potential regulatory roles in response to S. aureus infection.
Collapse
Affiliation(s)
- Mengqi Wang
- Sherbrooke Research and Development Centre, Agriculture and Agri-Food Canada, Sherbrooke, Quebec, J1MOC8, Canada; Department of Animal Science, Laval University, Quebec City, Quebec, G1V 0A6, Canada
| | - Nathalie Bissonnette
- Sherbrooke Research and Development Centre, Agriculture and Agri-Food Canada, Sherbrooke, Quebec, J1MOC8, Canada
| | - Mario Laterrière
- Quebec Research and Development Centre, Agriculture and Agri-Food Canada, Quebec City, Quebec, G1V 2J3, Canada
| | - Pier-Luc Dudemaine
- Sherbrooke Research and Development Centre, Agriculture and Agri-Food Canada, Sherbrooke, Quebec, J1MOC8, Canada
| | - David Gagné
- Quebec Research and Development Centre, Agriculture and Agri-Food Canada, Quebec City, Quebec, G1V 2J3, Canada
| | - Jean-Philippe Roy
- Department of Clinical Sciences, Faculty of Veterinary Medicine, Université de Montréal, St-Hyacinthe, Quebec, H3T 1J4, Canada
| | - Marc-André Sirard
- Department of Animal Science, Laval University, Quebec City, Quebec, G1V 0A6, Canada
| | - Eveline M Ibeagha-Awemu
- Sherbrooke Research and Development Centre, Agriculture and Agri-Food Canada, Sherbrooke, Quebec, J1MOC8, Canada.
| |
Collapse
|
20
|
Zhuang Z, Wu J, Qiu Y, Ruan D, Ding R, Xu C, Zhou S, Zhang Y, Liu Y, Ma F, Yang J, Sun Y, Zheng E, Yang M, Cai G, Yang J, Wu Z. Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs. J Anim Sci Biotechnol 2023; 14:67. [PMID: 37161604 PMCID: PMC10170792 DOI: 10.1186/s40104-023-00863-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Pork quality can directly affect customer purchase tendency and meat quality traits have become valuable in modern pork production. However, genetic improvement has been slow due to high phenotyping costs. In this study, whole genome sequence (WGS) data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction (GBLUP) for meat quality in large-scale crossbred commercial pigs. RESULTS We produced WGS data (18,695,907 SNPs and 2,106,902 INDELs exceed quality control) from 1,469 sequenced Duroc × (Landrace × Yorkshire) pigs and developed a reference panel for meat quality including meat color score, marbling score, L* (lightness), a* (redness), and b* (yellowness) of genomic prediction. The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population. Using different marker density panels derived from WGS data, accuracy differed substantially among meat quality traits, varied from 0.08 to 0.47. Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39% to 75%. We optimized the marker density and found medium- and high-density marker panels are beneficial for the estimation of heritability for meat quality. Moreover, we conducted genotype imputation from 50K chip to WGS level in the same population and found average concordance rate to exceed 95% and r2 = 0.81. CONCLUSIONS Overall, estimation of heritability for meat quality traits can benefit from the use of WGS data. This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction.
Collapse
Affiliation(s)
- Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Fucai Ma
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jifei Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ying Sun
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ming Yang
- College of Animal Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
- Yunfu Subcenter of Guangdong Laboratory for Lingnan Modern Agriculture, Yunfu, 527400, China.
| |
Collapse
|
21
|
Wu Y, Li D, Hu Y, Li H, Ramstein GP, Zhou S, Zhang X, Bao Z, Zhang Y, Song B, Zhou Y, Zhou Y, Gagnon E, Särkinen T, Knapp S, Zhang C, Städler T, Buckler ES, Huang S. Phylogenomic discovery of deleterious mutations facilitates hybrid potato breeding. Cell 2023; 186:2313-2328.e15. [PMID: 37146612 DOI: 10.1016/j.cell.2023.04.008] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 02/20/2023] [Accepted: 04/05/2023] [Indexed: 05/07/2023]
Abstract
Hybrid potato breeding will transform the crop from a clonally propagated tetraploid to a seed-reproducing diploid. Historical accumulation of deleterious mutations in potato genomes has hindered the development of elite inbred lines and hybrids. Utilizing a whole-genome phylogeny of 92 Solanaceae and its sister clade species, we employ an evolutionary strategy to identify deleterious mutations. The deep phylogeny reveals the genome-wide landscape of highly constrained sites, comprising ∼2.4% of the genome. Based on a diploid potato diversity panel, we infer 367,499 deleterious variants, of which 50% occur at non-coding and 15% at synonymous sites. Counterintuitively, diploid lines with relatively high homozygous deleterious burden can be better starting material for inbred-line development, despite showing less vigorous growth. Inclusion of inferred deleterious mutations increases genomic-prediction accuracy for yield by 24.7%. Our study generates insights into the genome-wide incidence and properties of deleterious mutations and their far-reaching consequences for breeding.
Collapse
Affiliation(s)
- Yaoyao Wu
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA
| | - Dawei Li
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; State Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan 571101, China
| | - Yong Hu
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; The AGISCAAS-YNNU Joint Academy of Potato Sciences, Yunnan Normal University, Kunming, Yunnan 650500, China
| | - Hongbo Li
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Guillaume P Ramstein
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus 8000, Denmark
| | - Shaoqun Zhou
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Xinyan Zhang
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Zhigui Bao
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Yu Zhang
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; School of Agriculture, Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Baoxing Song
- Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261000, China
| | - Yao Zhou
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100094, China
| | - Yongfeng Zhou
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Edeline Gagnon
- Technische Universität München, TUM School of Life Sciences, Emil-Ramann-Strasse 2, 85354 Freising, Germany
| | - Tiina Särkinen
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh EH3 5LR, UK
| | - Sandra Knapp
- Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - Chunzhi Zhang
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Thomas Städler
- Institute of Integrative Biology and Zurich-Basel Plant Science Center, ETH Zurich, 8092 Zurich, Switzerland
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; USDA-ARS, Ithaca, NY 14853, USA
| | - Sanwen Huang
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China; State Key Laboratory of Tropical Crop Breeding, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan 571101, China.
| |
Collapse
|
22
|
Tahir MS, Porto-Neto LR, Reverter-Gomez T, Olasege BS, Sajid MR, Wockner KB, Tan AWL, Fortes MRS. Utility of multi-omics data to inform genomic prediction of heifer fertility traits. J Anim Sci 2022; 100:skac340. [PMID: 36239447 PMCID: PMC9733504 DOI: 10.1093/jas/skac340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 10/12/2022] [Indexed: 12/15/2022] Open
Abstract
Biologically informed single nucleotide polymorphisms (SNPs) impact genomic prediction accuracy of the target traits. Our previous genomics, proteomics, and transcriptomics work identified candidate genes related to puberty and fertility in Brahman heifers. We aimed to test this biological information for capturing heritability and predicting heifer fertility traits in another breed i.e., Tropical Composite. The SNP from the identified genes including 10 kilobases (kb) region on either side were selected as biologically informed SNP set. The SNP from the rest of the Bos taurus genes including 10-kb region on either side were selected as biologically uninformed SNP set. Bovine high-density (HD) complete SNP set (628,323 SNP) was used as a control. Two populations-Tropical Composites (N = 1331) and Brahman (N = 2310)-had records for three traits: pregnancy after first mating season (PREG1, binary), first conception score (FCS, score 1 to 3), and rebreeding score (REB, score 1 to 3.5). Using the best linear unbiased prediction method, effectiveness of each SNP set to predict the traits was tested in two scenarios: a 5-fold cross-validation within Tropical Composites using biological information from Brahman studies, and application of prediction equations from one breed to the other. The accuracy of prediction was calculated as the correlation between genomic estimated breeding values and adjusted phenotypes. Results show that biologically informed SNP set estimated heritabilities not significantly better than the control HD complete SNP set in Tropical Composites; however, it captured all the observed genetic variance in PREG1 and FCS when modeled together with the biologically uninformed SNP set. In 5-fold cross-validation within Tropical Composites, the biologically informed SNP set performed marginally better (statistically insignificant) in terms of prediction accuracies (PREG1: 0.20, FCS: 0.13, and REB: 0.12) as compared to HD complete SNP set (PREG1: 0.17, FCS: 0.10, and REB: 0.11), and biologically uninformed SNP set (PREG1: 0.16, FCS: 0.10, and REB: 0.11). Across-breed use of prediction equations still remained a challenge: accuracies by all SNP sets dropped to around zero for all traits. The performance of biologically informed SNP was not significantly better than other sets in Tropical Composites. However, results indicate that biological information obtained from Brahman was successful to predict the fertility traits in Tropical Composite population.
Collapse
Affiliation(s)
- Muhammad S Tahir
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Laercio R Porto-Neto
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Toni Reverter-Gomez
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Babatunde S Olasege
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Mirza R Sajid
- Department of Statistics, University of Gujrat, 50700 Punjab, Pakistan
| | - Kimberley B Wockner
- Queensland Department of Agriculture and Fisheries, Brisbane 4072, QLD, Australia
| | - Andre W L Tan
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Marina R S Fortes
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| |
Collapse
|
23
|
Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. GENETICS SELECTION EVOLUTION 2022; 54:65. [PMID: 36153511 PMCID: PMC9509613 DOI: 10.1186/s12711-022-00756-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022]
Abstract
Background Early simulations indicated that whole-genome sequence data (WGS) could improve the accuracy of genomic predictions within and across breeds. However, empirical results have been ambiguous so far. Large datasets that capture most of the genomic diversity in a population must be assembled so that allele substitution effects are estimated with high accuracy. The objectives of this study were to use a large pig dataset from seven intensely selected lines to assess the benefits of using WGS for genomic prediction compared to using commercial marker arrays and to identify scenarios in which WGS provides the largest advantage. Methods We sequenced 6931 individuals from seven commercial pig lines with different numerical sizes. Genotypes of 32.8 million variants were imputed for 396,100 individuals (17,224 to 104,661 per line). We used BayesR to perform genomic prediction for eight complex traits. Genomic predictions were performed using either data from a standard marker array or variants preselected from WGS based on association tests. Results The accuracies of genomic predictions based on preselected WGS variants were not robust across traits and lines and the improvements in prediction accuracy that we achieved so far with WGS compared to standard marker arrays were generally small. The most favourable results for WGS were obtained when the largest training sets were available and standard marker arrays were augmented with preselected variants with statistically significant associations to the trait. With this method and training sets of around 80k individuals, the accuracy of within-line genomic predictions was on average improved by 0.025. With multi-line training sets, improvements of 0.04 compared to marker arrays could be expected. Conclusions Our results showed that WGS has limited potential to improve the accuracy of genomic predictions compared to marker arrays in intensely selected pig lines. Thus, although we expect that larger improvements in accuracy from the use of WGS are possible with a combination of larger training sets and optimised pipelines for generating and analysing such datasets, the use of WGS in the current implementations of genomic prediction should be carefully evaluated against the cost of large-scale WGS data on a case-by-case basis. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00756-0.
Collapse
|
24
|
Mollandin F, Gilbert H, Croiseau P, Rau A. Accounting for overlapping annotations in genomic prediction models of complex traits. BMC Bioinformatics 2022; 23:365. [PMID: 36068513 PMCID: PMC9446854 DOI: 10.1186/s12859-022-04914-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 08/25/2022] [Indexed: 11/10/2022] Open
Abstract
Background It is now widespread in livestock and plant breeding to use genotyping data to predict phenotypes with genomic prediction models. In parallel, genomic annotations related to a variety of traits are increasing in number and granularity, providing valuable insight into potentially important positions in the genome. The BayesRC model integrates this prior biological information by factorizing the genome according to disjoint annotation categories, in some cases enabling improved prediction of heritable traits. However, BayesRC is not adapted to cases where markers may have multiple annotations. Results We propose two novel Bayesian approaches to account for multi-annotated markers through a cumulative (BayesRC+) or preferential (BayesRC\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pi$$\end{document}π) model of the contribution of multiple annotation categories. We illustrate their performance on simulated data with various genetic architectures and types of annotations. We also explore their use on data from a backcross population of growing pigs in conjunction with annotations constructed using the PigQTLdb. In both simulated and real data, we observed a modest improvement in prediction quality with our models when used with informative annotations. In addition, our results show that BayesRC+ successfully prioritizes multi-annotated markers according to their posterior variance, while BayesRC\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pi$$\end{document}π provides a useful interpretation of informative annotations for multi-annotated markers. Finally, we explore several strategies for constructing annotations from a public database, highlighting the importance of careful consideration of this step. Conclusion When used with annotations that are relevant to the trait under study, BayesRC\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pi$$\end{document}π and BayesRC+ allow for improved prediction and prioritization of multi-annotated markers, and can provide useful biological insight into the genetic architecture of traits. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04914-5.
Collapse
Affiliation(s)
- Fanny Mollandin
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France.
| | - Hélène Gilbert
- GenPhySE, INRAE, ENVT, Université de Toulouse, 31320, Castanet Tolosan, France
| | - Pascal Croiseau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France
| | - Andrea Rau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France.,BioEcoAgro Joint Research Unit, INRAE, Université de Liège, Université de Lille, Université de Picardie Jules Verne, 50136, Estrée-Mons, France
| |
Collapse
|
25
|
Solodneva EV, Kuznetsov SB, Velieva AE, Stolpovsky YA. Molecular-Genetic Bases of Mammary Gland Development Using the Example of Cattle and Other Animal Species: I. Embryonic and Pubertal Developmental Stage. RUSS J GENET+ 2022. [DOI: 10.1134/s1022795422080087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
26
|
Wang Z, Zhang Z, Chen Z, Sun J, Cao C, Wu F, Xu Z, Zhao W, Sun H, Guo L, Zhang Z, Wang Q, Pan Y. PHARP: a pig haplotype reference panel for genotype imputation. Sci Rep 2022; 12:12645. [PMID: 35879321 PMCID: PMC9314402 DOI: 10.1038/s41598-022-15851-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 06/30/2022] [Indexed: 11/18/2022] Open
Abstract
Pigs not only function as a major meat source worldwide but also are commonly used as an animal model for studying human complex traits. A large haplotype reference panel has been used to facilitate efficient phasing and imputation of relatively sparse genome-wide microarray chips and low-coverage sequencing data. Using the imputed genotypes in the downstream analysis, such as GWASs, TWASs, eQTL mapping and genomic prediction (GS), is beneficial for obtaining novel findings. However, currently, there is still a lack of publicly available and high-quality pig reference panels with large sample sizes and high diversity, which greatly limits the application of genotype imputation in pigs. In response, we built the pig Haplotype Reference Panel (PHARP) database. PHARP provides a reference panel of 2012 pig haplotypes at 34 million SNPs constructed using whole-genome sequence data from more than 49 studies of 71 pig breeds. It also provides Web-based analytical tools that allow researchers to carry out phasing and imputation consistently and efficiently. PHARP is freely accessible at http://alphaindex.zju.edu.cn/PHARP/index.php . We demonstrate its applicability for pig commercial 50 K SNP arrays, by accurately imputing 2.6 billion genotypes at a concordance rate value of 0.971 in 81 Large White pigs (~ 17 × sequencing coverage). We also applied our reference panel to impute the low-density SNP chip into the high-density data for three GWASs and found novel significantly associated SNPs that might be casual variants.
Collapse
Affiliation(s)
- Zhen Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zhenyang Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zitao Chen
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Jiabao Sun
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Caiyun Cao
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Fen Wu
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zhong Xu
- Hubei Key Laboratory of Animal Embryo and Molecular Breeding, Institute of Animal Husbandry and Veterinary, Hubei Provincial Academy of Agricultural Sciences, Wuhan, 430064, China
| | - Wei Zhao
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Hao Sun
- Department of Animal Science, School of Animal Science, Jilin University, Changchun, 130062, China
| | - Longyu Guo
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhe Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| | - Qishan Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| | - Yuchun Pan
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| |
Collapse
|
27
|
Olguín V, Durán A, Las Heras M, Rubilar JC, Cubillos FA, Olguín P, Klein AD. Genetic Background Matters: Population-Based Studies in Model Organisms for Translational Research. Int J Mol Sci 2022; 23:7570. [PMID: 35886916 PMCID: PMC9316598 DOI: 10.3390/ijms23147570] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 06/28/2022] [Accepted: 07/04/2022] [Indexed: 02/01/2023] Open
Abstract
We are all similar but a bit different. These differences are partially due to variations in our genomes and are related to the heterogeneity of symptoms and responses to treatments that patients exhibit. Most animal studies are performed in one single strain with one manipulation. However, due to the lack of variability, therapies are not always reproducible when treatments are translated to humans. Panels of already sequenced organisms are valuable tools for mimicking human phenotypic heterogeneities and gene mapping. This review summarizes the current knowledge of mouse, fly, and yeast panels with insightful applications for translational research.
Collapse
Affiliation(s)
- Valeria Olguín
- Centro de Genética y Genómica, Facultad de Medicina, Clínica Alemana Universidad del Desarrollo, Santiago 7610658, Chile; (V.O.); (A.D.); (M.L.H.); (J.C.R.)
| | - Anyelo Durán
- Centro de Genética y Genómica, Facultad de Medicina, Clínica Alemana Universidad del Desarrollo, Santiago 7610658, Chile; (V.O.); (A.D.); (M.L.H.); (J.C.R.)
| | - Macarena Las Heras
- Centro de Genética y Genómica, Facultad de Medicina, Clínica Alemana Universidad del Desarrollo, Santiago 7610658, Chile; (V.O.); (A.D.); (M.L.H.); (J.C.R.)
| | - Juan Carlos Rubilar
- Centro de Genética y Genómica, Facultad de Medicina, Clínica Alemana Universidad del Desarrollo, Santiago 7610658, Chile; (V.O.); (A.D.); (M.L.H.); (J.C.R.)
| | - Francisco A. Cubillos
- Departamento de Biología, Santiago, Facultad de Química y Biología, Universidad de Santiago de Chile, Santiago 9170022, Chile;
- Millennium Institute for Integrative Biology (iBio), Santiago 7500565, Chile
| | - Patricio Olguín
- Program in Human Genetics, Institute of Biomedical Sciences, Biomedical Neurosciences Institute, Department of Neuroscience, Facultad de Medicina, Universidad de Chile, Santiago 8380453, Chile;
| | - Andrés D. Klein
- Centro de Genética y Genómica, Facultad de Medicina, Clínica Alemana Universidad del Desarrollo, Santiago 7610658, Chile; (V.O.); (A.D.); (M.L.H.); (J.C.R.)
| |
Collapse
|
28
|
Hershberger J, Tanaka R, Wood JC, Kaczmar N, Wu D, Hamilton JP, DellaPenna D, Buell CR, Gore MA. Transcriptome-wide association and prediction for carotenoids and tocochromanols in fresh sweet corn kernels. THE PLANT GENOME 2022; 15:e20197. [PMID: 35262278 DOI: 10.1002/tpg2.20197] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 01/23/2022] [Indexed: 06/14/2023]
Abstract
Sweet corn (Zea mays L.) is consistently one of the most highly consumed vegetables in the United States, providing a valuable opportunity to increase nutrient intake through biofortification. Significant variation for carotenoid (provitamin A, lutein, zeaxanthin) and tocochromanol (vitamin E, antioxidants) levels is present in temperate sweet corn germplasm, yet previous genome-wide association studies (GWAS) of these traits have been limited by low statistical power and mapping resolution. Here, we employed a high-quality transcriptomic dataset collected from fresh sweet corn kernels to conduct transcriptome-wide association studies (TWAS) and transcriptome prediction studies for 39 carotenoid and tocochromanol traits. In agreement with previous GWAS findings, TWAS detected significant associations for four causal genes, β-carotene hydroxylase (crtRB1), lycopene epsilon cyclase (lcyE), γ-tocopherol methyltransferase (vte4), and homogentisate geranylgeranyltransferase (hggt1) on a transcriptome-wide level. Pathway-level analysis revealed additional associations for deoxy-xylulose synthase2 (dxs2), diphosphocytidyl methyl erythritol synthase2 (dmes2), cytidine methyl kinase1 (cmk1), and geranylgeranyl hydrogenase1 (ggh1), of which, dmes2, cmk1, and ggh1 have not previously been identified through maize association studies. Evaluation of prediction models incorporating genome-wide markers and transcriptome-wide abundances revealed a trait-dependent benefit to the inclusion of both genomic and transcriptomic data over solely genomic data, but both transcriptome- and genome-wide datasets outperformed a priori candidate gene-targeted prediction models for most traits. Altogether, this study represents an important step toward understanding the role of regulatory variation in the accumulation of vitamins in fresh sweet corn kernels.
Collapse
Affiliation(s)
- Jenna Hershberger
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell Univ., Ithaca, NY, 14853, USA
| | - Ryokei Tanaka
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell Univ., Ithaca, NY, 14853, USA
| | - Joshua C Wood
- Dep. of Crop & Soil Sciences, Univ. of Georgia, Athens, GA, 30602, USA
| | - Nicholas Kaczmar
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell Univ., Ithaca, NY, 14853, USA
| | - Di Wu
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell Univ., Ithaca, NY, 14853, USA
| | - John P Hamilton
- Dep. of Crop & Soil Sciences, Univ. of Georgia, Athens, GA, 30602, USA
| | - Dean DellaPenna
- Dep. of Biochemistry and Molecular Biology, Michigan State Univ., East Lansing, MI, 48824, USA
| | - C Robin Buell
- Dep. of Crop & Soil Sciences, Univ. of Georgia, Athens, GA, 30602, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell Univ., Ithaca, NY, 14853, USA
| |
Collapse
|
29
|
Utilization Strategies of Two Environment Phenotypes in Genomic Prediction. Genes (Basel) 2022; 13:genes13050722. [PMID: 35627107 PMCID: PMC9141986 DOI: 10.3390/genes13050722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/01/2023] Open
Abstract
Multiple environment phenotypes may be utilized to implement genomic prediction in plant breeding, while it is unclear about optimal utilization strategies according to its different availability. It is necessary to assess the utilization strategies of genomic prediction models based on different availability of multiple environment phenotypes. Here, we compared the prediction accuracy of three genomic prediction models (genomic prediction model (genomic best linear unbiased prediction (GBLUP), genomic best linear unbiased prediction (GFBLUP), and multi-trait genomic best linear unbiased prediction (mtGBLUP)) which leveraged diverse information from multiple environment phenotypes using a rice dataset containing 19 agronomic traits in two disparate seasons. We found that the prediction accuracy of genomic prediction models considering multiple environment phenotypes (GFBLUP and mtGBLUP) was better than the classical genomic prediction model (GBLUP model). The deviation of prediction accuracy of between GBLUP and mtGBLUP or GFBLUP was associated with the phenotypic correlation. In summary, the genomic prediction models considering multiple environment phenotypes (GFBLUP and mtGBLUP) demonstrated better prediction accuracy. In addition, we could utilize different genomic prediction strategies according to different availability of multiple environment phenotypes.
Collapse
|
30
|
Ogawa S, Matsuda H, Taniguchi Y, Watanabe T, Sugimoto Y, Iwaisaki H. Estimation of the autosomal contribution to total additive genetic variability of carcass traits in Japanese Black cattle. Anim Sci J 2022; 93:e13710. [PMID: 35416392 DOI: 10.1111/asj.13710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 02/18/2022] [Accepted: 03/18/2022] [Indexed: 11/29/2022]
Abstract
We attempted to estimate the additive genetic variance explained by each autosome, using genotype data of 33,657 single nucleotide polymorphism (SNP) markers in 2271 Japanese Black fattened steers. Traits were cold carcass weight, ribeye area, rib thickness, subcutaneous fat thickness, estimated yield percentage, and marbling score. Two mixed linear models were used: One is that (model 1) incorporating a genomic relationship matrix (G matrix) constructed by using all available SNPs, and another (model 2), incorporating two G matrices constructed by using the SNPs on one autosome and using those on the remaining autosomes. Genomic heritabilities estimated using model 1 were moderate to high. The sums of the proportions of the additive genetic variance explained by each autosome to the total genetic variance estimated by using model 2 were >90%. For carcass weight, the proportions explained by Bos taurus autosomes 6, 8, and 14 were higher than those explained by the remaining autosomes. In some cases, the estimated proportion was close to 0. The results obtained from model 2 could provide a novel insight into the genetic architecture, such as heritability per chromosome, of carcass traits in Japanese Black cattle, although further careful investigation would be required.
Collapse
Affiliation(s)
| | | | - Yukio Taniguchi
- Graduate School of Agriculture, Kyoto University, Kyoto, Japan
| | | | - Yoshikazu Sugimoto
- Shirakawa Institute of Animal Genetics, Japan Livestock Technology Association, Tokyo, Japan
| | | |
Collapse
|
31
|
Shi S, Zhang Z, Li B, Zhang S, Fang L. Incorporation of Trait-Specific Genetic Information into Genomic Prediction Models. Methods Mol Biol 2022; 2467:329-340. [PMID: 35451781 DOI: 10.1007/978-1-0716-2205-6_11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Due to the rapid development of high-throughput sequencing technology, we can easily obtain not only the genetic variants at the whole-genome sequence level (e.g., from 1000 Genomes project and 1000 Bull Genomes project), but also a wide range of functional annotations (e.g., enhancers and promoters from ENCODE, FAANG, and FarmGTEx projects) across a wide range of tissues, cell types, developmental stages, and environmental conditions. This huge amount of information leads to a revolution in studying genetics and genomics of complex traits in humans, livestock, and plant species. In this chapter, we focused on and reviewed the genomic prediction methods that incorporate external biological information into genomic prediction, such as sequence ontology, linkage disequilibrium (LD) of SNPs, quantitative trait loci (QTL), and multi-layer omics data (e.g., transcriptome, epigenome, and microbiome).
Collapse
Affiliation(s)
- Shaolei Shi
- College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhe Zhang
- Department of Animal Breeding and genetics, College of Animal Science, South China Agricultural University (SCAU), Guangzhou, China
| | - Bingjie Li
- The Roslin Institute Building, Scotland's Rural College, Edinburgh, UK
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
32
|
Zhang Z, Ma P, Zhang Z, Wang Z, Wang Q, Pan Y. The construction of a haplotype reference panel using extremely low coverage whole genome sequences and its application in genome-wide association studies and genomic prediction in Duroc pigs. Genomics 2021; 114:340-350. [PMID: 34929285 DOI: 10.1016/j.ygeno.2021.12.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 10/11/2021] [Accepted: 12/15/2021] [Indexed: 12/30/2022]
Abstract
Extremely low coverage whole genome sequencing (lcWGS) is an economical technique to obtain high-density single nucleotide polymorphisms (SNPs). Here, we explored the feasibility of constructing a haplotype reference panel (lcHRP) using lcWGS and evaluated the effects of lcHRP through a genome-wide association study (GWAS) and genomic prediction in pigs. A total of 297 and 974 Duroc pigs were genotyped using lcWGS and a 50 K SNP array, respectively. We obtained 19,306,498 SNPs using lcWGS with an accuracy of 0.984. With the help of lcHRP, the accuracy of imputation from the SNP array to lcWGS was 0.922. Compared to the SNP array findings, those from the imputation-based GWAS identified more signals across four traits. With the integration of the top 1% imputation-based GWAS findings as genomic features, the accuracies of genomic prediction was improved by 6.0% to 13.2%. This study showed the great potential of lcWGS in pigs' molecular breeding.
Collapse
Affiliation(s)
- Zhe Zhang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou 310058, PR China
| | - Peipei Ma
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, PR China
| | - Zhenyang Zhang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou 310058, PR China
| | - Zhen Wang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou 310058, PR China
| | - Qishan Wang
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou 310058, PR China.
| | - Yuchun Pan
- Department of Animal Science, College of Animal Sciences, Zhejiang University, Hangzhou 310058, PR China; Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya 572000, China.
| |
Collapse
|
33
|
Wang Z, Cheng H. Single-Trait and Multiple-Trait Genomic Prediction From Multi-Class Bayesian Alphabet Models Using Biological Information. Front Genet 2021; 12:717457. [PMID: 34707638 PMCID: PMC8542848 DOI: 10.3389/fgene.2021.717457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 08/23/2021] [Indexed: 11/13/2022] Open
Abstract
Genomic prediction has been widely used in multiple areas and various genomic prediction methods have been developed. The majority of these methods, however, focus on statistical properties and ignore the abundant useful biological information like genome annotation or previously discovered causal variants. Therefore, to improve prediction performance, several methods have been developed to incorporate biological information into genomic prediction, mostly in single-trait analysis. A commonly used method to incorporate biological information is allocating molecular markers into different classes based on the biological information and assigning separate priors to molecular markers in different classes. It has been shown that such methods can achieve higher prediction accuracy than conventional methods in some circumstances. However, these methods mainly focus on single-trait analysis, and available priors of these methods are limited. Thus, in both single-trait and multiple-trait analysis, we propose the multi-class Bayesian Alphabet methods, in which multiple Bayesian Alphabet priors, including RR-BLUP, BayesA, BayesB, BayesCΠ, and Bayesian LASSO, can be used for markers allocated to different classes. The superior performance of the multi-class Bayesian Alphabet in genomic prediction is demonstrated using both real and simulated data. The software tool JWAS offers open-source routines to perform these analyses.
Collapse
Affiliation(s)
- Zigui Wang
- Department of Animal Science, University of California, Davis, Davis, CA, United States
| | - Hao Cheng
- Department of Animal Science, University of California, Davis, Davis, CA, United States
| |
Collapse
|
34
|
Yang R, Xu Z, Wang Q, Zhu D, Bian C, Ren J, Huang Z, Zhu X, Tian Z, Wang Y, Jiang Z, Zhao Y, Zhang D, Li N, Hu X. Genome‑wide association study and genomic prediction for growth traits in yellow-plumage chicken using genotyping-by-sequencing. Genet Sel Evol 2021; 53:82. [PMID: 34706641 PMCID: PMC8555081 DOI: 10.1186/s12711-021-00672-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 09/08/2021] [Indexed: 12/25/2022] Open
Abstract
Background Growth traits are of great importance for poultry breeding and production and have been the topic of extensive investigation, with many quantitative trait loci (QTL) detected. However, due to their complex genetic background, few causative genes have been confirmed and the underlying molecular mechanisms remain unclear, thus limiting our understanding of QTL and their potential use for the genetic improvement of poultry. Therefore, deciphering the genetic architecture is a promising avenue for optimising genomic prediction strategies and exploiting genomic information for commercial breeding. The objectives of this study were to: (1) conduct a genome-wide association study to identify key genetic factors and explore the polygenicity of chicken growth traits; (2) investigate the efficiency of genomic prediction in broilers; and (3) evaluate genomic predictions that harness genomic features. Results We identified five significant QTL, including one on chromosome 4 with major effects and four on chromosomes 1, 2, 17, and 27 with minor effects, accounting for 14.5 to 34.1% and 0.2 to 2.6% of the genomic additive genetic variance, respectively, and 23.3 to 46.7% and 0.6 to 4.5% of the observed predictive accuracy of breeding values, respectively. Further analysis showed that the QTL with minor effects collectively had a considerable influence, reflecting the polygenicity of the genetic background. The accuracy of genomic best linear unbiased predictions (BLUP) was improved by 22.0 to 70.3% compared to that of the conventional pedigree-based BLUP model. The genomic feature BLUP model further improved the observed prediction accuracy by 13.8 to 15.2% compared to the genomic BLUP model. Conclusions A major QTL and four minor QTL were identified for growth traits; the remaining variance was due to QTL effects that were too small to be detected. The genomic BLUP and genomic feature BLUP models yielded considerably higher prediction accuracy compared to the pedigree-based BLUP model. This study revealed the polygenicity of growth traits in yellow-plumage chickens and demonstrated that the predictive ability can be greatly improved by using genomic information and related features. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00672-9.
Collapse
Affiliation(s)
- Ruifei Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.,College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhenqiang Xu
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Qi Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Di Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Cheng Bian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Jiangli Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhuolin Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoning Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhixin Tian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yuzhe Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Ziqin Jiang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Yiqiang Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Dexiang Zhang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China.
| | - Ning Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoxiang Hu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.
| |
Collapse
|
35
|
Lyra DH, Griffiths CA, Watson A, Joynson R, Molero G, Igna AA, Hassani-Pak K, Reynolds MP, Hall A, Paul MJ. Gene-based mapping of trehalose biosynthetic pathway genes reveals association with source- and sink-related yield traits in a spring wheat panel. Food Energy Secur 2021; 10:e292. [PMID: 34594548 PMCID: PMC8459250 DOI: 10.1002/fes3.292] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 04/12/2021] [Accepted: 04/12/2021] [Indexed: 12/11/2022] Open
Abstract
Trehalose 6‐phosphate (T6P) signalling regulates carbon use and allocation and is a target to improve crop yields. However, the specific contributions of trehalose phosphate synthase (TPS) and trehalose phosphate phosphatase (TPP) genes to source‐ and sink‐related traits remain largely unknown. We used enrichment capture sequencing on TPS and TPP genes to estimate and partition the genetic variation of yield‐related traits in a spring wheat (Triticum aestivum) breeding panel specifically built to capture the diversity across the 75,000 CIMMYT wheat cultivar collection. Twelve phenotypes were correlated to variation in TPS and TPP genes including plant height and biomass (source), spikelets per spike, spike growth and grain filling traits (sink) which showed indications of both positive and negative gene selection. Individual genes explained proportions of heritability for biomass and grain‐related traits. Three TPS1 homologues were particularly significant for trait variation. Epistatic interactions were found within and between the TPS and TPP gene families for both plant height and grain‐related traits. Gene‐based prediction improved predictive ability for grain weight when gene effects were combined with the whole‐genome markers. Our study has generated a wealth of information on natural variation of TPS and TPP genes related to yield potential which confirms the role for T6P in resource allocation and in affecting traits such as grain number and size confirming other studies which now opens up the possibility of harnessing natural genetic variation more widely to better understand the contribution of native genes to yield traits for incorporation into breeding programmes.
Collapse
Affiliation(s)
- Danilo H Lyra
- Computational & Analytical Sciences Rothamsted Research Harpenden UK
| | | | - Amy Watson
- Plant Sciences Rothamsted Research Harpenden UK
| | | | - Gemma Molero
- Global Wheat Program, International Maize and Wheat Improvement Centre (CIMMYT) Texcoco Mexico
| | | | | | - Matthew P Reynolds
- Global Wheat Program, International Maize and Wheat Improvement Centre (CIMMYT) Texcoco Mexico
| | | | | |
Collapse
|
36
|
McGaugh SE, Lorenz AJ, Flagel LE. The utility of genomic prediction models in evolutionary genetics. Proc Biol Sci 2021; 288:20210693. [PMID: 34344180 PMCID: PMC8334854 DOI: 10.1098/rspb.2021.0693] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 07/15/2021] [Indexed: 12/25/2022] Open
Abstract
Variation in complex traits is the result of contributions from many loci of small effect. Based on this principle, genomic prediction methods are used to make predictions of breeding value for an individual using genome-wide molecular markers. In breeding, genomic prediction models have been used in plant and animal breeding for almost two decades to increase rates of genetic improvement and reduce the length of artificial selection experiments. However, evolutionary genomics studies have been slow to incorporate this technique to select individuals for breeding in a conservation context or to learn more about the genetic architecture of traits, the genetic value of missing individuals or microevolution of breeding values. Here, we outline the utility of genomic prediction and provide an overview of the methodology. We highlight opportunities to apply genomic prediction in evolutionary genetics of wild populations and the best practices when using these methods on field-collected phenotypes.
Collapse
Affiliation(s)
- Suzanne E. McGaugh
- Ecology, Evolution, and Behavior, University of Minnesota, 140 Gortner Lab, 1479 Gortner Avenue, Saint Paul, MN 55108, USA
| | - Aaron J. Lorenz
- Agronomy and Plant Genetics, University of Minnesota, 411 Borlaug Hall, 1991 Upper Buford Circle, Saint Paul, MN 55108, USA
| | - Lex E. Flagel
- Plant and Microbial Biology, University of Minnesota, 140 Gortner Lab, 1479 Gortner Avenue, Saint Paul, MN 55108, USA
- Bayer Crop Science, 700 W Chesterfield Parkway, Chesterfield, MO 63017, USA
| |
Collapse
|
37
|
Yang R, Guo X, Zhu D, Tan C, Bian C, Ren J, Huang Z, Zhao Y, Cai G, Liu D, Wu Z, Wang Y, Li N, Hu X. Accelerated deciphering of the genetic architecture of agricultural economic traits in pigs using a low-coverage whole-genome sequencing strategy. Gigascience 2021; 10:giab048. [PMID: 34282453 PMCID: PMC8290195 DOI: 10.1093/gigascience/giab048] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 03/21/2021] [Accepted: 06/15/2021] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Uncovering the genetic architecture of economic traits in pigs is important for agricultural breeding. However, high-density haplotype reference panels are unavailable in most agricultural species, limiting accurate genotype imputation in large populations. Moreover, the infinitesimal model of quantitative traits implies that weak association signals tend to be spread across most of the genome, further complicating the genetic analysis. Hence, there is a need to develop new methods for sequencing large cohorts without large reference panels. RESULTS We describe a Tn5-based highly accurate, cost- and time-efficient, low-coverage sequencing method to obtain 11.3 million whole-genome single-nucleotide polymorphisms in 2,869 Duroc boars at a mean depth of 0.73×. On the basis of these single-nucleotide polymorphisms, a genome-wide association study was performed, resulting in 14 quantitative trait loci (QTLs) for 7 of 21 important agricultural traits in pigs. These QTLs harbour genes, such as ABCD4 for total teat number and HMGA1 for back fat thickness, and provided a starting point for further investigation. The inheritance models of the different traits varied greatly. Most follow the minor-polygene model, but this can be attributed to different reasons, such as the shaping of genetic architecture by artificial selection for this population and sufficiently interconnected minor gene regulatory networks. CONCLUSIONS Genome-wide association study results for 21 important agricultural traits identified 14 QTLs/genes and showed their genetic architectures, providing guidance for genetic improvement harnessing genomic features. The Tn5-based low-coverage sequencing method can be applied to large-scale genome studies for any species without a good reference panel and can be used for agricultural breeding.
Collapse
Affiliation(s)
- Ruifei Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Xiaoli Guo
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Di Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Cheng Tan
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Cheng Bian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Jiangli Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Zhuolin Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Yiqiang Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Dewu Liu
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Yuzhe Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Ning Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Xiaoxiang Hu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| |
Collapse
|
38
|
Song H, Hu H. Strategies to improve the accuracy and reduce costs of genomic prediction in aquaculture species. Evol Appl 2021; 15:578-590. [PMID: 35505889 PMCID: PMC9046917 DOI: 10.1111/eva.13262] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/30/2021] [Accepted: 06/07/2021] [Indexed: 11/27/2022] Open
Affiliation(s)
- Hailiang Song
- Beijing Fisheries Research Institute & Beijing Key Laboratory of Fishery Biotechnology Beijing China
| | - Hongxia Hu
- Beijing Fisheries Research Institute & Beijing Key Laboratory of Fishery Biotechnology Beijing China
| |
Collapse
|
39
|
Validation of the Prediction Accuracy for 13 Traits in Chinese Simmental Beef Cattle Using a Preselected Low-Density SNP Panel. Animals (Basel) 2021; 11:ani11071890. [PMID: 34202066 PMCID: PMC8300368 DOI: 10.3390/ani11071890] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 06/08/2021] [Accepted: 06/15/2021] [Indexed: 11/17/2022] Open
Abstract
Simple Summary To reduce the breeding costs and promote the application of genomic selection (GS) in Chinese Simmental beef cattle, we developed a customized low-density single-nucleotide polymorphism (SNP) panel consisting of 30,684 SNPs. When comparing the predictive performance of the low-density SNP panel to that of the BovineHD Beadchip for 13 traits, we found that this ~30 K panel achieved moderate to high prediction accuracies for most traits, while reducing the prediction accuracies of six traits by 0.04–0.09 and decreasing the prediction accuracy of one trait by 0.2. For the remaining six traits, the usage of the low-density SNP panel was associated with a slight increase in prediction accuracy. Our studies suggested that the low-density SNP panel (~30 K) is a feasible and promising tool for cost-effective genomic prediction in Chinese Simmental beef cattle, which may provide breeding organizations with a cheaper option and greater returns on investment. Abstract Chinese Simmental beef cattle play a key role in the Chinese beef industry due to their great adaptability and marketability. To achieve efficient genetic gain at a low breeding cost, it is crucial to develop a customized cost-effective low-density SNP panel for this cattle population. Thirteen growth, carcass, and meat quality traits and a BovineHD Beadchip genotyping of 1346 individuals were used to select trait-associated variants and variants contributing to great genetic variance. In addition, highly informative SNPs with high MAF in each 500 kb sliding window and in each genic region were also included separately. A low-density SNP panel consisting of 30,684 SNPs was developed, with an imputation accuracy of 97.4% when imputed to the 770 K level. Among 13 traits, the average prediction accuracy levels evaluated by genomic best linear unbiased prediction (GBLUP) and BayesA/B/Cπ were 0.22–0.47 and 0.18–0.60 for the ~30 K array and BovineHD Beadchip, respectively. Generally, the predictive performance of the ~30 K array was trait-dependent, with reduced prediction accuracies for seven traits. While differences in terms of prediction accuracy were observed among the 13 traits, the low-density SNP panel achieved moderate to high accuracies for most of the traits and even improved the accuracies for some traits.
Collapse
|
40
|
Rohde PD, Kristensen TN, Sarup P, Muñoz J, Malmendal A. Prediction of complex phenotypes using the Drosophila melanogaster metabolome. Heredity (Edinb) 2021; 126:717-732. [PMID: 33510469 PMCID: PMC8102504 DOI: 10.1038/s41437-021-00404-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 01/04/2021] [Accepted: 01/04/2021] [Indexed: 01/30/2023] Open
Abstract
Understanding the genotype-phenotype map and how variation at different levels of biological organization is associated are central topics in modern biology. Fast developments in sequencing technologies and other molecular omic tools enable researchers to obtain detailed information on variation at DNA level and on intermediate endophenotypes, such as RNA, proteins and metabolites. This can facilitate our understanding of the link between genotypes and molecular and functional organismal phenotypes. Here, we use the Drosophila melanogaster Genetic Reference Panel and nuclear magnetic resonance (NMR) metabolomics to investigate the ability of the metabolome to predict organismal phenotypes. We performed NMR metabolomics on four replicate pools of male flies from each of 170 different isogenic lines. Our results show that metabolite profiles are variable among the investigated lines and that this variation is highly heritable. Second, we identify genes associated with metabolome variation. Third, using the metabolome gave better prediction accuracies than genomic information for four of five quantitative traits analyzed. Our comprehensive characterization of population-scale diversity of metabolomes and its genetic basis illustrates that metabolites have large potential as predictors of organismal phenotypes. This finding is of great importance, e.g., in human medicine, evolutionary biology and animal and plant breeding.
Collapse
Affiliation(s)
- Palle Duun Rohde
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark.
| | - Torsten Nygaard Kristensen
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
- Department of Animal Science, Aarhus University, Tjele, Denmark
| | - Pernille Sarup
- Department of Molecular Biology and Genetics, Aarhus University, Tjele, Denmark
- Nordic Seed A/S, Odder, Denmark
| | - Joaquin Muñoz
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| | - Anders Malmendal
- Department of Science and Environment, Roskilde University, Roskilde, Denmark.
| |
Collapse
|
41
|
Campbell MT, Hu H, Yeats TH, Brzozowski LJ, Caffe-Treml M, Gutiérrez L, Smith KP, Sorrells ME, Gore MA, Jannink JL. Improving Genomic Prediction for Seed Quality Traits in Oat (Avena sativa L.) Using Trait-Specific Relationship Matrices. Front Genet 2021; 12:643733. [PMID: 33868378 PMCID: PMC8044359 DOI: 10.3389/fgene.2021.643733] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 03/04/2021] [Indexed: 11/13/2022] Open
Abstract
The observable phenotype is the manifestation of information that is passed along different organization levels (transcriptional, translational, and metabolic) of a biological system. The widespread use of various omic technologies (RNA-sequencing, metabolomics, etc.) has provided plant genetics and breeders with a wealth of information on pertinent intermediate molecular processes that may help explain variation in conventional traits such as yield, seed quality, and fitness, among others. A major challenge is effectively using these data to help predict the genetic merit of new, unobserved individuals for conventional agronomic traits. Trait-specific genomic relationship matrices (TGRMs) model the relationships between individuals using genome-wide markers (SNPs) and place greater emphasis on markers that most relevant to the trait compared to conventional genomic relationship matrices. Given that these approaches define relationships based on putative causal loci, it is expected that these approaches should improve predictions for related traits. In this study we evaluated the use of TGRMs to accommodate information on intermediate molecular phenotypes (referred to as endophenotypes) and to predict an agronomic trait, total lipid content, in oat seed. Nine fatty acids were quantified in a panel of 336 oat lines. Marker effects were estimated for each endophenotype, and were used to construct TGRMs. A multikernel TRGM model (MK-TRGM-BLUP) was used to predict total seed lipid content in an independent panel of 210 oat lines. The MK-TRGM-BLUP approach significantly improved predictions for total lipid content when compared to a conventional genomic BLUP (gBLUP) approach. Given that the MK-TGRM-BLUP approach leverages information on the nine fatty acids to predict genetic values for total lipid content in unobserved individuals, we compared the MK-TGRM-BLUP approach to a multi-trait gBLUP (MT-gBLUP) approach that jointly fits phenotypes for fatty acids and total lipid content. The MK-TGRM-BLUP approach significantly outperformed MT-gBLUP. Collectively, these results highlight the utility of using TGRM to accommodate information on endophenotypes and improve genomic prediction for a conventional agronomic trait.
Collapse
Affiliation(s)
- Malachy T. Campbell
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Haixiao Hu
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Trevor H. Yeats
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Lauren J. Brzozowski
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Melanie Caffe-Treml
- Seed Technology Lab 113, Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD, United States
| | - Lucía Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI, United States
| | - Kevin P. Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN, United States
| | - Mark E. Sorrells
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Michael A. Gore
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Jean-Luc Jannink
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
- R.W. Holley Center for Agriculture & Health, US Department of Agriculture, Agricultural Research Service, Ithaca, NY, United States
| |
Collapse
|
42
|
Campbell MT, Hu H, Yeats TH, Caffe-Treml M, Gutiérrez L, Smith KP, Sorrells ME, Gore MA, Jannink JL. Translating insights from the seed metabolome into improved prediction for lipid-composition traits in oat (Avena sativa L.). Genetics 2021; 217:iyaa043. [PMID: 33789350 PMCID: PMC8045723 DOI: 10.1093/genetics/iyaa043] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 12/08/2020] [Indexed: 12/13/2022] Open
Abstract
Oat (Avena sativa L.) seed is a rich resource of beneficial lipids, soluble fiber, protein, and antioxidants, and is considered a healthful food for humans. Little is known regarding the genetic controllers of variation for these compounds in oat seed. We characterized natural variation in the mature seed metabolome using untargeted metabolomics on 367 diverse lines and leveraged this information to improve prediction for seed quality traits. We used a latent factor approach to define unobserved variables that may drive covariance among metabolites. One hundred latent factors were identified, of which 21% were enriched for compounds associated with lipid metabolism. Through a combination of whole-genome regression and association mapping, we show that latent factors that generate covariance for many metabolites tend to have a complex genetic architecture. Nonetheless, we recovered significant associations for 23% of the latent factors. These associations were used to inform a multi-kernel genomic prediction model, which was used to predict seed lipid and protein traits in two independent studies. Predictions for 8 of the 12 traits were significantly improved compared to genomic best linear unbiased prediction when this prediction model was informed using associations from lipid-enriched factors. This study provides new insights into variation in the oat seed metabolome and provides genomic resources for breeders to improve selection for health-promoting seed quality traits. More broadly, we outline an approach to distill high-dimensional "omics" data to a set of biologically meaningful variables and translate inferences on these data into improved breeding decisions.
Collapse
Affiliation(s)
- Malachy T Campbell
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Haixiao Hu
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Trevor H Yeats
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Melanie Caffe-Treml
- Department of Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD 57007, USA
| | - Lucía Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Kevin P Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Mark E Sorrells
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Michael A Gore
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Jean-Luc Jannink
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
- R.W. Holley Center for Agriculture & Health US Department of Agriculture, Agricultural Research Service, Ithaca, NY 14853, USA
| |
Collapse
|
43
|
Farooq M, van Dijk ADJ, Nijveen H, Aarts MGM, Kruijer W, Nguyen TP, Mansoor S, de Ridder D. Prior Biological Knowledge Improves Genomic Prediction of Growth-Related Traits in Arabidopsis thaliana. Front Genet 2021; 11:609117. [PMID: 33552126 PMCID: PMC7855462 DOI: 10.3389/fgene.2020.609117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 12/21/2020] [Indexed: 01/11/2023] Open
Abstract
Prediction of growth-related complex traits is highly important for crop breeding. Photosynthesis efficiency and biomass are direct indicators of overall plant performance and therefore even minor improvements in these traits can result in significant breeding gains. Crop breeding for complex traits has been revolutionized by technological developments in genomics and phenomics. Capitalizing on the growing availability of genomics data, genome-wide marker-based prediction models allow for efficient selection of the best parents for the next generation without the need for phenotypic information. Until now such models mostly predict the phenotype directly from the genotype and fail to make use of relevant biological knowledge. It is an open question to what extent the use of such biological knowledge is beneficial for improving genomic prediction accuracy and reliability. In this study, we explored the use of publicly available biological information for genomic prediction of photosynthetic light use efficiency (Φ PSII ) and projected leaf area (PLA) in Arabidopsis thaliana. To explore the use of various types of knowledge, we mapped genomic polymorphisms to Gene Ontology (GO) terms and transcriptomics-based gene clusters, and applied these in a Genomic Feature Best Linear Unbiased Predictor (GFBLUP) model, which is an extension to the traditional Genomic BLUP (GBLUP) benchmark. Our results suggest that incorporation of prior biological knowledge can improve genomic prediction accuracy for both Φ PSII and PLA. The improvement achieved depends on the trait, type of knowledge and trait heritability. Moreover, transcriptomics offers complementary evidence to the Gene Ontology for improvement when used to define functional groups of genes. In conclusion, prior knowledge about trait-specific groups of genes can be directly translated into improved genomic prediction.
Collapse
Affiliation(s)
- Muhammad Farooq
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
- Molecular Virology and Gene Silencing Lab, Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Punjab, Pakistan
| | - Aalt D. J. van Dijk
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
- Biometris, Wageningen University, Wageningen, Netherlands
| | - Harm Nijveen
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
| | - Mark G. M. Aarts
- Laboratory of Genetics, Wageningen University, Wageningen, Netherlands
| | - Willem Kruijer
- Biometris, Wageningen University, Wageningen, Netherlands
| | - Thu-Phuong Nguyen
- Laboratory of Genetics, Wageningen University, Wageningen, Netherlands
| | - Shahid Mansoor
- Molecular Virology and Gene Silencing Lab, Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Punjab, Pakistan
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
| |
Collapse
|
44
|
Lee HJ, Chung YJ, Jang S, Seo DW, Lee HK, Yoon D, Lim D, Lee SH. Genome-wide identification of major genes and genomic prediction using high-density and text-mined gene-based SNP panels in Hanwoo (Korean cattle). PLoS One 2020; 15:e0241848. [PMID: 33264312 PMCID: PMC7710051 DOI: 10.1371/journal.pone.0241848] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 10/21/2020] [Indexed: 11/24/2022] Open
Abstract
It was hypothesized that single-nucleotide polymorphisms (SNPs) extracted from text-mined genes could be more tightly related to causal variant for each trait and that differentially weighting of this SNP panel in the GBLUP model could improve the performance of genomic prediction in cattle. Fitting two GRMs constructed by text-mined SNPs and SNPs except text-mined SNPs from 777k SNPs set (exp_777K) as different random effects showed better accuracy than fitting one GRM (Im_777K) for six traits (e.g. backfat thickness: + 0.002, eye muscle area: + 0.014, Warner–Bratzler Shear Force of semimembranosus and longissimus dorsi: + 0.024 and + 0.068, intramuscular fat content of semimembranosus and longissimus dorsi: + 0.008 and + 0.018). These results can suggest that attempts to incorporate text mining into genomic predictions seem valuable, and further study using text mining can be expected to present the significant results.
Collapse
Affiliation(s)
- Hyo Jun Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon, Korea
| | - Yoon Ji Chung
- Division of Animal and Dairy Science, Chungnam National University, Daejeon, Korea
| | - Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States of America
| | - Dong Won Seo
- Division of Animal and Dairy Science, Chungnam National University, Daejeon, Korea
| | - Hak Kyo Lee
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, Korea
| | - Duhak Yoon
- Department of Animal Science, Kyungpook National University, Sangju, Korea
| | - Dajeong Lim
- Animal Genome & Bioinformatics, National Institute of Animal Science, Wanju, Korea
- * E-mail: (DL); (SHL)
| | - Seung Hwan Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon, Korea
- * E-mail: (DL); (SHL)
| |
Collapse
|
45
|
Morgante F, Huang W, Sørensen P, Maltecca C, Mackay TFC. Leveraging Multiple Layers of Data To Predict Drosophila Complex Traits. G3 (BETHESDA, MD.) 2020; 10:4599-4613. [PMID: 33106232 PMCID: PMC7718734 DOI: 10.1534/g3.120.401847] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 10/12/2020] [Indexed: 02/07/2023]
Abstract
The ability to accurately predict complex trait phenotypes from genetic and genomic data are critical for the implementation of personalized medicine and precision agriculture; however, prediction accuracy for most complex traits is currently low. Here, we used data on whole genome sequences, deep RNA sequencing, and high quality phenotypes for three quantitative traits in the ∼200 inbred lines of the Drosophila melanogaster Genetic Reference Panel (DGRP) to compare the prediction accuracies of gene expression and genotypes for three complex traits. We found that expression levels (r = 0.28 and 0.38, for females and males, respectively) provided higher prediction accuracy than genotypes (r = 0.07 and 0.15, for females and males, respectively) for starvation resistance, similar prediction accuracy for chill coma recovery (null for both models and sexes), and lower prediction accuracy for startle response (r = 0.15 and 0.14 for female and male genotypes, respectively; and r = 0.12 and 0.11, for females and male transcripts, respectively). Models including both genotype and expression levels did not outperform the best single component model. However, accuracy increased considerably for all the three traits when we included gene ontology (GO) category as an additional layer of information for both genomic variants and transcripts. We found strongly predictive GO terms for each of the three traits, some of which had a clear plausible biological interpretation. For example, for starvation resistance in females, GO:0033500 (r = 0.39 for transcripts) and GO:0032870 (r = 0.40 for transcripts), have been implicated in carbohydrate homeostasis and cellular response to hormone stimulus (including the insulin receptor signaling pathway), respectively. In summary, this study shows that integrating different sources of information improved prediction accuracy and helped elucidate the genetic architecture of three Drosophila complex phenotypes.
Collapse
Affiliation(s)
- Fabio Morgante
- Department of Biological Sciences and W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, NC 27695
- Program in Genetics, North Carolina State University, Raleigh, NC 27695
| | - Wen Huang
- Department of Biological Sciences and W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, NC 27695
- Program in Genetics, North Carolina State University, Raleigh, NC 27695
| | - Peter Sørensen
- Center of Quantitative Genetics and Genomics and Department of Molecular Biology and Genetics, Aarhus University, Tjele 8830, Denmark
| | - Christian Maltecca
- Program in Genetics, North Carolina State University, Raleigh, NC 27695
- Department of Animal Science, North Carolina State University, Raleigh, NC 27695
| | - Trudy F C Mackay
- Department of Biological Sciences and W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, NC 27695
- Program in Genetics, North Carolina State University, Raleigh, NC 27695
| |
Collapse
|
46
|
Ye S, Li J, Zhang Z. Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2020; 11:109. [PMID: 33292577 PMCID: PMC7708144 DOI: 10.1186/s40104-020-00515-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 09/22/2020] [Indexed: 12/02/2022] Open
Abstract
Background Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. Results Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. Conclusions Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China.
| |
Collapse
|
47
|
Aono AH, Costa EA, Rody HVS, Nagai JS, Pimenta RJG, Mancini MC, Dos Santos FRC, Pinto LR, Landell MGDA, de Souza AP, Kuroshu RM. Machine learning approaches reveal genomic regions associated with sugarcane brown rust resistance. Sci Rep 2020; 10:20057. [PMID: 33208862 PMCID: PMC7676261 DOI: 10.1038/s41598-020-77063-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 08/24/2020] [Indexed: 12/18/2022] Open
Abstract
Sugarcane is an economically important crop, but its genomic complexity has hindered advances in molecular approaches for genetic breeding. New cultivars are released based on the identification of interesting traits, and for sugarcane, brown rust resistance is a desirable characteristic due to the large economic impact of the disease. Although marker-assisted selection for rust resistance has been successful, the genes involved are still unknown, and the associated regions vary among cultivars, thus restricting methodological generalization. We used genotyping by sequencing of full-sib progeny to relate genomic regions with brown rust phenotypes. We established a pipeline to identify reliable SNPs in complex polyploid data, which were used for phenotypic prediction via machine learning. We identified 14,540 SNPs, which led to a mean prediction accuracy of 50% when using different models. We also tested feature selection algorithms to increase predictive accuracy, resulting in a reduced dataset with more explanatory power for rust phenotypes. As a result of this approach, we achieved an accuracy of up to 95% with a dataset of 131 SNPs related to brown rust QTL regions and auxiliary genes. Therefore, our novel strategy has the potential to assist studies of the genomic organization of brown rust resistance in sugarcane.
Collapse
Affiliation(s)
- Alexandre Hild Aono
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil
| | - Estela Araujo Costa
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - Hugo Vianna Silva Rody
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - James Shiniti Nagai
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil
| | - Ricardo José Gonzaga Pimenta
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil
| | - Melina Cristina Mancini
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil
| | | | - Luciana Rossini Pinto
- Advanced Center of Sugarcane Agrobusiness Technological Research, Agronomic Institute of Campinas (IAC), Ribeirão Preto, SP, Brazil
| | | | - Anete Pereira de Souza
- Molecular Biology and Genetic Engineering Center (CBMEG), University of Campinas (UNICAMP), Campinas, SP, Brazil.
- Department of Plant Biology, Institute of Biology (IB), University of Campinas (UNICAMP), Campinas, SP, Brazil.
| | - Reginaldo Massanobu Kuroshu
- Instituto de Ciência e Tecnologia (ICT), Universidade Federal de São Paulo (UNIFESP), São José dos Campos, SP, Brazil.
| |
Collapse
|
48
|
Genomic Prediction Informed by Biological Processes Expands Our Understanding of the Genetic Architecture Underlying Free Amino Acid Traits in Dry Arabidopsis Seeds. G3-GENES GENOMES GENETICS 2020; 10:4227-4239. [PMID: 32978264 PMCID: PMC7642941 DOI: 10.1534/g3.120.401240] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Plant growth, development, and nutritional quality depends upon amino acid homeostasis, especially in seeds. However, our understanding of the underlying genetics influencing amino acid content and composition remains limited, with only a few candidate genes and quantitative trait loci identified to date. Improved knowledge of the genetics and biological processes that determine amino acid levels will enable researchers to use this information for plant breeding and biological discovery. Toward this goal, we used genomic prediction to identify biological processes that are associated with, and therefore potentially influence, free amino acid (FAA) composition in seeds of the model plant Arabidopsis thaliana. Markers were split into categories based on metabolic pathway annotations and fit using a genomic partitioning model to evaluate the influence of each pathway on heritability explained, model fit, and predictive ability. Selected pathways included processes known to influence FAA composition, albeit to an unknown degree, and spanned four categories: amino acid, core, specialized, and protein metabolism. Using this approach, we identified associations for pathways containing known variants for FAA traits, in addition to finding new trait-pathway associations. Markers related to amino acid metabolism, which are directly involved in FAA regulation, improved predictive ability for branched chain amino acids and histidine. The use of genomic partitioning also revealed patterns across biochemical families, in which serine-derived FAAs were associated with protein related annotations and aromatic FAAs were associated with specialized metabolic pathways. Taken together, these findings provide evidence that genomic partitioning is a viable strategy to uncover the relative contributions of biological processes to FAA traits in seeds, offering a promising framework to guide hypothesis testing and narrow the search space for candidate genes.
Collapse
|
49
|
Banos G, Lindsay V, Desta TT, Bettridge J, Sanchez-Molano E, Vallejo-Trujillo A, Matika O, Dessie T, Wigley P, Christley RM, Kaiser P, Hanotte O, Psifidi A. Integrating Genetic and Genomic Analyses of Combined Health Data Across Ecotypes to Improve Disease Resistance in Indigenous African Chickens. Front Genet 2020; 11:543890. [PMID: 33193617 PMCID: PMC7581896 DOI: 10.3389/fgene.2020.543890] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Accepted: 09/04/2020] [Indexed: 12/30/2022] Open
Abstract
Poultry play an important role in the agriculture of many African countries. The majority of chickens in sub-Saharan Africa are indigenous, raised in villages under semi-scavenging conditions. Vaccinations and biosecurity measures rarely apply, and infectious diseases remain a major cause of mortality and reduced productivity. Genomic selection for disease resistance offers a potentially sustainable solution but this requires sufficient numbers of individual birds with genomic and phenotypic data, which is often a challenge to collect in the small populations of indigenous chicken ecotypes. The use of information across-ecotypes presents an attractive possibility to increase the relevant numbers and the accuracy of genomic selection. In this study, we performed a joint analysis of two distinct Ethiopian indigenous chicken ecotypes to investigate the genomic architecture of important health and productivity traits and explore the feasibility of conducting genomic selection across-ecotype. Phenotypic traits considered were antibody response to Infectious Bursal Disease (IBDV), Marek's Disease (MDV), Fowl Cholera (PM) and Fowl Typhoid (SG), resistance to Eimeria and cestode parasitism, and productivity [body weight and body condition score (BCS)]. Combined data from the two chicken ecotypes, Horro (n = 384) and Jarso (n = 376), were jointly analyzed for genetic parameter estimation, genome-wide association studies (GWAS), genomic breeding value (GEBVs) calculation, genomic predictions, whole-genome sequencing (WGS), and pathways analyses. Estimates of across-ecotype heritability were significant and moderate in magnitude (0.22-0.47) for all traits except for SG and BCS. GWAS identified several significant genomic associations with health and productivity traits. The WGS analysis revealed putative candidate genes and mutations for IBDV (TOLLIP, ANGPTL5, BCL9, THEMIS2), MDV (GRM7), SG (MAP3K21), Eimeria (TOM1L1) and cestodes (TNFAIP1, ATG9A, NOS2) parasitism, which warrant further investigation. Reliability of GEBVs increased compared to within-ecotype calculations but accuracy of genomic prediction did not, probably because the genetic distance between the two ecotypes offset the benefit from increased sample size. However, for some traits genomic prediction was only feasible in across-ecotype analysis. Our results generally underpin the potential of genomic selection to enhance health and productivity across-ecotypes. Future studies should establish the required minimum sample size and genetic similarity between ecotypes to ensure accurate joint genomic selection.
Collapse
Affiliation(s)
- Georgios Banos
- The Roslin Institute, The University of Edinburgh, Edinburgh, United Kingdom
- Scotland’s Rural College, Edinburgh, United Kingdom
- Centre for Tropical Livestock Genetics and Health, Edinburgh, United Kingdom
| | - Victoria Lindsay
- Royal Veterinary College, University of London, London, United Kingdom
| | - Takele T. Desta
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Judy Bettridge
- Institute of Infection and Global Health, University of Liverpool, Liverpool, United Kingdom
- LiveGene – Centre for Tropical Livestock Genetics and Health, International Livestock Research Institute, Addis Ababa, Ethiopia
- Natural Resources Institute, University of Greenwich, London, United Kingdom
| | | | | | - Oswald Matika
- The Roslin Institute, The University of Edinburgh, Edinburgh, United Kingdom
| | - Tadelle Dessie
- LiveGene – Centre for Tropical Livestock Genetics and Health, International Livestock Research Institute, Addis Ababa, Ethiopia
| | - Paul Wigley
- Institute of Infection and Global Health, University of Liverpool, Liverpool, United Kingdom
| | - Robert M. Christley
- Institute of Infection and Global Health, University of Liverpool, Liverpool, United Kingdom
| | - Peter Kaiser
- The Roslin Institute, The University of Edinburgh, Edinburgh, United Kingdom
| | - Olivier Hanotte
- Centre for Tropical Livestock Genetics and Health, Edinburgh, United Kingdom
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
- LiveGene – Centre for Tropical Livestock Genetics and Health, International Livestock Research Institute, Addis Ababa, Ethiopia
| | - Androniki Psifidi
- The Roslin Institute, The University of Edinburgh, Edinburgh, United Kingdom
- Centre for Tropical Livestock Genetics and Health, Edinburgh, United Kingdom
- Royal Veterinary College, University of London, London, United Kingdom
| |
Collapse
|
50
|
Yan Z, Huang H, Freebern E, Santos DJA, Dai D, Si J, Ma C, Cao J, Guo G, Liu GE, Ma L, Fang L, Zhang Y. Integrating RNA-Seq with GWAS reveals novel insights into the molecular mechanism underpinning ketosis in cattle. BMC Genomics 2020; 21:489. [PMID: 32680461 PMCID: PMC7367229 DOI: 10.1186/s12864-020-06909-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 07/13/2020] [Indexed: 01/12/2023] Open
Abstract
Background Ketosis is a common metabolic disease during the transition period in dairy cattle, resulting in long-term economic loss to the dairy industry worldwide. While genetic selection of resistance to ketosis has been adopted by many countries, the genetic and biological basis underlying ketosis is poorly understood. Results We collected a total of 24 blood samples from 12 Holstein cows, including 4 healthy and 8 ketosis-diagnosed ones, before (2 weeks) and after (5 days) calving, respectively. We then generated RNA-Sequencing (RNA-Seq) data and seven blood biochemical indicators (bio-indicators) from leukocytes and plasma in each of these samples, respectively. By employing a weighted gene co-expression network analysis (WGCNA), we detected that 4 out of 16 gene-modules, which were significantly engaged in lipid metabolism and immune responses, were transcriptionally (FDR < 0.05) correlated with postpartum ketosis and several bio-indicators (e.g., high-density lipoprotein and low-density lipoprotein). By conducting genome-wide association signal (GWAS) enrichment analysis among six common health traits (ketosis, mastitis, displaced abomasum, metritis, hypocalcemia and livability), we found that 4 out of 16 modules were genetically (FDR < 0.05) associated with ketosis, among which three were correlated with postpartum ketosis based on WGCNA. We further identified five candidate genes for ketosis, including GRINA, MAF1, MAFA, C14H8orf82 and RECQL4. Our phenome-wide association analysis (Phe-WAS) demonstrated that human orthologues of these candidate genes were also significantly associated with many metabolic, endocrine, and immune traits in humans. For instance, MAFA, which is involved in insulin secretion, glucose response, and transcriptional regulation, showed a significantly higher association with metabolic and endocrine traits compared to other types of traits in humans. Conclusions In summary, our study provides novel insights into the molecular mechanism underlying ketosis in cattle, and highlights that an integrative analysis of omics data and cross-species mapping are promising for illustrating the genetic architecture underpinning complex traits.
Collapse
Affiliation(s)
- Ze Yan
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Hetian Huang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.,College of Animal Science and Veterinary Medicine, Henan Agricultural University, Zhengzhou, 450002, China
| | - Ellen Freebern
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Daniel J A Santos
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Dongmei Dai
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Jingfang Si
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Chong Ma
- Department of Clinical Veterinary Medicine, College of Veterinary Medicine, China Agricultural University, Beijing, 100193, China
| | - Jie Cao
- Department of Clinical Veterinary Medicine, College of Veterinary Medicine, China Agricultural University, Beijing, 100193, China
| | - Gang Guo
- Beijing Sunlon Livestock Development Co Ltd., Beijing, 100076, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK.
| | - Yi Zhang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|