1
|
Ayres L, Bovenhuis H, Calus MPL. A single-locus quantitative genetic model incorporating DNA methylation. J Theor Biol 2025; 607:112110. [PMID: 40189137 DOI: 10.1016/j.jtbi.2025.112110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2025] [Accepted: 03/31/2025] [Indexed: 04/29/2025]
Abstract
We describe a single-locus quantitative genetic model that incorporates effects due to DNA methylation. Extending Fisher's decomposition of the genotypic value, we distinguish two quantities to predict an individual's phenotypic or genetic values: the "basic genetic value" and the "expressed genetic value". We show how these quantities relate to the concept of breeding value and derive their corresponding formulas, along with those for phenotypic variance and covariance between relatives. The resulting parameters are influenced by several factors, including the population distribution of DNA methylation levels, the functional relationship between methylation and phenotype, the magnitudes of genetic and methylation effects, and allele frequencies. We show that under the conditions modeled, the presence of DNA methylation does not bias estimated breeding values.
Collapse
Affiliation(s)
- L Ayres
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH, the Netherlands.
| | - H Bovenhuis
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH, the Netherlands
| | - M P L Calus
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH, the Netherlands
| |
Collapse
|
2
|
Cai X, Zhang W, Gao N, Wei C, Wu X, Si J, Gao Y, Li J, Yin T, Zhang Z. Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations. J Anim Breed Genet 2025; 142:223-236. [PMID: 39215551 DOI: 10.1111/jbg.12896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 07/18/2024] [Accepted: 08/18/2024] [Indexed: 09/04/2024]
Abstract
The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.
Collapse
Affiliation(s)
- Xiaodian Cai
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Wenjing Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Ning Gao
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Chen Wei
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xibo Wu
- Guangxi State Farmd Yongxin Animal Husbandry Group Co., Ltd, Nanning, China
| | - Jinglei Si
- Guangxi State Farmd Yongxin Animal Husbandry Group Co., Ltd, Nanning, China
| | - Yahui Gao
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Tong Yin
- Institute of Animal Breeding and Genetics, Justus Liebig University, Giessen, Germany
| | - Zhe Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
3
|
Klimkowski Arango N, Morgante F. Comparing statistical learning methods for complex trait prediction from gene expression. PLoS One 2025; 20:e0317516. [PMID: 39932918 PMCID: PMC11813155 DOI: 10.1371/journal.pone.0317516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Accepted: 12/30/2024] [Indexed: 02/13/2025] Open
Abstract
Accurate prediction of complex traits is an important task in quantitative genetics. Genotypes have been used for trait prediction using a variety of methods such as mixed models, Bayesian methods, penalized regression methods, dimension reduction methods, and machine learning methods. Recent studies have shown that gene expression levels can produce higher prediction accuracy than genotypes. However, only a few prediction methods were tested in these studies. Thus, a comprehensive assessment of methods is needed to fully evaluate the potential of gene expression as a predictor of complex trait phenotypes. Here, we used data from the Drosophila Genetic Reference Panel (DGRP) to compare the ability of several existing statistical learning methods to predict starvation resistance and startle response from gene expression in the two sexes separately. The methods considered differ in assumptions about the distribution of gene effects-ranging from models that assume that every gene affects the trait to more sparse models-and their ability to capture gene-gene interactions. We also used functional annotation (i.e., Gene Ontology (GO)) as a source of biological information to inform prediction models. The results show that differences in prediction accuracy exist. For example, methods performing variable selection achieved higher prediction accuracy for starvation resistance in females, while they generally had lower accuracy for startle response in both sexes. Incorporating GO annotations further improved prediction accuracy for a few GO terms of biological significance. Biological significance extended to the genes underlying highly predictive GO terms. Notably, the Insulin-like Receptor (InR) was prevalent across methods and sexes for starvation resistance. For startle response, crumbs (crb) and imaginal disc growth factor 2 (Idgf2) were found for females and males, respectively. Our results confirmed the potential of transcriptomic prediction and highlighted the importance of selecting appropriate methods and strategies in order to achieve accurate predictions.
Collapse
Affiliation(s)
- Noah Klimkowski Arango
- Center for Human Genetics, Clemson University, Greenwood, SC, United States of America
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States of America
| | - Fabio Morgante
- Center for Human Genetics, Clemson University, Greenwood, SC, United States of America
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States of America
| |
Collapse
|
4
|
Raffo MA, Sarup P, Jensen J, Guo X, Jensen JD, Orabi J, Jahoor A, Christensen OF. Genomic prediction for yield and malting traits in barley using metabolomic and near-infrared spectra. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2025; 138:24. [PMID: 39786601 PMCID: PMC11717810 DOI: 10.1007/s00122-024-04806-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Accepted: 12/19/2024] [Indexed: 01/12/2025]
Abstract
KEY MESSAGE Genetic variation for malting quality as well as metabolomic and near-infrared features was identified. However, metabolomic and near-infrared features as additional omics-information did not improve accuracy of predicted breeding values. Significant attention has recently been given to the potential benefits of metabolomics and near-infrared spectroscopy technologies for enhancing genetic evaluation in breeding programs. In this article, we used a commercial barley breeding population phenotyped for grain yield, grain protein content, and five malting quality traits: extract yield, wort viscosity, wort color, filtering speed, and β-glucan, and aimed to: (i) investigate genetic variation and heritability of metabolomic intensities and near-infrared wavelengths originating from leaf tissue and malted grain, respectively; (ii) investigate variance components and heritabilities for genomic models including metabolomics (GOBLUP-MI) or near-infrared wavelengths (GOBLUP-NIR); and (iii) evaluate the developed models for prediction of breeding values for traits of interest. In total, 639 barley lines were genotyped using an iSelect9K-Illumina barley chip and recorded with 30,468 metabolomic intensities and 141 near-infrared wavelengths. First, we found that a significant proportion of metabolomic intensities and near-infrared wavelengths had medium to high additive genetic variances and heritabilities. Second, we observed that both GOBLUP-MI and GOBLUP-NIR, increased the proportion of estimated genetic variance for grain yield, protein, malt extract, and β-glucan compared to a genomic model (GBLUP). Finally, we assessed these models to predict accurate breeding values in fivefold and leave-one-breeding-cycle-out cross-validations, and we generally observed a similar accuracy between GBLUP and GOBLUP-MI, and a worse accuracy for GOBLUP-NIR. Despite this trend, GOBLUP-MI and GOBLUP-NIR enhanced predictive ability compared to GBLUP by 4.6 and 2.4% for grain protein in leave-one-breeding-cycle-out and grain yield in fivefold cross-validations, respectively, but differences were not significant (P-value > 0.01).
Collapse
Affiliation(s)
- Miguel A Raffo
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus C, Denmark.
| | | | - Just Jensen
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus C, Denmark
| | - Xiangyu Guo
- Danish Pig Research Centre, Danish Agriculture & Food Council, Copenhagen V, Denmark
| | | | | | | | - Ole F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus C, Denmark.
| |
Collapse
|
5
|
Arango NK, Morgante F. Comparing statistical learning methods for complex trait prediction from gene expression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.01.596951. [PMID: 38895364 PMCID: PMC11185554 DOI: 10.1101/2024.06.01.596951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Accurate prediction of complex traits is an important task in quantitative genetics that has become increasingly relevant for personalized medicine. Genotypes have traditionally been used for trait prediction using a variety of methods such as mixed models, Bayesian methods, penalized regressions, dimension reductions, and machine learning methods. Recent studies have shown that gene expression levels can produce higher prediction accuracy than genotypes. However, only a few prediction methods were used in these studies. Thus, a comprehensive assessment of methods is needed to fully evaluate the potential of gene expression as a predictor of complex trait phenotypes. Here, we used data from the Drosophila Genetic Reference Panel (DGRP) to compare the ability of several existing statistical learning methods to predict starvation resistance from gene expression in the two sexes separately. The methods considered differ in assumptions about the distribution of gene effect sizes - ranging from models that assume that every gene affects the trait to more sparse models - and their ability to capture gene-gene interactions. We also used functional annotation (i.e., Gene Ontology (GO)) as an external source of biological information to inform prediction models. The results show that differences in prediction accuracy between methods exist, although they are generally not large. Methods performing variable selection gave higher accuracy in females while methods assuming a more polygenic architecture performed better in males. Incorporating GO annotations further improved prediction accuracy for a few GO terms of biological significance. Biological significance extended to the genes underlying highly predictive GO terms with different genes emerging between sexes. Notably, the Insulin-like Receptor (InR) was prevalent across methods and sexes. Our results confirmed the potential of transcriptomic prediction and highlighted the importance of selecting appropriate methods and strategies in order to achieve accurate predictions.
Collapse
Affiliation(s)
- Noah Klimkowski Arango
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| | - Fabio Morgante
- Center for Human Genetics, Clemson University, Greenwood, SC, USA
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
| |
Collapse
|
6
|
Guo X, Sarup P, Jahoor A, Jensen J, Christensen OF. Metabolomic-genomic prediction can improve prediction accuracy of breeding values for malting quality traits in barley. Genet Sel Evol 2023; 55:61. [PMID: 37670243 PMCID: PMC10478459 DOI: 10.1186/s12711-023-00835-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 08/24/2023] [Indexed: 09/07/2023] Open
Abstract
BACKGROUND Metabolomics measures an intermediate stage between genotype and phenotype, and may therefore be useful for breeding. Our objectives were to investigate genetic parameters and accuracies of predicted breeding values for malting quality (MQ) traits when integrating both genomic and metabolomic information. In total, 2430 plots of 562 malting spring barley lines from three years and two locations were included. Five MQ traits were measured in wort produced from each plot. Metabolomic features used were 24,018 nuclear magnetic resonance intensities measured on each wort sample. Methods for statistical analyses were genomic best linear unbiased prediction (GBLUP) and metabolomic-genomic best linear unbiased prediction (MGBLUP). Accuracies of predicted breeding values were compared using two cross-validation strategies: leave-one-year-out (LOYO) and leave-one-line-out (LOLO), and the increase in accuracy from the successive inclusion of first, metabolomic data on the lines in the validation population (VP), and second, both metabolomic data and phenotypes on the lines in the VP, was investigated using the linear regression (LR) method. RESULTS For all traits, we saw that the metabolome-mediated heritability was substantial. Cross-validation results showed that, in general, prediction accuracies from MGBLUP and GBLUP were similar when phenotypes and metabolomic data were recorded on the same plots. Results from the LR method showed that for all traits, except one, accuracy of MGBLUP increased when including metabolomic data on the lines of the VP, and further increased when including also phenotypes. However, in general the increase in accuracy of MGBLUP when including both metabolomic data and phenotypes on lines of the VP was similar to the increase in accuracy of GBLUP when including phenotypes on the lines of the VP. Therefore, we found that, when metabolomic data were included on the lines of the VP, accuracies substantially increased for lines without phenotypic records, but they did not increase much when phenotypes were already known. CONCLUSIONS MGBLUP is a useful approach to combine phenotypic, genomic and metabolomic data for predicting breeding values for MQ traits. We believe that our results have significant implications for practical breeding of barley and potentially many other species.
Collapse
Affiliation(s)
- Xiangyu Guo
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
- Danish Pig Research Centre, Danish Agriculture and Food Council, 1609, Copenhagen V, Denmark
| | | | - Ahmed Jahoor
- Nordic Seed A/S, 8300, Odder, Denmark
- Department of Plant Breeding, The Swedish University of Agricultural Sciences, 2353, Alnarp, Sweden
| | - Just Jensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| | - Ole F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark.
| |
Collapse
|
7
|
Boggio GM, Christensen OF, Legarra A, Meynadier A, Marie-Etancelin C. Microbiability of milk composition and genetic control of microbiota effects in sheep. J Dairy Sci 2023; 106:6288-6298. [PMID: 37474364 DOI: 10.3168/jds.2022-22948] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 02/28/2023] [Indexed: 07/22/2023]
Abstract
Recently, high-dimensional omics data are becoming available in larger quantities, and models have been developed that integrate them with genomics to understand in finer detail the relationship between genotype and phenotype, and thus improve the performance of genetic evaluations. Our objectives are to quantify the effect of the inclusion of microbiome data in the genetic evaluation for dairy traits in sheep, through the estimation of the heritability, microbiability, and how the microbiome effect on dairy traits decomposes into genetic and nongenetic parts. In this study we analyzed milk and rumen samples of 795 Lacaune dairy ewes. We included, as phenotype, dairy traits and milk fatty acids and proteins composition; as omics measurements, 16S rRNA rumen bacterial abundances; and as genotyping, 54K SNP chip for all ewes. Two nested genomic models were used: a first model to predict the individual contributions of the genetic and microbial abundances to phenotypes, and a second model to predict the additive genetic effect of the microbial community. In addition, microbiome-wide association studies for all dairy traits were applied using the 2,059 rumen bacterial abundances, and the genetic correlations between microbiome principal components and dairy traits were estimated. Results showed that in general the inclusion of both genetic and microbiome effect did not improve the fit of the model compared with the model with the genetic effect only. In addition, for all dairy traits the total heritability was equal to the direct heritability after fitting microbiota effects, due to a microbiability being almost zero for most dairy traits and heritability of the microbial community was very close to zero. Microbiome-wide association studies did not show operational taxonomic units with major effect for any of the dairy traits evaluated, and the genetic correlations between the first 5 principal components and dairy traits were low to moderate. So far, we can conclude that, using a substantial data set of 795 Lacaune dairy ewes, rumen bacterial abundances do not provide improved genetic evaluation for dairy traits in sheep.
Collapse
Affiliation(s)
- G Martinez Boggio
- GenPhySE, Université de Toulouse, INRAE-ENVT, 31326, Castanet-Tolosan, France.
| | - O F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, DK-8000 Aarhus C, Denmark
| | - A Legarra
- GenPhySE, Université de Toulouse, INRAE-ENVT, 31326, Castanet-Tolosan, France
| | - A Meynadier
- GenPhySE, Université de Toulouse, INRAE-ENVT, 31326, Castanet-Tolosan, France
| | - C Marie-Etancelin
- GenPhySE, Université de Toulouse, INRAE-ENVT, 31326, Castanet-Tolosan, France.
| |
Collapse
|
8
|
Legarra A, Christensen O. Genomic evaluation methods to include intermediate correlated features such as high-throughput or omics phenotypes. JDS COMMUNICATIONS 2022; 4:55-60. [PMID: 36713125 PMCID: PMC9873823 DOI: 10.3168/jdsc.2022-0276] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 09/26/2022] [Indexed: 12/05/2022]
Abstract
Gene expression is supposed to be an intermediate between DNA and the phenotype, and it can be measured. Thus, for a trait, we may have intermediate measures, which are in fact a series of genetically controlled traits. Similarly, several traits may be measured or predicted using infrared spectra, accelerometers, and similar high-throughput measures that we will call "omics." Although these measurements have errors, many of them are heritable, and they may be more accurate or easier to record than the trait of interest. It is therefore important to develop methods to use intermediate measurements in selection. Here, we present methods and perspectives for selection based on massively recorded intermediate traits (omics). Recent developments allow a hierarchical integrated framework for prediction, in which a trait is partially controlled by omics. In addition, the omics measures are themselves partly controlled by genetics ("mediated breeding values") and partly by environment or residual factors. Thus, a part of the genetic determinism of a trait is mediated by omics, whereas the remaining part is not mediated, which results in "residual breeding values." In such a framework, genetic evaluations consist of 2 nested genomic BLUP-based models. In the first, the effect of omics on the trait (which can be seen as an improved estimate of the phenotype) and the residual breeding values are estimated. The second model extracts the mediated breeding values from the improved estimate of the phenotype, considering that omics themselves are heritable. The whole procedure is called GOBLUP (genomics omics BLUP) and it allows measures in only some individuals; that is, it is a "single-step"-like method. In this model, heritability is split into "mediated" and "not mediated" parts. This decomposition allows us to predict how accurate the omics measure of the trait would be compared with the direct measure. The ideal omics measure is heritable and explains a large part of the phenotypic variation of the trait. Ideally, this could be the case for some traits with low heritability. However, even if the omics measure explains only a small part of the phenotypic variation, when omics measurement themselves are heritable, the use of such a model would lead to more accurate selection. Expressions for upper bounds of reliability given omics measurements are also presented. More studies are needed to confirm the usefulness of omics or high-throughput prediction. Usefulness of the technology likely needs to be checked on a case-by-case basis.
Collapse
Affiliation(s)
- A. Legarra
- GenPhySE (Genetique, Physiologie et Systemes d'Elevage), INRA, 31326 Castanet-Tolosan, France,Corresponding author
| | - O.F. Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| |
Collapse
|
9
|
Perez BC, Bink MCAM, Svenson KL, Churchill GA, Calus MPL. Adding gene transcripts into genomic prediction improves accuracy and reveals sampling time dependence. G3 (BETHESDA, MD.) 2022; 12:jkac258. [PMID: 36161485 PMCID: PMC9635642 DOI: 10.1093/g3journal/jkac258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 09/07/2022] [Indexed: 06/16/2023]
Abstract
Recent developments allowed generating multiple high-quality 'omics' data that could increase the predictive performance of genomic prediction for phenotypes and genetic merit in animals and plants. Here, we have assessed the performance of parametric and nonparametric models that leverage transcriptomics in genomic prediction for 13 complex traits recorded in 478 animals from an outbred mouse population. Parametric models were implemented using the best linear unbiased prediction, while nonparametric models were implemented using the gradient boosting machine algorithm. We also propose a new model named GTCBLUP that aims to remove between-omics-layer covariance from predictors, whereas its counterpart GTBLUP does not do that. While gradient boosting machine models captured more phenotypic variation, their predictive performance did not exceed the best linear unbiased prediction models for most traits. Models leveraging gene transcripts captured higher proportions of the phenotypic variance for almost all traits when these were measured closer to the moment of measuring gene transcripts in the liver. In most cases, the combination of layers was not able to outperform the best single-omics models to predict phenotypes. Using only gene transcripts, the gradient boosting machine model was able to outperform best linear unbiased prediction for most traits except body weight, but the same pattern was not observed when using both single nucleotide polymorphism genotypes and gene transcripts. Although the GTCBLUP model was not able to produce the most accurate phenotypic predictions, it showed the highest accuracies for breeding values for 9 out of 13 traits. We recommend using the GTBLUP model for prediction of phenotypes and using the GTCBLUP for prediction of breeding values.
Collapse
Affiliation(s)
- Bruno C Perez
- Hendrix Genetics B.V., Research and Technology Center (RTC), 5830 AC Boxmeer, The Netherlands
| | - Marco C A M Bink
- Hendrix Genetics B.V., Research and Technology Center (RTC), 5830 AC Boxmeer, The Netherlands
| | | | | | - Mario P L Calus
- Corresponding author: Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH Wageningen, The Netherlands.
| |
Collapse
|
10
|
Liang M, An B, Chang T, Deng T, Du L, Li K, Cao S, Du Y, Xu L, Zhang L, Gao X, Li J, Gao H. Incorporating kernelized multi-omics data improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2022; 13:103. [PMID: 36127743 PMCID: PMC9490992 DOI: 10.1186/s40104-022-00756-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 07/08/2022] [Indexed: 11/18/2022] Open
Abstract
Background Genomic selection (GS) has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes. Besides genome, transcriptome and metabolome information are increasingly considered new sources for GS. Difficulties in building the model with multi-omics data for GS and the limit of specimen availability have both delayed the progress of investigating multi-omics. Results We utilized the Cosine kernel to map genomic and transcriptomic data as \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${n}\times {n}$$\end{document}n×n symmetric matrix (G matrix and T matrix), combined with the best linear unbiased prediction (BLUP) for GS. Here, we defined five kernel-based prediction models: genomic BLUP (GBLUP), transcriptome-BLUP (TBLUP), multi-omics BLUP (MBLUP, \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\boldsymbol M=\mathrm{ratio}\times\boldsymbol G+(1-\mathrm{ratio})\times\boldsymbol T$$\end{document}M=ratio×G+(1-ratio)×T), multi-omics single-step BLUP (mssBLUP), and weighted multi-omics single-step BLUP (wmssBLUP) to integrate transcribed individuals and genotyped resource population. The predictive accuracy evaluations in four traits of the Chinese Simmental beef cattle population showed that (1) MBLUP was far preferred to GBLUP (ratio = 1.0), (2) the prediction accuracy of wmssBLUP and mssBLUP had 4.18% and 3.37% average improvement over GBLUP, (3) We also found the accuracy of wmssBLUP increased with the growing proportion of transcribed cattle in the whole resource population. Conclusions We concluded that the inclusion of transcriptome data in GS had the potential to improve accuracy. Moreover, wmssBLUP is accepted to be a promising alternative for the present situation in which plenty of individuals are genotyped when fewer are transcribed. Supplementary Information The online version contains supplementary material available at 10.1186/s40104-022-00756-6.
Collapse
Affiliation(s)
- Mang Liang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Bingxing An
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Tianpeng Chang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Tianyu Deng
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Lili Du
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Keanning Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Sheng Cao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Yueying Du
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, People's Republic of China.
| |
Collapse
|
11
|
Mollandin F, Gilbert H, Croiseau P, Rau A. Accounting for overlapping annotations in genomic prediction models of complex traits. BMC Bioinformatics 2022; 23:365. [PMID: 36068513 PMCID: PMC9446854 DOI: 10.1186/s12859-022-04914-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 08/25/2022] [Indexed: 11/10/2022] Open
Abstract
Background It is now widespread in livestock and plant breeding to use genotyping data to predict phenotypes with genomic prediction models. In parallel, genomic annotations related to a variety of traits are increasing in number and granularity, providing valuable insight into potentially important positions in the genome. The BayesRC model integrates this prior biological information by factorizing the genome according to disjoint annotation categories, in some cases enabling improved prediction of heritable traits. However, BayesRC is not adapted to cases where markers may have multiple annotations. Results We propose two novel Bayesian approaches to account for multi-annotated markers through a cumulative (BayesRC+) or preferential (BayesRC\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pi$$\end{document}π) model of the contribution of multiple annotation categories. We illustrate their performance on simulated data with various genetic architectures and types of annotations. We also explore their use on data from a backcross population of growing pigs in conjunction with annotations constructed using the PigQTLdb. In both simulated and real data, we observed a modest improvement in prediction quality with our models when used with informative annotations. In addition, our results show that BayesRC+ successfully prioritizes multi-annotated markers according to their posterior variance, while BayesRC\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pi$$\end{document}π provides a useful interpretation of informative annotations for multi-annotated markers. Finally, we explore several strategies for constructing annotations from a public database, highlighting the importance of careful consideration of this step. Conclusion When used with annotations that are relevant to the trait under study, BayesRC\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\pi$$\end{document}π and BayesRC+ allow for improved prediction and prioritization of multi-annotated markers, and can provide useful biological insight into the genetic architecture of traits. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04914-5.
Collapse
Affiliation(s)
- Fanny Mollandin
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France.
| | - Hélène Gilbert
- GenPhySE, INRAE, ENVT, Université de Toulouse, 31320, Castanet Tolosan, France
| | - Pascal Croiseau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France
| | - Andrea Rau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Allée de Vilvert, 78350, Jouy-en-Josas, France.,BioEcoAgro Joint Research Unit, INRAE, Université de Liège, Université de Lille, Université de Picardie Jules Verne, 50136, Estrée-Mons, France
| |
Collapse
|
12
|
Wade AR, Duruflé H, Sanchez L, Segura V. eQTLs are key players in the integration of genomic and transcriptomic data for phenotype prediction. BMC Genomics 2022; 23:476. [PMID: 35764918 PMCID: PMC9238188 DOI: 10.1186/s12864-022-08690-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 06/11/2022] [Indexed: 11/10/2022] Open
Abstract
Background Multi-omics represent a promising link between phenotypes and genome variation. Few studies yet address their integration to understand genetic architecture and improve predictability. Results Our study used 241 poplar genotypes, phenotyped in two common gardens, with xylem and cambium RNA sequenced at one site, yielding large phenotypic, genomic (SNP), and transcriptomic datasets. Prediction models for each trait were built separately for SNPs and transcripts, and compared to a third model integrated by concatenation of both omics. The advantage of integration varied across traits and, to understand such differences, an eQTL analysis was performed to characterize the interplay between the genome and transcriptome and classify the predicting features into cis or trans relationships. A strong, significant negative correlation was found between the change in predictability and the change in predictor ranking for trans eQTLs for traits evaluated in the site of transcriptomic sampling. Conclusions Consequently, beneficial integration happens when the redundancy of predictors is decreased, likely leaving the stage to other less prominent but complementary predictors. An additional gene ontology (GO) enrichment analysis appeared to corroborate such statistical output. To our knowledge, this is a novel finding delineating a promising method to explore data integration. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08690-7.
Collapse
|
13
|
Abstract
In this chapter, we discuss the motivation for integrating other types of omics data into genomic prediction methods. We give an overview of literature investigating the performance of omics-enhanced predictions, and highlight potential pitfalls when applying these methods in breeding. We emphasize that the statistical methods available for genomic data can be transferred to the general omics case. However, when using a framework of omic relationship matrices, the standardization of the variables may be more relevant than it is for a genomic relationship matrix based on single-nucleotide polymorphisms.
Collapse
Affiliation(s)
- Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico.
| | - Ning Gao
- School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico
| |
Collapse
|
14
|
Christensen OF, Börner V, Varona L, Legarra A. Genetic evaluation including intermediate omics features. Genetics 2021; 219:6345349. [PMID: 34849886 DOI: 10.1093/genetics/iyab130] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 07/13/2021] [Indexed: 11/14/2022] Open
Abstract
In animal and plant breeding and genetics, there has been an increasing interest in intermediate omics traits, such as metabolomics and transcriptomics, which mediate the effect of genetics on the phenotype of interest. For inclusion of such intermediate traits into a genetic evaluation system, there is a need for a statistical model that integrates phenotypes, genotypes, pedigree, and omics traits, and a need for associated computational methods that provide estimated breeding values. In this paper, a joint model for phenotypes and omics data is presented, and a formula for the breeding values on individuals is derived. For complete omics data, three equivalent methods for best linear unbiased prediction of breeding values are presented. In all three cases, this requires solving two mixed model equation systems. Estimation of parameters using restricted maximum likelihood is also presented. For incomplete omics data, extensions of two of these methods are presented, where in both cases, the extension consists of extending an omics-related similarity matrix to incorporate individuals without omics data. The methods are illustrated using a simulated data set.
Collapse
Affiliation(s)
- Ole F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Vinzent Börner
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Luis Varona
- Departmento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, 50013 Saragoza, Spain
| | - Andres Legarra
- GenPhySE (Génétique, Physiologie et Systèmes d'Elevage), INRA, 31326 Castanet-Tolosan, France
| |
Collapse
|
15
|
Singh RS. Decoding 'Unnecessary Complexity': A Law of Complexity and a Concept of Hidden Variation Behind "Missing Heritability" in Precision Medicine. J Mol Evol 2021; 89:513-526. [PMID: 34341835 PMCID: PMC8327892 DOI: 10.1007/s00239-021-10023-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 07/20/2021] [Indexed: 01/06/2023]
Abstract
The high hopes for the Human Genome Project and personalized medicine were not met because the relationship between genotypes and phenotypes turned out to be more complex than expected. In a previous study we laid the foundation of a theory of complexity and showed that because of the blind nature of evolution, and molecular and historical contingency, cells have accumulated unnecessary complexity, complexity beyond what is necessary and sufficient to describe an organism. Here we provide empirical evidence and show that unnecessary complexity has become integrated into the genome in the form of redundancy and is relevant to molecular evolution of phenotypic complexity. Unnecessary complexity creates uncertainty between molecular and phenotypic complexity, such that phenotypic complexity (CP) is higher than molecular complexity (CM), which is higher than DNA complexity (CD). The qualitative inequality in complexity is based on the following hierarchy: CP > CM > CD. This law-like relationship holds true for all complex traits, including complex diseases. We present a hypothesis of two types of variation, namely open and closed (hidden) systems, show that hidden variation provides a hitherto undiscovered "third source" of phenotypic variation, beside genotype and environment, and argue that "missing heritability" for some complex diseases is likely to be a case of "diluted heritability". There is a need for radically new ways of thinking about the principles of genotype-phenotype relationship. Understanding how cells use hidden, pathway variation to respond to stress can shed light on why two individuals who share the same risk factors may not develop the same disease, or how cancer cells escape death.
Collapse
Affiliation(s)
- Rama S Singh
- Department of Biology, and Origins Institute, McMaster University, 1280 Main Street West, Hamilton, ON, L8S4K1, Canada.
| |
Collapse
|
16
|
Campbell MT, Hu H, Yeats TH, Brzozowski LJ, Caffe-Treml M, Gutiérrez L, Smith KP, Sorrells ME, Gore MA, Jannink JL. Improving Genomic Prediction for Seed Quality Traits in Oat (Avena sativa L.) Using Trait-Specific Relationship Matrices. Front Genet 2021; 12:643733. [PMID: 33868378 PMCID: PMC8044359 DOI: 10.3389/fgene.2021.643733] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 03/04/2021] [Indexed: 11/13/2022] Open
Abstract
The observable phenotype is the manifestation of information that is passed along different organization levels (transcriptional, translational, and metabolic) of a biological system. The widespread use of various omic technologies (RNA-sequencing, metabolomics, etc.) has provided plant genetics and breeders with a wealth of information on pertinent intermediate molecular processes that may help explain variation in conventional traits such as yield, seed quality, and fitness, among others. A major challenge is effectively using these data to help predict the genetic merit of new, unobserved individuals for conventional agronomic traits. Trait-specific genomic relationship matrices (TGRMs) model the relationships between individuals using genome-wide markers (SNPs) and place greater emphasis on markers that most relevant to the trait compared to conventional genomic relationship matrices. Given that these approaches define relationships based on putative causal loci, it is expected that these approaches should improve predictions for related traits. In this study we evaluated the use of TGRMs to accommodate information on intermediate molecular phenotypes (referred to as endophenotypes) and to predict an agronomic trait, total lipid content, in oat seed. Nine fatty acids were quantified in a panel of 336 oat lines. Marker effects were estimated for each endophenotype, and were used to construct TGRMs. A multikernel TRGM model (MK-TRGM-BLUP) was used to predict total seed lipid content in an independent panel of 210 oat lines. The MK-TRGM-BLUP approach significantly improved predictions for total lipid content when compared to a conventional genomic BLUP (gBLUP) approach. Given that the MK-TGRM-BLUP approach leverages information on the nine fatty acids to predict genetic values for total lipid content in unobserved individuals, we compared the MK-TGRM-BLUP approach to a multi-trait gBLUP (MT-gBLUP) approach that jointly fits phenotypes for fatty acids and total lipid content. The MK-TGRM-BLUP approach significantly outperformed MT-gBLUP. Collectively, these results highlight the utility of using TGRM to accommodate information on endophenotypes and improve genomic prediction for a conventional agronomic trait.
Collapse
Affiliation(s)
- Malachy T. Campbell
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Haixiao Hu
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Trevor H. Yeats
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Lauren J. Brzozowski
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Melanie Caffe-Treml
- Seed Technology Lab 113, Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD, United States
| | - Lucía Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI, United States
| | - Kevin P. Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN, United States
| | - Mark E. Sorrells
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Michael A. Gore
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Jean-Luc Jannink
- Plant Breeding & Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
- R.W. Holley Center for Agriculture & Health, US Department of Agriculture, Agricultural Research Service, Ithaca, NY, United States
| |
Collapse
|
17
|
Ye S, Li J, Zhang Z. Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2020; 11:109. [PMID: 33292577 PMCID: PMC7708144 DOI: 10.1186/s40104-020-00515-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 09/22/2020] [Indexed: 12/02/2022] Open
Abstract
Background Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. Results Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. Conclusions Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China.
| |
Collapse
|