1
|
Charles M, Gaiani N, Sanchez MP, Boussaha M, Hozé C, Boichard D, Rocha D, Boulling A. Functional impact of splicing variants in the elaboration of complex traits in cattle. Nat Commun 2025; 16:3893. [PMID: 40274775 PMCID: PMC12022281 DOI: 10.1038/s41467-025-58970-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Accepted: 04/04/2025] [Indexed: 04/26/2025] Open
Abstract
GWAS conducted directly on imputed whole genome sequence have led to the identification of numerous genetic variants associated with agronomic traits in cattle. However, such variants are often simply markers in linkage disequilibrium with the actual causal variants, which is a limiting factor for the development of accurate genomic predictions. It is possible to identify causal variants by integrating information on how variants impact gene expression into GWAS output. RNA splicing plays a major role in regulating gene expression. Thus, assessing the effect of variants on RNA splicing may explain their function. Here, we use a high-throughput strategy to functionally analyse putative splice-disrupting variants in the bovine genome. Using GWAS, massively parallel reporter assay and deep learning algorithms designed to predict splice-disrupting variants, we identify 38 splice-disrupting variants associated with complex traits in cattle, three of which could be classified as causal. Our results indicate that splice-disrupting variants are widely found in the quantitative trait loci related to these phenotypes. Using our combined approach, we also assess the validity of splicing predictors originally developed to analyse human variants in the context of the bovine genome.
Collapse
Affiliation(s)
- Mathieu Charles
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
- INRAE, SIGENAE, 78350, Jouy-en-Josas, France
| | - Nicolas Gaiani
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Marie-Pierre Sanchez
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Mekki Boussaha
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Chris Hozé
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
- ELIANCE, 75012, Paris, France
| | - Didier Boichard
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Dominique Rocha
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Arnaud Boulling
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France.
| |
Collapse
|
2
|
Alemu SW, Lopdell TJ, Trevarton AJ, Snell RG, Littlejohn MD, Garrick DJ. Comparison of genomic prediction accuracies in dairy cattle lactation traits using five classes of functional variants versus generic SNP. Genet Sel Evol 2025; 57:20. [PMID: 40217496 PMCID: PMC11987224 DOI: 10.1186/s12711-025-00966-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 03/27/2025] [Indexed: 04/14/2025] Open
Abstract
BACKGROUND Genomic selection, typically employing genetic markers from SNP chips, is routine in modern dairy cattle breeding. This study assessed the impact of functional sequence variants on genomic prediction accuracy relative to 50 k SNP chip markers for fat percent, protein percent, milk volume, fat yield, and protein yield in lactating dairy cattle. The functional variants were identified through GWAS, RNA-seq, Histone modification ChIP-seq, ATAC-seq, or were coding variants. The genomic prediction accuracy obtained using each class of functional variants was compared with matched numbers of SNPs randomly selected from the Illumina 50 k SNP chip. RESULTS The investigation revealed that variants identified by GWAS or RNA-seq, significantly improved the prediction accuracy across all five traits. Contributions from ChIP-seq, ATAC-seq, and coding variants varied. Some variants identified using ChIP-seq showed marked improvements, while others reduced accuracy in protein yield predictions. Relative to a matched number of 32,595 SNPs from the SNP chip, pooling all the functional variants demonstrated prediction accuracy increases of 1.76% for fat percent, 2.97% for protein percent, 0.51% for milk volume, and 0.26% for fat yield, but with a slight decrease of 0.43% in protein yield. CONCLUSION The study demonstrates that functional variants can improve prediction accuracy relative to equivalent numbers of variants from a generic SNP panel, with percent traits showing more significant gains than yield traits. The main advantage of using functional variants for genomic prediction was achievement of comparable accuracy using a smaller, more selective set of loci. This is particularly evident in trait-specific scenarios. Our findings indicate that specific combinations of functional variants comprising 16 k variants can achieve genomic prediction accuracy comparable to employing a standard panel of twice the size (32.6 k), especially for percent traits. This highlights the potential for the development of more efficient, trait-focused SNP panels utilizing functional variants.
Collapse
Affiliation(s)
- Setegn Worku Alemu
- AL Rae Centre for Genetics and Breeding, Massey University, 10 Bisley Drive, Hamilton, 3240, New Zealand.
- Invermay Agricultural Centre, AgResearch Limited, Mosgiel, New Zealand.
| | | | | | - Russell G Snell
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Mathew D Littlejohn
- AL Rae Centre for Genetics and Breeding, Massey University, 10 Bisley Drive, Hamilton, 3240, New Zealand
- LIC, Hamilton, New Zealand
| | - Dorian J Garrick
- AL Rae Centre for Genetics and Breeding, Massey University, 10 Bisley Drive, Hamilton, 3240, New Zealand
| |
Collapse
|
3
|
Dos Santos TCF, Silva EN, Frezarim GB, Salatta BM, Baldi F, Fonseca LFS, Albuquerque LGD, Muniz MMM, Silva DBDS. Identification of cis-sQTL demonstrates genetic associations and functional implications of inflammatory processes in Nelore cattle muscle tissue. Mamm Genome 2025; 36:106-117. [PMID: 39825903 DOI: 10.1007/s00335-024-10100-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Accepted: 12/22/2024] [Indexed: 01/20/2025]
Abstract
This study aimed to identify splicing quantitative trait loci (cis-sQTL) in Nelore cattle muscle tissue and explore the involvement of spliced genes (sGenes) in immune system-related biological processes. Genotypic data from 80 intact male Nelore cattle were obtained using SNP-Chip technology, while RNA-Seq analysis was performed to measure gene expression levels, enabling the integration of genomic and transcriptomic datasets. The normalized expression levels of spliced transcripts were associated with single nucleotide polymorphisms (SNPs) through an analysis of variance using an additive linear model with the MatrixEQTL package. A permutation analysis then assessed the significance of the best SNPs for each spliced transcript. Functional enrichment analysis was performed on the sGenes to investigate their roles in the immune system. In total, 3,187 variants were linked to 3,202 spliced transcripts, with 83 sGenes involved in immune system processes. Of these, 31 sGenes were enriched for five transcription factors. Most cis-sQTL effects were found in intronic regions, with 27 sQTL variants associated with disease susceptibility and resistance in cattle. Key sGenes identified, such as GSDMA, NLRP6, CASP6, GZMA, CASP4, CASP1, TREM2, NLRP1, and NAIP, were related to inflammasome formation and pyroptosis. Additionally, genes like PIDD1, OPTN, NFKBIB, STAT1, TNIP3, and TREM2 were involved in regulating the NF-kB pathway. These findings lay the groundwork for breeding disease-resistant cattle and enhance our understanding of genetic mechanisms in immune responses.
Collapse
Affiliation(s)
- Thaís Cristina Ferreira Dos Santos
- Universidade Professor Edson Antônio Velano (UNIFENAS), Rodovia 179, Km 0, Alfenas, MG, 37132440, Brasil.
- Centro Nacional de Pesquisa em Energia e Materiais (CNPEM), Campinas, SP, Brasil.
| | - Evandro Neves Silva
- Universidade Professor Edson Antônio Velano (UNIFENAS), Rodovia 179, Km 0, Alfenas, MG, 37132440, Brasil
- Universidade Federal de Alfenas (UNIFAL), Alfenas, MG, Brasil
| | | | - Bruna Maria Salatta
- Faculdade de Ciências Agrárias e Veterinárias (FCAV-UNESP), Jaboticabal, SP, Brasil
| | - Fernando Baldi
- Faculdade de Ciências Agrárias e Veterinárias (FCAV-UNESP), Jaboticabal, SP, Brasil
| | | | - Lucia Galvão De Albuquerque
- Faculdade de Ciências Agrárias e Veterinárias (FCAV-UNESP), Jaboticabal, SP, Brasil
- Conselho Nacional de Desenvolvimento Científico e Tecnológico, Brasília, DF, Brasil
| | - Maria Malane Magalhães Muniz
- Faculdade de Ciências Agrárias e Veterinárias (FCAV-UNESP), Jaboticabal, SP, Brasil
- University of Guelph, UOGELPH, Guelph, Canada
| | - Danielly Beraldo Dos Santos Silva
- Universidade Professor Edson Antônio Velano (UNIFENAS), Rodovia 179, Km 0, Alfenas, MG, 37132440, Brasil.
- Faculdade de Ciências Agrárias e Veterinárias (FCAV-UNESP), Jaboticabal, SP, Brasil.
| |
Collapse
|
4
|
Križanac AM, Reimer C, Heise J, Liu Z, Pryce JE, Bennewitz J, Thaller G, Falker-Gieske C, Tetens J. Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits. Genet Sel Evol 2025; 57:3. [PMID: 39905301 PMCID: PMC11796172 DOI: 10.1186/s12711-025-00951-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 01/23/2025] [Indexed: 02/06/2025] Open
Abstract
BACKGROUND Milk production traits are complex and influenced by many genetic and environmental factors. Although extensive research has been performed for these traits, with many associations unveiled thus far, due to their crucial economic importance, complex genetic architecture, and the fact that causal variants in cattle are still scarce, there is a need for a better understanding of their genetic background. In this study, we aimed to identify new candidate loci associated with milk production traits in German Holstein cattle, the most important dairy breed in Germany and worldwide. For that purpose, 180,217 cattle were imputed to the sequence level and large-scale genome-wide association study (GWAS) followed by fine-mapping and evolutionary and functional annotation were carried out to identify and prioritize new association signals. RESULTS Using the imputed sequence data of a large cattle dataset, we identified 50,876 significant variants, confirming many known and identifying previously unreported candidate variants for milk (MY), fat (FY), and protein yield (PY). Genome-wide significant signals were fine-mapped with the Bayesian approach that determines the credible variant sets and generates the probability of causality for each signal. The variants with the highest probabilities of being causal were further classified using external information about the function and evolution, making the prioritization for subsequent validation experiments easier. The top potential causal variants determined with fine-mapping explained a large percentage of genetic variance compared to random ones; 178 variants explained 11.5%, 104 explained 7.7%, and 68 variants explained 3.9% of the variance for MY, FY, and PY, respectively, demonstrating the potential for causality. CONCLUSIONS Our findings proved the power of large samples and sequence-based GWAS in detecting new association signals. In order to fully exploit the power of GWAS, one should aim at very large samples combined with whole-genome sequence data. These can also come with both computational and time burdens, as presented in our study. Although milk production traits in cattle are comprehensively investigated, the genetic background of these traits is still not fully understood, with the potential for many new associations to be revealed, as shown. With constantly growing sample sizes, we expect more insights into the genetic architecture of milk production traits in the future.
Collapse
Affiliation(s)
- Ana-Marija Križanac
- Department of Animal Sciences, University of Goettingen, Burckhardtweg 2, 37077, Göttingen, Germany.
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany.
| | - Christian Reimer
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, 31535, Neustadt, Germany
| | - Johannes Heise
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283, Verden, Germany
| | - Zengting Liu
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283, Verden, Germany
| | - Jennie E Pryce
- Agriculture Victoria Research, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Jörn Bennewitz
- Institute of Animal Science, University of Hohenheim, 70599, Stuttgart, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University, 24118, Kiel, Germany
| | - Clemens Falker-Gieske
- Department of Animal Sciences, University of Goettingen, Burckhardtweg 2, 37077, Göttingen, Germany
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
| | - Jens Tetens
- Department of Animal Sciences, University of Goettingen, Burckhardtweg 2, 37077, Göttingen, Germany
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
| |
Collapse
|
5
|
Leonard AS, Mapel XM, Pausch H. RNA-DNA differences in variant calls from cattle tissues result in erroneous eQTLs. BMC Genomics 2024; 25:750. [PMID: 39090567 PMCID: PMC11295900 DOI: 10.1186/s12864-024-10645-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Accepted: 07/22/2024] [Indexed: 08/04/2024] Open
Abstract
BACKGROUND Association testing between molecular phenotypes and genomic variants can help to understand how genotype affects phenotype. RNA sequencing provides access to molecular phenotypes such as gene expression and alternative splicing while DNA sequencing or microarray genotyping are the prevailing options to obtain genomic variants. RESULTS We genotype variants for 74 male Braunvieh cattle from both DNA (~ 13-fold coverage) and deep total RNA sequencing from testis, vas deferens, and epididymis tissue (~ 250 million reads per tissue). We show that RNA sequencing can be used to identify approximately 40% of variants (7-10 million) called from DNA sequencing, with over 80% precision. Within highly expressed coding regions, over 92% of expected variants were called with nearly 98% precision. Allele-specific expression and putative post-transcriptional modifications negatively impact variant genotyping accuracy from RNA sequencing and contribute to RNA-DNA differences. Variants called from RNA sequencing detect roughly 75% of eGenes identified using variants called from DNA sequencing, demonstrating a nearly 2-fold enrichment of eQTL variants. We observe a moderate-to-strong correlation in nominal association p-values (Spearman ρ2 ~ 0.6), although only 9% of eGenes have the same top associated variant. CONCLUSIONS We find hundreds of thousands of RNA-DNA differences in variants called from RNA and DNA sequencing on the same individuals. We identify several highly significant eQTL when using RNA sequencing variant genotypes which are not found with DNA sequencing variant genotypes, suggesting that using RNA sequencing variant genotypes for association testing results in an increased number of false positives. Our findings demonstrate that caution must be exercised beyond filtering for variant quality or imputation accuracy when analysing or imputing variants called from RNA sequencing.
Collapse
Affiliation(s)
- Alexander S Leonard
- Animal Genomics, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland.
| | - Xena M Mapel
- Animal Genomics, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland
| | - Hubert Pausch
- Animal Genomics, ETH Zurich, Universitaetstrasse 2, Zurich, 8092, Switzerland.
| |
Collapse
|
6
|
Ooi E, Xiang R, Chamberlain AJ, Goddard ME. Archetypal clustering reveals physiological mechanisms linking milk yield and fertility in dairy cattle. J Dairy Sci 2024; 107:4726-4742. [PMID: 38369117 DOI: 10.3168/jds.2023-23699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 01/11/2024] [Indexed: 02/20/2024]
Abstract
Fertility in dairy cattle has declined as an unintended consequence of single-trait selection for high milk yield. The unfavorable genetic correlation between milk yield and fertility is now well documented; however, the underlying physiological mechanisms are still uncertain. To understand the relationship between these traits, we developed a method that clusters variants with similar patterns of effects and, after the integration of gene expression data, identifies the genes through which they are likely to act. Biological processes that are enriched in the genes of each cluster were then identified. We identified several clusters with unique patterns of effects. One of the clusters included variants associated with increased milk yield and decreased fertility, where the "archetypal" variant (i.e., the one with the largest effect) was associated with the GC gene, whereas others were associated with TRIM32, LRRK2, and U6-associated snRNA. These genes have been linked to transcription and alternative splicing, suggesting that these processes are likely contributors to the unfavorable relationship between the 2 traits. Another cluster, with archetypal variant near DGAT1 and including variants associated with CDH2, BTRC, SFRP2, ZFHX3, and SLITRK5, appeared to affect milk yield but have little effect on fertility. These genes have been linked to insulin, adipose tissue, and energy metabolism. A third cluster with archetypal variant near ZNF613 and including variants associated with ROBO1, EFNA5, PALLD, GPC6, and PTPRT were associated with fertility but not milk yield. These genes have been linked to GnRH neuronal migration, embryonic development, or ovarian function. The use of archetypal clustering to group variants with similar patterns of effects may assist in identifying the biological processes underlying correlated traits. The method is hypothesis generating and requires experimental confirmation. However, we have uncovered several novel mechanisms potentially affecting milk production and fertility such as GnRH neuronal migration. We anticipate our method to be a starting point for experimental research into novel pathways, which have been previously unexplored within the context of dairy production.
Collapse
Affiliation(s)
- E Ooi
- Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, Victoria 3010, Australia; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria 3083, Australia.
| | - R Xiang
- Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, Victoria 3010, Australia; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria 3083, Australia
| | - A J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria 3083, Australia; School of Applied Systems Biology, La Trobe University, Bundoora, Victoria 3083, Australia
| | - M E Goddard
- Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, Victoria 3010, Australia; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria 3083, Australia
| |
Collapse
|
7
|
Tang Y, Zhang J, Li W, Liu X, Chen S, Mi S, Yang J, Teng J, Fang L, Yu Y. Identification and characterization of whole blood gene expression and splicing quantitative trait loci during early to mid-lactation of dairy cattle. BMC Genomics 2024; 25:445. [PMID: 38711039 DOI: 10.1186/s12864-024-10346-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Accepted: 04/25/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND Characterization of regulatory variants (e.g., gene expression quantitative trait loci, eQTL; gene splicing QTL, sQTL) is crucial for biologically interpreting molecular mechanisms underlying loci associated with complex traits. However, regulatory variants in dairy cattle, particularly in specific biological contexts (e.g., distinct lactation stages), remain largely unknown. In this study, we explored regulatory variants in whole blood samples collected during early to mid-lactation (22-150 days after calving) of 101 Holstein cows and analyzed them to decipher the regulatory mechanisms underlying complex traits in dairy cattle. RESULTS We identified 14,303 genes and 227,705 intron clusters expressed in the white blood cells of 101 cattle. The average heritability of gene expression and intron excision ratio explained by cis-SNPs is 0.28 ± 0.13 and 0.25 ± 0.13, respectively. We identified 23,485 SNP-gene expression pairs and 18,166 SNP-intron cluster pairs in dairy cattle during early to mid-lactation. Compared with the 2,380,457 cis-eQTLs reported to be present in blood in the Cattle Genotype-Tissue Expression atlas (CattleGTEx), only 6,114 cis-eQTLs (P < 0.05) were detected in the present study. By conducting colocalization analysis between cis-e/sQTL and the results of genome-wide association studies (GWAS) from four traits, we identified a cis-e/sQTL (rs109421300) of the DGAT1 gene that might be a key marker in early to mid-lactation for milk yield, fat yield, protein yield, and somatic cell score (PP4 > 0.6). Finally, transcriptome-wide association studies (TWAS) revealed certain genes (e.g., FAM83H and TBC1D17) whose expression in white blood cells was significantly (P < 0.05) associated with complex traits. CONCLUSIONS This study investigated the genetic regulation of gene expression and alternative splicing in dairy cows during early to mid-lactation and provided new insights into the regulatory mechanisms underlying complex traits of economic importance.
Collapse
Affiliation(s)
- Yongjie Tang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Jinning Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Wenlong Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Xueqin Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Siqian Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Siyuan Mi
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Jinyan Yang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Jinyan Teng
- State Key Laboratory of Swine and Poultry Breeding Industry, National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark.
| | - Ying Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture & National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
8
|
Leonard AS, Mapel XM, Pausch H. Pangenome-genotyped structural variation improves molecular phenotype mapping in cattle. Genome Res 2024; 34:300-309. [PMID: 38355307 PMCID: PMC10984387 DOI: 10.1101/gr.278267.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 02/01/2024] [Indexed: 02/16/2024]
Abstract
Expression and splicing quantitative trait loci (e/sQTL) are large contributors to phenotypic variability. Achieving sufficient statistical power for e/sQTL mapping requires large cohorts with both genotypes and molecular phenotypes, and so, the genomic variation is often called from short-read alignments, which are unable to comprehensively resolve structural variation. Here we build a pangenome from 16 HiFi haplotype-resolved cattle assemblies to identify small and structural variation and genotype them with PanGenie in 307 short-read samples. We find high (>90%) concordance of PanGenie-genotyped and DeepVariant-called small variation and confidently genotype close to 21 million small and 43,000 structural variants in the larger population. We validate 85% of these structural variants (with MAF > 0.1) directly with a subset of 25 short-read samples that also have medium coverage HiFi reads. We then conduct e/sQTL mapping with this comprehensive variant set in a subset of 117 cattle that have testis transcriptome data, and find 92 structural variants as causal candidates for eQTL and 73 for sQTL. We find that roughly half of the top associated structural variants affecting expression or splicing are transposable elements, such as SV-eQTL for STN1 and MYH7 and SV-sQTL for CEP89 and ASAH2 Extensive linkage disequilibrium between small and structural variation results in only 28 additional eQTL and 17 sQTL discovered when including SVs, although many top associated SVs are compelling candidates.
Collapse
Affiliation(s)
| | - Xena M Mapel
- Animal Genomics, ETH Zurich, 8092 Zurich, Switzerland
| | - Hubert Pausch
- Animal Genomics, ETH Zurich, 8092 Zurich, Switzerland
| |
Collapse
|
9
|
Schneider H, Krizanac AM, Falker-Gieske C, Heise J, Tetens J, Thaller G, Bennewitz J. Genomic dissection of the correlation between milk yield and various health traits using functional and evolutionary information about imputed sequence variants of 34,497 German Holstein cows. BMC Genomics 2024; 25:265. [PMID: 38461236 PMCID: PMC11385139 DOI: 10.1186/s12864-024-10115-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 02/13/2024] [Indexed: 03/11/2024] Open
Abstract
BACKGROUND Over the last decades, it was subject of many studies to investigate the genomic connection of milk production and health traits in dairy cattle. Thereby, incorporating functional information in genomic analyses has been shown to improve the understanding of biological and molecular mechanisms shaping complex traits and the accuracies of genomic prediction, especially in small populations and across-breed settings. Still, little is known about the contribution of different functional and evolutionary genome partitioning subsets to milk production and dairy health. Thus, we performed a uni- and a bivariate analysis of milk yield (MY) and eight health traits using a set of ~34,497 German Holstein cows with 50K chip genotypes and ~17 million imputed sequence variants divided into 27 subsets depending on their functional and evolutionary annotation. In the bivariate analysis, eight trait-combinations were observed that contrasted MY with each health trait. Two genomic relationship matrices (GRM) were included, one consisting of the 50K chip variants and one consisting of each set of subset variants, to obtain subset heritabilities and genetic correlations. In addition, 50K chip heritabilities and genetic correlations were estimated applying merely the 50K GRM. RESULTS In general, 50K chip heritabilities were larger than the subset heritabilities. The largest heritabilities were found for MY, which was 0.4358 for the 50K and 0.2757 for the subset heritabilities. Whereas all 50K genetic correlations were negative, subset genetic correlations were both, positive and negative (ranging from -0.9324 between MY and mastitis to 0.6662 between MY and digital dermatitis). The subsets containing variants which were annotated as noncoding related, splice sites, untranslated regions, metabolic quantitative trait loci, and young variants ranked highest in terms of their contribution to the traits` genetic variance. We were able to show that linkage disequilibrium between subset variants and adjacent variants did not cause these subsets` high effect. CONCLUSION Our results confirm the connection of milk production and health traits in dairy cattle via the animals` metabolic state. In addition, they highlight the potential of including functional information in genomic analyses, which helps to dissect the extent and direction of the observed traits` connection in more detail.
Collapse
Affiliation(s)
- Helen Schneider
- Institute of Animal Science, University of Hohenheim, 70599, Stuttgart, Germany.
| | - Ana-Marija Krizanac
- Department of Animal Sciences, University of Göttingen, 37077, Göttingen, Germany
| | | | - Johannes Heise
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283, Verden, Germany
| | - Jens Tetens
- Department of Animal Sciences, University of Göttingen, 37077, Göttingen, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts University of Kiel, 24098, Kiel, Germany
| | - Jörn Bennewitz
- Institute of Animal Science, University of Hohenheim, 70599, Stuttgart, Germany
| |
Collapse
|
10
|
Xiang R, Fang L, Liu S, Macleod IM, Liu Z, Breen EJ, Gao Y, Liu GE, Tenesa A, Mason BA, Chamberlain AJ, Wray NR, Goddard ME. Gene expression and RNA splicing explain large proportions of the heritability for complex traits in cattle. CELL GENOMICS 2023; 3:100385. [PMID: 37868035 PMCID: PMC10589627 DOI: 10.1016/j.xgen.2023.100385] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 08/10/2022] [Accepted: 07/26/2023] [Indexed: 10/24/2023]
Abstract
Many quantitative trait loci (QTLs) are in non-coding regions. Therefore, QTLs are assumed to affect gene regulation. Gene expression and RNA splicing are primary steps of transcription, so DNA variants changing gene expression (eVariants) or RNA splicing (sVariants) are expected to significantly affect phenotypes. We quantify the contribution of eVariants and sVariants detected from 16 tissues (n = 4,725) to 37 traits of ∼120,000 cattle (average magnitude of genetic correlation between traits = 0.13). Analyzed in Bayesian mixture models, averaged across 37 traits, cis and trans eVariants and sVariants detected from 16 tissues jointly explain 69.2% (SE = 0.5%) of heritability, 44% more than expected from the same number of random variants. This 69.2% includes an average of 24% from trans e-/sVariants (14% more than expected). Averaged across 56 lipidomic traits, multi-tissue cis and trans e-/sVariants also explain 71.5% (SE = 0.3%) of heritability, demonstrating the essential role of proximal and distal regulatory variants in shaping mammalian phenotypes.
Collapse
Affiliation(s)
- Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, the University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- Cambridge-Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, the University of Edinburgh, Edinburgh, UK
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark
| | - Shuli Liu
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
| | - Iona M. Macleod
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Zhiqian Liu
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Edmond J. Breen
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - George E. Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Albert Tenesa
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, the University of Edinburgh, Edinburgh, UK
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, the University of Edinburgh, Midlothian EH25 9RG, UK
| | - CattleGTEx Consortium
- Faculty of Veterinary & Agricultural Science, the University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- Cambridge-Baker Systems Genomics Initiative, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, the University of Edinburgh, Edinburgh, UK
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, the University of Edinburgh, Midlothian EH25 9RG, UK
- Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
- Queensland Brain Institute, the University of Queensland, Brisbane, QLD 4072, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Brett A. Mason
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Amanda J. Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Naomi R. Wray
- Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
- Queensland Brain Institute, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Michael E. Goddard
- Faculty of Veterinary & Agricultural Science, the University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| |
Collapse
|
11
|
Zhang F, Velez-Irizarry D, Ernst CW, Huang W. Mapping splice QTLs reveals distinct transcriptional and post-transcriptional regulatory variation of gene expression and identifies putative alternative splicing variation mediating complex trait variation in pigs. BMC Genomics 2023; 24:240. [PMID: 37142954 PMCID: PMC10161646 DOI: 10.1186/s12864-023-09314-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 04/14/2023] [Indexed: 05/06/2023] Open
Abstract
BACKGROUND Alternative splicing is an important step in gene expression, generating multiple isoforms for the same genes and greatly expanding the diversity of proteomes. Genetic variation in alternative splicing contributes to phenotypic diversity in natural populations. However, the genetic basis of variation in alternative splicing in livestock including pigs remains poorly understood. RESULTS In this study, using a Duroc x Pietrain F2 pig population, we performed genome-wide analysis of alternative splicing estimated from stranded RNA-Seq data in skeletal muscle. We characterized the genetic architecture of alternative splicing and compared its basic features with those of overall gene expression. We detected a large number of novel alternative splicing events that were not previously annotated. We found heritability of quantitative alternative splicing scores (percent spliced in or PSI) to be lower than that of overall gene expression. In addition, heritabilities showed little correlation between alternative splicing and overall gene expression. We mapped expression QTLs (eQTLs) and splice QTLs (sQTLs) and found them to be largely non-overlapping. Finally, we integrated sQTL mapping with phenotype QTL (pQTL mapping to identify potential mediator of pQTL effect by alternative splicing. CONCLUSIONS Our results suggest that regulatory variation exists at multiple levels and that their genetic controls are distinct, offering opportunities for genetic improvement.
Collapse
Affiliation(s)
- Fei Zhang
- Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA.
| | | | - Catherine W Ernst
- Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA
| | - Wen Huang
- Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA.
| |
Collapse
|
12
|
Pausch H, Mapel XM. Review: Genetic mutations affecting bull fertility. Animal 2023; 17 Suppl 1:100742. [PMID: 37567657 DOI: 10.1016/j.animal.2023.100742] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 08/13/2023] Open
Abstract
Cattle are a well-suited "model organism" to study the genetic underpinnings of variation in male reproductive performance. The adoption of artificial insemination and genomic prediction in many cattle breeds provide access to microarray-derived genotypes and repeated measurements for semen quality and insemination success in several thousand bulls. Similar-sized mapping cohorts with phenotypes for male fertility are not available for most other species precluding powerful association testing. The repeated measurements of the artificial insemination bulls' semen quality enable the differentiation between transient and biologically relevant trait fluctuations, and thus, are an ideal source of phenotypes for variance components estimation and genome-wide association testing. Genome-wide case-control association testing involving bulls with either aberrant sperm quality or low insemination success revealed several causal recessive loss-of-function alleles underpinning monogenic reproductive disorders. These variants are routinely monitored with customised genotyping arrays in the male selection candidates to avoid the use of subfertile or infertile bulls for artificial insemination and natural service. Genome-wide association studies with quantitative measurements of semen quality and insemination success revealed quantitative trait loci for male fertility, but the underlying causal variants remain largely unknown. Moreover, these loci explain only a small part of the heritability of male fertility. Integrating genome-wide association studies with gene expression and other omics data from male reproductive tissues is required for the fine-mapping of candidate causal variants underlying variation in male reproductive performance in cattle.
Collapse
Affiliation(s)
- Hubert Pausch
- Animal Genomics, Department of Environmental Systems Science, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland.
| | - Xena Marie Mapel
- Animal Genomics, Department of Environmental Systems Science, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland
| |
Collapse
|
13
|
Liu S, Gao Y, Canela-Xandri O, Wang S, Yu Y, Cai W, Li B, Xiang R, Chamberlain AJ, Pairo-Castineira E, D’Mellow K, Rawlik K, Xia C, Yao Y, Navarro P, Rocha D, Li X, Yan Z, Li C, Rosen BD, Van Tassell CP, Vanraden PM, Zhang S, Ma L, Cole JB, Liu GE, Tenesa A, Fang L. A multi-tissue atlas of regulatory variants in cattle. Nat Genet 2022; 54:1438-1447. [PMID: 35953587 PMCID: PMC7613894 DOI: 10.1038/s41588-022-01153-5] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 07/07/2022] [Indexed: 12/12/2022]
Abstract
Characterization of genetic regulatory variants acting on livestock gene expression is essential for interpreting the molecular mechanisms underlying traits of economic value and for increasing the rate of genetic gain through artificial selection. Here we build a Cattle Genotype-Tissue Expression atlas (CattleGTEx) as part of the pilot phase of the Farm animal GTEx (FarmGTEx) project for the research community based on 7,180 publicly available RNA-sequencing (RNA-seq) samples. We describe the transcriptomic landscape of more than 100 tissues/cell types and report hundreds of thousands of genetic associations with gene expression and alternative splicing for 23 distinct tissues. We evaluate the tissue-sharing patterns of these genetic regulatory effects, and functionally annotate them using multiomics data. Finally, we link gene expression in different tissues to 43 economically important traits using both transcriptome-wide association and colocalization analyses to decipher the molecular regulatory mechanisms underpinning such agronomic traits in cattle.
Collapse
Affiliation(s)
- Shuli Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Oriol Canela-Xandri
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Sheng Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Ying Yu
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Wentao Cai
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing 100193, China
| | - Bingjie Li
- Scotland’s Rural College (SRUC), Roslin Institute Building, Midlothian EH25 9RG, UK
| | - Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville 3052, Victoria, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria 3083, Australia
| | - Amanda J. Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria 3083, Australia
| | - Erola Pairo-Castineira
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Kenton D’Mellow
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Konrad Rawlik
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
| | - Charley Xia
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
| | - Yuelin Yao
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Pau Navarro
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Dominique Rocha
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas, F-78350, France
| | - Xiujin Li
- Guangdong Provincial Key Laboratory of Waterfowl Healthy Breeding, College of Animal Science & Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, Guangdong 510225, China
| | - Ze Yan
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Congjun Li
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Benjamin D. Rosen
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Curtis P. Van Tassell
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Paul M. Vanraden
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - John B. Cole
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - George E. Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
| | - Albert Tenesa
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian EH25 9RG, UK
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Lingzhao Fang
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, Maryland 20705, USA
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| |
Collapse
|
14
|
Mutant alleles differentially shape fitness and other complex traits in cattle. Commun Biol 2021; 4:1353. [PMID: 34857886 PMCID: PMC8640064 DOI: 10.1038/s42003-021-02874-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 11/09/2021] [Indexed: 11/08/2022] Open
Abstract
Mutant alleles (MAs) that have been classically recognised have large effects on phenotype and tend to be deleterious to traits and fitness. Is this the case for mutations with small effects? We infer MAs for 8 million sequence variants in 113k cattle and quantify the effects of MA on 37 complex traits. Heterozygosity for variants at genomic sites conserved across 100 vertebrate species increase fertility, stature, and milk production, positively associating these traits with fitness. MAs decrease stature and fat and protein concentration in milk, but increase gestation length and somatic cell count in milk (the latter indicative of mastitis). However, the frequency of MAs decreasing stature and fat and protein concentration, increasing gestation length and somatic cell count were lower than the frequency of MAs with the opposite effect. These results suggest bias in the mutations direction of effect (e.g. towards reduced protein in milk), but selection operating to reduce the frequency of these MAs. Taken together, our results imply two classes of genomic sites subject to long-term selection: sites conserved across vertebrates show hybrid vigour while sites subject to less long-term selection show a bias in mutation towards undesirable alleles.
Collapse
|
15
|
Schwartz JC, Maccari G, Heimeier D, Hammond JA. Highly-contiguous bovine genomes underpin accurate functional analyses and updated nomenclature of MHC class I. HLA 2021; 99:167-182. [PMID: 34802191 DOI: 10.1111/tan.14494] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 11/05/2021] [Accepted: 11/17/2021] [Indexed: 11/29/2022]
Abstract
The major histocompatibility complex (MHC) class I region of cattle is both highly polymorphic and, unlike many species, highly variable in gene content between haplotypes. Cattle MHC class I alleles were historically grouped by sequence similarity in the more conserved 3' end of the coding sequence to form phylogenetic allele groups. This has formed the basis of current cattle MHC class I nomenclature. We presently describe and compare five fully assembled MHC class I haplotypes using the latest cattle and yak genome assemblies. Of the five previously described "pseudogenes" in the cattle MHC class I region, Pseudogene 3 is putatively functional in all haplotypes and Pseudogene 6 and Pseudogene 7 are putatively functional in some haplotypes. This was reinforced by evidence of transcription. Based on full gene sequences as well as 3' coding sequence, we identified distinct subgroups of BoLA-3 and BoLA-6 that represent distinct genetic loci. We further examined allele-specific expression using transcriptomic data revealing that certain alleles are consistently weakly expressed compared to others. These observations will help to inform further studies into how MHC class I region variability influences T cell and natural killer cell functions in cattle.
Collapse
Affiliation(s)
| | - Giuseppe Maccari
- The Pirbright Institute, Pirbright, UK.,Anthony Nolan Research Institute, London, UK
| | | | | |
Collapse
|
16
|
Yuan Z, Ge L, Sun J, Zhang W, Wang S, Cao X, Sun W. Integrative analysis of Iso-Seq and RNA-seq data reveals transcriptome complexity and differentially expressed transcripts in sheep tail fat. PeerJ 2021; 9:e12454. [PMID: 34760406 PMCID: PMC8571958 DOI: 10.7717/peerj.12454] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 10/18/2021] [Indexed: 01/22/2023] Open
Abstract
Background Nowadays, both customers and producers prefer thin-tailed fat sheep. To effectively breed for this phenotype, it is important to identify candidate genes and uncover the genetic mechanism related to tail fat deposition in sheep. Accumulating evidence suggesting that post-transcriptional modification events of precursor-messenger RNA (pre-mRNA), including alternative splicing (AS) and alternative polyadenylation (APA), may regulate tail fat deposition in sheep. Differentially expressed transcripts (DETs) analysis is a way to identify candidate genes related to tail fat deposition. However, due to the technological limitation, post-transcriptional modification events in the tail fat of sheep and DETs between thin-tailed and fat-tailed sheep remains unclear. Methods In the present study, we applied pooled PacBio isoform sequencing (Iso-Seq) to generate transcriptomic data of tail fat tissue from six sheep (three thin-tailed sheep and three fat-tailed sheep). By comparing with reference genome, potential gene loci and novel transcripts were identified. Post-transcriptional modification events, including AS and APA, and lncRNA in sheep tail fat were uncovered using pooled Iso-Seq data. Combining Iso-Seq data with six RNA-sequencing (RNA-Seq) data, DETs between thin- and fat-tailed sheep were identified. Protein protein interaction (PPI) network, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were implemented to investigate the potential functions of DETs. Results In the present study, we revealed the transcriptomic complexity of the tail fat of sheep, result in 9,001 potential novel gene loci, 17,834 AS events, 5,791 APA events, and 3,764 lncRNAs. Combining Iso-Seq data with RNA-Seq data, we identified hundreds of DETs between thin- and fat-tailed sheep. Among them, 21 differentially expressed lncRNAs, such as ENSOART00020036299, ENSOART00020033641, ENSOART00020024562, ENSOART00020003848 and 9.53.1 may regulate tail fat deposition. Many novel transcripts were identified as DETs, including 15.527.13 (DGAT2), 13.624.23 (ACSS2), 11.689.28 (ACLY), 11.689.18 (ACLY), 11.689.14 (ACLY), 11.660.12 (ACLY), 22.289.6 (SCD), 22.289.3 (SCD) and 22.289.14 (SCD). Most of the identified DETs have been enriched in GO and KEGG pathways related to extracellular matrix (ECM). Our result revealed the transcriptome complexity and identified many candidate transcripts in tail fat, which could enhance the understanding of molecular mechanisms behind tail fat deposition.
Collapse
Affiliation(s)
- Zehu Yuan
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou, China
| | - Ling Ge
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Jingyi Sun
- College of Veterinary Medicine, Yangzhou University, Yangzhou, China
| | - Weibo Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Shanhe Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Xiukai Cao
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou, China
| | - Wei Sun
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education, Yangzhou University, Yangzhou, China.,College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| |
Collapse
|
17
|
Prowse-Wilkins CP, Wang J, Xiang R, Garner JB, Goddard ME, Chamberlain AJ. Putative Causal Variants Are Enriched in Annotated Functional Regions From Six Bovine Tissues. Front Genet 2021; 12:664379. [PMID: 34249087 PMCID: PMC8260860 DOI: 10.3389/fgene.2021.664379] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 05/24/2021] [Indexed: 12/13/2022] Open
Abstract
Genetic variants which affect complex traits (causal variants) are thought to be found in functional regions of the genome. Identifying causal variants would be useful for predicting complex trait phenotypes in dairy cows, however, functional regions are poorly annotated in the bovine genome. Functional regions can be identified on a genome-wide scale by assaying for post-translational modifications to histone proteins (histone modifications) and proteins interacting with the genome (e.g., transcription factors) using a method called Chromatin immunoprecipitation followed by sequencing (ChIP-seq). In this study ChIP-seq was performed to find functional regions in the bovine genome by assaying for four histone modifications (H3K4Me1, H3K4Me3, H3K27ac, and H3K27Me3) and one transcription factor (CTCF) in 6 tissues (heart, kidney, liver, lung, mammary and spleen) from 2 to 3 lactating dairy cows. Eighty-six ChIP-seq samples were generated in this study, identifying millions of functional regions in the bovine genome. Combinations of histone modifications and CTCF were found using ChromHMM and annotated by comparing with active and inactive genes across the genome. Functional marks differed between tissues highlighting areas which might be particularly important to tissue-specific regulation. Supporting the cis-regulatory role of functional regions, the read counts in some ChIP peaks correlated with nearby gene expression. The functional regions identified in this study were enriched for putative causal variants as seen in other species. Interestingly, regions which correlated with gene expression were particularly enriched for potential causal variants. This supports the hypothesis that complex traits are regulated by variants that alter gene expression. This study provides one of the largest ChIP-seq annotation resources in cattle including, for the first time, in the mammary gland of lactating cows. By linking regulatory regions to expression QTL and trait QTL we demonstrate a new strategy for identifying causal variants in cattle.
Collapse
Affiliation(s)
- Claire P Prowse-Wilkins
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Jianghui Wang
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Ruidong Xiang
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Josie B Garner
- Agriculture Victoria, Ellinbank Dairy Centre, Ellinbank, VIC, Australia
| | - Michael E Goddard
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| |
Collapse
|
18
|
Novel functional sequences uncovered through a bovine multiassembly graph. Proc Natl Acad Sci U S A 2021; 118:2101056118. [PMID: 33972446 DOI: 10.1073/pnas.2101056118] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Many genomic analyses start by aligning sequencing reads to a linear reference genome. However, linear reference genomes are imperfect, lacking millions of bases of unknown relevance and are unable to reflect the genetic diversity of populations. This makes reference-guided methods susceptible to reference-allele bias. To overcome such limitations, we build a pangenome from six reference-quality assemblies from taurine and indicine cattle as well as yak. The pangenome contains an additional 70,329,827 bases compared to the Bos taurus reference genome. Our multiassembly approach reveals 30 and 10.1 million bases private to yak and indicine cattle, respectively, and between 3.3 and 4.4 million bases unique to each taurine assembly. Utilizing transcriptomes from 56 cattle, we show that these nonreference sequences encode transcripts that hitherto remained undetected from the B. taurus reference genome. We uncover genes, primarily encoding proteins contributing to immune response and pathogen-mediated immunomodulation, differentially expressed between Mycobacterium bovis-infected and noninfected cattle that are also undetectable in the B. taurus reference genome. Using whole-genome sequencing data of cattle from five breeds, we show that reads which were previously misaligned against the Bos taurus reference genome now align accurately to the pangenome sequences. This enables us to discover 83,250 polymorphic sites that segregate within and between breeds of cattle and capture genetic differentiation across breeds. Our work makes a so-far unused source of variation amenable to genetic investigations and provides methods and a framework for establishing and exploiting a more diverse reference genome.
Collapse
|
19
|
Kern C, Wang Y, Xu X, Pan Z, Halstead M, Chanthavixay G, Saelao P, Waters S, Xiang R, Chamberlain A, Korf I, Delany ME, Cheng HH, Medrano JF, Van Eenennaam AL, Tuggle CK, Ernst C, Flicek P, Quon G, Ross P, Zhou H. Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research. Nat Commun 2021; 12:1821. [PMID: 33758196 PMCID: PMC7988148 DOI: 10.1038/s41467-021-22100-8] [Citation(s) in RCA: 112] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Accepted: 03/01/2021] [Indexed: 01/31/2023] Open
Abstract
Gene regulatory elements are central drivers of phenotypic variation and thus of critical importance towards understanding the genetics of complex traits. The Functional Annotation of Animal Genomes consortium was formed to collaboratively annotate the functional elements in animal genomes, starting with domesticated animals. Here we present an expansive collection of datasets from eight diverse tissues in three important agricultural species: chicken (Gallus gallus), pig (Sus scrofa), and cattle (Bos taurus). Comparative analysis of these datasets and those from the human and mouse Encyclopedia of DNA Elements projects reveal that a core set of regulatory elements are functionally conserved independent of divergence between species, and that tissue-specific transcription factor occupancy at regulatory elements and their predicted target genes are also conserved. These datasets represent a unique opportunity for the emerging field of comparative epigenomics, as well as the agricultural research community, including species that are globally important food resources.
Collapse
Affiliation(s)
- Colin Kern
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Ying Wang
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Xiaoqin Xu
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Zhangyuan Pan
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Michelle Halstead
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Ganrea Chanthavixay
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Perot Saelao
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Susan Waters
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Ruidong Xiang
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Melbourne, VIC, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Amanda Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Ian Korf
- Genome Center, University of California, Davis, Davis, CA, USA
| | - Mary E Delany
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | - Hans H Cheng
- USDA-ARS, Avian Disease and Oncology Laboratory, East Lansing, MI, USA
| | - Juan F Medrano
- Department of Animal Science, University of California, Davis, Davis, CA, USA
| | | | - Chris K Tuggle
- Department of Animal Science, Iowa State University, Ames, IA, USA
| | - Catherine Ernst
- Department of Animal Science, Michigan State University, East Lansing, MI, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Gerald Quon
- Department of Molecular and Cellular Biology, University of California, David, Davis, CA, USA
| | - Pablo Ross
- Department of Animal Science, University of California, Davis, Davis, CA, USA.
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, Davis, CA, USA.
| |
Collapse
|
20
|
Xiang R, MacLeod IM, Daetwyler HD, de Jong G, O’Connor E, Schrooten C, Chamberlain AJ, Goddard ME. Genome-wide fine-mapping identifies pleiotropic and functional variants that predict many traits across global cattle populations. Nat Commun 2021; 12:860. [PMID: 33558518 PMCID: PMC7870883 DOI: 10.1038/s41467-021-21001-0] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 11/23/2020] [Indexed: 02/08/2023] Open
Abstract
The difficulty in finding causative mutations has hampered their use in genomic prediction. Here, we present a methodology to fine-map potentially causal variants genome-wide by integrating the functional, evolutionary and pleiotropic information of variants using GWAS, variant clustering and Bayesian mixture models. Our analysis of 17 million sequence variants in 44,000+ Australian dairy cattle for 34 traits suggests, on average, one pleiotropic QTL existing in each 50 kb chromosome-segment. We selected a set of 80k variants representing potentially causal variants within each chromosome segment to develop a bovine XT-50K genotyping array. The custom array contains many pleiotropic variants with biological functions, including splicing QTLs and variants at conserved sites across 100 vertebrate species. This biology-informed custom array outperformed the standard array in predicting genetic value of multiple traits across populations in independent datasets of 90,000+ dairy cattle from the USA, Australia and New Zealand.
Collapse
Affiliation(s)
- Ruidong Xiang
- grid.1008.90000 0001 2179 088XFaculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC Australia ,grid.452283.a0000 0004 0407 2669Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC Australia
| | - Iona M. MacLeod
- grid.452283.a0000 0004 0407 2669Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC Australia
| | - Hans D. Daetwyler
- grid.452283.a0000 0004 0407 2669Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC Australia ,grid.1018.80000 0001 2342 0938School of Applied Systems Biology, La Trobe University, Bundoora, VIC Australia
| | | | | | | | - Amanda J. Chamberlain
- grid.452283.a0000 0004 0407 2669Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC Australia
| | - Michael E. Goddard
- grid.1008.90000 0001 2179 088XFaculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC Australia ,grid.452283.a0000 0004 0407 2669Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC Australia
| |
Collapse
|
21
|
Yuan Z, Sunduimijid B, Xiang R, Behrendt R, Knight MI, Mason BA, Reich CM, Prowse-Wilkins C, Vander Jagt CJ, Chamberlain AJ, MacLeod IM, Li F, Yue X, Daetwyler HD. Expression quantitative trait loci in sheep liver and muscle contribute to variations in meat traits. Genet Sel Evol 2021; 53:8. [PMID: 33461502 PMCID: PMC7812657 DOI: 10.1186/s12711-021-00602-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 01/08/2021] [Indexed: 11/10/2022] Open
Abstract
Background Variants that regulate transcription, such as expression quantitative trait loci (eQTL), have shown enrichment in genome-wide association studies (GWAS) for mammalian complex traits. However, no study has reported eQTL in sheep, although it is an important agricultural species for which many GWAS of complex meat traits have been conducted. Using RNA sequence data produced from liver and muscle from 149 sheep and imputed whole-genome single nucleotide polymorphisms (SNPs), our aim was to dissect the genetic architecture of the transcriptome by associating sheep genotypes with three major molecular phenotypes including gene expression (geQTL), exon expression (eeQTL) and RNA splicing (sQTL). We also examined these three types of eQTL for their enrichment in GWAS of multi-meat traits and fatty acid profiles. Results Whereas a relatively small number of molecular phenotypes were significantly heritable (h2 > 0, P < 0.05), their mean heritability ranged from 0.67 to 0.73 for liver and from 0.71 to 0.77 for muscle. Association analysis between molecular phenotypes and SNPs within ± 1 Mb identified many significant cis-eQTL (false discovery rate, FDR < 0.01). The median distance between the eQTL and transcription start sites (TSS) ranged from 68 to 153 kb across the three eQTL types. The number of common variants between geQTL, eeQTL and sQTL within each tissue, and the number of common variants between liver and muscle within each eQTL type were all significantly (P < 0.05) larger than expected by chance. The identified eQTL were significantly (P < 0.05) enriched in GWAS hits associated with 56 carcass traits and fatty acid profiles. For example, several geQTL in muscle mapped to the FAM184B gene, hundreds of sQTL in liver and muscle mapped to the CAST gene, and hundreds of sQTL in liver mapped to the C6 gene. These three genes are associated with body composition or fatty acid profiles. Conclusions We detected a large number of significant eQTL and found that the overlap of variants between eQTL types and tissues was prevalent. Many eQTL were also QTL for meat traits. Our study fills a gap in the knowledge on the regulatory variants and their role in complex traits for the sheep model.
Collapse
Affiliation(s)
- Zehu Yuan
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Grassland Agriculture Engineering Center, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020, People's Republic of China.,Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Institutes of Agricultural Science and Technology Development (Joint International Research Laboratory of Agriculture & Agri-Product Safety), Yangzhou University, Yangzhou, 225000, People's Republic of China
| | - Bolormaa Sunduimijid
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Ruidong Xiang
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary & Agricultural Science, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Ralph Behrendt
- Agriculture Victoria, Hamilton Centre, Hamilton, VIC, 3300, Australia
| | - Matthew I Knight
- Agriculture Victoria, Hamilton Centre, Hamilton, VIC, 3300, Australia
| | - Brett A Mason
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Coralie M Reich
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Claire Prowse-Wilkins
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Christy J Vander Jagt
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Fadi Li
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Grassland Agriculture Engineering Center, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020, People's Republic of China
| | - Xiangpeng Yue
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Grassland Agriculture Engineering Center, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020, People's Republic of China.
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia. .,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia.
| |
Collapse
|
22
|
Xiang R, Breen EJ, Prowse-Wilkins CP, Chamberlain AJ, Goddard ME. Bayesian genome-wide analysis of cattle traits using variants with functional and evolutionary significance. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Context
Functional genomics studies have highlighted genomic regions with regulatory and evolutionary significance. Such information independent of association analysis may benefit fine-mapping and genomic selection of economically important traits. However, systematic evaluation of the use of functional information in mapping, and genomic selection of cattle traits, is lacking. Also, single-nucleotide polymorphisms (SNPs) from the high-density (HD) panel are known to tag informative variants, but the performance of genomic prediction using HD SNPs together with variants supported by different functional genomics is unknown.
Aims
We selected six sets of functionally important variants and modelled each set together with HD SNPs in Bayesian models to map and predict protein, fat and milk yield as well as mastitis, somatic cell count and temperament of dairy cattle.
Methods
Two models were used, namely (1) BayesR, which includes priors of four distribution of variant effects, and (2) BayesRC, which includes additional priors of different functional classes of variants. Bayesian models were trained in three breeds of 28 000 cows of Holstein, Jersey and Australian Red and predicted into 2600 independent bulls.
Key results
Adding functionally important variants significantly increased the enrichment of genetic variance explained for mapped variants, suggesting improved genome-wide mapping precision. Such improvement was significantly higher when the same set of variants was modelled by BayesRC than by BayesR. Combining functional variant sets with HD SNPs improves genomic prediction accuracy in the majority of the cases and such improvement was more common and stronger for non-Holstein breeds and traits such as mastitis, somatic cell count and temperament. In contrast, adding a large number of random sequence variants to HD SNPs reduces mapping precision and has a worse or similar prediction accuracy, compared with using HD SNPs alone to map or predict. While BayesRC tended to have better genomic prediction accuracy than did BayesR, the overall difference in prediction accuracy between the two models was insignificant.
Conclusions
Our findings demonstrated the usefulness of functional data in genomic mapping and prediction.
Implications
We have highlighted the need for effective tools exploiting complex functional datasets to improve genomic prediction.
Collapse
|
23
|
Dorji J, MacLeod IM, Chamberlain AJ, Vander Jagt CJ, Ho PN, Khansefid M, Mason BA, Prowse-Wilkins CP, Marett LC, Wales WJ, Cocks BG, Pryce JE, Daetwyler HD. Mitochondrial protein gene expression and the oxidative phosphorylation pathway associated with feed efficiency and energy balance in dairy cattle. J Dairy Sci 2020; 104:575-587. [PMID: 33162069 DOI: 10.3168/jds.2020-18503] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 08/20/2020] [Indexed: 12/12/2022]
Abstract
Feed efficiency and energy balance are important traits underpinning profitability and environmental sustainability in animal production. They are complex traits, and our understanding of their underlying biology is currently limited. One measure of feed efficiency is residual feed intake (RFI), which is the difference between actual and predicted intake. Variation in RFI among individuals is attributable to the metabolic efficiency of energy utilization. High RFI (H_RFI) animals require more energy per unit of weight gain or milk produced compared with low RFI (L_RFI) animals. Energy balance (EB) is a closely related trait calculated very similarly to RFI. Cellular energy metabolism in mitochondria involves mitochondrial protein (MiP) encoded by both nuclear (NuMiP) and mitochondrial (MtMiP) genomes. We hypothesized that MiP genes are differentially expressed (DE) between H_RFI and L_RFI animal groups and similarly between negative and positive EB groups. Our study aimed to characterize MiP gene expression in white blood cells of H_RFI and L_RFI cows using RNA sequencing to identify genes and biological pathways associated with feed efficiency in dairy cattle. We used the top and bottom 14 cows ranked for RFI and EB out of 109 animals as H_RFI and L_RFI, and positive and negative EB groups, respectively. The gene expression counts across all nuclear and mitochondrial genes for animals in each group were used for differential gene expression analyses, weighted gene correlation network analysis, functional enrichment, and identification of hub genes. Out of 244 DE genes between RFI groups, 38 were MiP genes. The DE genes were enriched for the oxidative phosphorylation (OXPHOS) and ribosome pathways. The DE MiP genes were underexpressed in L_RFI (and negative EB) compared with the H_RFI (and positive EB) groups, suggestive of reduced mitochondrial activity in the L_RFI group. None of the MtMiP genes were among the DE MiP genes between the groups, which suggests a non-rate limiting role of MtMiP genes in feed efficiency and warrants further investigation. The role of MiP, particularly the NuMiP and OXPHOS pathways in RFI, was also supported by our gene correlation network analysis and the hub gene identification. We validated the findings in an independent data set. Overall, our study suggested that differences in feed efficiency in dairy cows may be linked to differences in cellular energy demand. This study broadens our knowledge of the biology of feed efficiency in dairy cattle.
Collapse
Affiliation(s)
- Jigme Dorji
- School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, Australia, 3083; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083.
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Christy J Vander Jagt
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Phuong N Ho
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Majid Khansefid
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Brett A Mason
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Claire P Prowse-Wilkins
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083; Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Victoria, Australia, 3010
| | - Leah C Marett
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Victoria, Australia, 3010; Agriculture Victoria, Ellinbank Dairy Centre, Ellinbank, Victoria, Australia, 3821
| | - William J Wales
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Victoria, Australia, 3010; Agriculture Victoria, Ellinbank Dairy Centre, Ellinbank, Victoria, Australia, 3821
| | - Benjamin G Cocks
- School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, Australia, 3083; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Jennie E Pryce
- School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, Australia, 3083; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| | - Hans D Daetwyler
- School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, Australia, 3083; Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, Victoria, Australia, 3083
| |
Collapse
|
24
|
van den Berg I, Xiang R, Jenko J, Pausch H, Boussaha M, Schrooten C, Tribout T, Gjuvsland AB, Boichard D, Nordbø Ø, Sanchez MP, Goddard ME. Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds. Genet Sel Evol 2020; 52:37. [PMID: 32635893 PMCID: PMC7339598 DOI: 10.1186/s12711-020-00556-4] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 06/26/2020] [Indexed: 12/14/2022] Open
Abstract
Background Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS and increases the power to detect trait-associated variants without requiring access to data at the individual level of the GWAS mapping cohorts. Because linkage disequilibrium between adjacent markers is conserved only over short distances across breeds, a multi-breed meta-analysis can improve mapping precision. Results To maximise the power to identify quantitative trait loci (QTL), we combined the results of nine within-population GWAS that used imputed sequence variant genotypes of 94,321 cattle from eight breeds, to perform a large-scale meta-analysis for fat and protein percentage in cattle. The meta-analysis detected (p ≤ 10−8) 138 QTL for fat percentage and 176 QTL for protein percentage. This was more than the number of QTL detected in all within-population GWAS together (124 QTL for fat percentage and 104 QTL for protein percentage). Among all the lead variants, 100 QTL for fat percentage and 114 QTL for protein percentage had the same direction of effect in all within-population GWAS. This indicates either persistence of the linkage phase between the causal variant and the lead variant across breeds or that some of the lead variants might indeed be causal or tightly linked with causal variants. The percentage of intergenic variants was substantially lower for significant variants than for non-significant variants, and significant variants had mostly moderate to high minor allele frequencies. Significant variants were also clustered in genes that are known to be relevant for fat and protein percentages in milk. Conclusions Our study identified a large number of QTL associated with fat and protein percentage in dairy cattle. We demonstrated that large-scale multi-breed meta-analysis reveals more QTL at the nucleotide resolution than within-population GWAS. Significant variants were more often located in genic regions than non-significant variants and a large part of them was located in potentially regulatory regions.
Collapse
Affiliation(s)
- Irene van den Berg
- Agriculture Victoria Research, AgriBio, 5 Ring Road, Bundoora, VIC, 3083, Australia.
| | - Ruidong Xiang
- Agriculture Victoria Research, AgriBio, 5 Ring Road, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary & Agricultural Science, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Janez Jenko
- GENO SA, Storhamargata 44, 2317, Hamar, Norway
| | | | - Mekki Boussaha
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | | | - Thierry Tribout
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | | | - Didier Boichard
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | | | - Marie-Pierre Sanchez
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Mike E Goddard
- Agriculture Victoria Research, AgriBio, 5 Ring Road, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary & Agricultural Science, University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
25
|
Xiang R, van den Berg I, MacLeod IM, Daetwyler HD, Goddard ME. Effect direction meta-analysis of GWAS identifies extreme, prevalent and shared pleiotropy in a large mammal. Commun Biol 2020; 3:88. [PMID: 32111961 PMCID: PMC7048789 DOI: 10.1038/s42003-020-0823-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 02/06/2020] [Indexed: 12/17/2022] Open
Abstract
In genome-wide association studies (GWAS), variants showing consistent effect directions across populations are considered as true discoveries. We model this information in an Effect Direction MEta-analysis (EDME) to quantify pleiotropy using GWAS of 34 Cholesky-decorrelated traits in 44,000+ cattle with sequence variants. The effect-direction agreement between independent bull and cow datasets was used to quantify the false discovery rate by effect direction (FDRed) and the number of affected traits for prioritised variants. Variants with multi-trait p < 1e–6 affected 1∼22 traits with an average of 10 traits. EDME assigns pleiotropic variants to each trait which informs the biology behind complex traits. New pleiotropic loci are identified, including signals from the cattle FTO locus mirroring its bystander effects on human obesity. When validated in the 1000-Bull Genome database, the prioritized pleiotropic variants consistently predicted expected phenotypic differences between dairy and beef cattle. EDME provides robust approaches to control GWAS FDR and quantify pleiotropy. Xiang et al. developed an Effect Direction Meta-analysis (EDME) approach to identify true pleiotropy. They used Cholesky-transformation to decorrelate the traits and identified many pleiotropic variants that consistently predicted phenotypic differences in cattle.
Collapse
Affiliation(s)
- Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, 3052, Victoria, Australia. .,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia.
| | - Irene van den Berg
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, 3052, Victoria, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3083, Australia
| | - Michael E Goddard
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, 3052, Victoria, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia
| |
Collapse
|
26
|
Hayes BJ, Daetwyler HD. 1000 Bull Genomes Project to Map Simple and Complex Genetic Traits in Cattle: Applications and Outcomes. Annu Rev Anim Biosci 2019; 7:89-102. [PMID: 30508490 DOI: 10.1146/annurev-animal-020518-115024] [Citation(s) in RCA: 208] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The 1000 Bull Genomes Project is a collection of whole-genome sequences from 2,703 individuals capturing a significant proportion of the world's cattle diversity. So far, 84 million single-nucleotide polymorphisms (SNPs) and 2.5 million small insertion deletions have been identified in the collection, a very high level of genetic diversity. The project has greatly accelerated the identification of deleterious mutations for a range of genetic diseases, as well as for embryonic lethals. The rate of identification of causal mutations for complex traits has been slower, reflecting the typically small effect size of these mutations and the fact that many are likely in as-yet-unannotated regulatory regions. Both the deleterious mutations that have been identified and the mutations associated with complex trait variation have been included in low-cost SNP array designs, and these arrays are being genotyped in tens of thousands of dairy and beef cattle, enabling management of deleterious mutations in these populations as well as genomic selection.
Collapse
Affiliation(s)
- Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland 4067, Australia; .,Agriculture Victoria Research, AgriBio, Bundoora, Victoria 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria Research, AgriBio, Bundoora, Victoria 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria 3083, Australia
| |
Collapse
|
27
|
Liu S, Fang L, Zhou Y, Santos DJA, Xiang R, Daetwyler HD, Chamberlain AJ, Cole JB, Li CJ, Yu Y, Ma L, Zhang S, Liu GE. Analyses of inter-individual variations of sperm DNA methylation and their potential implications in cattle. BMC Genomics 2019; 20:888. [PMID: 31752687 PMCID: PMC6873545 DOI: 10.1186/s12864-019-6228-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 10/28/2019] [Indexed: 12/18/2022] Open
Abstract
Background DNA methylation has been shown to be involved in many biological processes, including X chromosome inactivation in females, paternal genomic imprinting, and others. Results Based on the correlation patterns of methylation levels of neighboring CpG sites among 28 sperm whole genome bisulfite sequencing (WGBS) data (486 × coverage), we obtained 31,272 methylation haplotype blocks (MHBs). Among them, we defined conserved methylated regions (CMRs), variably methylated regions (VMRs) and highly variably methylated regions (HVMRs) among individuals, and showed that HVMRs might play roles in transcriptional regulation and function in complex traits variation and adaptive evolution by integrating evidence from traditional and molecular quantitative trait loci (QTL), and selection signatures. Using a weighted correlation network analysis (WGCNA), we also detected a co-regulated module of HVMRs that was significantly associated with reproduction traits, and enriched for glycosyltransferase genes, which play critical roles in spermatogenesis and fertilization. Additionally, we identified 46 VMRs significantly associated with reproduction traits, nine of which were regulated by cis-SNPs, implying the possible intrinsic relationships among genomic variations, DNA methylation, and phenotypes. These significant VMRs were co-localized (± 10 kb) with genes related to sperm motility and reproduction, including ZFP36L1, CRISP2 and HGF. We provided further evidence that rs109326022 within a predominant QTL on BTA18 might influence the reproduction traits through regulating the methylation level of nearby genes JOSD2 and ASPDH in sperm. Conclusion In summary, our results demonstrated associations of sperm DNA methylation with reproduction traits, highlighting the potential of epigenomic information in genomic improvement programs for cattle.
Collapse
Affiliation(s)
- Shuli Liu
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.,USDA-ARS, Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705, USA
| | - Lingzhao Fang
- USDA-ARS, Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705, USA.,Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA.,Medical Research Council Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Yang Zhou
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Education Ministry of China, Huazhong Agricultural University, Wuhan, 430070, Hubei, China
| | - Daniel J A Santos
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, Victoria, 3052, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3083, Australia
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, 3083, Australia
| | - John B Cole
- USDA-ARS, Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705, USA
| | - Cong-Jun Li
- USDA-ARS, Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705, USA
| | - Ying Yu
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - George E Liu
- USDA-ARS, Animal Genomics and Improvement Laboratory, Beltsville, MD, 20705, USA.
| |
Collapse
|
28
|
Xiang R, Berg IVD, MacLeod IM, Hayes BJ, Prowse-Wilkins CP, Wang M, Bolormaa S, Liu Z, Rochfort SJ, Reich CM, Mason BA, Vander Jagt CJ, Daetwyler HD, Lund MS, Chamberlain AJ, Goddard ME. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc Natl Acad Sci U S A 2019; 116:19398-19408. [PMID: 31501319 PMCID: PMC6765237 DOI: 10.1073/pnas.1904159116] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Many genome variants shaping mammalian phenotype are hypothesized to regulate gene transcription and/or to be under selection. However, most of the evidence to support this hypothesis comes from human studies. Systematic evidence for regulatory and evolutionary signals contributing to complex traits in a different mammalian model is needed. Sequence variants associated with gene expression (expression quantitative trait loci [eQTLs]) and concentration of metabolites (metabolic quantitative trait loci [mQTLs]) and under histone-modification marks in several tissues were discovered from multiomics data of over 400 cattle. Variants under selection and evolutionary constraint were identified using genome databases of multiple species. These analyses defined 30 sets of variants, and for each set, we estimated the genetic variance the set explained across 34 complex traits in 11,923 bulls and 32,347 cows with 17,669,372 imputed variants. The per-variant trait heritability of these sets across traits was highly consistent (r > 0.94) between bulls and cows. Based on the per-variant heritability, conserved sites across 100 vertebrate species and mQTLs ranked the highest, followed by eQTLs, young variants, those under histone-modification marks, and selection signatures. From these results, we defined a Functional-And-Evolutionary Trait Heritability (FAETH) score indicating the functionality and predicted heritability of each variant. In additional 7,551 cattle, the high FAETH-ranking variants had significantly increased genetic variances and genomic prediction accuracies in 3 production traits compared to the low FAETH-ranking variants. The FAETH framework combines the information of gene regulation, evolution, and trait heritability to rank variants, and the publicly available FAETH data provide a set of biological priors for cattle genomic selection worldwide.
Collapse
Affiliation(s)
- Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia;
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Irene van den Berg
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Benjamin J Hayes
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- Centre for Animal Science, The University of Queensland, St. Lucia, QLD 4067, Australia
| | - Claire P Prowse-Wilkins
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Min Wang
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Sunduimijid Bolormaa
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Zhiqian Liu
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Simone J Rochfort
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Coralie M Reich
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Brett A Mason
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Christy J Vander Jagt
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Mogens S Lund
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, DK-8830 Tjele, Denmark
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Michael E Goddard
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| |
Collapse
|
29
|
Salavati M, Bush SJ, Palma-Vera S, McCulloch MEB, Hume DA, Clark EL. Elimination of Reference Mapping Bias Reveals Robust Immune Related Allele-Specific Expression in Crossbred Sheep. Front Genet 2019; 10:863. [PMID: 31608110 PMCID: PMC6761296 DOI: 10.3389/fgene.2019.00863] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Accepted: 08/19/2019] [Indexed: 12/13/2022] Open
Abstract
Pervasive allelic variation at both gene and single nucleotide level (SNV) between individuals is commonly associated with complex traits in humans and animals. Allele-specific expression (ASE) analysis, using RNA-Seq, can provide a detailed annotation of allelic imbalance and infer the existence of cis-acting transcriptional regulation. However, variant detection in RNA-Seq data is compromised by biased mapping of reads to the reference DNA sequence. In this manuscript, we describe an unbiased standardized computational pipeline for allele-specific expression analysis using RNA-Seq data, which we have adapted and developed using tools available under open license. The analysis pipeline we present is designed to minimize reference bias while providing accurate profiling of allele-specific expression across tissues and cell types. Using this methodology, we were able to profile pervasive allelic imbalance across tissues and cell types, at both the gene and SNV level, in Texel×Scottish Blackface sheep, using the sheep gene expression atlas data set. ASE profiles were pervasive in each sheep and across all tissue types investigated. However, ASE profiles shared across tissues were limited, and instead, they tended to be highly tissue-specific. These tissue-specific ASE profiles may underlie the expression of economically important traits and could be utilized as weighted SNVs, for example, to improve the accuracy of genomic selection in breeding programs for sheep. An additional benefit of the pipeline is that it does not require parental genotypes and can therefore be applied to other RNA-Seq data sets for livestock, including those available on the Functional Annotation of Animal Genomes (FAANG) data portal. This study is the first global characterization of moderate to extreme ASE in tissues and cell types from sheep. We have applied a robust methodology for ASE profiling to provide both a novel analysis of the multi-dimensional sheep gene expression atlas data set and a foundation for identifying the regulatory and expressed elements of the genome that are driving complex traits in livestock.
Collapse
Affiliation(s)
- Mazdak Salavati
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Edinburgh, United Kingdom
| | - Stephen J. Bush
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Edinburgh, United Kingdom
| | - Sergio Palma-Vera
- Leibniz Institute for Farm Animal Biology (FBN), Institute for Reproductive Biology, Dummerstorf, Germany
| | - Mary E. B. McCulloch
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Edinburgh, United Kingdom
| | - David A. Hume
- Mater Research Institute-University of Queensland, Translational Research Institute, Woolloongabba, QLD, Australia
| | - Emily L. Clark
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Edinburgh, United Kingdom
| |
Collapse
|