1
|
Veltsos P, Kelly JK. The quantitative genetics of gene expression in Mimulus guttatus. PLoS Genet 2024; 20:e1011072. [PMID: 38603726 PMCID: PMC11060551 DOI: 10.1371/journal.pgen.1011072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 04/30/2024] [Accepted: 03/23/2024] [Indexed: 04/13/2024] Open
Abstract
Gene expression can be influenced by genetic variants that are closely linked to the expressed gene (cis eQTLs) and variants in other parts of the genome (trans eQTLs). We created a multiparental mapping population by sampling genotypes from a single natural population of Mimulus guttatus and scored gene expression in the leaves of 1,588 plants. We find that nearly every measured gene exhibits cis regulatory variation (91% have FDR < 0.05). cis eQTLs are usually allelic series with three or more functionally distinct alleles. The cis locus explains about two thirds of the standing genetic variance (on average) but varies among genes and tends to be greatest when there is high indel variation in the upstream regulatory region and high nucleotide diversity in the coding sequence. Despite mapping over 10,000 trans eQTL / affected gene pairs, most of the genetic variance generated by trans acting loci remains unexplained. This implies a large reservoir of trans acting genes with subtle or diffuse effects. Mapped trans eQTLs show lower allelic diversity but much higher genetic dominance than cis eQTLs. Several analyses also indicate that trans eQTLs make a substantial contribution to the genetic correlations in expression among different genes. They may thus be essential determinants of "gene expression modules," which has important implications for the evolution of gene expression and how it is studied by geneticists.
Collapse
Affiliation(s)
- Paris Veltsos
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, United States of America
| | - John K. Kelly
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, United States of America
| |
Collapse
|
2
|
Everman ER, Macdonald SJ. Gene expression variation underlying tissue-specific responses to copper stress in Drosophila melanogaster. G3 (BETHESDA, MD.) 2024; 14:jkae015. [PMID: 38262701 PMCID: PMC11021028 DOI: 10.1093/g3journal/jkae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/04/2024] [Accepted: 01/08/2024] [Indexed: 01/25/2024]
Abstract
Copper is one of a handful of biologically necessary heavy metals that is also a common environmental pollutant. Under normal conditions, copper ions are required for many key physiological processes. However, in excess, copper results in cell and tissue damage ranging in severity from temporary injury to permanent neurological damage. Because of its biological relevance, and because many conserved copper-responsive genes respond to nonessential heavy metal pollutants, copper resistance in Drosophila melanogaster is a useful model system with which to investigate the genetic control of the heavy metal stress response. Because heavy metal toxicity has the potential to differently impact specific tissues, we genetically characterized the control of the gene expression response to copper stress in a tissue-specific manner in this study. We assessed the copper stress response in head and gut tissue of 96 inbred strains from the Drosophila Synthetic Population Resource using a combination of differential expression analysis and expression quantitative trait locus mapping. Differential expression analysis revealed clear patterns of tissue-specific expression. Tissue and treatment specific responses to copper stress were also detected using expression quantitative trait locus mapping. Expression quantitative trait locus associated with MtnA, Mdr49, Mdr50, and Sod3 exhibited both genotype-by-tissue and genotype-by-treatment effects on gene expression under copper stress, illuminating tissue- and treatment-specific patterns of gene expression control. Together, our data build a nuanced description of the roles and interactions between allelic and expression variation in copper-responsive genes, provide valuable insight into the genomic architecture of susceptibility to metal toxicity, and highlight candidate genes for future functional characterization.
Collapse
Affiliation(s)
- Elizabeth R Everman
- School of Biological Sciences, The University of Oklahoma, 730 Van Vleet Oval, Norman, OK 73019, USA
| | - Stuart J Macdonald
- Molecular Biosciences, University of Kansas, 1200 Sunnyside Ave, Lawrence, KS 66045, USA
| |
Collapse
|
3
|
Veltsos P, Kelly JK. The quantitative genetics of gene expression in Mimulus guttatus. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.21.568003. [PMID: 38045261 PMCID: PMC10690227 DOI: 10.1101/2023.11.21.568003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Gene expression can be influenced by genetic variants that are closely linked to the expressed gene (cis eQTLs) and variants in other parts of the genome (trans eQTLs). We created a multiparental mapping population by sampling genotypes from a single natural population of Mimulus guttatus and scored gene expression in the leaves of 1,588 plants. We find that nearly every measured gene exhibits cis regulatory variation (91% have FDR < 0.05) and that cis eQTLs are usually allelic series with three or more functionally distinct alleles. The cis locus explains about two thirds of the standing genetic variance (on average) but varies among genes and tends to be greatest when there is high indel variation in the upstream regulatory region and high nucleotide diversity in the coding sequence. Despite mapping over 10,000 trans eQTL / affected gene pairs, most of the genetic variance generated by trans acting loci remains unexplained. This implies a large reservoir of trans acting genes with subtle or diffuse effects. Mapped trans eQTLs show lower allelic diversity but much higher genetic dominance than cis eQTLs. Several analyses also indicate that trans eQTL make a substantial contribution to the genetic correlations in expression among different genes. They may thus be essential determinants of "gene expression modules", which has important implications for the evolution of gene expression and also how it is studied by geneticists.
Collapse
Affiliation(s)
- Paris Veltsos
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, USA
- Current address: Ecology, Evolution and Genetics Research Group, Biology Department, Vrije Universiteit Brussel, Pleinlaan 2, 1050, Brussels, Belgium
| | - John K. Kelly
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, USA
| |
Collapse
|
4
|
Everman ER, Macdonald SJ. Gene expression variation underlying tissue-specific responses to copper stress in Drosophila melanogaster. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.12.548746. [PMID: 37503205 PMCID: PMC10370140 DOI: 10.1101/2023.07.12.548746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Copper is one of a handful of biologically necessary heavy metals that is also a common environmental pollutant. Under normal conditions, copper ions are required for many key physiological processes. However, in excess, copper quickly results in cell and tissue damage that can range in severity from temporary injury to permanent neurological damage. Because of its biological relevance, and because many conserved copper-responsive genes also respond to other non-essential heavy metal pollutants, copper resistance in Drosophila melanogaster is a useful model system with which to investigate the genetic control of the response to heavy metal stress. Because heavy metal toxicity has the potential to differently impact specific tissues, we genetically characterized the control of the gene expression response to copper stress in a tissue-specific manner in this study. We assessed the copper stress response in head and gut tissue of 96 inbred strains from the Drosophila Synthetic Population Resource (DSPR) using a combination of differential expression analysis and expression quantitative trait locus (eQTL) mapping. Differential expression analysis revealed clear patterns of tissue-specific expression, primarily driven by a more pronounced gene expression response in gut tissue. eQTL mapping of gene expression under control and copper conditions as well as for the change in gene expression following copper exposure (copper response eQTL) revealed hundreds of genes with tissue-specific local cis-eQTL and many distant trans-eQTL. eQTL associated with MtnA, Mdr49, Mdr50, and Sod3 exhibited genotype by environment effects on gene expression under copper stress, illuminating several tissue- and treatment-specific patterns of gene expression control. Together, our data build a nuanced description of the roles and interactions between allelic and expression variation in copper-responsive genes, provide valuable insight into the genomic architecture of susceptibility to metal toxicity, and highlight many candidate genes for future functional characterization.
Collapse
Affiliation(s)
- Elizabeth R Everman
- 1200 Sunnyside Ave, University of Kansas, Molecular Biosciences, Lawrence, KS 66045, USA
- 730 Van Vleet Oval, University of Oklahoma, Biology, Norman, OK 73019, USA
| | - Stuart J Macdonald
- 1200 Sunnyside Ave, University of Kansas, Molecular Biosciences, Lawrence, KS 66045, USA
- 1200 Sunnyside Ave, University of Kansas, Center for Computational Biology, Lawrence, KS 66045, USA
| |
Collapse
|
5
|
Zhang T, Keele GR, Gyuricza IG, Vincent M, Brunton C, Bell TA, Hock P, Shaw GD, Munger SC, de Villena FPM, Ferris MT, Paulo JA, Gygi SP, Churchill GA. Multi-omics analysis identifies drivers of protein phosphorylation. Genome Biol 2023; 24:52. [PMID: 36944993 PMCID: PMC10031968 DOI: 10.1186/s13059-023-02892-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 03/09/2023] [Indexed: 03/23/2023] Open
Abstract
BACKGROUND Phosphorylation of proteins is a key step in the regulation of many cellular processes including activation of enzymes and signaling cascades. The abundance of a phosphorylated peptide (phosphopeptide) is determined by the abundance of its parent protein and the proportion of target sites that are phosphorylated. RESULTS We quantified phosphopeptides, proteins, and transcripts in heart, liver, and kidney tissue samples of mice from 58 strains of the Collaborative Cross strain panel. We mapped ~700 phosphorylation quantitative trait loci (phQTL) across the three tissues and applied genetic mediation analysis to identify causal drivers of phosphorylation. We identified kinases, phosphatases, cytokines, and other factors, including both known and potentially novel interactions between target proteins and genes that regulate site-specific phosphorylation. Our analysis highlights multiple targets of pyruvate dehydrogenase kinase 1 (PDK1), a regulator of mitochondrial function that shows reduced activity in the NZO/HILtJ mouse, a polygenic model of obesity and type 2 diabetes. CONCLUSIONS Together, this integrative multi-omics analysis in genetically diverse CC strains provides a powerful tool to identify regulators of protein phosphorylation. The data generated in this study provides a resource for further exploration.
Collapse
Affiliation(s)
- Tian Zhang
- Harvard Medical School, Boston, MA, 02115, USA
| | | | | | | | | | - Timothy A Bell
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Pablo Hock
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Ginger D Shaw
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | | | - Fernando Pardo-Manuel de Villena
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Martin T Ferris
- Department of Genetics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | | | | | | |
Collapse
|
6
|
Lama J, Srivastav S, Tasnim S, Hubbard D, Hadjipanteli S, Smith BR, Macdonald SJ, Green L, Kelleher ES. Genetic variation in P-element dysgenic sterility is associated with double-strand break repair and alternative splicing of TE transcripts. PLoS Genet 2022; 18:e1010080. [PMID: 36477699 PMCID: PMC9762592 DOI: 10.1371/journal.pgen.1010080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 12/19/2022] [Accepted: 11/02/2022] [Indexed: 12/12/2022] Open
Abstract
The germline mobilization of transposable elements (TEs) by small RNA mediated silencing pathways is conserved across eukaryotes and critical for ensuring the integrity of gamete genomes. However, genomes are recurrently invaded by novel TEs through horizontal transfer. These invading TEs are not targeted by host small RNAs, and their unregulated activity can cause DNA damage in germline cells and ultimately lead to sterility. Here we use hybrid dysgenesis-a sterility syndrome of Drosophila caused by transposition of invading P-element DNA transposons-to uncover host genetic variants that modulate dysgenic sterility. Using a panel of highly recombinant inbred lines of Drosophila melanogaster, we identified two linked quantitative trait loci (QTL) that determine the severity of dysgenic sterility in young and old females, respectively. We show that ovaries of fertile genotypes exhibit increased expression of splicing factors that suppress the production of transposase encoding transcripts, which likely reduces the transposition rate and associated DNA damage. We also show that fertile alleles are associated with decreased sensitivity to double-stranded breaks and enhanced DNA repair, explaining their ability to withstand high germline transposition rates. Together, our work reveals a diversity of mechanisms whereby host genotype modulates the cost of an invading TE, and points to genetic variants that were likely beneficial during the P-element invasion.
Collapse
Affiliation(s)
- Jyoti Lama
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Satyam Srivastav
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Sadia Tasnim
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Donald Hubbard
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Savana Hadjipanteli
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Brittny R. Smith
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
| | - Stuart J. Macdonald
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
| | - Llewellyn Green
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Erin S. Kelleher
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| |
Collapse
|
7
|
Macdonald SJ, Long AD. Discovery of malathion resistance QTL in Drosophila melanogaster using a bulked phenotyping approach. G3 (BETHESDA, MD.) 2022; 12:jkac279. [PMID: 36250804 PMCID: PMC9713458 DOI: 10.1093/g3journal/jkac279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 10/09/2022] [Indexed: 12/03/2022]
Abstract
Drosophila melanogaster has proved an effective system with which to understand the evolutionary genetics and molecular mechanisms of insecticide resistance. Insecticide use has left signatures of selection in the fly genome, and both functional and quantitative genetic studies in the system have identified genes and variants associated with resistance. Here, we use D. melanogaster and leverage a bulk phenotyping and pooled sequencing "extreme quantitative trait loci" approach to genetically dissect variation in resistance to malathion, an organophosphate insecticide. We resolve 2 quantitative trait loci, one of which implicates allelic variation at the cytochrome P450 gene Cyp6g1, a strong candidate based on previous work. The second shows no overlap with hits from a previous genome-wide association study for malathion resistance, recapitulating other studies showing that different strategies for complex trait dissection in flies can yield apparently different architectures. Notably, we see no genetic signal at the Ace gene. Ace encodes the target of organophosphate insecticide inhibition, and genome-wide association studies have identified strong Ace-linked associations with resistance in flies. The absence of quantitative trait locus implicating Ace here is most likely because our mapping population does not segregate for several of the known functional polymorphisms impacting resistance at Ace, perhaps because our population is derived from flies collected prior to the widespread use of organophosphate insecticides. Our fundamental approach can be an efficient, powerful strategy to dissect genetic variation in resistance traits. Nonetheless, studies seeking to interrogate contemporary insecticide resistance variation may benefit from deriving mapping populations from more recently collected strains.
Collapse
Affiliation(s)
- Stuart J Macdonald
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66046, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS 66047, USA
| | - Anthony D Long
- Department of Ecology and Evolutionary Biology, University of California at Irvine, Irvine, CA 92697, USA
| |
Collapse
|
8
|
Macdonald SJ, Cloud-Richardson KM, Sims-West DJ, Long AD. Powerful, efficient QTL mapping in Drosophila melanogaster using bulked phenotyping and pooled sequencing. Genetics 2022; 220:iyab238. [PMID: 35100395 PMCID: PMC8893256 DOI: 10.1093/genetics/iyab238] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 12/19/2021] [Indexed: 01/22/2024] Open
Abstract
Despite the value of recombinant inbred lines for the dissection of complex traits, large panels can be difficult to maintain, distribute, and phenotype. An attractive alternative to recombinant inbred lines for many traits leverages selecting phenotypically extreme individuals from a segregating population, and subjecting pools of selected and control individuals to sequencing. Under a bulked or extreme segregant analysis paradigm, genomic regions contributing to trait variation are revealed as frequency differences between pools. Here, we describe such an extreme quantitative trait locus, or extreme quantitative trait loci, mapping strategy that builds on an existing multiparental population, the Drosophila Synthetic Population Resource, and involves phenotyping and genotyping a population derived by mixing hundreds of Drosophila Synthetic Population Resource recombinant inbred lines. Simulations demonstrate that challenging, yet experimentally tractable extreme quantitative trait loci designs (≥4 replicates, ≥5,000 individuals/replicate, and selecting the 5-10% most extreme animals) yield at least the same power as traditional recombinant inbred line-based quantitative trait loci mapping and can localize variants with sub-centimorgan resolution. We empirically demonstrate the effectiveness of the approach using a 4-fold replicated extreme quantitative trait loci experiment that identifies 7 quantitative trait loci for caffeine resistance. Two mapped extreme quantitative trait loci factors replicate loci previously identified in recombinant inbred lines, 6/7 are associated with excellent candidate genes, and RNAi knock-downs support the involvement of 4 genes in the genetic control of trait variation. For many traits of interest to drosophilists, a bulked phenotyping/genotyping extreme quantitative trait loci design has considerable advantages.
Collapse
Affiliation(s)
- Stuart J Macdonald
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS 66047, USA
| | | | - Dylan J Sims-West
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045, USA
| | - Anthony D Long
- Department of Ecology and Evolutionary Biology, University of California at Irvine, Irvine, CA 92697, USA
| |
Collapse
|
9
|
Li B, Ritchie MD. From GWAS to Gene: Transcriptome-Wide Association Studies and Other Methods to Functionally Understand GWAS Discoveries. Front Genet 2021; 12:713230. [PMID: 34659337 PMCID: PMC8515949 DOI: 10.3389/fgene.2021.713230] [Citation(s) in RCA: 67] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2021] [Accepted: 07/27/2021] [Indexed: 12/12/2022] Open
Abstract
Since their inception, genome-wide association studies (GWAS) have identified more than a hundred thousand single nucleotide polymorphism (SNP) loci that are associated with various complex human diseases or traits. The majority of GWAS discoveries are located in non-coding regions of the human genome and have unknown functions. The valley between non-coding GWAS discoveries and downstream affected genes hinders the investigation of complex disease mechanism and the utilization of human genetics for the improvement of clinical care. Meanwhile, advances in high-throughput sequencing technologies reveal important genomic regulatory roles that non-coding regions play in the transcriptional activities of genes. In this review, we focus on data integrative bioinformatics methods that combine GWAS with functional genomics knowledge to identify genetically regulated genes. We categorize and describe two types of data integrative methods. First, we describe fine-mapping methods. Fine-mapping is an exploratory approach that calibrates likely causal variants underneath GWAS signals. Fine-mapping methods connect GWAS signals to potentially causal genes through statistical methods and/or functional annotations. Second, we discuss gene-prioritization methods. These are hypothesis generating approaches that evaluate whether genetic variants regulate genes via certain genetic regulatory mechanisms to influence complex traits, including colocalization, mendelian randomization, and the transcriptome-wide association study (TWAS). TWAS is a gene-based association approach that investigates associations between genetically regulated gene expression and complex diseases or traits. TWAS has gained popularity over the years due to its ability to reduce multiple testing burden in comparison to other variant-based analytic approaches. Multiple types of TWAS methods have been developed with varied methodological designs and biological hypotheses over the past 5 years. We dive into discussions of how TWAS methods differ in many aspects and the challenges that different TWAS methods face. Overall, TWAS is a powerful tool for identifying complex trait-associated genes. With the advent of single-cell sequencing, chromosome conformation capture, gene editing technologies, and multiplexing reporter assays, we are expecting a more comprehensive understanding of genomic regulation and genetically regulated genes underlying complex human diseases and traits in the future.
Collapse
Affiliation(s)
- Binglan Li
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, United States.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
10
|
Benowitz KM, Coleman JM, Allan CW, Matzkin LM. Contributions of cis- and trans-Regulatory Evolution to Transcriptomic Divergence across Populations in the Drosophila mojavensis Larval Brain. Genome Biol Evol 2021; 12:1407-1418. [PMID: 32653899 PMCID: PMC7495911 DOI: 10.1093/gbe/evaa145] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/06/2020] [Indexed: 12/22/2022] Open
Abstract
Natural selection on gene expression was originally predicted to result primarily in cis- rather than trans-regulatory evolution, due to the expectation of reduced pleiotropy. Despite this, numerous studies have ascribed recent evolutionary divergence in gene expression predominantly to trans-regulation. Performing RNA-seq on single isofemale lines from genetically distinct populations of the cactophilic fly Drosophila mojavensis and their F1 hybrids, we recapitulated this pattern in both larval brains and whole bodies. However, we demonstrate that improving the measurement of brain expression divergence between populations by using seven additional genotypes considerably reduces the estimate of trans-regulatory contributions to expression evolution. We argue that the finding of trans-regulatory predominance can result from biases due to environmental variation in expression or other sources of noise, and that cis-regulation is likely a greater contributor to transcriptional evolution across D. mojavensis populations. Lastly, we merge these lines of data to identify several previously hypothesized and intriguing novel candidate genes, and suggest that the integration of regulatory and population-level transcriptomic data can provide useful filters for the identification of potentially adaptive genes.
Collapse
Affiliation(s)
| | - Joshua M Coleman
- Department of Entomology, University of Arizona.,Department of Biological Sciences, University of Alabama in Huntsville
| | | | - Luciano M Matzkin
- Department of Entomology, University of Arizona.,Department of Ecology and Evolutionary Biology, University of Arizona.,BIO5 Institute, University of Arizona
| |
Collapse
|
11
|
Crouse WL, Kelada SNP, Valdar W. Inferring the Allelic Series at QTL in Multiparental Populations. Genetics 2020; 216:957-983. [PMID: 33082282 PMCID: PMC7768242 DOI: 10.1534/genetics.120.303393] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 10/12/2020] [Indexed: 12/25/2022] Open
Abstract
Multiparental populations (MPPs) are experimental populations in which the genome of every individual is a mosaic of known founder haplotypes. These populations are useful for detecting quantitative trait loci (QTL) because tests of association can leverage inferred founder haplotype descent. It is difficult, however, to determine how haplotypes at a locus group into distinct functional alleles, termed the allelic series. The allelic series is important because it provides information about the number of causal variants at a QTL and their combined effects. In this study, we introduce a fully Bayesian model selection framework for inferring the allelic series. This framework accounts for sources of uncertainty found in typical MPPs, including the number and composition of functional alleles. Our prior distribution for the allelic series is based on the Chinese restaurant process, a relative of the Dirichlet process, and we leverage its connection to the coalescent to introduce additional prior information about haplotype relatedness via a phylogenetic tree. We evaluate our approach via simulation and apply it to QTL from two MPPs: the Collaborative Cross (CC) and the Drosophila Synthetic Population Resource (DSPR). We find that, although posterior inference of the exact allelic series is often uncertain, we are able to distinguish biallelic QTL from more complex multiallelic cases. Additionally, our allele-based approach improves haplotype effect estimation when the true number of functional alleles is small. Our method, Tree-Based Inference of Multiallelism via Bayesian Regression (TIMBR), provides new insight into the genetic architecture of QTL in MPPs.
Collapse
Affiliation(s)
- Wesley L Crouse
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina 27599
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Samir N P Kelada
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
- Marsico Lung Institute, University of North Carolina, Chapel Hill, North Carolina 27599
| | - William Valdar
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| |
Collapse
|
12
|
R. Smith B, J. Macdonald S. Dissecting the Genetic Basis of Variation in Drosophila Sleep Using a Multiparental QTL Mapping Resource. Genes (Basel) 2020; 11:genes11030294. [PMID: 32168738 PMCID: PMC7140804 DOI: 10.3390/genes11030294] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 03/07/2020] [Accepted: 03/09/2020] [Indexed: 12/27/2022] Open
Abstract
There is considerable variation in sleep duration, timing and quality in human populations, and sleep dysregulation has been implicated as a risk factor for a range of health problems. Human sleep traits are known to be regulated by genetic factors, but also by an array of environmental and social factors. These uncontrolled, non-genetic effects complicate powerful identification of the loci contributing to sleep directly in humans. The model system, Drosophila melanogaster, exhibits a behavior that shows the hallmarks of mammalian sleep, and here we use a multitiered approach, encompassing high-resolution QTL mapping, expression QTL data, and functional validation with RNAi to investigate the genetic basis of sleep under highly controlled environmental conditions. We measured a battery of sleep phenotypes in >750 genotypes derived from a multiparental mapping panel and identified several, modest-effect QTL contributing to natural variation for sleep. Merging sleep QTL data with a large head transcriptome eQTL mapping dataset from the same population allowed us to refine the list of plausible candidate causative sleep loci. This set includes genes with previously characterized effects on sleep and circadian rhythms, in addition to novel candidates. Finally, we employed adult, nervous system-specific RNAi on the Dopa decarboxylase, dyschronic, and timeless genes, finding significant effects on sleep phenotypes for all three. The genes we resolve are strong candidates to harbor causative, regulatory variation contributing to sleep.
Collapse
Affiliation(s)
- Brittny R. Smith
- Department of Molecular Biosciences, 4043 Haworth Hall, 1200 Sunnyside Avenue, University of Kansas, Lawrence, KS 66045, USA
| | - Stuart J. Macdonald
- Department of Molecular Biosciences, 4043 Haworth Hall, 1200 Sunnyside Avenue, University of Kansas, Lawrence, KS 66045, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS 66047, USA
- Correspondence: ; Tel.: +1-785-864-5362
| |
Collapse
|
13
|
Frochaux MV, Bou Sleiman M, Gardeux V, Dainese R, Hollis B, Litovchenko M, Braman VS, Andreani T, Osman D, Deplancke B. cis-regulatory variation modulates susceptibility to enteric infection in the Drosophila genetic reference panel. Genome Biol 2020; 21:6. [PMID: 31948474 PMCID: PMC6966807 DOI: 10.1186/s13059-019-1912-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 12/05/2019] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Resistance to enteric pathogens is a complex trait at the crossroads of multiple biological processes. We have previously shown in the Drosophila Genetic Reference Panel (DGRP) that resistance to infection is highly heritable, but our understanding of how the effects of genetic variants affect different molecular mechanisms to determine gut immunocompetence is still limited. RESULTS To address this, we perform a systems genetics analysis of the gut transcriptomes from 38 DGRP lines that were orally infected with Pseudomonas entomophila. We identify a large number of condition-specific, expression quantitative trait loci (local-eQTLs) with infection-specific ones located in regions enriched for FOX transcription factor motifs. By assessing the allelic imbalance in the transcriptomes of 19 F1 hybrid lines from a large round robin design, we independently attribute a robust cis-regulatory effect to only 10% of these detected local-eQTLs. However, additional analyses indicate that many local-eQTLs may act in trans instead. Comparison of the transcriptomes of DGRP lines that were either susceptible or resistant to Pseudomonas entomophila infection reveals nutcracker as the only differentially expressed gene. Interestingly, we find that nutcracker is linked to infection-specific eQTLs that correlate with its expression level and to enteric infection susceptibility. Further regulatory analysis reveals one particular eQTL that significantly decreases the binding affinity for the repressor Broad, driving differential allele-specific nutcracker expression. CONCLUSIONS Our collective findings point to a large number of infection-specific cis- and trans-acting eQTLs in the DGRP, including one common non-coding variant that lowers enteric infection susceptibility.
Collapse
Affiliation(s)
- Michael V. Frochaux
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Maroun Bou Sleiman
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Current Address: Laboratory of Integrative Systems Physiology, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Vincent Gardeux
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Riccardo Dainese
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Hollis
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Current Address: Department of Biological Sciences, University of South Carolina, Columbia, South Carolina USA
| | - Maria Litovchenko
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL) and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Virginie S. Braman
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Tommaso Andreani
- Computational Biology and Data Mining Group, Institute of Molecular Biology, Johannes Gutenberg-Universität Mainz, Mainz, Germany
| | - Dani Osman
- Faculty of Sciences III and Azm Center for Research in Biotechnology and its Applications, LBA3B, EDST, Lebanese University, Tripoli, 1300 Lebanon
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
14
|
Williams-Simon PA, Ganesan M, King EG. Learning to collaborate: bringing together behavior and quantitative genomics. J Neurogenet 2020; 34:28-35. [PMID: 31920134 DOI: 10.1080/01677063.2019.1710145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The genetic basis of complex trait like learning and memory have been well studied over the decades. Through those groundbreaking findings, we now have a better understanding about some of the genes and pathways that are involved in learning and/or memory. However, few of these findings identified the naturally segregating variants that are influencing learning and/or memory within populations. In this special issue honoring the legacy of Troy Zars, we review some of the traditional approaches that have been used to elucidate the genetic basis of learning and/or memory, specifically in fruit flies. We highlight some of his contributions to the field, and specifically describe his vision to bring together behavior and quantitative genomics with the aim of expanding our knowledge of the genetic basis of both learning and memory. Finally, we present some of our recent work in this area using a multiparental population (MPP) as a case study and describe the potential of this approach to advance our understanding of neurogenetics.
Collapse
Affiliation(s)
| | - Mathangi Ganesan
- Division of Biological Sciences, University of Missouri, Columbia, MO, USA
| | - Elizabeth G King
- Division of Biological Sciences, University of Missouri, Columbia, MO, USA
| |
Collapse
|
15
|
Thornton KR. Polygenic Adaptation to an Environmental Shift: Temporal Dynamics of Variation Under Gaussian Stabilizing Selection and Additive Effects on a Single Trait. Genetics 2019; 213:1513-1530. [PMID: 31653678 PMCID: PMC6893385 DOI: 10.1534/genetics.119.302662] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 10/21/2019] [Indexed: 11/26/2022] Open
Abstract
Predictions about the effect of natural selection on patterns of linked neutral variation are largely based on models involving the rapid fixation of unconditionally beneficial mutations. However, when phenotypes adapt to a new optimum trait value, the strength of selection on individual mutations decreases as the population adapts. Here, I use explicit forward simulations of a single trait with additive-effect mutations adapting to an "optimum shift." Detectable "hitchhiking" patterns are only apparent if (i) the optimum shifts are large with respect to equilibrium variation for the trait, (ii) mutation rates to large-effect mutations are low, and (iii) large-effect mutations rapidly increase in frequency and eventually reach fixation, which typically occurs after the population reaches the new optimum. For the parameters simulated here, partial sweeps do not appreciably affect patterns of linked variation, even when the mutations are strongly selected. The contribution of new mutations vs. standing variation to fixation depends on the mutation rate affecting trait values. Given the fixation of a strongly selected variant, patterns of hitchhiking are similar on average for the two classes of sweeps because sweeps from standing variation involving large-effect mutations are rare when the optimum shifts. The distribution of effect sizes of new mutations has little effect on the time to reach the new optimum, but reducing the mutational variance increases the magnitude of hitchhiking patterns. In general, populations reach the new optimum prior to the completion of any sweeps, and the times to fixation are longer for this model than for standard models of directional selection. The long fixation times are due to a combination of declining selection pressures during adaptation and the possibility of interference among weakly selected sites for traits with high mutation rates.
Collapse
Affiliation(s)
- Kevin R Thornton
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697
| |
Collapse
|
16
|
Structural variants exhibit widespread allelic heterogeneity and shape variation in complex traits. Nat Commun 2019; 10:4872. [PMID: 31653862 PMCID: PMC6814777 DOI: 10.1038/s41467-019-12884-1] [Citation(s) in RCA: 104] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 09/25/2019] [Indexed: 12/11/2022] Open
Abstract
It has been hypothesized that individually-rare hidden structural variants (SVs) could account for a significant fraction of variation in complex traits. Here we identified more than 20,000 euchromatic SVs from 14 Drosophila melanogaster genome assemblies, of which ~40% are invisible to high specificity short-read genotyping approaches. SVs are common, with 31.5% of diploid individuals harboring a SV in genes larger than 5kb, and 24% harboring multiple SVs in genes larger than 10kb. SV minor allele frequencies are rarer than amino acid polymorphisms, suggesting that SVs are more deleterious. We show that a number of functionally important genes harbor previously hidden structural variants likely to affect complex phenotypes. Furthermore, SVs are overrepresented in candidate genes associated with quantitative trait loci mapped using the Drosophila Synthetic Population Resource. We conclude that SVs are ubiquitous, frequently constitute a heterogeneous allelic series, and can act as rare alleles of large effect.
Collapse
|
17
|
Williams-Simon PA, Posey C, Mitchell S, Ng'oma E, Mrkvicka JA, Zars T, King EG. Multiple genetic loci affect place learning and memory performance in Drosophila melanogaster. GENES, BRAIN, AND BEHAVIOR 2019; 18:e12581. [PMID: 31095869 PMCID: PMC6718298 DOI: 10.1111/gbb.12581] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/11/2019] [Accepted: 05/13/2019] [Indexed: 12/25/2022]
Abstract
Learning and memory are critical functions for all animals, giving individuals the ability to respond to changes in their environment. Within populations, individuals vary, however the mechanisms underlying this variation in performance are largely unknown. Thus, it remains to be determined what genetic factors cause an individual to have high learning ability and what factors determine how well an individual will remember what they have learned. To genetically dissect learning and memory performance, we used the Drosophila synthetic population resource (DSPR), a multiparent mapping resource in the model system Drosophila melanogaster, consisting of a large set of recombinant inbred lines (RILs) that naturally vary in these and other traits. Fruit flies can be trained in a "heat box" to learn to remain on one side of a chamber (place learning) and can remember this (place memory) over short timescales. Using this paradigm, we measured place learning and memory for ~49 000 individual flies from over 700 DSPR RILs. We identified 16 different loci across the genome that significantly affect place learning and/or memory performance, with 5 of these loci affecting both traits. To identify transcriptomic differences associated with performance, we performed RNA-Seq on pooled samples of seven high performing and seven low performing RILs for both learning and memory and identified hundreds of genes with differences in expression in the two sets. Integrating our transcriptomic results with the mapping results allowed us to identify nine promising candidate genes, advancing our understanding of the genetic basis underlying natural variation in learning and memory performance.
Collapse
Affiliation(s)
| | - Christopher Posey
- Division of Biological Sciences, University of Missouri, Columbia, Missouri
| | - Samuel Mitchell
- Division of Biological Sciences, University of Missouri, Columbia, Missouri
| | - Enoch Ng'oma
- Division of Biological Sciences, University of Missouri, Columbia, Missouri
| | - James A Mrkvicka
- Division of Biological Sciences, University of Missouri, Columbia, Missouri
| | - Troy Zars
- Division of Biological Sciences, University of Missouri, Columbia, Missouri
| | - Elizabeth G King
- Division of Biological Sciences, University of Missouri, Columbia, Missouri
| |
Collapse
|
18
|
A major role for noncoding regulatory mutations in the evolution of enzyme activity. Proc Natl Acad Sci U S A 2019; 116:12383-12389. [PMID: 31152141 PMCID: PMC6589674 DOI: 10.1073/pnas.1904071116] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
This study investigates how evolutionary changes in enzyme activity occur. Multiple species of Drosophila flies have adapted to food with different levels of alcohol. This study uncovers genetic changes responsible for these repeated adaptive events, focusing on the main enzyme responsible for alcohol metabolism, Alcohol dehydrogenase. Better alcohol metabolism could be achieved either through changes to the enzyme itself or through changes in DNA regulatory sequences that affect how many enzyme molecules are produced. In four different cases, it was found that regulatory changes were the most frequent contributors to evolution. These findings have important implications because most studies of enzyme evolution focus exclusively on changes to protein sequence, and thus a significant source of adaptive changes may be overlooked. The quantitative evolution of protein activity is a common phenomenon, yet we know little about any general mechanistic tendencies that underlie it. For example, an increase (or decrease) in enzyme activity may evolve from changes in protein sequence that alter specific activity, or from changes in gene expression that alter the amount of protein produced. The latter in turn could arise via mutations that affect gene transcription, posttranscriptional processes, or copy number. Here, to determine the types of genetic changes underlying the quantitative evolution of protein activity, we dissected the basis of ecologically relevant differences in Alcohol dehydrogenase (Adh) enzyme activity between and within several Drosophila species. By using recombinant Adh transgenes to map the functional divergence of ADH enzyme activity in vivo, we find that amino acid substitutions explain only a minority (0 to 25%) of between- and within-species differences in enzyme activity. Instead, noncoding substitutions that occur across many parts of the gene (enhancer, promoter, and 5′ and 3′ untranslated regions) account for the majority of activity differences. Surprisingly, one substitution in a transcriptional Initiator element has occurred in parallel in two species, indicating that core promoters can be an important natural source of the tuning of gene activity. Furthermore, we show that both regulatory and coding substitutions contribute to fitness (resistance to ethanol toxicity). Although qualitative changes in protein specificity necessarily derive from coding mutations, these results suggest that regulatory mutations may be the primary source of quantitative changes in protein activity, a possibility overlooked in most analyses of protein evolution.
Collapse
|
19
|
Qu W, Gurdziel K, Pique-Regi R, Ruden DM. Lead Modulates trans- and cis-Expression Quantitative Trait Loci (eQTLs) in Drosophila melanogaster Heads. Front Genet 2018; 9:395. [PMID: 30294342 PMCID: PMC6158337 DOI: 10.3389/fgene.2018.00395] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Accepted: 08/30/2018] [Indexed: 11/13/2022] Open
Abstract
Lead exposure has long been one of the most important topics in global public health because it is a potent developmental neurotoxin. Here, an eQTL analysis, which is the genome-wide association analysis of genetic variants with gene expression, was performed. In this analysis, the male heads of 79 Drosophila melanogaster inbred lines from Drosophila Synthetic Population Resource (DSPR) were treated with or without developmental exposure, from hatching to adults, to 250 μM lead acetate [Pb(C2H3O2)2]. The goal was to identify genomic intervals that influence the gene-expression response to lead. After detecting 1798 cis-eQTLs and performing an initial trans-eQTL analysis, we focused our analysis on lead-sensitive "trans-eQTL hotspots," defined as genomic regions that are associated with a cluster of genes in a lead-dependent manner. We noticed that the genes associated with one of the 14 detected trans-eQTL hotspots, Chr 2L: 6,250,000 could be roughly divided into two groups based on their differential expression profile patterns and different categories of function. This trans-eQTL hotspot validates one identified in a previous study using different recombinant inbred lines. The expression of all the associated genes in the trans-eQTL hotspot was visualized with hierarchical clustering analysis. Besides the overall expression profile patterns, the heatmap displayed the segregation of differential parental genetic contributions. This suggested that trans-regulatory regions with different genetic contributions from the parental lines have significantly different expression changes after lead exposure. We believe this study confirms our earlier study, and provides important insights to unravel the genetic variation in lead susceptibility in Drosophila model.
Collapse
Affiliation(s)
- Wen Qu
- Department of Pharmacology, Wayne State University, Detroit, MI, United States
| | - Katherine Gurdziel
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, United States
| | - Roger Pique-Regi
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, United States.,Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, United States
| | - Douglas M Ruden
- Department of Pharmacology, Wayne State University, Detroit, MI, United States.,Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, United States.,Institute of Environmental Health Sciences, Wayne State University, Detroit, MI, United States
| |
Collapse
|
20
|
Angeles-Albores D, Puckett Robinson C, Williams BA, Wold BJ, Sternberg PW. Reconstructing a metazoan genetic pathway with transcriptome-wide epistasis measurements. Proc Natl Acad Sci U S A 2018; 115:E2930-E2939. [PMID: 29531064 PMCID: PMC5879656 DOI: 10.1073/pnas.1712387115] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
RNA-sequencing (RNA-seq) is commonly used to identify genetic modules that respond to perturbations. In single cells, transcriptomes have been used as phenotypes, but this concept has not been applied to whole-organism RNA-seq. Also, quantifying and interpreting epistatic effects using expression profiles remains a challenge. We developed a single coefficient to quantify transcriptome-wide epistasis that reflects the underlying interactions and which can be interpreted intuitively. To demonstrate our approach, we sequenced four single and two double mutants of Caenorhabditis elegans From these mutants, we reconstructed the known hypoxia pathway. In addition, we uncovered a class of 56 genes with HIF-1-dependent expression that have opposite changes in expression in mutants of two genes that cooperate to negatively regulate HIF-1 abundance; however, the double mutant of these genes exhibits suppression epistasis. This class violates the classical model of HIF-1 regulation but can be explained by postulating a role of hydroxylated HIF-1 in transcriptional control.
Collapse
Affiliation(s)
- David Angeles-Albores
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125
| | - Carmie Puckett Robinson
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125
- Department of Neurology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033
| | - Brian A Williams
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Barbara J Wold
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Paul W Sternberg
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125;
- Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125
| |
Collapse
|
21
|
Serin EAR, Snoek LB, Nijveen H, Willems LAJ, Jiménez-Gómez JM, Hilhorst HWM, Ligterink W. Construction of a High-Density Genetic Map from RNA-Seq Data for an Arabidopsis Bay-0 × Shahdara RIL Population. Front Genet 2017; 8:201. [PMID: 29259624 PMCID: PMC5723289 DOI: 10.3389/fgene.2017.00201] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 11/21/2017] [Indexed: 12/17/2022] Open
Abstract
High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches.
Collapse
Affiliation(s)
- Elise A R Serin
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| | - L B Snoek
- Laboratory of Nematology, Wageningen University, Wageningen, Netherlands.,Theoretical Biology and Bioinformatics, Utrecht University, Utrecht, Netherlands
| | - Harm Nijveen
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands.,Laboratory of Bioinformatics, Wageningen University, Wageningen, Netherlands
| | - Leo A J Willems
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| | - Jose M Jiménez-Gómez
- Department of Plant Breeding and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany.,Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique, AgroParisTech, Centre National de la Recherche Scientifique, Université Paris-Saclay, Versailles Cedex, France
| | - Henk W M Hilhorst
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| | - Wilco Ligterink
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Wageningen, Netherlands
| |
Collapse
|
22
|
Noble LM, Chelo I, Guzella T, Afonso B, Riccardi DD, Ammerman P, Dayarian A, Carvalho S, Crist A, Pino-Querido A, Shraiman B, Rockman MV, Teotónio H. Polygenicity and Epistasis Underlie Fitness-Proximal Traits in the Caenorhabditis elegans Multiparental Experimental Evolution (CeMEE) Panel. Genetics 2017; 207:1663-1685. [PMID: 29066469 PMCID: PMC5714472 DOI: 10.1534/genetics.117.300406] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 10/10/2017] [Indexed: 01/27/2023] Open
Abstract
Understanding the genetic basis of complex traits remains a major challenge in biology. Polygenicity, phenotypic plasticity, and epistasis contribute to phenotypic variance in ways that are rarely clear. This uncertainty can be problematic for estimating heritability, for predicting individual phenotypes from genomic data, and for parameterizing models of phenotypic evolution. Here, we report an advanced recombinant inbred line (RIL) quantitative trait locus mapping panel for the hermaphroditic nematode Caenorhabditis elegans, the C. elegans multiparental experimental evolution (CeMEE) panel. The CeMEE panel, comprising 507 RILs at present, was created by hybridization of 16 wild isolates, experimental evolution for 140-190 generations, and inbreeding by selfing for 13-16 generations. The panel contains 22% of single-nucleotide polymorphisms known to segregate in natural populations, and complements existing C. elegans mapping resources by providing fine resolution and high nucleotide diversity across > 95% of the genome. We apply it to study the genetic basis of two fitness components, fertility and hermaphrodite body size at time of reproduction, with high broad-sense heritability in the CeMEE. While simulations show that we should detect common alleles with additive effects as small as 5%, at gene-level resolution, the genetic architectures of these traits do not feature such alleles. We instead find that a significant fraction of trait variance, approaching 40% for fertility, can be explained by sign epistasis with main effects below the detection limit. In congruence, phenotype prediction from genomic similarity, while generally poor ([Formula: see text]), requires modeling epistasis for optimal accuracy, with most variance attributed to the rapidly evolving chromosome arms.
Collapse
Affiliation(s)
- Luke M Noble
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York 10003
| | - Ivo Chelo
- Instituto Gulbenkian de Ciência, P-2781-901 Oeiras, Portugal
| | - Thiago Guzella
- Institut de Biologie, École Normale Supérieure, Centre National de la Recherche Scientifique (CNRS) UMR 8197, Institut National de la Santé et de la Recherche Médicale (INSERM) U1024, F-75005 Paris, France
| | - Bruno Afonso
- Instituto Gulbenkian de Ciência, P-2781-901 Oeiras, Portugal
- Institut de Biologie, École Normale Supérieure, Centre National de la Recherche Scientifique (CNRS) UMR 8197, Institut National de la Santé et de la Recherche Médicale (INSERM) U1024, F-75005 Paris, France
| | - David D Riccardi
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York 10003
| | - Patrick Ammerman
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York 10003
| | - Adel Dayarian
- Kavli Institute for Theoretical Physics, University of California, Santa Barbara, California 93106
| | - Sara Carvalho
- Instituto Gulbenkian de Ciência, P-2781-901 Oeiras, Portugal
| | - Anna Crist
- Institut de Biologie, École Normale Supérieure, Centre National de la Recherche Scientifique (CNRS) UMR 8197, Institut National de la Santé et de la Recherche Médicale (INSERM) U1024, F-75005 Paris, France
| | | | - Boris Shraiman
- Kavli Institute for Theoretical Physics, University of California, Santa Barbara, California 93106
- Department of Physics, University of California, Santa Barbara, California 93106
| | - Matthew V Rockman
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York 10003
| | - Henrique Teotónio
- Institut de Biologie, École Normale Supérieure, Centre National de la Recherche Scientifique (CNRS) UMR 8197, Institut National de la Santé et de la Recherche Médicale (INSERM) U1024, F-75005 Paris, France
| |
Collapse
|
23
|
Qu W, Gurdziel K, Pique-Regi R, Ruden DM. Identification of Splicing Quantitative Trait Loci (sQTL) in Drosophila melanogaster with Developmental Lead (Pb 2+) Exposure. Front Genet 2017; 8:145. [PMID: 29114259 PMCID: PMC5660682 DOI: 10.3389/fgene.2017.00145] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Accepted: 09/22/2017] [Indexed: 01/09/2023] Open
Abstract
Lead (Pb) poisoning has been a major public health issue globally and the recent Flint water crisis has drawn nation-wide attention to its effects. To better understand how lead plays a role as a neurotoxin, we utilized the Drosophila melanogaster model to study the genetic effects of lead exposure during development and identified lead-responsive genes. In our previous studies, we have successfully identified hundreds of lead-responsive expression QTLs (eQTLs) by using RNA-seq analysis on heads collected from the Drosophila Synthetic Population Resource. Cis-eQTLs, also known as allele-specific expression (ASE) polymorphisms, are generally single-nucleotide polymorphisms in the promoter regions of genes that affect expression of the gene, such as by inhibiting the binding of transcription factors. Trans-eQTLs are genes that regulate mRNA levels for many genes, and are generally thought to be SNPs in trans-acting transcription or translation factors. In this study, we focused our attention on alternative splicing events that are affected by lead exposure. Splicing QTLs (sQTLs), which can be caused by SNPs that alter splicing or alternative splicing (AS), such as by changing the sequence-specific binding affinity of splicing factors to the pre-mRNA. We applied two methods in search for sQTLs by using RNA-seq data from control and lead-exposed w1118Drosophila heads. First, we used the fraction of reads in a gene that falls in each exon as the phenotype. Second, we directly compared the transcript counts among the various splicing isoforms as the phenotype. Among the 1,236 potential Pb-responsive sQTLs (p < 0.0001, FDR < 0.39), mostly cis-sQTLs, one of the most distinct genes is Dscam1 (Down Syndrome Cell Adhesion Molecule), which has over 30,000 potential alternative splicing isoforms. We have also identified a candidate Pb-responsive trans-sQTL hotspot that appears to regulate 129 genes that are enriched in the “cation channel” gene ontology category, suggesting a model in which alternative splicing of these channels might lead to an increase in the elimination of Pb2+ from the neurons encoding these channels. To our knowledge, this is the first paper that uses sQTL analyses to understand the neurotoxicology of an environmental toxin in any organism, and the first reported discovery of a candidate trans-sQTL hotspot.
Collapse
Affiliation(s)
- Wen Qu
- Laboratory of Epigenomics, Department of Pharmacology, C.S. Mott Center for Human Growth and Development, Wayne State University, Detroit, MI, United States
| | - Katherine Gurdziel
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, United States
| | - Roger Pique-Regi
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, United States
| | - Douglas M Ruden
- Laboratory of Epigenomics, Department of Pharmacology, C.S. Mott Center for Human Growth and Development, Wayne State University, Detroit, MI, United States.,Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, United States.,Institute of Environmental Health Sciences, Wayne State University, Detroit, MI, United States
| |
Collapse
|
24
|
Highfill CA, Tran JH, Nguyen SKT, Moldenhauer TR, Wang X, Macdonald SJ. Naturally Segregating Variation at Ugt86Dd Contributes to Nicotine Resistance in Drosophila melanogaster. Genetics 2017; 207:311-325. [PMID: 28743761 PMCID: PMC5586381 DOI: 10.1534/genetics.117.300058] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 07/24/2017] [Indexed: 12/16/2022] Open
Abstract
Identifying the sequence polymorphisms underlying complex trait variation is a key goal of genetics research, since knowing the precise causative molecular events allows insight into the pathways governing trait variation. Genetic analysis of complex traits in model systems regularly starts by constructing QTL maps, but generally fails to identify causative sequence polymorphisms. Previously we mapped a series of QTL contributing to resistance to nicotine in a Drosophila melanogaster multiparental mapping resource and here use a battery of functional tests to resolve QTL to the molecular level. One large-effect QTL resided over a cluster of UDP-glucuronosyltransferases, and quantitative complementation tests using deficiencies eliminating subsets of these detoxification genes revealed allelic variation impacting resistance. RNAseq showed that Ugt86Dd had significantly higher expression in genotypes that are more resistant to nicotine, and anterior midgut-specific RNA interference (RNAi) of this gene reduced resistance. We discovered a segregating 22-bp frameshift deletion in Ugt86Dd, and accounting for the InDel during mapping largely eliminates the QTL, implying the event explains the bulk of the effect of the mapped locus. CRISPR/Cas9 editing of a relatively resistant genotype to generate lesions in Ugt86Dd that recapitulate the naturally occurring putative loss-of-function allele, leads to a large reduction in resistance. Despite this major effect of the deletion, the allele appears to be very rare in wild-caught populations and likely explains only a small fraction of the natural variation for the trait. Nonetheless, this putatively causative coding InDel can be a launchpad for future mechanistic exploration of xenobiotic detoxification.
Collapse
Affiliation(s)
- Chad A Highfill
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047
| | - Jonathan H Tran
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047
| | - Samantha K T Nguyen
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047
| | - Taylor R Moldenhauer
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047
| | - Xiaofei Wang
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047
| | - Stuart J Macdonald
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas 66047
- Center for Computational Biology, University of Kansas, Lawrence, Kansas 66047
| |
Collapse
|
25
|
Genetic Dissection of Nutrition-Induced Plasticity in Insulin/Insulin-Like Growth Factor Signaling and Median Life Span in a Drosophila Multiparent Population. Genetics 2017; 206:587-602. [PMID: 28592498 DOI: 10.1534/genetics.116.197780] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2017] [Accepted: 03/13/2017] [Indexed: 11/18/2022] Open
Abstract
The nutritional environments that organisms experience are inherently variable, requiring tight coordination of how resources are allocated to different functions relative to the total amount of resources available. A growing body of evidence supports the hypothesis that key endocrine pathways play a fundamental role in this coordination. In particular, the insulin/insulin-like growth factor signaling (IIS) and target of rapamycin (TOR) pathways have been implicated in nutrition-dependent changes in metabolism and nutrient allocation. However, little is known about the genetic basis of standing variation in IIS/TOR or how diet-dependent changes in expression in this pathway influence phenotypes related to resource allocation. To characterize natural genetic variation in the IIS/TOR pathway, we used >250 recombinant inbred lines (RILs) derived from a multiparental mapping population, the Drosophila Synthetic Population Resource, to map transcript-level QTL of genes encoding 52 core IIS/TOR components in three different nutritional environments [dietary restriction (DR), control (C), and high sugar (HS)]. Nearly all genes, 87%, were significantly differentially expressed between diets, though not always in ways predicted by loss-of-function mutants. We identified cis (i.e., local) expression QTL (eQTL) for six genes, all of which are significant in multiple nutrient environments. Further, we identified trans (i.e., distant) eQTL for two genes, specific to a single nutrient environment. Our results are consistent with many small changes in the IIS/TOR pathways. A discriminant function analysis for the C and DR treatments identified a pattern of gene expression associated with the diet treatment. Mapping the composite discriminant function scores revealed a significant global eQTL within the DR diet. A correlation between the discriminant function scores and the median life span (r = 0.46) provides evidence that gene expression changes in response to diet are associated with longevity in these RILs.
Collapse
|
26
|
Zan Y, Sheng Z, Lillie M, Rönnegård L, Honaker CF, Siegel PB, Carlborg Ö. Artificial Selection Response due to Polygenic Adaptation from a Multilocus, Multiallelic Genetic Architecture. Mol Biol Evol 2017; 34:2678-2689. [DOI: 10.1093/molbev/msx194] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
|
27
|
Ng'oma E, Perinchery AM, King EG. How to get the most bang for your buck: the evolution and physiology of nutrition-dependent resource allocation strategies. Proc Biol Sci 2017; 284:20170445. [PMID: 28637856 PMCID: PMC5489724 DOI: 10.1098/rspb.2017.0445] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 05/23/2017] [Indexed: 12/31/2022] Open
Abstract
All organisms use resources to grow, survive and reproduce. The supply of these resources varies widely across landscapes and time, imposing ultimate constraints on the maximal trait values for allocation-related traits. In this review, we address three key questions fundamental to our understanding of the evolution of allocation strategies and their underlying mechanisms. First, we ask: how diverse are flexible resource allocation strategies among different organisms? We find there are many, varied, examples of flexible strategies that depend on nutrition. However, this diversity is often ignored in some of the best-known cases of resource allocation shifts, such as the commonly observed pattern of lifespan extension under nutrient limitation. A greater appreciation of the wide variety of flexible allocation strategies leads directly to our second major question: what conditions select for different plastic allocation strategies? Here, we highlight the need for additional models that explicitly consider the evolution of phenotypically plastic allocation strategies and empirical tests of the predictions of those models in natural populations. Finally, we consider the question: what are the underlying mechanisms determining resource allocation strategies? Although evolutionary biologists assume differential allocation of resources is a major factor limiting trait evolution, few proximate mechanisms are known that specifically support the model. We argue that an integrated framework can reconcile evolutionary models with proximate mechanisms that appear at first glance to be in conflict with these models. Overall, we encourage future studies to: (i) mimic ecological conditions in which those patterns evolve, and (ii) take advantage of the 'omic' opportunities to produce multi-level data and analytical models that effectively integrate across physiological and evolutionary theory.
Collapse
Affiliation(s)
- Enoch Ng'oma
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Anna M Perinchery
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Elizabeth G King
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
28
|
Complex Coding and Regulatory Polymorphisms in a Restriction Factor Determine the Susceptibility of Drosophila to Viral Infection. Genetics 2017. [PMID: 28630113 PMCID: PMC5560813 DOI: 10.1534/genetics.117.201970] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
It is common to find that major-effect genes are an important cause of variation in susceptibility to infection. Here we have characterized natural variation in a gene called pastrel that explains over half of the genetic variance in susceptibility to the Drosophila C virus (DCV) in populations of Drosophila melanogaster We found extensive allelic heterogeneity, with a sample of seven alleles of pastrel from around the world conferring four phenotypically distinct levels of resistance. By modifying candidate SNPs in transgenic flies, we show that the largest effect is caused by an amino acid polymorphism that arose when an ancestral threonine was mutated to alanine, greatly increasing resistance to DCV. Overexpression of the ancestral, susceptible allele provides strong protection against DCV; indicating that this mutation acted to improve an existing restriction factor. The pastrel locus also contains complex structural variation and cis-regulatory polymorphisms altering gene expression. We find that higher expression of pastrel is associated with increased survival after DCV infection. To understand why this variation is maintained in populations, we investigated genetic variation surrounding the amino acid variant that is causing flies to be resistant. We found no evidence of natural selection causing either recent changes in allele frequency or geographical variation in frequency, suggesting that this is an old polymorphism that has been maintained at a stable frequency. Overall, our data demonstrate how complex genetic variation at a single locus can control susceptibility to a virulent natural pathogen.
Collapse
|
29
|
Affiliation(s)
- Ian M Ehrenreich
- Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089-2910
| |
Collapse
|
30
|
The Beavis Effect in Next-Generation Mapping Panels in Drosophila melanogaster. G3-GENES GENOMES GENETICS 2017; 7:1643-1652. [PMID: 28592647 PMCID: PMC5473746 DOI: 10.1534/g3.117.041426] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
A major goal in the analysis of complex traits is to partition the observed genetic variation in a trait into components due to individual loci and perhaps variants within those loci. However, in both QTL mapping and genetic association studies, the estimated percent variation attributable to a QTL is upwardly biased conditional on it being discovered. This bias was first described in two-way QTL mapping experiments by William Beavis, and has been referred to extensively as “the Beavis effect.” The Beavis effect is likely to occur in multiparent population (MPP) panels as well as collections of sequenced lines used for genome-wide association studies (GWAS). However, the strength of the Beavis effect is unknown—and often implicitly assumed to be negligible—when “hits” are obtained from an association panel consisting of hundreds of inbred lines tested across millions of SNPs, or in multiparent mapping populations where mapping involves fitting a complex statistical model with several d.f. at thousands of genetic intervals. To estimate the size of the effect in more complex panels, we performed simulations of both biallelic and multiallelic QTL in two major Drosophila melanogaster mapping panels, the GWAS-based Drosophila Genetic Reference Panel (DGRP), and the MPP the Drosophila Synthetic Population Resource (DSPR). Our results show that overestimation is determined most strongly by sample size and is only minimally impacted by the mapping design. When < 100, 200, 500, and 1000 lines are employed, the variance attributable to hits is inflated by factors of 6, 3, 1.5, and 1.1, respectively, for a QTL that truly contributes 5% to the variation in the trait. This overestimation indicates that QTL could be difficult to validate in follow-up replication experiments where additional individuals are examined. Further, QTL could be difficult to cross-validate between the two Drosophila resources. We provide guidelines for: (1) the sample sizes necessary to accurately estimate the percent variance to an identified QTL, (2) the conditions under which one is likely to replicate a mapped QTL in a second study using the same mapping population, and (3) the conditions under which a QTL mapped in one mapping panel is likely to replicate in the other (DGRP and DSPR).
Collapse
|
31
|
Loci Contributing to Boric Acid Toxicity in Two Reference Populations of Drosophila melanogaster. G3-GENES GENOMES GENETICS 2017; 7:1631-1641. [PMID: 28592646 PMCID: PMC5473745 DOI: 10.1534/g3.117.041418] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Populations maintain considerable segregating variation in the response to toxic, xenobiotic compounds. To identify variants associated with resistance to boric acid, a commonly-used household insecticide with a poorly understood mechanism of action, we assayed thousands of individuals from hundreds of strains. Using the Drosophila Synthetic Population Resource (DSPR), a multi-parental population (MPP) of inbred genotypes, we mapped six QTL to short genomic regions containing few protein-coding genes (3–188), allowing us to identify plausible candidate genes underlying resistance to boric acid toxicity. One interval contains multiple genes from the cytochrome P450 family, and we show that ubiquitous RNAi of one of these genes, Cyp9b2, markedly reduces resistance to the toxin. Resistance to boric acid is positively correlated with caffeine resistance. The two phenotypes additionally share a pair of QTL, potentially suggesting a degree of pleiotropy in the genetic control of resistance to these two distinct xenobiotics. Finally, we screened the Drosophila Genetic Reference Panel (DGRP) in an attempt to identify sequence variants within mapped QTL that are associated with boric acid resistance. The approach was largely unsuccessful, with only one QTL showing any associations at QTL-specific 20% False Discovery Rate (FDR) thresholds. Nonetheless, these associations point to a potential candidate gene that can be targeted in future validation efforts. Although the mapping data resulting from the two reference populations do not clearly overlap, our work provides a starting point for further genetic dissection of the processes underlying boric acid toxicity in insects.
Collapse
|
32
|
Affiliation(s)
- Ian M Ehrenreich
- Molecular and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, California 90089-2910
| |
Collapse
|
33
|
Josephs EB, Stinchcombe JR, Wright SI. What can genome-wide association studies tell us about the evolutionary forces maintaining genetic variation for quantitative traits? THE NEW PHYTOLOGIST 2017; 214:21-33. [PMID: 28211582 DOI: 10.1111/nph.14410] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 11/14/2016] [Indexed: 05/27/2023]
Abstract
Contents 21 I. 21 II. 22 III. 24 IV. 25 V. 29 30 References 30 SUMMARY: Understanding the evolutionary forces that shape genetic variation within species has long been a goal of evolutionary biology. Integrating data for the genetic architecture of traits from genome-wide association mapping studies (GWAS) along with the development of new population genetic methods for identifying selection in sequence data may allow us to evaluate the roles of mutation-selection balance and balancing selection in shaping genetic variation at various scales. Here, we review the theoretical predictions for genetic architecture and additional signals of selection on genomic sequence for the loci that affect traits. Next, we review how plant GWAS have tested for the signatures of various selective scenarios. Limited evidence to date suggests that within-population variation is maintained primarily by mutation-selection balance while variation across the landscape is the result of local adaptation. However, there are a number of inherent biases in these interpretations. We highlight these challenges and suggest ways forward to further understanding of the maintenance of variation.
Collapse
Affiliation(s)
- Emily B Josephs
- Department of Evolution and Ecology, University of California, Davis, One Shields Avenue, Davis, CA, 95616, USA
| | - John R Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St., Toronto, ON, M5S 3B2, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St., Toronto, ON, M5S 3B2, Canada
| |
Collapse
|
34
|
Moreno-Moral A, Pesce F, Behmoaras J, Petretto E. Systems Genetics as a Tool to Identify Master Genetic Regulators in Complex Disease. Methods Mol Biol 2017; 1488:337-362. [PMID: 27933533 DOI: 10.1007/978-1-4939-6427-7_16] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Systems genetics stems from systems biology and similarly employs integrative modeling approaches to describe the perturbations and phenotypic effects observed in a complex system. However, in the case of systems genetics the main source of perturbation is naturally occurring genetic variation, which can be analyzed at the systems-level to explain the observed variation in phenotypic traits. In contrast with conventional single-variant association approaches, the success of systems genetics has been in the identification of gene networks and molecular pathways that underlie complex disease. In addition, systems genetics has proven useful in the discovery of master trans-acting genetic regulators of functional networks and pathways, which in many cases revealed unexpected gene targets for disease. Here we detail the central components of a fully integrated systems genetics approach to complex disease, starting from assessment of genetic and gene expression variation, linking DNA sequence variation to mRNA (expression QTL mapping), gene regulatory network analysis and mapping the genetic control of regulatory networks. By summarizing a few illustrative (and successful) examples, we highlight how different data-modeling strategies can be effectively integrated in a systems genetics study.
Collapse
Affiliation(s)
- Aida Moreno-Moral
- Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore
| | - Francesco Pesce
- National Heart and Lung Institute, Faculty of Medicine, Imperial College London, Hammersmith Campus, Imperial Centre for Translational and Experimental Medicine, London, UK
| | - Jacques Behmoaras
- Centre for Complement and Inflammation Research, Imperial College London, Hammersmith Hospital, Du Cane Road, London, W12 0NN, UK
| | - Enrico Petretto
- Duke-NUS Medical School, 8 College Road, Singapore, 169857, Singapore.
| |
Collapse
|
35
|
Wheeler HE, Shah KP, Brenner J, Garcia T, Aquino-Michaels K, Cox NJ, Nicolae DL, Im HK. Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues. PLoS Genet 2016; 12:e1006423. [PMID: 27835642 PMCID: PMC5106030 DOI: 10.1371/journal.pgen.1006423] [Citation(s) in RCA: 98] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Accepted: 10/12/2016] [Indexed: 11/19/2022] Open
Abstract
Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR < 0.1) in the DGN whole blood cohort. However, current sample sizes (n ≤ 922) do not allow us to compute distal h2. Bayesian Sparse Linear Mixed Model (BSLMM) analysis provides strong evidence that the genetic contribution to local expression traits is dominated by a handful of genetic variants rather than by the collective contribution of a large number of variants each of modest size. In other words, the local architecture of gene expression traits is sparse rather than polygenic across all 40 tissues (from DGN and GTEx) examined. This result is confirmed by the sparsity of optimal performing gene expression predictors via elastic net modeling. To further explore the tissue context specificity, we decompose the expression traits into cross-tissue and tissue-specific components using a novel Orthogonal Tissue Decomposition (OTD) approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan).
Collapse
Affiliation(s)
- Heather E. Wheeler
- Department of Biology, Loyola University Chicago, Chicago, Illinois, United States of America
- Department of Computer Science, Loyola University Chicago, Chicago, Illinois, United States of America
| | - Kaanan P. Shah
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
| | - Jonathon Brenner
- Department of Computer Science, Loyola University Chicago, Chicago, Illinois, United States of America
| | - Tzintzuni Garcia
- Center for Research Informatics, University of Chicago, Chicago, Illinois, United States of America
| | - Keston Aquino-Michaels
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
| | | | - Nancy J. Cox
- Division of Genetic Medicine, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Dan L. Nicolae
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
| | - Hae Kyung Im
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
36
|
Branco AT, Schilling L, Silkaitis K, Dowling DK, Lemos B. Reproductive activity triggers accelerated male mortality and decreases lifespan: genetic and gene expression determinants in Drosophila. Heredity (Edinb) 2016; 118:221-228. [PMID: 27731328 DOI: 10.1038/hdy.2016.89] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2016] [Revised: 06/01/2016] [Accepted: 07/15/2016] [Indexed: 02/07/2023] Open
Abstract
Reproduction and aging evolved to be intimately associated. Experimental selection for early-life reproduction drives the evolution of decreased longevity in Drosophila whereas experimental selection for increased longevity leads to changes in reproduction. Although life history theory offers hypotheses to explain these relationships, the genetic architecture and molecular mechanisms underlying reproduction-longevity associations remain a matter of debate. Here we show that mating triggers accelerated mortality in males and identify hundreds of genes that are modulated upon mating in the fruit fly Drosophila melanogaster. Interrogation of genome-wide gene expression in virgin and recently mated males revealed coherent responses, with biological processes that are upregulated (testis-specific gene expression) or downregulated (metabolism and mitochondria-related functions) upon mating. Furthermore, using a panel of genotypes from the Drosophila Synthetic Population Resource (DSPR) as a source of naturally occurring genetic perturbation, we uncover abundant variation in longevity and reproduction-induced mortality among genotypes. Genotypes displayed more than fourfold variation in longevity and reproduction-induced mortality that can be traced to variation in specific segments of the genome. The data reveal individual variation in sensitivity to reproduction and physiological processes that are enhanced and suppressed upon mating. These results raise the prospect that variation in longevity and age-related traits could be traced to processes that coordinate germline and somatic function.
Collapse
Affiliation(s)
- A T Branco
- Molecular and Integrative Physiological Sciences Program, Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - L Schilling
- Molecular and Integrative Physiological Sciences Program, Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - K Silkaitis
- Molecular and Integrative Physiological Sciences Program, Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - D K Dowling
- School of Biological Sciences, Monash University, Clayton, Victoria, Australia
| | - B Lemos
- Molecular and Integrative Physiological Sciences Program, Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| |
Collapse
|
37
|
Cogni R, Cao C, Day JP, Bridson C, Jiggins FM. The genetic architecture of resistance to virus infection in Drosophila. Mol Ecol 2016; 25:5228-5241. [PMID: 27460507 PMCID: PMC5082504 DOI: 10.1111/mec.13769] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Revised: 07/03/2016] [Accepted: 07/05/2016] [Indexed: 12/18/2022]
Abstract
Variation in susceptibility to infection has a substantial genetic component in natural populations, and it has been argued that selection by pathogens may result in it having a simpler genetic architecture than many other quantitative traits. This is important as models of host-pathogen co-evolution typically assume resistance is controlled by a small number of genes. Using the Drosophila melanogaster multiparent advanced intercross, we investigated the genetic architecture of resistance to two naturally occurring viruses, the sigma virus and DCV (Drosophila C virus). We found extensive genetic variation in resistance to both viruses. For DCV resistance, this variation is largely caused by two major-effect loci. Sigma virus resistance involves more genes - we mapped five loci, and together these explained less than half the genetic variance. Nonetheless, several of these had a large effect on resistance. Models of co-evolution typically assume strong epistatic interactions between polymorphisms controlling resistance, but we were only able to detect one locus that altered the effect of the main effect loci we had mapped. Most of the loci we mapped were probably at an intermediate frequency in natural populations. Overall, our results are consistent with major-effect genes commonly affecting susceptibility to infectious diseases, with DCV resistance being a near-Mendelian trait.
Collapse
Affiliation(s)
- Rodrigo Cogni
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK.
- Department of Ecology, University of São Paulo, São Paulo, 05508-900, Brazil.
| | - Chuan Cao
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Jonathan P Day
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Calum Bridson
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - Francis M Jiggins
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| |
Collapse
|
38
|
Buffering of Genetic Regulatory Networks in Drosophila melanogaster. Genetics 2016; 203:1177-90. [PMID: 27194752 DOI: 10.1534/genetics.116.188797] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 05/17/2016] [Indexed: 01/01/2023] Open
Abstract
Regulatory variation in gene expression can be described by cis- and trans-genetic components. Here we used RNA-seq data from a population panel of Drosophila melanogaster test crosses to compare allelic imbalance (AI) in female head tissue between mated and virgin flies, an environmental change known to affect transcription. Indeed, 3048 exons (1610 genes) are differentially expressed in this study. A Bayesian model for AI, with an intersection test, controls type I error. There are ∼200 genes with AI exclusively in mated or virgin flies, indicating an environmental component of expression regulation. On average 34% of genes within a cross and 54% of all genes show evidence for genetic regulation of transcription. Nearly all differentially regulated genes are affected in cis, with an average of 63% of expression variation explained by the cis-effects. Trans-effects explain 8% of the variance in AI on average and the interaction between cis and trans explains an average of 11% of the total variance in AI. In both environments cis- and trans-effects are compensatory in their overall effect, with a negative association between cis- and trans-effects in 85% of the exons examined. We hypothesize that the gene expression level perturbed by cis-regulatory mutations is compensated through trans-regulatory mechanisms, e.g., trans and cis by trans-factors buffering cis-mutations. In addition, when AI is detected in both environments, cis-mated, cis-virgin, and trans-mated-trans-virgin estimates are highly concordant with 99% of all exons positively correlated with a median correlation of 0.83 for cis and 0.95 for trans We conclude that the gene regulatory networks (GRNs) are robust and that trans-buffering explains robustness.
Collapse
|
39
|
Kamkina P, Snoek LB, Grossmann J, Volkers RJM, Sterken MG, Daube M, Roschitzki B, Fortes C, Schlapbach R, Roth A, von Mering C, Hengartner MO, Schrimpf SP, Kammenga JE. Natural Genetic Variation Differentially Affects the Proteome and Transcriptome in Caenorhabditis elegans. Mol Cell Proteomics 2016; 15:1670-80. [PMID: 26944343 DOI: 10.1074/mcp.m115.052548] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Indexed: 11/06/2022] Open
Abstract
Natural genetic variation is the raw material of evolution and influences disease development and progression. An important question is how this genetic variation translates into variation in protein abundance. To analyze the effects of the genetic background on gene and protein expression in the nematode Caenorhabditis elegans, we quantitatively compared the two genetically highly divergent wild-type strains N2 and CB4856. Gene expression was analyzed by microarray assays, and proteins were quantified using stable isotope labeling by amino acids in cell culture. Among all transcribed genes, we found 1,532 genes to be differentially transcribed between the two wild types. Of the total 3,238 quantified proteins, 129 proteins were significantly differentially expressed between N2 and CB4856. The differentially expressed proteins were enriched for genes that function in insulin-signaling and stress-response pathways, underlining strong divergence of these pathways in nematodes. The protein abundance of the two wild-type strains correlates more strongly than protein abundance versus transcript abundance within each wild type. Our findings indicate that in C. elegans only a fraction of the changes in protein abundance can be explained by the changes in mRNA abundance. These findings corroborate with the observations made across species.
Collapse
Affiliation(s)
- Polina Kamkina
- From the ‡Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland; §Ph.D. Program in Molecular Life Sciences Zurich, 8057 Zurich, Switzerland
| | - L Basten Snoek
- ‖Laboratory of Nematology, Wageningen University, Wageningen 6708 PB, The Netherlands
| | - Jonas Grossmann
- **Functional Genomics Center Zurich, University of Zurich and Swiss Federal Institute of Technology Zurich, 8057 Zurich, Switzerland
| | - Rita J M Volkers
- ‖Laboratory of Nematology, Wageningen University, Wageningen 6708 PB, The Netherlands
| | - Mark G Sterken
- ‖Laboratory of Nematology, Wageningen University, Wageningen 6708 PB, The Netherlands
| | - Michael Daube
- From the ‡Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Bernd Roschitzki
- **Functional Genomics Center Zurich, University of Zurich and Swiss Federal Institute of Technology Zurich, 8057 Zurich, Switzerland
| | - Claudia Fortes
- **Functional Genomics Center Zurich, University of Zurich and Swiss Federal Institute of Technology Zurich, 8057 Zurich, Switzerland
| | - Ralph Schlapbach
- **Functional Genomics Center Zurich, University of Zurich and Swiss Federal Institute of Technology Zurich, 8057 Zurich, Switzerland
| | - Alexander Roth
- From the ‡Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Christian von Mering
- From the ‡Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Michael O Hengartner
- From the ‡Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Sabine P Schrimpf
- From the ‡Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland;
| | - Jan E Kammenga
- ‖Laboratory of Nematology, Wageningen University, Wageningen 6708 PB, The Netherlands;
| |
Collapse
|
40
|
Zhao X, Bergland AO, Behrman EL, Gregory BD, Petrov DA, Schmidt PS. Global Transcriptional Profiling of Diapause and Climatic Adaptation in Drosophila melanogaster. Mol Biol Evol 2016; 33:707-20. [PMID: 26568616 PMCID: PMC5009998 DOI: 10.1093/molbev/msv263] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Wild populations of the model organism Drosophila melanogaster experience highly heterogeneous environments over broad geographical ranges as well as over seasonal and annual timescales. Diapause is a primary adaptation to environmental heterogeneity, and in D. melanogaster the propensity to enter diapause varies predictably with latitude and season. Here we performed global transcriptomic profiling of naturally occurring variation in diapause expression elicited by short day photoperiod and moderately low temperature in two tissue types associated with neuroendocrine and endocrine signaling, heads, and ovaries. We show that diapause in D. melanogaster is an actively regulated phenotype at the transcriptional level, suggesting that diapause is not a simple physiological or reproductive quiescence. Differentially expressed genes and pathways are highly distinct in heads and ovaries, demonstrating that the diapause response is not uniform throughout the soma and suggesting that it may be comprised of functional modules associated with specific tissues. Genes downregulated in heads of diapausing flies are significantly enriched for clinally varying single nucleotide polymorphism (SNPs) and seasonally oscillating SNPs, consistent with the hypothesis that diapause is a driving phenotype of climatic adaptation. We also show that chromosome location-based coregulation of gene expression is present in the transcriptional regulation of diapause. Taken together, these results demonstrate that diapause is a complex phenotype actively regulated in multiple tissues, and support the hypothesis that natural variation in diapause propensity underlies adaptation to spatially and temporally varying selective pressures.
Collapse
Affiliation(s)
- Xiaqing Zhao
- Department of Biology, University of Pennsylvania
| | | | | | | | | | | |
Collapse
|
41
|
Najarro MA, Hackett JL, Smith BR, Highfill CA, King EG, Long AD, Macdonald SJ. Identifying Loci Contributing to Natural Variation in Xenobiotic Resistance in Drosophila. PLoS Genet 2015; 11:e1005663. [PMID: 26619284 PMCID: PMC4664282 DOI: 10.1371/journal.pgen.1005663] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 10/21/2015] [Indexed: 12/12/2022] Open
Abstract
Natural populations exhibit a great deal of interindividual genetic variation in the response to toxins, exemplified by the variable clinical efficacy of pharmaceutical drugs in humans, and the evolution of pesticide resistant insects. Such variation can result from several phenomena, including variable metabolic detoxification of the xenobiotic, and differential sensitivity of the molecular target of the toxin. Our goal is to genetically dissect variation in the response to xenobiotics, and characterize naturally-segregating polymorphisms that modulate toxicity. Here, we use the Drosophila Synthetic Population Resource (DSPR), a multiparent advanced intercross panel of recombinant inbred lines, to identify QTL (Quantitative Trait Loci) underlying xenobiotic resistance, and employ caffeine as a model toxic compound. Phenotyping over 1,700 genotypes led to the identification of ten QTL, each explaining 4.5-14.4% of the broad-sense heritability for caffeine resistance. Four QTL harbor members of the cytochrome P450 family of detoxification enzymes, which represent strong a priori candidate genes. The case is especially strong for Cyp12d1, with multiple lines of evidence indicating the gene causally impacts caffeine resistance. Cyp12d1 is implicated by QTL mapped in both panels of DSPR RILs, is significantly upregulated in the presence of caffeine, and RNAi knockdown robustly decreases caffeine tolerance. Furthermore, copy number variation at Cyp12d1 is strongly associated with phenotype in the DSPR, with a trend in the same direction observed in the DGRP (Drosophila Genetic Reference Panel). No additional plausible causative polymorphisms were observed in a full genomewide association study in the DGRP, or in analyses restricted to QTL regions mapped in the DSPR. Just as in human populations, replicating modest-effect, naturally-segregating causative variants in an association study framework in flies will likely require very large sample sizes.
Collapse
Affiliation(s)
- Michael A. Najarro
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
| | - Jennifer L. Hackett
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
| | - Brittny R. Smith
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
| | - Chad A. Highfill
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
| | - Elizabeth G. King
- Division of Biological Sciences, University of Missouri, Columbia, Missouri, United States of America
| | - Anthony D. Long
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, United States of America
| | - Stuart J. Macdonald
- Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, United States of America
- Center for Computational Biology, University of Kansas, Lawrence, Kansas, United States of America
- * E-mail:
| |
Collapse
|
42
|
Fear JM, Arbeitman MN, Salomon MP, Dalton JE, Tower J, Nuzhdin SV, McIntyre LM. The Wright stuff: reimagining path analysis reveals novel components of the sex determination hierarchy in Drosophila melanogaster. BMC SYSTEMS BIOLOGY 2015; 9:53. [PMID: 26335107 PMCID: PMC4558766 DOI: 10.1186/s12918-015-0200-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 08/20/2015] [Indexed: 11/10/2022]
Abstract
BACKGROUND The Drosophila sex determination hierarchy is a classic example of a transcriptional regulatory hierarchy, with sex-specific isoforms regulating morphology and behavior. We use a structural equation modeling approach, leveraging natural genetic variation from two studies on Drosophila female head tissues--DSPR collection (596 F1-hybrids from crosses between DSPR sub-populations) and CEGS population (75 F1-hybrids from crosses between DGRP/Winters lines to a reference strain w1118)--to expand understanding of the sex hierarchy gene regulatory network (GRN). This approach is completely generalizable to any natural population, including humans. RESULTS We expanded the sex hierarchy GRN adding novel links among genes, including a link from fruitless (fru) to Sex-lethal (Sxl) identified in both populations. This link is further supported by the presence of fru binding sites in the Sxl locus. 754 candidate genes were added to the pathway, including the splicing factors male-specific lethal 2 and Rm62 as downstream targets of Sxl which are well-supported links in males. Independent studies of doublesex and transformer mutants support many additions, including evidence for a link between the sex hierarchy and metabolism, via Insulin-like receptor. CONCLUSIONS The genes added in the CEGS population were enriched for genes with sex-biased splicing and components of the spliceosome. A common goal of molecular biologists is to expand understanding about regulatory interactions among genes. Using natural alleles we can not only identify novel relationships, but using supervised approaches can order genes into a regulatory hierarchy. Combining these results with independent large effect mutation studies, allows clear candidates for detailed molecular follow-up to emerge.
Collapse
Affiliation(s)
- Justin M Fear
- Department of Molecular Genetics and Microbiology, University of Florida, CGRC Room 116, PO Box 100266, FL 32610-0266, Gainesville, FL, USA.
| | | | - Matthew P Salomon
- Molecular and Computational Biology, University of California, Los Angeles, CA, USA.
| | - Justin E Dalton
- Biomedical Science, Florida State University, Tallahassee, FL, USA.
| | - John Tower
- Molecular and Computational Biology, University of California, Los Angeles, CA, USA.
| | - Sergey V Nuzhdin
- Molecular and Computational Biology, University of California, Los Angeles, CA, USA.
| | - Lauren M McIntyre
- Department of Molecular Genetics and Microbiology, University of Florida, CGRC Room 116, PO Box 100266, FL 32610-0266, Gainesville, FL, USA.
| |
Collapse
|
43
|
Huang BE, Verbyla KL, Verbyla AP, Raghavan C, Singh VK, Gaur P, Leung H, Varshney RK, Cavanagh CR. MAGIC populations in crops: current status and future prospects. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2015; 128:999-1017. [PMID: 25855139 DOI: 10.1007/s00122-015-2506-0] [Citation(s) in RCA: 129] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 03/20/2015] [Indexed: 05/20/2023]
Abstract
MAGIC populations present novel challenges and opportunities in crops due to their complex pedigree structure. They offer great potential both for dissecting genomic structure and for improving breeding populations. The past decade has seen the rise of multiparental populations as a study design offering great advantages for genetic studies in plants. The genetic diversity of multiple parents, recombined over several generations, generates a genetic resource population with large phenotypic diversity suitable for high-resolution trait mapping. While there are many variations on the general design, this review focuses on populations where the parents have all been inter-mated, typically termed Multi-parent Advanced Generation Intercrosses (MAGIC). Such populations have already been created in model animals and plants, and are emerging in many crop species. However, there has been little consideration of the full range of factors which create novel challenges for design and analysis in these populations. We will present brief descriptions of large MAGIC crop studies currently in progress to motivate discussion of population construction, efficient experimental design, and genetic analysis in these populations. In addition, we will highlight some recent achievements and discuss the opportunities and advantages to exploit the unique structure of these resources post-QTL analysis for gene discovery.
Collapse
Affiliation(s)
- B Emma Huang
- Digital Productivity and Agriculture Flagships, CSIRO, Dutton Park, QLD, 4102, Australia,
| | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Chen J, Nolte V, Schlötterer C. Temperature stress mediates decanalization and dominance of gene expression in Drosophila melanogaster. PLoS Genet 2015; 11:e1004883. [PMID: 25719753 PMCID: PMC4342254 DOI: 10.1371/journal.pgen.1004883] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 11/10/2014] [Indexed: 11/18/2022] Open
Abstract
The regulatory architecture of gene expression remains an area of active research. Here, we studied how the interplay of genetic and environmental variation affects gene expression by exposing Drosophila melanogaster strains to four different developmental temperatures. At 18°C we observed almost complete canalization with only very few allelic effects on gene expression. In contrast, at the two temperature extremes, 13°C and 29°C a large number of allelic differences in gene expression were detected due to both cis- and trans-regulatory effects. Allelic differences in gene expression were mainly dominant, but for up to 62% of the genes the dominance swapped between 13 and 29°C. Our results are consistent with stabilizing selection causing buffering of allelic expression variation in non-stressful environments. We propose that decanalization of gene expression in stressful environments is not only central to adaptation, but may also contribute to genetic disorders in human populations.
Collapse
Affiliation(s)
- Jun Chen
- Institut für Populationsgenetik, Vienna, Austria
| | - Viola Nolte
- Institut für Populationsgenetik, Vienna, Austria
| | | |
Collapse
|
45
|
Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nat Rev Genet 2015; 16:197-212. [DOI: 10.1038/nrg3891] [Citation(s) in RCA: 684] [Impact Index Per Article: 68.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
46
|
Long AD, Macdonald SJ, King EG. Dissecting complex traits using the Drosophila Synthetic Population Resource. Trends Genet 2014; 30:488-95. [PMID: 25175100 DOI: 10.1016/j.tig.2014.07.009] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Revised: 07/28/2014] [Accepted: 07/28/2014] [Indexed: 11/25/2022]
Abstract
For most complex traits we have a poor understanding of the positions, phenotypic effects, and population frequencies of the underlying genetic variants contributing to their variation. Recently, several groups have developed multi-parent advanced intercross mapping panels in different model organisms in an attempt to improve our ability to characterize causative genetic variants. These panels are powerful and are particularly well suited to the dissection of phenotypic variation generated by rare alleles and loci segregating multiple functional alleles. We describe studies using one such panel, the Drosophila Synthetic Population Resource (DSPR), and the implications for our understanding of the genetic basis of complex traits. In particular, we note that many loci of large effect appear to be multiallelic. If multiallelism is a general rule, analytical approaches designed to identify multiallelic variants should be a priority for both genome-wide association studies (GWASs) and multi-parental panels.
Collapse
Affiliation(s)
- Anthony D Long
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA.
| | - Stuart J Macdonald
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045, USA
| | - Elizabeth G King
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|