1
|
Mackay TFC, Anholt RRH. Pleiotropy, epistasis and the genetic architecture of quantitative traits. Nat Rev Genet 2024:10.1038/s41576-024-00711-3. [PMID: 38565962 DOI: 10.1038/s41576-024-00711-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/14/2024] [Indexed: 04/04/2024]
Abstract
Pleiotropy (whereby one genetic polymorphism affects multiple traits) and epistasis (whereby non-linear interactions between genetic polymorphisms affect the same trait) are fundamental aspects of the genetic architecture of quantitative traits. Recent advances in the ability to characterize the effects of polymorphic variants on molecular and organismal phenotypes in human and model organism populations have revealed the prevalence of pleiotropy and unexpected shared molecular genetic bases among quantitative traits, including diseases. By contrast, epistasis is common between polymorphic loci associated with quantitative traits in model organisms, such that alleles at one locus have different effects in different genetic backgrounds, but is rarely observed for human quantitative traits and common diseases. Here, we review the concepts and recent inferences about pleiotropy and epistasis, and discuss factors that contribute to similarities and differences between the genetic architecture of quantitative traits in model organisms and humans.
Collapse
Affiliation(s)
- Trudy F C Mackay
- Center for Human Genetics, Clemson University, Greenwood, SC, USA.
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA.
| | - Robert R H Anholt
- Center for Human Genetics, Clemson University, Greenwood, SC, USA.
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA.
| |
Collapse
|
2
|
Tyler AL, Emerson J, El Kassaby B, Wells AE, Philip VM, Carter GW. The Combined Analysis of Pleiotropy and Epistasis (CAPE). Methods Mol Biol 2021; 2212:55-67. [PMID: 33733350 DOI: 10.1007/978-1-0716-0947-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Epistasis, or gene-gene interaction, contributes substantially to trait variation in organisms ranging from yeast to humans, and modeling epistasis directly is critical to understanding the genotype-phenotype map. However, inference of genetic interactions is challenging compared to inference of individual allele effects due to low statistical power. Furthermore, genetic interactions can appear inconsistent across different quantitative traits, presenting a challenge for the interpretation of detected interactions. Here we present a method called the Combined Analysis of Pleiotropy and Epistasis (CAPE) that combines information across multiple quantitative traits to infer directed epistatic interactions. By combining information across multiple traits, CAPE not only increases power to detect genetic interactions but also interprets these interactions across traits to identify a single interaction that is consistent across all observed data. This method generates informative, interpretable interaction networks that explain how variants interact with each other to influence groups of related traits. This method could potentially be used to link genetic variants to gene expression, physiological endophenotypes, and higher-level disease traits.
Collapse
|
3
|
Xu F, Ashbrook DG, Gao J, Starlard-Davenport A, Zhao W, Miller DB, O'Callaghan JP, Williams RW, Jones BC, Lu L. Genome-wide transcriptome architecture in a mouse model of Gulf War Illness. Brain Behav Immun 2020; 89:209-223. [PMID: 32574576 PMCID: PMC7787136 DOI: 10.1016/j.bbi.2020.06.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 05/18/2020] [Accepted: 06/11/2020] [Indexed: 12/31/2022] Open
Abstract
Gulf War Illness (GWI) is thought to be a chronic neuroimmune disorder caused by in-theater exposure during the 1990-1991 Gulf War. There is a consensus that the illness is caused by exposure to insecticides and nerve agent toxicants. However, the heterogeneity in both development of disease and clinical outcomes strongly suggests a genetic contribution. Here, we modeled GWI in 30 BXD recombinant inbred mouse strains with a combined treatment of corticosterone (CORT) and diisopropyl fluorophosphate (DFP). We quantified transcriptomes from 409 prefrontal cortex samples. Compared to the untreated and DFP treated controls, the combined treatment significantly activated pathways such as cytokine-cytokine receptor interaction and TNF signaling pathway. Protein-protein interaction analysis defined 6 subnetworks for CORT + DFP, with the key regulators being Cxcl1, Il6, Ccnb1, Tnf, Agt, and Itgam. We also identified 21 differentially expressed genes having significant QTLs related to CORT + DFP, but without evidence for untreated and DFP treated controls, suggesting regions of the genome specifically involved in the response to CORT + DFP. We identified Adamts9 as a potential contributor to response to CORT + DFP and found links to symptoms of GWI. Furthermore, we observed a significant effect of CORT + DFP treatment on the relative proportion of myelinating oligodendrocytes, with a QTL on Chromosome 5. We highlight three candidates, Magi2, Sema3c, and Gnai1, based on their high expression in the brain and oligodendrocyte. In summary, our results show significant genetic effects of the CORT + DFP treatment, which mirrors gene and protein expression changes seen in GWI sufferers, providing insight into the disease and a testbed for future interventions.
Collapse
Affiliation(s)
- Fuyi Xu
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - David G Ashbrook
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Jun Gao
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA; Institute of Animal Husbandry and Veterinary Science, Shanghai Academy of Agricultural Sciences, Shanghai 201106, China
| | - Athena Starlard-Davenport
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Wenyuan Zhao
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Diane B Miller
- Toxicology and Molecular Biology Branch, Health Effects Laboratory Division, Centers for Disease Control and Prevention, National Institute for Occupational Safety and Health, Morgantown, WV 26505, USA
| | - James P O'Callaghan
- Molecular Neurotoxicology Laboratory, Centers for Disease Control and Prevention, National Institute for Occupational Safety and Health, Morgantown, WV 26505, USA
| | - Robert W Williams
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Byron C Jones
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA.
| | - Lu Lu
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA.
| |
Collapse
|
4
|
Genetic Interactions Affect Lung Function in Patients with Systemic Sclerosis. G3-GENES GENOMES GENETICS 2020; 10:151-163. [PMID: 31694854 PMCID: PMC6945038 DOI: 10.1534/g3.119.400775] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Scleroderma, or systemic sclerosis (SSc), is an autoimmune disease characterized by progressive fibrosis of the skin and internal organs. The most common cause of death in people with SSc is lung disease, but the pathogenesis of lung disease in SSc is insufficiently understood to devise specific treatment strategies. Developing targeted treatments requires not only the identification of molecular processes involved in SSc-associated lung disease, but also understanding of how these processes interact to drive pathology. One potentially powerful approach is to identify alleles that interact genetically to influence lung outcomes in patients with SSc. Analysis of interactions, rather than individual allele effects, has the potential to delineate molecular interactions that are important in SSc-related lung pathology. However, detecting genetic interactions, or epistasis, in human cohorts is challenging. Large numbers of variants with low minor allele frequencies, paired with heterogeneous disease presentation, reduce power to detect epistasis. Here we present an analysis that increases power to detect epistasis in human genome-wide association studies (GWAS). We tested for genetic interactions influencing lung function and autoantibody status in a cohort of 416 SSc patients. Using Matrix Epistasis to filter SNPs followed by the Combined Analysis of Pleiotropy and Epistasis (CAPE), we identified a network of interacting alleles influencing lung function in patients with SSc. In particular, we identified a three-gene network comprising WNT5A, RBMS3, and MSI2, which in combination influenced multiple pulmonary pathology measures. The associations of these genes with lung outcomes in SSc are novel and high-confidence. Furthermore, gene coexpression analysis suggested that the interactions we identified are tissue-specific, thus differentiating SSc-related pathogenic processes in lung from those in skin.
Collapse
|
5
|
Mahoney JM, Mills JD, Muhlebner A, Noebels J, Potschka H, Simonato M, Kobow K. 2017 WONOEP appraisal: Studying epilepsy as a network disease using systems biology approaches. Epilepsia 2019; 60:1045-1053. [DOI: 10.1111/epi.15216] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Revised: 04/17/2019] [Accepted: 04/17/2019] [Indexed: 12/15/2022]
Affiliation(s)
- John M. Mahoney
- Department of Neurological Sciences Department of Computer Science University of Vermont Larner College of Medicine Burlington Vermont
| | - James D. Mills
- Department of (Neuro)Pathology Amsterdam University Medical CenterUniversity of Amsterdam Amsterdam The Netherlands
| | - Angelika Muhlebner
- Department of (Neuro)Pathology Amsterdam University Medical CenterUniversity of Amsterdam Amsterdam The Netherlands
| | - Jeffrey Noebels
- Department of Neurology Baylor College of Medicine Houston Texas
| | - Heidrun Potschka
- Institute of Pharmacology, Toxicology, and Pharmacy Ludwig Maximilian University of Munich Munich Germany
| | - Michele Simonato
- Department of Medical Sciences University of Ferrara and School of Medicine University Vita‐Salute San Raffaele Milan Italy
| | - Katja Kobow
- Department of Neuropathology Universitätsklinikum ErlangenFriedrich‐Alexander University Erlangen‐Nürnberg Erlangen Germany
| |
Collapse
|
6
|
Protein Moonlighting Revealed by Noncatalytic Phenotypes of Yeast Enzymes. Genetics 2017; 208:419-431. [PMID: 29127264 DOI: 10.1534/genetics.117.300377] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Accepted: 11/06/2017] [Indexed: 12/19/2022] Open
Abstract
A single gene can partake in several biological processes, and therefore gene deletions can lead to different-sometimes unexpected-phenotypes. However, it is not always clear whether such pleiotropy reflects the loss of a unique molecular activity involved in different processes or the loss of a multifunctional protein. Here, using Saccharomyces cerevisiae metabolism as a model, we systematically test the null hypothesis that enzyme phenotypes depend on a single annotated molecular function, namely their catalysis. We screened a set of carefully selected genes by quantifying the contribution of catalysis to gene deletion phenotypes under different environmental conditions. While most phenotypes were explained by loss of catalysis, slow growth was readily rescued by a catalytically inactive protein in about one-third of the enzymes tested. Such noncatalytic phenotypes were frequent in the Alt1 and Bat2 transaminases and in the isoleucine/valine biosynthetic enzymes Ilv1 and Ilv2, suggesting novel "moonlighting" activities in these proteins. Furthermore, differential genetic interaction profiles of gene deletion and catalytic mutants indicated that ILV1 is functionally associated with regulatory processes, specifically to chromatin modification. Our systematic study shows that gene loss phenotypes and their genetic interactions are frequently not driven by the loss of an annotated catalytic function, underscoring the moonlighting nature of cellular metabolism.
Collapse
|
7
|
Dissecting genetic architecture of startle response in Drosophila melanogaster using multi-omics information. Sci Rep 2017; 7:12367. [PMID: 28959013 PMCID: PMC5620086 DOI: 10.1038/s41598-017-11676-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 08/24/2017] [Indexed: 01/01/2023] Open
Abstract
Startle behavior is important for survival, and abnormal startle responses are related to several neurological diseases. Drosophila melanogaster provides a powerful system to investigate the genetic underpinnings of variation in startle behavior. Since mechanically induced, startle responses and environmental conditions can be readily quantified and precisely controlled. The 156 wild-derived fully sequenced lines of the Drosophila Genetic Reference Panel (DGRP) were used to identify SNPs and transcripts associated with variation in startle behavior. The results validated highly significant effects of 33 quantitative trait SNPs (QTSs) and 81 quantitative trait transcripts (QTTs) directly associated with phenotypic variation of startle response. We also detected QTT variation controlled by 20 QTSs (tQTSs) and 73 transcripts (tQTTs). Association mapping based on genomic and transcriptomic data enabled us to construct a complex genetic network that underlies variation in startle behavior. Based on principles of evolutionary conservation, human orthologous genes could be superimposed on this network. This study provided both genetic and biological insights into the variation of startle response behavior of Drosophila melanogaster, and highlighted the importance of genetic network to understand the genetic architecture of complex traits.
Collapse
|
8
|
Epistatic Networks Jointly Influence Phenotypes Related to Metabolic Disease and Gene Expression in Diversity Outbred Mice. Genetics 2017; 206:621-639. [PMID: 28592500 PMCID: PMC5499176 DOI: 10.1534/genetics.116.198051] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 04/03/2017] [Indexed: 12/20/2022] Open
Abstract
In this study, Tyler et al. analyzed the complex genetic architecture of metabolic disease-related traits using the Diversity Outbred mouse population Genetic studies of multidimensional phenotypes can potentially link genetic variation, gene expression, and physiological data to create multi-scale models of complex traits. The challenge of reducing these data to specific hypotheses has become increasingly acute with the advent of genome-scale data resources. Multi-parent populations derived from model organisms provide a resource for developing methods to understand this complexity. In this study, we simultaneously modeled body composition, serum biomarkers, and liver transcript abundances from 474 Diversity Outbred mice. This population contained both sexes and two dietary cohorts. Transcript data were reduced to functional gene modules with weighted gene coexpression network analysis (WGCNA), which were used as summary phenotypes representing enriched biological processes. These module phenotypes were jointly analyzed with body composition and serum biomarkers in a combined analysis of pleiotropy and epistasis (CAPE), which inferred networks of epistatic interactions between quantitative trait loci that affect one or more traits. This network frequently mapped interactions between alleles of different ancestries, providing evidence of both genetic synergy and redundancy between haplotypes. Furthermore, a number of loci interacted with sex and diet to yield sex-specific genetic effects and alleles that potentially protect individuals from the effects of a high-fat diet. Although the epistatic interactions explained small amounts of trait variance, the combination of directional interactions, allelic specificity, and high genomic resolution provided context to generate hypotheses for the roles of specific genes in complex traits. Our approach moves beyond the cataloging of single loci to infer genetic networks that map genetic etiology by simultaneously modeling all phenotypes.
Collapse
|
9
|
Vonesch SC, Lamparter D, Mackay TFC, Bergmann S, Hafen E. Genome-Wide Analysis Reveals Novel Regulators of Growth in Drosophila melanogaster. PLoS Genet 2016; 12:e1005616. [PMID: 26751788 PMCID: PMC4709145 DOI: 10.1371/journal.pgen.1005616] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Accepted: 09/28/2015] [Indexed: 12/21/2022] Open
Abstract
Organismal size depends on the interplay between genetic and environmental factors. Genome-wide association (GWA) analyses in humans have implied many genes in the control of height but suffer from the inability to control the environment. Genetic analyses in Drosophila have identified conserved signaling pathways controlling size; however, how these pathways control phenotypic diversity is unclear. We performed GWA of size traits using the Drosophila Genetic Reference Panel of inbred, sequenced lines. We find that the top associated variants differ between traits and sexes; do not map to canonical growth pathway genes, but can be linked to these by epistasis analysis; and are enriched for genes and putative enhancers. Performing GWA on well-studied developmental traits under controlled conditions expands our understanding of developmental processes underlying phenotypic diversity. Genetic studies in Drosophila have elucidated conserved signaling pathways and environmental factors that together control organismal size. In humans, hundreds of genes are associated with height variation, but these associations have not been performed in a controlled environment. As a result we are still lacking an understanding of the mechanisms creating size variability within a species. Here, under carefully controlled environmental conditions, we identify naturally occurring genetic variants that are associated with size diversity in Drosophila. We identify a cluster of associations close to the kek1 locus, a well-characterized growth regulator, but otherwise find that most variants are located in or close to genes that do not belong to the conserved pathways but may interact with these in a biological network. We validate 33 novel growth regulatory genes that participate in diverse cellular processes, most notably cellular metabolism and cell polarity. This study is the first genome-wide association analysis of natural variants underlying size in Drosophila and our results complement the knowledge we have accumulated on this trait from mutational studies of single genes.
Collapse
Affiliation(s)
| | - David Lamparter
- Department of Medical Genetics, University of Lausanne, Lausanne, Switzerland
| | - Trudy F. C. Mackay
- Department of Biological Sciences, Program in Genetics, W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Sven Bergmann
- Department of Medical Genetics, University of Lausanne, Lausanne, Switzerland
| | - Ernst Hafen
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
- * E-mail:
| |
Collapse
|
10
|
Tyler AL, McGarr TC, Beyer BJ, Frankel WN, Carter GW. A genetic interaction network model of a complex neurological disease. GENES BRAIN AND BEHAVIOR 2014; 13:831-40. [PMID: 25251056 PMCID: PMC4241132 DOI: 10.1111/gbb.12178] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Revised: 08/26/2014] [Accepted: 09/18/2014] [Indexed: 12/05/2022]
Abstract
Absence epilepsy (AE) is a complex, heritable disease characterized by a brief disruption of normal behavior and accompanying spike wave discharges (SWD) on the electroencephalogram. Only a handful of genes has been definitively associated with AE in humans and rodent models. Most studies suggest that genetic interactions play a large role in the etiology and severity of AE, but mapping and understanding their architecture remains a challenge, requiring new computational approaches. Here we use Combined Analysis of Pleiotropy and Epistasis (CAPE) to detect and interpret genetic interactions in a meta-population derived from three C3H x B6 strain crosses, each of which is fixed for a different SWD-causing mutation. Although each mutation causes SWD through a different molecular mechanism, the phenotypes caused by each mutation are exacerbated on the C3H genetic background compared with B6, suggesting common modifiers. By combining information across two phenotypic measures – SWD duration and frequency – CAPE revealed a large, directed genetic network consisting of suppressive and enhancing interactions between loci on 10 chromosomes. These results illustrate the power of CAPE in identifying novel modifier loci and interactions in a complex neurological disease, towards a more comprehensive view of its underlying genetic architecture.
Collapse
Affiliation(s)
- A L Tyler
- The Jackson Laboratory, Bar Harbor, ME, USA
| | | | | | | | | |
Collapse
|
11
|
Schramm SJ, Jayaswal V, Goel A, Li SS, Yang YH, Mann GJ, Wilkins MR. Molecular interaction networks for the analysis of human disease: utility, limitations, and considerations. Proteomics 2014; 13:3393-405. [PMID: 24166987 DOI: 10.1002/pmic.201200570] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 09/11/2013] [Accepted: 10/07/2013] [Indexed: 01/01/2023]
Abstract
High-throughput '-omics' data can be combined with large-scale molecular interaction networks, for example, protein-protein interaction networks, to provide a unique framework for the investigation of human molecular biology. Interest in these integrative '-omics' methods is growing rapidly because of their potential to understand complexity and association with disease; such approaches have a focus on associations between phenotype and "network-type." The potential of this research is enticing, yet there remain a series of important considerations. Here, we discuss interaction data selection, data quality, the relative merits of using data from large high-throughput studies versus a meta-database of smaller literature-curated studies, and possible issues of sociological or inspection bias in interaction data. Other work underway, especially international consortia to establish data formats, quality standards and address data redundancy, and the improvements these efforts are making to the field, is also evaluated. We present options for researchers intending to use large-scale molecular interaction networks as a functional context for protein or gene expression data, including microRNAs, especially in the context of human disease.
Collapse
Affiliation(s)
- Sarah-Jane Schramm
- Sydney Medical School, Westmead Millennium Institute for Medical Research, The University of Sydney, Sydney, NSW, Australia; Melanoma Institute Australia, Sydney, NSW, Australia
| | | | | | | | | | | | | |
Collapse
|
12
|
PHILIP VIVEKM, TYLER ANNAL, CARTER GREGORYW. Dissection of complex gene expression using the combined analysis of pleiotropy and epistasis. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2014:200-11. [PMID: 24297548 PMCID: PMC3900022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Global transcript expression experiments are commonly used to investigate the biological processes that underlie complex traits. These studies can exhibit complex patterns of pleiotropy when trans-acting genetic factors influence overlapping sets of multiple transcripts. Dissecting these patterns into biological modules with distinct genetic etiology can provide models of how genetic variants affect specific processes that contribute to a trait. Here we identify transcript modules associated with pleiotropic genetic factors and apply genetic interaction analysis to disentangle the regulatory architecture in a mouse intercross study of kidney function. The method, called the combined analysis of pleiotropy and epistasis (CAPE), has been previously used to model genetic networks for multiple physiological traits. It simultaneously models multiple phenotypes to identify direct genetic influences as well as influences mediated through genetic interactions. We first identify candidate trans expression quantitative trait loci (eQTL) and the transcripts potentially affected. We then clustered the transcripts into modules of co-expressed genes, from which we compute summary module phenotypes. Finally, we applied CAPE to map the network of interacting module QTL (modQTL) affecting the gene modules. The resulting network mapped how multiple modQTL both directly and indirectly affect modules associated with metabolic functions and biosynthetic processes. This work demonstrates how the integration of pleiotropic signals in gene expression data can be used to infer a complex hypothesis of how multiple loci interact to co-regulate transcription programs, thereby providing additional constraints to prioritize validation experiments.
Collapse
|
13
|
Tyler AL, Lu W, Hendrick JJ, Philip VM, Carter GW. CAPE: an R package for combined analysis of pleiotropy and epistasis. PLoS Comput Biol 2013; 9:e1003270. [PMID: 24204223 PMCID: PMC3808451 DOI: 10.1371/journal.pcbi.1003270] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Accepted: 08/15/2013] [Indexed: 11/18/2022] Open
Abstract
Contemporary genetic studies are revealing the genetic complexity of many traits in humans and model organisms. Two hallmarks of this complexity are epistasis, meaning gene-gene interaction, and pleiotropy, in which one gene affects multiple phenotypes. Understanding the genetic architecture of complex traits requires addressing these phenomena, but interpreting the biological significance of epistasis and pleiotropy is often difficult. While epistasis reveals dependencies between genetic variants, it is often unclear how the activity of one variant is specifically modifying the other. Epistasis found in one phenotypic context may disappear in another context, rendering the genetic interaction ambiguous. Pleiotropy can suggest either redundant phenotype measures or gene variants that affect multiple biological processes. Here we present an R package, R/cape, which addresses these interpretation ambiguities by implementing a novel method to generate predictive and interpretable genetic networks that influence quantitative phenotypes. R/cape integrates information from multiple related phenotypes to constrain models of epistasis, thereby enhancing the detection of interactions that simultaneously describe all phenotypes. The networks inferred by R/cape are readily interpretable in terms of directed influences that indicate suppressive and enhancing effects of individual genetic variants on other variants, which in turn account for the variance in quantitative traits. We demonstrate the utility of R/cape by analyzing a mouse backcross, thereby discovering novel epistatic interactions influencing phenotypes related to obesity and diabetes. R/cape is an easy-to-use, platform-independent R package and can be applied to data from both genetic screens and a variety of segregating populations including backcrosses, intercrosses, and natural populations. The package is freely available under the GPL-3 license at http://cran.r-project.org/web/packages/cape.
Collapse
Affiliation(s)
- Anna L. Tyler
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
| | - Wei Lu
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
- Department of Electrical and Computer Engineering, Duke University, Durham, North Carolina, United States of America
| | | | - Vivek M. Philip
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
| | - Gregory W. Carter
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
- * E-mail:
| |
Collapse
|