51
|
Eser P, Wachutka L, Maier KC, Demel C, Boroni M, Iyer S, Cramer P, Gagneur J. Determinants of RNA metabolism in the Schizosaccharomyces pombe genome. Mol Syst Biol 2016; 12:857. [PMID: 26883383 PMCID: PMC4770384 DOI: 10.15252/msb.20156526] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
To decrypt the regulatory code of the genome, sequence elements must be defined that determine the kinetics of RNA metabolism and thus gene expression. Here, we attempt such decryption in an eukaryotic model organism, the fission yeast S. pombe. We first derive an improved genome annotation that redefines borders of 36% of expressed mRNAs and adds 487 non‐coding RNAs (ncRNAs). We then combine RNA labeling in vivo with mathematical modeling to obtain rates of RNA synthesis and degradation for 5,484 expressed RNAs and splicing rates for 4,958 introns. We identify functional sequence elements in DNA and RNA that control RNA metabolic rates and quantify the contributions of individual nucleotides to RNA synthesis, splicing, and degradation. Our approach reveals distinct kinetics of mRNA and ncRNA metabolism, separates antisense regulation by transcription interference from RNA interference, and provides a general tool for studying the regulatory code of genomes.
Collapse
Affiliation(s)
- Philipp Eser
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany Gene Center Munich and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Leonhard Wachutka
- Gene Center Munich and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Kerstin C Maier
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Carina Demel
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Mariana Boroni
- Gene Center Munich and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Srignanakshi Iyer
- Gene Center Munich and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Patrick Cramer
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | - Julien Gagneur
- Gene Center Munich and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität München, Munich, Germany
| |
Collapse
|
52
|
Unravelling the human genome-phenome relationship using phenome-wide association studies. Nat Rev Genet 2016; 17:129-45. [PMID: 26875678 DOI: 10.1038/nrg.2015.36] [Citation(s) in RCA: 182] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Advances in genotyping technology have, over the past decade, enabled the focused search for common genetic variation associated with human diseases and traits. With the recently increased availability of detailed phenotypic data from electronic health records and epidemiological studies, the impact of one or more genetic variants on the phenome is starting to be characterized both in clinical and population-based settings using phenome-wide association studies (PheWAS). These studies reveal a number of challenges that will need to be overcome to unlock the full potential of PheWAS for the characterization of the complex human genome-phenome relationship.
Collapse
|
53
|
Cenik C, Cenik ES, Byeon GW, Grubert F, Candille SI, Spacek D, Alsallakh B, Tilgner H, Araya CL, Tang H, Ricci E, Snyder MP. Integrative analysis of RNA, translation, and protein levels reveals distinct regulatory variation across humans. Genome Res 2015; 25:1610-21. [PMID: 26297486 PMCID: PMC4617958 DOI: 10.1101/gr.193342.115] [Citation(s) in RCA: 146] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2015] [Accepted: 08/20/2015] [Indexed: 11/24/2022]
Abstract
Elucidating the consequences of genetic differences between humans is essential for understanding phenotypic diversity and personalized medicine. Although variation in RNA levels, transcription factor binding, and chromatin have been explored, little is known about global variation in translation and its genetic determinants. We used ribosome profiling, RNA sequencing, and mass spectrometry to perform an integrated analysis in lymphoblastoid cell lines from a diverse group of individuals. We find significant differences in RNA, translation, and protein levels suggesting diverse mechanisms of personalized gene expression control. Combined analysis of RNA expression and ribosome occupancy improves the identification of individual protein level differences. Finally, we identify genetic differences that specifically modulate ribosome occupancy—many of these differences lie close to start codons and upstream ORFs. Our results reveal a new level of gene expression variation among humans and indicate that genetic variants can cause changes in protein levels through effects on translation.
Collapse
Affiliation(s)
- Can Cenik
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Elif Sarinay Cenik
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Gun W Byeon
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Fabian Grubert
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Sophie I Candille
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Damek Spacek
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Bilal Alsallakh
- Institute of Software Technology and Interactive Systems, Vienna University of Technology, A-140 Vienna, Austria
| | - Hagen Tilgner
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Carlos L Araya
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Hua Tang
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Emiliano Ricci
- RNA Therapeutics Institute, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA; CIRI, International Center for Infectiology Research, Eukaryotic and Viral Translation Team, Université de Lyon, INSERM U1111, Lyon, 69634, France
| | - Michael P Snyder
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| |
Collapse
|
54
|
Zhou X, Cain CE, Myrthil M, Lewellen N, Michelini K, Davenport ER, Stephens M, Pritchard JK, Gilad Y. Epigenetic modifications are associated with inter-species gene expression variation in primates. Genome Biol 2015; 15:547. [PMID: 25468404 PMCID: PMC4290387 DOI: 10.1186/s13059-014-0547-3] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2014] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Changes in gene regulation have long been thought to play an important role in evolution and speciation, especially in primates. Over the past decade, comparative genomic studies have revealed extensive inter-species differences in gene expression levels, yet we know much less about the extent to which regulatory mechanisms differ between species. RESULTS To begin addressing this gap, we perform a comparative epigenetic study in primate lymphoblastoid cell lines, to query the contribution of RNA polymerase II and four histone modifications, H3K4me1, H3K4me3, H3K27ac, and H3K27me3, to inter-species variation in gene expression levels. We find that inter-species differences in mark enrichment near transcription start sites are significantly more often associated with inter-species differences in the corresponding gene expression level than expected by chance alone. Interestingly, we also find that first-order interactions among the five marks, as well as chromatin states, do not markedly contribute to the degree of association between the marks and inter-species variation in gene expression levels, suggesting that the marginal effects of the five marks dominate this contribution. CONCLUSIONS Our observations suggest that epigenetic modifications are substantially associated with changes in gene expression levels among primates and may represent important molecular mechanisms in primate evolution.
Collapse
|
55
|
Gibson G, Powell JE, Marigorta UM. Expression quantitative trait locus analysis for translational medicine. Genome Med 2015; 7:60. [PMID: 26110023 PMCID: PMC4479075 DOI: 10.1186/s13073-015-0186-7] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Expression quantitative trait locus analysis has emerged as an important component of efforts to understand how genetic polymorphisms influence disease risk and is poised to make contributions to translational medicine. Here we review how expression quantitative trait locus analysis is aiding the identification of which gene(s) within regions of association are causal for a disease or phenotypic trait; the narrowing down of the cell types or regulators involved in the etiology of disease; the characterization of drivers and modifiers of cancer; and our understanding of how different environments and cellular contexts can modify gene expression. We also introduce the concept of transcriptional risk scores as a means of refining estimates of individual liability to disease based on targeted profiling of the transcripts that are regulated by polymorphisms jointly associated with disease and gene expression.
Collapse
Affiliation(s)
- Greg Gibson
- Center for Integrative Genomics, School of Biology, Georgia Institute of Technology, Atlanta, GA 30332 USA
| | - Joseph E Powell
- Centre for Neurogenetics and Statistical Genomics, Queensland Brain Institute, University of Queensland, St Lucia, Brisbane, QLD 4072 Australia ; The Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072 Australia
| | - Urko M Marigorta
- Center for Integrative Genomics, School of Biology, Georgia Institute of Technology, Atlanta, GA 30332 USA
| |
Collapse
|
56
|
del Rosario RCH, Poschmann J, Rouam SL, Png E, Khor CC, Hibberd ML, Prabhakar S. Sensitive detection of chromatin-altering polymorphisms reveals autoimmune disease mechanisms. Nat Methods 2015; 12:458-64. [PMID: 25799442 DOI: 10.1038/nmeth.3326] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2014] [Accepted: 02/06/2015] [Indexed: 12/30/2022]
Abstract
Most disease associations detected by genome-wide association studies (GWAS) lie outside coding genes, but very few have been mapped to causal regulatory variants. Here, we present a method for detecting regulatory quantitative trait loci (QTLs) that does not require genotyping or whole-genome sequencing. The method combines deep, long-read chromatin immunoprecipitation-sequencing (ChIP-seq) with a statistical test that simultaneously scores peak height correlation and allelic imbalance: the genotype-independent signal correlation and imbalance (G-SCI) test. We performed histone acetylation ChIP-seq on 57 human lymphoblastoid cell lines and used the resulting reads to call 500,066 single-nucleotide polymorphisms de novo within regulatory elements. The G-SCI test annotated 8,764 of these as histone acetylation QTLs (haQTLs)—an order of magnitude larger than the set of candidates detected by expression QTL analysis. Lymphoblastoid haQTLs were highly predictive of autoimmune disease mechanisms. Thus, our method facilitates large-scale regulatory variant detection in any moderately sized cohort for which functional profiling data can be generated, thereby simplifying identification of causal variants within GWAS loci.
Collapse
Affiliation(s)
| | - Jeremie Poschmann
- Computational and Systems Biology Group, Genome Institute of Singapore, Singapore
| | - Sigrid Laure Rouam
- Computational and Systems Biology Group, Genome Institute of Singapore, Singapore
| | - Eileen Png
- Infectious Diseases Group, Genome Institute of Singapore, Singapore
| | - Chiea Chuen Khor
- 1] Human Genetics Group, Genome Institute of Singapore, Singapore. [2] Singapore Eye Research Institute, Singapore. [3] Department of Opthalmology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - Martin Lloyd Hibberd
- 1] Infectious Diseases Group, Genome Institute of Singapore, Singapore. [2] Department of Pathogen Molecular Biology, London School of Hygiene &Tropical Medicine, London, UK
| | - Shyam Prabhakar
- Computational and Systems Biology Group, Genome Institute of Singapore, Singapore
| |
Collapse
|
57
|
Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nat Rev Genet 2015; 16:197-212. [DOI: 10.1038/nrg3891] [Citation(s) in RCA: 684] [Impact Index Per Article: 68.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
58
|
Zhang W, Spector TD, Deloukas P, Bell JT, Engelhardt BE. Predicting genome-wide DNA methylation using methylation marks, genomic position, and DNA regulatory elements. Genome Biol 2015; 16:14. [PMID: 25616342 PMCID: PMC4389802 DOI: 10.1186/s13059-015-0581-9] [Citation(s) in RCA: 131] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2013] [Accepted: 01/02/2015] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Recent assays for individual-specific genome-wide DNA methylation profiles have enabled epigenome-wide association studies to identify specific CpG sites associated with a phenotype. Computational prediction of CpG site-specific methylation levels is critical to enable genome-wide analyses, but current approaches tackle average methylation within a locus and are often limited to specific genomic regions. RESULTS We characterize genome-wide DNA methylation patterns, and show that correlation among CpG sites decays rapidly, making predictions solely based on neighboring sites challenging. We built a random forest classifier to predict methylation levels at CpG site resolution using features including neighboring CpG site methylation levels and genomic distance, co-localization with coding regions, CpG islands (CGIs), and regulatory elements from the ENCODE project. Our approach achieves 92% prediction accuracy of genome-wide methylation levels at single-CpG-site precision. The accuracy increases to 98% when restricted to CpG sites within CGIs and is robust across platform and cell-type heterogeneity. Our classifier outperforms other types of classifiers and identifies features that contribute to prediction accuracy: neighboring CpG site methylation, CGIs, co-localized DNase I hypersensitive sites, transcription factor binding sites, and histone modifications were found to be most predictive of methylation levels. CONCLUSIONS Our observations of DNA methylation patterns led us to develop a classifier to predict DNA methylation levels at CpG site resolution with high accuracy. Furthermore, our method identified genomic features that interact with DNA methylation, suggesting mechanisms involved in DNA methylation modification and regulation, and linking diverse epigenetic processes.
Collapse
Affiliation(s)
- Weiwei Zhang
- Department of Molecular Genetics and Microbiology, Duke University, Durham, NC, USA.
| | - Tim D Spector
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK.
| | - Panos Deloukas
- William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK.
- Princess Al-Jawhara Al-Brahim Centre of Excellence in Research of Hereditary Disorders (PACER-HD), King Abdulaziz University, Jeddah, 21589, Saudi Arabia.
| | - Jordana T Bell
- Department of Twin Research and Genetic Epidemiology, King's College London, London, UK.
| | | |
Collapse
|
59
|
Abstract
It is now well established that noncoding regulatory variants play a central role in the genetics of common diseases and in evolution. However, until recently, we have known little about the mechanisms by which most regulatory variants act. For instance, what types of functional elements in DNA, RNA, or proteins are most often affected by regulatory variants? Which stages of gene regulation are typically altered? How can we predict which variants are most likely to impact regulation in a given cell type? Recent studies, in many cases using quantitative trait loci (QTL)-mapping approaches in cell lines or tissue samples, have provided us with considerable insight into the properties of genetic loci that have regulatory roles. Such studies have uncovered novel biochemical regulatory interactions and led to the identification of previously unrecognized regulatory mechanisms. We have learned that genetic variation is often directly associated with variation in regulatory activities (namely, we can map regulatory QTLs, not just expression QTLs [eQTLs]), and we have taken the first steps towards understanding the causal order of regulatory events (for example, the role of pioneer transcription factors). Yet, in most cases, we still do not know how to interpret overlapping combinations of regulatory interactions, and we are still far from being able to predict how variation in regulatory mechanisms is propagated through a chain of interactions to eventually result in changes in gene expression profiles.
Collapse
Affiliation(s)
- Athma A. Pai
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Jonathan K. Pritchard
- Departments of Genetics and Biology, and Howard Hughes Medical Institute; Stanford University, Stanford, California, United States of America
- * E-mail: (JKP); (YG)
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (JKP); (YG)
| |
Collapse
|
60
|
Battle A, Khan Z, Wang SH, Mitrano A, Ford MJ, Pritchard JK, Gilad Y. Genomic variation. Impact of regulatory variation from RNA to protein. Science 2014; 347:664-7. [PMID: 25657249 DOI: 10.1126/science.1260793] [Citation(s) in RCA: 335] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The phenotypic consequences of expression quantitative trait loci (eQTLs) are presumably due to their effects on protein expression levels. Yet the impact of genetic variation, including eQTLs, on protein levels remains poorly understood. To address this, we mapped genetic variants that are associated with eQTLs, ribosome occupancy (rQTLs), or protein abundance (pQTLs). We found that most QTLs are associated with transcript expression levels, with consequent effects on ribosome and protein levels. However, eQTLs tend to have significantly reduced effect sizes on protein levels, which suggests that their potential impact on downstream phenotypes is often attenuated or buffered. Additionally, we identified a class of cis QTLs that affect protein abundance with little or no effect on messenger RNA or ribosome levels, which suggests that they may arise from differences in posttranslational regulation.
Collapse
Affiliation(s)
- Alexis Battle
- Department of Genetics, Stanford University, Stanford, CA 94305, USA. Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA
| | - Zia Khan
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Sidney H Wang
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Amy Mitrano
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Michael J Ford
- MS Bioworks, LLC, 3950 Varsity Drive, Ann Arbor, MI 48108, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA 94305, USA. Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA. Department of Biology, Stanford University, Stanford, CA 94305, USA.
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
61
|
Popadin K, Gutierrez-Arcelus M, Lappalainen T, Buil A, Steinberg J, Nikolaev S, Lukowski S, Bazykin G, Seplyarskiy V, Ioannidis P, Zdobnov E, Dermitzakis E, Antonarakis S. Gene age predicts the strength of purifying selection acting on gene expression variation in humans. Am J Hum Genet 2014; 95:660-74. [PMID: 25480033 DOI: 10.1016/j.ajhg.2014.11.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 11/10/2014] [Indexed: 10/24/2022] Open
Abstract
Gene expression levels can be subject to selection. We hypothesized that the age of gene origin is associated with expression constraints, given that it affects the level of gene integration into the functional cellular environment. By studying the genetic variation affecting gene expression levels (cis expression quantitative trait loci [cis-eQTLs]) and protein levels (cis protein QTLs [cis-pQTLs]), we determined that young, primate-specific genes are enriched in cis-eQTLs and cis-pQTLs. Compared to cis-eQTLs of old genes originating before the zebrafish divergence, cis-eQTLs of young genes have a higher effect size, are located closer to the transcription start site, are more significant, and tend to influence genes in multiple tissues and populations. These results suggest that the expression constraint of each gene increases throughout its lifespan. We also detected a positive correlation between expression constraints (approximated by cis-eQTL properties) and coding constraints (approximated by Ka/Ks) and observed that this correlation might be driven by gene age. To uncover factors associated with the increase in gene-age-related expression constraints, we demonstrated that gene connectivity, gene involvement in complex regulatory networks, gene haploinsufficiency, and the strength of posttranscriptional regulation increase with gene age. We also observed an increase in heritability of gene expression levels with age, implying a reduction of the environmental component. In summary, we show that gene age shapes key gene properties during evolution and is therefore an important component of genome function.
Collapse
|
62
|
Das J, Podder S, Ghosh TC. Insights into the miRNA regulations in human disease genes. BMC Genomics 2014; 15:1010. [PMID: 25416156 PMCID: PMC4256923 DOI: 10.1186/1471-2164-15-1010] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2014] [Accepted: 11/11/2014] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND MicroRNAs are a class of short non-coding RNAs derived from either cellular or viral transcripts that act post-transcriptionally to regulate mRNA stability and translation. In recent days, increasing numbers of miRNAs have been shown to be involved in the development and progression of a variety of diseases. We, therefore, intend to enumerate miRNA targets in several known disease classes to explore the degree of miRNA regulations on them which is unexplored till date. RESULTS Here, we noticed that miRNA hits in cancer genes are remarkably higher than other diseases in human. Our observation suggests that UTRs and the transcript length of cancer related genes have a significant contribution in higher susceptibility to miRNA regulation. Moreover, gene duplication, mRNA stability, AREScores and evolutionary rate were likely to have implications for more miRNA targeting on cancer genes. Consequently, the regression analysis have confirmed that the AREScores plays most important role in detecting miRNA targets on disease genes. Interestingly, we observed that epigenetic modifications like CpG methylation and histone modification are less effective than miRNA regulations in controlling the gene expression of cancer genes. CONCLUSIONS The intrinsic properties of cancer genes studied here, for higher miRNA targeting will enhance the knowledge on cancer gene regulation.
Collapse
Affiliation(s)
| | - Soumita Podder
- Bioinformatics Centre, Bose Institute, P 1/12, C,I,T, Scheme VII M, Kolkata 700 054, India.
| | | |
Collapse
|
63
|
Zhang L, Xiao A, Ruggeri J, Bacares R, Somar J, Melo S, Figueiredo J, Simões-Correia J, Seruca R, Shah MA. The germline CDH1 c.48 G>C substitution contributes to cancer predisposition through generation of a pro-invasive mutation. Mutat Res 2014; 770:106-11. [PMID: 25771876 DOI: 10.1016/j.mrfmmm.2014.10.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Revised: 09/25/2014] [Accepted: 10/01/2014] [Indexed: 01/08/2023]
Abstract
Mutation screening of CDH1 is a standard of care for patients who meet criteria for Hereditary Diffuse Gastric Cancer (HDGC). In this setting, the classification of the sequence variants found in CDH1 is a critical step for risk management of patients with HDGC. In this report, we describe a germline CDH1 c.48 G>C variant found in a 21 year old woman and her living great uncle, who were both diagnosed with gastric cancer and belong to a family with high incidence of this type of cancer. This variant occurs at the last nucleotide of exon 1 and presumably results in a Gln-to-His change at codon 16 (Q16H). We used cloning strategies to evaluate the effects on mRNA stability and found that 5/27 and 0/17 clones have the "C" mutant allele in patient and her great uncle, respectively. In vitro functional studies revealed that the germline missense mutant (Q16H) had a pro-invasive cell behavior. Both results (functional and clinical) support the conclusion that the CDH1 c.48 G>C (Q16H) variant contributes to HDGC through the generation of a pathogenic missense mutation with loss of anti-invasive function.
Collapse
Affiliation(s)
- Liying Zhang
- Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA.
| | | | | | - Ruben Bacares
- Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | - Joshua Somar
- Department of Pathology, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | - Soraia Melo
- IPATIMUP - Institute of Molecular Pathology and Immunology of the University of Porto, 4200-465 Porto, Portugal
| | - Joana Figueiredo
- IPATIMUP - Institute of Molecular Pathology and Immunology of the University of Porto, 4200-465 Porto, Portugal
| | - Joana Simões-Correia
- IPATIMUP - Institute of Molecular Pathology and Immunology of the University of Porto, 4200-465 Porto, Portugal; Centre of Ophthalmology and Vision Sciences, IBILI - Faculty of Medicine, University of Coimbra, 3000-548 Coimbra, Portugal
| | - Raquel Seruca
- IPATIMUP - Institute of Molecular Pathology and Immunology of the University of Porto, 4200-465 Porto, Portugal; Faculty of Medicine, University of Porto, 4200-319 Porto, Portugal
| | - Manish A Shah
- Department of Medicine, Weill Cornell Medical College, New York, NY 10065, USA
| |
Collapse
|
64
|
Abstract
Gene expression levels are determined by the balance between rates of mRNA transcription and decay, and genetic variation in either of these processes can result in heritable differences in transcript abundance. Although the genetics of gene expression has been a subject of intense interest, the contribution of heritable variation in mRNA decay rates to gene expression variation has received far less attention. To this end, we developed a novel statistical framework and measured allele-specific differences in mRNA decay rates in a diploid yeast hybrid created by mating two genetically diverse parental strains. We estimate that 31% of genes exhibit allelic differences in mRNA decay rates, of which 350 can be identified at a false discovery rate of 10%. Genes with significant allele-specific differences in mRNA decay rates have higher levels of polymorphism compared to other genes, with all gene regions contributing to allelic differences in mRNA decay rates. Strikingly, we find widespread evidence for compensatory evolution, such that variants influencing transcriptional initiation and decay have opposite effects, suggesting that steady-state gene expression levels are subject to pervasive stabilizing selection. Our results demonstrate that heritable differences in mRNA decay rates are widespread and are an important target for natural selection to maintain or fine-tune steady-state gene expression levels.
Collapse
Affiliation(s)
- Jennifer M Andrie
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Jon Wakefield
- Department of Statistics, University of Washington, Seattle, Washington 98195, USA
| | - Joshua M Akey
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA;
| |
Collapse
|
65
|
Pai AA, Gilad Y. Comparative studies of gene regulatory mechanisms. Curr Opin Genet Dev 2014; 29:68-74. [PMID: 25215415 DOI: 10.1016/j.gde.2014.08.010] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 08/04/2014] [Accepted: 08/23/2014] [Indexed: 01/03/2023]
Abstract
It has become increasingly clear that changes in gene regulation have played an important role in adaptive evolution both between and within species. Over the past five years, comparative studies have moved beyond simple characterizations of differences in gene expression levels within and between species to studying variation in regulatory mechanisms. We still know relatively little about the precise chain of events that lead to most regulatory adaptations, but we have taken significant steps towards understanding the relative importance of changes in different mechanisms of gene regulatory evolution. In this review, we first discuss insights from comparative studies in model organisms, where the available experimental toolkit is extensive. We then focus on a few recent comparative studies in primates, where the limited feasibility of experimental manipulation dictates the approaches that can be used to study gene regulatory evolution.
Collapse
Affiliation(s)
- Athma A Pai
- Department of Biology, Massachusetts Institute of Technology, United States
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, United States.
| |
Collapse
|
66
|
When genetics meets epigenetics: deciphering the mechanisms controlling inter-individual variation in immune responses to infection. Curr Opin Immunol 2014; 29:119-26. [PMID: 24981784 DOI: 10.1016/j.coi.2014.06.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Revised: 05/31/2014] [Accepted: 06/04/2014] [Indexed: 11/23/2022]
Abstract
The response of host immune cells to microbial stimuli is dependent on robust and coordinated gene expression programs involving the transcription of thousands of genes. The dysregulation of such regulatory programs is likely to significantly contribute to the marked differences in susceptibility to infectious diseases observed among individuals and between human populations. Although the specific factors leading to a dysfunctional immune response to infection remain largely unknown, we are increasingly appreciating the importance of genetic variants in altering the expression levels of immune-related genes, possibly via epigenetic changes. This review describes how recent technological advances have profoundly contributed to our current understanding of the genetic architecture and the epigenetic rules controlling immune responses to infectious agents and how genetic and epigenetic data can be combined to unravel the mechanisms associated with host variation in transcriptional responses to infection.
Collapse
|
67
|
Zhang X, Gierman HJ, Levy D, Plump A, Dobrin R, Goring HHH, Curran JE, Johnson MP, Blangero J, Kim SK, O’Donnell CJ, Emilsson V, Johnson AD. Synthesis of 53 tissue and cell line expression QTL datasets reveals master eQTLs. BMC Genomics 2014; 15:532. [PMID: 24973796 PMCID: PMC4102726 DOI: 10.1186/1471-2164-15-532] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Accepted: 06/18/2014] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Gene expression genetic studies in human tissues and cells identify cis- and trans-acting expression quantitative trait loci (eQTLs). These eQTLs provide insights into regulatory mechanisms underlying disease risk. However, few studies systematically characterized eQTL results across cell and tissues types. We synthesized eQTL results from >50 datasets, including new primary data from human brain, peripheral plaque and kidney samples, in order to discover features of human eQTLs. RESULTS We find a substantial number of robust cis-eQTLs and far fewer trans-eQTLs consistent across tissues. Analysis of 45 full human GWAS scans indicates eQTLs are enriched overall, and above nSNPs, among positive statistical signals in genetic mapping studies, and account for a significant fraction of the strongest human trait effects. Expression QTLs are enriched for gene centricity, higher population allele frequencies, in housekeeping genes, and for coincidence with regulatory features, though there is little evidence of 5' or 3' positional bias. Several regulatory categories are not enriched including microRNAs and their predicted binding sites and long, intergenic non-coding RNAs. Among the most tissue-ubiquitous cis-eQTLs, there is enrichment for genes involved in xenobiotic metabolism and mitochondrial function, suggesting these eQTLs may have adaptive origins. Several strong eQTLs (CDK5RAP2, NBPFs) coincide with regions of reported human lineage selection. The intersection of new kidney and plaque eQTLs with related GWAS suggest possible gene prioritization. For example, butyrophilins are now linked to arterial pathogenesis via multiple genetic and expression studies. Expression QTL and GWAS results are made available as a community resource through the NHLBI GRASP database [http://apps.nhlbi.nih.gov/grasp/]. CONCLUSIONS Expression QTLs inform the interpretation of human trait variability, and may account for a greater fraction of phenotypic variability than protein-coding variants. The synthesis of available tissue eQTL data highlights many strong cis-eQTLs that may have important biologic roles and could serve as positive controls in future studies. Our results indicate some strong tissue-ubiquitous eQTLs may have adaptive origins in humans. Efforts to expand the genetic, splicing and tissue coverage of known eQTLs will provide further insights into human gene regulation.
Collapse
Affiliation(s)
- Xiaoling Zhang
- />Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA USA
| | - Hinco J Gierman
- />Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305 USA
| | - Daniel Levy
- />Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA USA
| | - Andrew Plump
- />Sanofi Aventis Pharmaceuticals, Bridgewater, NJ 08807 USA
| | - Radu Dobrin
- />Johnson & Johnson Pharmaceutical Research and Development, Radnor, PA 19477 USA
| | - Harald HH Goring
- />Department of Genetics, Texas Biomedical Research Institute, San Antonio, TX 78227 USA
| | - Joanne E Curran
- />Department of Genetics, Texas Biomedical Research Institute, San Antonio, TX 78227 USA
| | - Matthew P Johnson
- />Department of Genetics, Texas Biomedical Research Institute, San Antonio, TX 78227 USA
| | - John Blangero
- />Department of Genetics, Texas Biomedical Research Institute, San Antonio, TX 78227 USA
| | - Stuart K Kim
- />Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305 USA
| | - Christopher J O’Donnell
- />Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA USA
- />Division of Cardiology, Massachusetts General Hospital, Boston, MA 02114 USA
| | | | - Andrew D Johnson
- />Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA USA
| |
Collapse
|
68
|
Stark AL, Hause RJ, Gorsic LK, Antao NN, Wong SS, Chung SH, Gill DF, Im HK, Myers JL, White KP, Jones RB, Dolan ME. Protein quantitative trait loci identify novel candidates modulating cellular response to chemotherapy. PLoS Genet 2014; 10:e1004192. [PMID: 24699359 PMCID: PMC3974641 DOI: 10.1371/journal.pgen.1004192] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2013] [Accepted: 01/07/2014] [Indexed: 11/24/2022] Open
Abstract
Annotating and interpreting the results of genome-wide association studies (GWAS) remains challenging. Assigning function to genetic variants as expression quantitative trait loci is an expanding and useful approach, but focuses exclusively on mRNA rather than protein levels. Many variants remain without annotation. To address this problem, we measured the steady state abundance of 441 human signaling and transcription factor proteins from 68 Yoruba HapMap lymphoblastoid cell lines to identify novel relationships between inter-individual protein levels, genetic variants, and sensitivity to chemotherapeutic agents. Proteins were measured using micro-western and reverse phase protein arrays from three independent cell line thaws to permit mixed effect modeling of protein biological replicates. We observed enrichment of protein quantitative trait loci (pQTLs) for cellular sensitivity to two commonly used chemotherapeutics: cisplatin and paclitaxel. We functionally validated the target protein of a genome-wide significant trans-pQTL for its relevance in paclitaxel-induced apoptosis. GWAS overlap results of drug-induced apoptosis and cytotoxicity for paclitaxel and cisplatin revealed unique SNPs associated with the pharmacologic traits (at p<0.001). Interestingly, GWAS SNPs from various regions of the genome implicated the same target protein (p<0.0001) that correlated with drug induced cytotoxicity or apoptosis (p≤0.05). Two genes were functionally validated for association with drug response using siRNA: SMC1A with cisplatin response and ZNF569 with paclitaxel response. This work allows pharmacogenomic discovery to progress from the transcriptome to the proteome and offers potential for identification of new therapeutic targets. This approach, linking targeted proteomic data to variation in pharmacologic response, can be generalized to other studies evaluating genotype-phenotype relationships and provide insight into chemotherapeutic mechanisms. The central dogma of biology explains that DNA is transcribed to mRNA that is further translated into protein. Many genome-wide studies have implicated genetic variation that influences gene expression and that ultimately affect downstream complex traits including response to drugs. However, because of technical limitations, few studies have evaluated the contribution of genetic variation on protein expression and ensuing effects on downstream phenotypes. To overcome this challenge, we used a novel technology to simultaneously measure the baseline expression of 441 proteins in lymphoblastoid cell lines and compared them with publicly available genetic data. To further illustrate the utility of this approach, we compared protein-level measurements with chemotherapeutic induced apoptosis and cell-growth inhibition data. This study demonstrates the importance of using protein information to understand the functional consequences of genetic variants identified in genome-wide association studies. This protein data set will also have broad utility for understanding the relationship between other genome-wide studies of complex traits.
Collapse
Affiliation(s)
- Amy L. Stark
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - Ronald J. Hause
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Ben May Department for Cancer Research, The University of Chicago, Chicago, Illinois, United States of America
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
| | - Lidija K. Gorsic
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - Nirav N. Antao
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - Shan S. Wong
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - Sophie H. Chung
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
| | - Daniel F. Gill
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
| | - Hae K. Im
- Department of Health Studies, The University of Chicago, Chicago, Illinois, United States of America
| | - Jamie L. Myers
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
| | - Kevin P. White
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America
| | - Richard Baker Jones
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Ben May Department for Cancer Research, The University of Chicago, Chicago, Illinois, United States of America
- Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Committee on Clinical Pharmacology and Pharmacogenomics, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (RBJ); (MED)
| | - M. Eileen Dolan
- Department of Medicine, The University of Chicago, Chicago, Illinois, United States of America
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Committee on Clinical Pharmacology and Pharmacogenomics, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (RBJ); (MED)
| |
Collapse
|
69
|
Abstract
FOXP3(+) regulatory T (Treg) cells enforce immune self-tolerance and homeostasis, and variation in some aspects of Treg function may contribute to human autoimmune diseases. Here, we analyzed population-level Treg variability by performing genome-wide expression profiling of CD4(+) Treg and conventional CD4(+) T (Tconv) cells from 168 donors, healthy or with established type-1 diabetes (T1D) or type-2 diabetes (T2D), in relation to genetic and immunologic screening. There was a range of variability in Treg signature transcripts, some almost invariant, others more variable, with more extensive variability for genes that control effector function (ENTPD1, FCRL1) than for lineage-specification factors like FOXP3 or IKZF2. Network analysis of Treg signature genes identified coregulated clusters that respond similarly to genetic and environmental variation in Treg and Tconv cells, denoting qualitative differences in otherwise shared regulatory circuits whereas other clusters are coregulated in Treg, but not Tconv, cells, suggesting Treg-specific regulation of genes like CTLA4 or DUSP4. Dense genotyping identified 110 local genetic variants (cis-expression quantitative trait loci), some of which are specifically active in Treg, but not Tconv, cells. The Treg signature became sharper with age and with increasing body-mass index, suggesting a tuning of Treg function with repertoire selection and/or chronic inflammation. Some Treg signature transcripts correlated with FOXP3 mRNA and/or protein, suggesting transcriptional or posttranslational regulatory relationships. Although no single transcript showed significant association to diabetes, overall expression of the Treg signature was subtly perturbed in T1D, but not T2D, patients.
Collapse
|
70
|
Abstract
Systems genetics is an approach to understand the flow of biological information that underlies complex traits. It uses a range of experimental and statistical methods to quantitate and integrate intermediate phenotypes, such as transcript, protein or metabolite levels, in populations that vary for traits of interest. Systems genetics studies have provided the first global view of the molecular architecture of complex traits and are useful for the identification of genes, pathways and networks that underlie common human diseases. Given the urgent need to understand how the thousands of loci that have been identified in genome-wide association studies contribute to disease susceptibility, systems genetics is likely to become an increasingly important approach to understanding both biology and disease.
Collapse
Affiliation(s)
- Mete Civelek
- 1] Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles. [2] Department of Human Genetics, University of California, Los Angeles. [3] Department of Medicine, A2-237 Center for Health Sciences, University of California, Los Angeles, California 90095-1679, USA
| | - Aldons J Lusis
- 1] Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles. [2] Department of Human Genetics, University of California, Los Angeles. [3] Department of Medicine, A2-237 Center for Health Sciences, University of California, Los Angeles, California 90095-1679, USA
| |
Collapse
|
71
|
Sun M, Schwalb B, Pirkl N, Maier KC, Schenk A, Failmezger H, Tresch A, Cramer P. Global analysis of eukaryotic mRNA degradation reveals Xrn1-dependent buffering of transcript levels. Mol Cell 2013; 52:52-62. [PMID: 24119399 DOI: 10.1016/j.molcel.2013.09.010] [Citation(s) in RCA: 189] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2013] [Revised: 06/30/2013] [Accepted: 09/06/2013] [Indexed: 02/08/2023]
Abstract
The rates of mRNA synthesis and degradation determine cellular mRNA levels and can be monitored by comparative dynamic transcriptome analysis (cDTA) that uses nonperturbing metabolic RNA labeling. Here we present cDTA data for 46 yeast strains lacking genes involved in mRNA degradation and metabolism. In these strains, changes in mRNA degradation rates are generally compensated by changes in mRNA synthesis rates, resulting in a buffering of mRNA levels. We show that buffering of mRNA levels requires the RNA exonuclease Xrn1. The buffering is rapidly established when mRNA synthesis is impaired, but is delayed when mRNA degradation is impaired, apparently due to Xrn1-dependent transcription repressor induction. Cluster analysis of the data defines the general mRNA degradation machinery, reveals different substrate preferences for the two mRNA deadenylase complexes Ccr4-Not and Pan2-Pan3, and unveils an interwoven cellular mRNA surveillance network.
Collapse
Affiliation(s)
- Mai Sun
- Gene Center Munich and Department of Biochemistry, Center for Integrated Protein Science CIPSM, Ludwig-Maximilians-Universität München, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | | | | | | | | | | | | | | |
Collapse
|
72
|
Horvatovich P, Franke L, Bischoff R. Proteomic studies related to genetic determinants of variability in protein concentrations. J Proteome Res 2013; 13:5-14. [PMID: 24237071 DOI: 10.1021/pr400765y] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Genetic variation has multiple effects on the proteome. It may influence the expression level of proteins, modify their sequences through single nucleotide polymorphisms, the occurrence of allelic variants, or alternative splicing (ASP) events. This perspective paper summarizes the major effects of genetic variability on protein expression and isoforms and provides an overview of proteomics techniques and methods that allow studying the effects of genetic variability at different levels of the proteome. The paper provides an overview of recent quantitative trait loci studies performed to explore the effect of genetic variation on protein expression (pQTL). Finally it gives a perspective view on advances in proteomics technology and the role of the Chromosome-Centric Human Proteome Project (C-HPP) by creating large-scale resources that may facilitate performing more comprehensive pQTL experiments in the future.
Collapse
Affiliation(s)
- Péter Horvatovich
- Analytical Biochemistry, Department of Pharmacy, University of Groningen , A. Deusinglaan 1, 9713 AV Groningen, The Netherlands
| | | | | |
Collapse
|
73
|
Additive, epistatic, and environmental effects through the lens of expression variability QTL in a twin cohort. Genetics 2013; 196:413-25. [PMID: 24298061 DOI: 10.1534/genetics.113.157503] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
The expression of a gene can vary across individuals in the general population, as well as between monozygotic twins. This variable expression is assumed to be due to the influence of both genetic and nongenetic factors. Yet little evidence supporting this assumption has been obtained from empirical data. In this study, we used expression data from a large twin cohort to investigate the influences of genetic and nongenetic factors on variable gene expression. We focused on a set of expression variability QTL (evQTL)--i.e., genetic loci associated with the variance, as opposed to the mean, of gene expression. We identified evQTL for 99, 56, and 79 genes in lymphoblastoid cell lines, skin, and fat, respectively. The differences in gene expression, measured by the relative mean difference (RMD), tended to be larger between pairs of dizygotic (DZ) twins than between pairs of monozygotic (MZ) twins, showing that genetic background influenced the expression variability. Furthermore, a more profound RMD was observed between pairs of MZ twins whose genotypes were associated with greater expression variability than the RMD found between pairs of MZ twins whose genotypes were associated with smaller expression variability. This suggests that nongenetic (e.g., environmental) factors contribute to the variable expression. Lastly, we demonstrated that the formation of evQTL is likely due to partial linkages between eQTL SNPs that are additively associated with the mean of gene expression; in most cases, no epistatic effect is involved. Our findings have implications for understanding divergent sources of gene expression variability.
Collapse
|
74
|
Nguyen TT, Seoighe C. Integrative analysis of mRNA expression and half-life data reveals trans-acting genetic variants associated with increased expression of stable transcripts. PLoS One 2013; 8:e79627. [PMID: 24260269 PMCID: PMC3832542 DOI: 10.1371/journal.pone.0079627] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 10/03/2013] [Indexed: 11/19/2022] Open
Abstract
Genetic variation in gene expression makes an important contribution to phenotypic variation and susceptibility to disease. Recently, a subset of cis-acting expression quantitative loci (eQTLs) has been found to result from polymorphisms that affect RNA stability. Here we carried out a search for trans-acting variants that influence RNA stability. We first demonstrate that differences in the activity of trans-acting factors that stabilize RNA can be detected by comparing the expression levels of long-lived (stable) and short-lived (unstable) transcripts in high-throughput gene expression experiments. Using gene expression microarray data generated from eight HapMap3 populations, we calculated the relative expression ranks of long-lived transcripts versus short-lived transcripts in each sample. Treating this as a quantitative trait, we applied genome-wide association and identified a single nucleotide polymorphism (SNP), rs6137010, on chromosome 20p13 with which it is strongly associated in two Asian populations (p = 4×10−10 in CHB – Han Chinese from Beijing; p = 1×10−4 in JPT – Japanese from Tokyo). This SNP is a cis-eQTL for SNRPB in CHB and JPT but not in the other six HapMap3 populations. SNRPB is a core component of the spliceosome, and has previously been shown to affect the expression of many RNA processing factors. We propose that a cis-eQTL of SNRPB may be directly responsible for inter-individual variation in relative expression of long-lived versus short-lived transcript in Asian populations. In support of this hypothesis, knockdown of SNRPB results in a significant reduction in the relative expression of long-lived versus short-lived transcripts. Samples with higher relative expression of long-lived transcripts also had higher relative expression of coding compared to non-coding RNA and of RNA from housekeeping compared to non-housekeeping genes, due to the lower decay rates of coding RNAs, particularly those that perform housekeeping functions, compared to non-coding RNAs.
Collapse
Affiliation(s)
- Thong T. Nguyen
- School of Mathematics, Statistics & Applied Mathematics, National University of Ireland, Galway, Ireland
| | - Cathal Seoighe
- School of Mathematics, Statistics & Applied Mathematics, National University of Ireland, Galway, Ireland
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Anzio Road, Observatory, Cape Town, South Africa
- * E-mail:
| |
Collapse
|
75
|
Edwards SL, Beesley J, French JD, Dunning AM. Beyond GWASs: illuminating the dark road from association to function. Am J Hum Genet 2013; 93:779-97. [PMID: 24210251 PMCID: PMC3824120 DOI: 10.1016/j.ajhg.2013.10.012] [Citation(s) in RCA: 591] [Impact Index Per Article: 49.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Indexed: 12/15/2022] Open
Abstract
Genome-wide association studies (GWASs) have enabled the discovery of common genetic variation contributing to normal and pathological traits and clinical drug responses, but recognizing the precise targets of these associations is now the major challenge. Here, we review recent approaches to the functional follow-up of GWAS loci, including fine mapping of GWAS signal(s), prioritization of putative functional SNPs by the integration of genetic epidemiological and bioinformatic methods, and in vitro and in vivo experimental verification of predicted molecular mechanisms for identifying the targeted genes. The majority of GWAS-identified variants fall in noncoding regions of the genome. Therefore, this review focuses on strategies for assessing likely mechanisms affected by noncoding variants; such mechanisms include transcriptional regulation, noncoding RNA function, and epigenetic regulation. These approaches have already accelerated progress from genetic studies to biological knowledge and might ultimately guide the development of prognostic, preventive, and therapeutic measures.
Collapse
Affiliation(s)
- Stacey L Edwards
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland 4029, Australia; School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland 4072, Australia.
| | | | | | | |
Collapse
|
76
|
Variance heterogeneity in Saccharomyces cerevisiae expression data: trans-regulation and epistasis. PLoS One 2013; 8:e79507. [PMID: 24223957 PMCID: PMC3817098 DOI: 10.1371/journal.pone.0079507] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Accepted: 09/30/2013] [Indexed: 11/26/2022] Open
Abstract
Here, we describe the results from the first variance heterogeneity Genome Wide Association Study (VGWAS) on yeast expression data. Using this forward genetics approach, we show that the genetic regulation of gene-expression in the budding yeast, Saccharomyces cerevisiae, includes mechanisms that can lead to variance heterogeneity in the expression between genotypes. Additionally, we performed a mean effect association study (GWAS). Comparing the mean and variance heterogeneity analyses, we find that the mean expression level is under genetic regulation from a larger absolute number of loci but that a higher proportion of the variance controlling loci were trans-regulated. Both mean and variance regulating loci cluster in regulatory hotspots that affect a large number of phenotypes; a single variance-controlling locus, mapping close to DIA2, was found to be involved in more than 10% of the significant associations. It has been suggested in the literature that variance-heterogeneity between the genotypes might be due to genetic interactions. We therefore screened the multi-locus genotype-phenotype maps for several traits where multiple associations were found, for indications of epistasis. Several examples of two and three locus genetic interactions were found to involve variance-controlling loci, with reports from the literature corroborating the functional connections between the loci. By using a new analytical approach to re-analyze a powerful existing dataset, we are thus able to both provide novel insights to the genetic mechanisms involved in the regulation of gene-expression in budding yeast and experimentally validate epistasis as an important mechanism underlying genetic variance-heterogeneity between genotypes.
Collapse
|
77
|
McVicker G, van de Geijn B, Degner JF, Cain CE, Banovich NE, Raj A, Lewellen N, Myrthil M, Gilad Y, Pritchard JK. Identification of genetic variants that affect histone modifications in human cells. Science 2013; 342:747-9. [PMID: 24136359 DOI: 10.1126/science.1242429] [Citation(s) in RCA: 335] [Impact Index Per Article: 27.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Histone modifications are important markers of function and chromatin state, yet the DNA sequence elements that direct them to specific genomic locations are poorly understood. Here, we identify hundreds of quantitative trait loci, genome-wide, that affect histone modification or RNA polymerase II (Pol II) occupancy in Yoruba lymphoblastoid cell lines (LCLs). In many cases, the same variant is associated with quantitative changes in multiple histone marks and Pol II, as well as in deoxyribonuclease I sensitivity and nucleosome positioning. Transcription factor binding site polymorphisms are correlated overall with differences in local histone modification, and we identify specific transcription factors whose binding leads to histone modification in LCLs. Furthermore, variants that affect chromatin at distal regulatory sites frequently also direct changes in chromatin and gene expression at associated promoters.
Collapse
Affiliation(s)
- Graham McVicker
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
78
|
Ohler U. Using machine learning to identify disease-relevant regulatory RNAs. Proc Natl Acad Sci U S A 2013; 110:15516-15517. [PMID: 24046375 PMCID: PMC3785730 DOI: 10.1073/pnas.1315199110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023] Open
Affiliation(s)
- Uwe Ohler
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany
| |
Collapse
|
79
|
Kessler T, Hache H, Wierling C. Integrative analysis of cancer-related signaling pathways. Front Physiol 2013; 4:124. [PMID: 23760067 PMCID: PMC3671203 DOI: 10.3389/fphys.2013.00124] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2012] [Accepted: 05/12/2013] [Indexed: 12/11/2022] Open
Abstract
Identification and classification of cancer types and subtypes is a major issue in current cancer research. Whole genome expression profiling of cancer tissues is often the basis for such subtype classifications of tumors and different signatures for individual cancer types have been described. However, the search for best performing discriminatory gene-expression signatures covering more than one cancer type remains a relevant topic in cancer research as such a signature would help understanding the common changes in signaling networks in these disease types. In this work, we explore the idea of a top down approach for sample stratification based on a module-based network of cancer relevant signaling pathways. For assembly of this network, we consider several of the most established cancer pathways. We evaluate our sample stratification approach using expression data of human breast and ovarian cancer signatures. We show that our approach performs equally well to previously reported methods besides providing the advantage to classify different cancer types. Furthermore, it allows to identify common changes in network module activity of those cancer samples.
Collapse
Affiliation(s)
- Thomas Kessler
- Systems Biology Group, Department Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Hendrik Hache
- Systems Biology Group, Department Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Christoph Wierling
- Systems Biology Group, Department Vertebrate Genomics, Max Planck Institute for Molecular Genetics, Berlin, Germany
| |
Collapse
|
80
|
Abstract
Identification and functional interpretation of gene regulatory variants is a major focus of modern genomics. The application of genetic mapping to molecular and cellular traits has enabled the detection of regulatory variation on genome-wide scales and revealed an enormous diversity of regulatory architecture in humans and other species. In this review I summarise the insights gained and questions raised by a decade of genetic mapping of gene expression variation. I discuss recent extensions of this approach using alternative molecular phenotypes that have revealed some of the biological mechanisms that drive gene expression variation between individuals. Finally, I highlight outstanding problems and future directions for development.
Collapse
|
81
|
Variation and genetic control of protein abundance in humans. Nature 2013; 499:79-82. [PMID: 23676674 PMCID: PMC3789121 DOI: 10.1038/nature12223] [Citation(s) in RCA: 286] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2012] [Accepted: 04/26/2013] [Indexed: 12/21/2022]
Abstract
Gene expression differs among both individuals and populations and is thought to be a major determinant of phenotypic variation. Although variation and genetic loci responsible for RNA expression levels have been analyzed extensively in human populations1–5, our knowledge is limited regarding the differences in human protein abundance and their genetic basis. Variation in mRNA expression is not a perfect surrogate for protein expression because the latter is influenced by a battery of post-transcriptional regulatory mechanisms, and, empirically, the correlation between protein and mRNA levels is generally modest6,7. Here we used isobaric tandem mass tag (TMT)-based quantitative mass spectrometry to determine relative protein levels of 5953 genes in lymphoblastoid cell lines (LCLs) from 95 diverse individuals genotyped in the HapMap Project8,9. We found that protein levels are heritable molecular phenotypes that exhibit considerable variation between individuals, populations, and sexes. Levels of specific sets of proteins involved in the same biological process co-vary among individuals, indicating that these processes are tightly regulated at the protein level. We identified cis-pQTLs (protein quantitative trait loci), including variants not detected by previous transcriptome studies. This study demonstrates the feasibility of high throughput human proteome quantification which, when integrated with DNA variation and transcriptome information, adds a new dimension to the characterization of gene expression regulation.
Collapse
|
82
|
|
83
|
Deneke C, Lipowsky R, Valleriani A. Complex degradation processes lead to non-exponential decay patterns and age-dependent decay rates of messenger RNA. PLoS One 2013; 8:e55442. [PMID: 23408982 PMCID: PMC3569439 DOI: 10.1371/journal.pone.0055442] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Accepted: 12/23/2012] [Indexed: 11/18/2022] Open
Abstract
Experimental studies on mRNA stability have established several, qualitatively distinct decay patterns for the amount of mRNA within the living cell. Furthermore, a variety of different and complex biochemical pathways for mRNA degradation have been identified. The central aim of this paper is to bring together both the experimental evidence about the decay patterns and the biochemical knowledge about the multi-step nature of mRNA degradation in a coherent mathematical theory. We first introduce a mathematical relationship between the mRNA decay pattern and the lifetime distribution of individual mRNA molecules. This relationship reveals that the mRNA decay patterns at steady state expression level must obey a general convexity condition, which applies to any degradation mechanism. Next, we develop a theory, formulated as a Markov chain model, that recapitulates some aspects of the multi-step nature of mRNA degradation. We apply our theory to experimental data for yeast and explicitly derive the lifetime distribution of the corresponding mRNAs. Thereby, we show how to extract single-molecule properties of an mRNA, such as the age-dependent decay rate and the residual lifetime. Finally, we analyze the decay patterns of the whole translatome of yeast cells and show that yeast mRNAs can be grouped into three broad classes that exhibit three distinct decay patterns. This paper provides both a method to accurately analyze non-exponential mRNA decay patterns and a tool to validate different models of degradation using decay data.
Collapse
Affiliation(s)
- Carlus Deneke
- Department of Theory and Bio-Systems, Max Planck Institute of Colloids and Interfaces, Potsdam, Germany
| | - Reinhard Lipowsky
- Department of Theory and Bio-Systems, Max Planck Institute of Colloids and Interfaces, Potsdam, Germany
| | - Angelo Valleriani
- Department of Theory and Bio-Systems, Max Planck Institute of Colloids and Interfaces, Potsdam, Germany
- * E-mail:
| |
Collapse
|
84
|
Niemitz E. RNA decay QTL. Nat Genet 2012. [DOI: 10.1038/ng.2488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
85
|
Genetic variation contributing to mRNA decay. Nat Rev Genet 2012. [DOI: 10.1038/nrg3378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|