1
|
Park D, Cenik C. Long-read RNA sequencing reveals allele-specific N 6-methyladenosine modifications. Genome Res 2025; 35:999-1011. [PMID: 39472020 PMCID: PMC12047277 DOI: 10.1101/gr.279270.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 10/23/2024] [Indexed: 11/06/2024]
Abstract
Long-read sequencing technology enables highly accurate detection of allele-specific RNA expression, providing insights into the effects of genetic variation on splicing and RNA abundance. Furthermore, the ability to directly sequence RNA enables the detection of RNA modifications in tandem with ascertaining the allelic origin of each molecule. Here, we leverage these advantages to determine allele-biased patterns of N 6-methyladenosine (m6A) modifications in native mRNA. We used human and mouse cells with known genetic variants to assign the allelic origin of each mRNA molecule combined with a supervised machine learning model to detect read-level m6A modification ratios. Our analyses reveal the importance of sequences adjacent to the DRACH motif in determining m6A deposition, in addition to allelic differences that directly alter the motif. Moreover, we discover allele-specific m6A modification events with no genetic variants in close proximity to the differentially modified nucleotide, demonstrating the unique advantage of using long-reads and surpassing the capabilities of antibody-based short-read approaches. This technological advance will further our understanding of the role of genetics in determining mRNA modifications.
Collapse
Affiliation(s)
- Dayea Park
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas 78712, USA
| | - Can Cenik
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
2
|
Zhou H, Clark E, Guan D, Lagarrigue S, Fang L, Cheng H, Tuggle CK, Kapoor M, Wang Y, Giuffra E, Egidy G. Comparative Genomics and Epigenomics of Transcriptional Regulation. Annu Rev Anim Biosci 2025; 13:73-98. [PMID: 39565835 DOI: 10.1146/annurev-animal-111523-102217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
Transcriptional regulation in response to diverse physiological cues involves complicated biological processes. Recent initiatives that leverage whole genome sequencing and annotation of regulatory elements significantly contribute to our understanding of transcriptional gene regulation. Advances in the data sets available for comparative genomics and epigenomics can identify evolutionarily constrained regulatory variants and shed light on noncoding elements that influence transcription in different tissues and developmental stages across species. Most epigenomic data, however, are generated from healthy subjects at specific developmental stages. To bridge the genotype-phenotype gap, future research should focus on generating multidimensional epigenomic data under diverse physiological conditions. Farm animal species offer advantages in terms of feasibility, cost, and experimental design for such integrative analyses in comparison to humans. Deep learning modeling and cutting-edge technologies in sequencing and functional screening and validation also provide great promise for better understanding transcriptional regulation in this dynamic field.
Collapse
Affiliation(s)
- Huaijun Zhou
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | - Emily Clark
- The Roslin Institute, University of Edinburgh, Edinburgh, Midlothian, United Kingdom;
| | - Dailu Guan
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark;
| | - Hao Cheng
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Muskan Kapoor
- Department of Animal Science, Iowa State University, Ames, Iowa, USA; ,
| | - Ying Wang
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Giorgia Egidy
- GABI, AgroParisTech, INRAE, Jouy-en-Josas, France; ,
| |
Collapse
|
3
|
Park D, Cenik C. Long-read RNA sequencing reveals allele-specific N 6-methyladenosine modifications. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.08.602538. [PMID: 39026828 PMCID: PMC11257478 DOI: 10.1101/2024.07.08.602538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Long-read sequencing technology enables highly accurate detection of allele-specific RNA expression, providing insights into the effects of genetic variation on splicing and RNA abundance. Furthermore, the ability to directly sequence RNA promises the detection of RNA modifications in tandem with ascertaining the allelic origin of each molecule. Here, we leverage these advantages to determine allele-biased patterns of N6-methyladenosine (m6A) modifications in native mRNA. We utilized human and mouse cells with known genetic variants to assign allelic origin of each mRNA molecule combined with a supervised machine learning model to detect read-level m6A modification ratios. Our analyses revealed the importance of sequences adjacent to the DRACH-motif in determining m6A deposition, in addition to allelic differences that directly alter the motif. Moreover, we discovered allele-specific m6A modification (ASM) events with no genetic variants in close proximity to the differentially modified nucleotide, demonstrating the unique advantage of using long reads and surpassing the capabilities of antibody-based short-read approaches. This technological advancement promises to advance our understanding of the role of genetics in determining mRNA modifications.
Collapse
Affiliation(s)
- Dayea Park
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
| | - Can Cenik
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
| |
Collapse
|
4
|
Lagattuta KA, Park HL, Rumker L, Ishigaki K, Nathan A, Raychaudhuri S. The genetic basis of autoimmunity seen through the lens of T cell functional traits. Nat Commun 2024; 15:1204. [PMID: 38331990 PMCID: PMC10853555 DOI: 10.1038/s41467-024-45170-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 01/15/2024] [Indexed: 02/10/2024] Open
Abstract
Autoimmune disease heritability is enriched in T cell-specific regulatory regions of the genome. Modern-day T cell datasets now enable association studies between single nucleotide polymorphisms (SNPs) and a myriad of molecular phenotypes, including chromatin accessibility, gene expression, transcriptional programs, T cell antigen receptor (TCR) amino acid usage, and cell state abundances. Such studies have identified hundreds of quantitative trait loci (QTLs) in T cells that colocalize with genetic risk for autoimmune disease. The key challenge facing immunologists today lies in synthesizing these results toward a unified understanding of the autoimmune T cell: which genes, cell states, and antigens drive tissue destruction?
Collapse
Affiliation(s)
- Kaitlyn A Lagattuta
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Hannah L Park
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Laurie Rumker
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Kazuyoshi Ishigaki
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Laboratory for Human Immunogenetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Aparna Nathan
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
- Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Soumya Raychaudhuri
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
- Division of Rheumatology, Inflammation, and Immunity, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
5
|
Akhtar F, Ruiz JH, Liu YG, Resendez RG, Feliers D, Morales LD, Diaz-Badillo A, Lehman DM, Arya R, Lopez-Alvarenga JC, Blangero J, Duggirala R, Mummidi S. Functional characterization of the disease-associated CCL2 rs1024611G-rs13900T haplotype: The role of the RNA-binding protein HuR. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.31.564937. [PMID: 37961304 PMCID: PMC10635030 DOI: 10.1101/2023.10.31.564937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
CC-chemokine ligand 2 (CCL2) is involved in the pathogenesis of several diseases associated with monocyte/macrophage recruitment, such as HIV-associated neurocognitive disorder (HAND), tuberculosis, and atherosclerosis. The rs1024611 (alleles:A>G; G is the risk allele) polymorphism in the CCL2 cis-regulatory region is associated with increased CCL2 expression in vitro and ex vivo, leukocyte mobilization in vivo, and deleterious disease outcomes. However, the molecular basis for the rs1024611-associated differential CCL2 expression remains poorly characterized. It is conceivable that genetic variant(s) in linkage disequilibrium (LD) with rs1024611 could mediate such effects. Previously, we used rs13900 (alleles:_C>T) in the CCL2 3' untranslated region (3' UTR) that is in perfect LD with rs1024611 to demonstrate allelic expression imbalance (AEI) of CCL2 in heterozygous individuals. Here we tested the hypothesis that the rs13900 could modulate CCL2 expression by altering mRNA turnover and/or translatability. The rs13900 T allele conferred greater stability to the CCL2 transcript when compared to the rs13900 C allele. The rs13900 T allele also had increased binding to Human Antigen R (HuR), an RNA-binding protein, in vitro and ex vivo. The rs13900 alleles imparted differential activity to reporter vectors and influenced the translatability of the reporter transcript. We further demonstrated a role for HuR in mediating allele-specific effects on CCL2 expression in overexpression and silencing studies. The presence of the rs1024611G-rs13900T conferred a distinct transcriptomic signature related to inflammation and immunity. Our studies suggest that the differential interactions of HuR with rs13900 could modulate CCL2 expression and explain the interindividual differences in CCL2-mediated disease susceptibility.
Collapse
Affiliation(s)
- Feroz Akhtar
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| | - Joselin Hernandez Ruiz
- Utah Center for Genetic Discovery, Department of Human Genetics, University of Utah, Salt Lake City, Utah, USA
| | - Ya-Guang Liu
- Department of Pathology, School of Medicine, University of Texas Health San Antonio, San Antonio, Texas, USA
| | - Roy G. Resendez
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| | - Denis Feliers
- Department of Medicine, School of Medicine, University of Texas Health San Antonio, San Antonio, Texas, USA
| | - Liza D. Morales
- South Texas Diabetes and Obesity Institute, Department of Genetics, School of Medicine, University of Texas Rio Grane Valley, Brownsville, USA
| | - Alvaro Diaz-Badillo
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| | - Donna M. Lehman
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| | - Rector Arya
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| | - Juan Carlos Lopez-Alvarenga
- Department of Population Health and Biostatistics, School of Medicine, University of Texas Rio Grande Valley, Harlingen, Texas, USA
| | - John Blangero
- South Texas Diabetes and Obesity Institute, Department of Genetics, School of Medicine, University of Texas Rio Grane Valley, Brownsville, USA
| | - Ravindranath Duggirala
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| | - Srinivas Mummidi
- Department of Health and Behavioral Sciences, Texas A&M University- San Antonio, Texas, USA
| |
Collapse
|
6
|
Adams DM, Reay WR, Cairns MJ. Multiomic prioritisation of risk genes for anorexia nervosa. Psychol Med 2023; 53:6754-6762. [PMID: 36803885 PMCID: PMC10600818 DOI: 10.1017/s0033291723000235] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 01/12/2023] [Accepted: 01/23/2023] [Indexed: 02/22/2023]
Abstract
BACKGROUND Anorexia nervosa (AN) is a psychiatric disorder associated with marked morbidity. Whilst AN genetic studies could identify novel treatment targets, integration of functional genomics data, including transcriptomics and proteomics, would assist to disentangle correlated signals and reveal causally associated genes. METHODS We used models of genetically imputed expression and splicing from 14 tissues, leveraging mRNA, protein, and mRNA alternative splicing weights to identify genes, proteins, and transcripts, respectively, associated with AN risk. This was accomplished through transcriptome, proteome, and spliceosome-wide association studies, followed by conditional analysis and finemapping to prioritise candidate causal genes. RESULTS We uncovered 134 genes for which genetically predicted mRNA expression was associated with AN after multiple-testing correction, as well as four proteins and 16 alternatively spliced transcripts. Conditional analysis of these significantly associated genes on other proximal association signals resulted in 97 genes independently associated with AN. Moreover, probabilistic finemapping further refined these associations and prioritised putative causal genes. The gene WDR6, for which increased genetically predicted mRNA expression was correlated with AN, was strongly supported by both conditional analyses and finemapping. Pathway analysis of genes revealed by finemapping identified the pathway regulation of immune system process (overlapping genes = MST1, TREX1, PRKAR2A, PROS1) as statistically overrepresented. CONCLUSIONS We leveraged multiomic datasets to genetically prioritise novel risk genes for AN. Multiple-lines of evidence support that WDR6 is associated with AN, whilst other prioritised genes were enriched within immune related pathways, further supporting the role of the immune system in AN.
Collapse
Affiliation(s)
- Danielle M. Adams
- School of Biomedical Sciences and Pharmacy, Centre for Complex Disease Neurobiology and Precision Medicine, The University of Newcastle, Callaghan, NSW, Australia
- Precision Medicine Research Program, Hunter Medical Research Institute, Newcastle, NSW, Australia
| | - William R. Reay
- School of Biomedical Sciences and Pharmacy, Centre for Complex Disease Neurobiology and Precision Medicine, The University of Newcastle, Callaghan, NSW, Australia
- Precision Medicine Research Program, Hunter Medical Research Institute, Newcastle, NSW, Australia
| | - Murray J. Cairns
- School of Biomedical Sciences and Pharmacy, Centre for Complex Disease Neurobiology and Precision Medicine, The University of Newcastle, Callaghan, NSW, Australia
- Precision Medicine Research Program, Hunter Medical Research Institute, Newcastle, NSW, Australia
| |
Collapse
|
7
|
D'Sa K, Guelfi S, Vandrovcova J, Reynolds RH, Zhang D, Hardy J, Botía JA, Weale ME, Taliun SAG, Small KS, Ryten M. Analysis of subcellular RNA fractions demonstrates significant genetic regulation of gene expression in human brain post-transcriptionally. Sci Rep 2023; 13:13874. [PMID: 37620324 PMCID: PMC10449874 DOI: 10.1038/s41598-023-40324-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 08/08/2023] [Indexed: 08/26/2023] Open
Abstract
Gaining insight into the genetic regulation of gene expression in human brain is key to the interpretation of genome-wide association studies for major neurological and neuropsychiatric diseases. Expression quantitative trait loci (eQTL) analyses have largely been used to achieve this, providing valuable insights into the genetic regulation of steady-state RNA in human brain, but not distinguishing between molecular processes regulating transcription and stability. RNA quantification within cellular fractions can disentangle these processes in cell types and tissues which are challenging to model in vitro. We investigated the underlying molecular processes driving the genetic regulation of gene expression specific to a cellular fraction using allele-specific expression (ASE). Applying ASE analysis to genomic and transcriptomic data from paired nuclear and cytoplasmic fractions of anterior prefrontal cortex, cerebellar cortex and putamen tissues from 4 post-mortem neuropathologically-confirmed control human brains, we demonstrate that a significant proportion of genetic regulation of gene expression occurs post-transcriptionally in the cytoplasm, with genes undergoing this form of regulation more likely to be synaptic. These findings have implications for understanding the structure of gene expression regulation in human brain, and importantly the interpretation of rapidly growing single-nucleus brain RNA-sequencing and eQTL datasets, where cytoplasm-specific regulatory events could be missed.
Collapse
Affiliation(s)
- Karishma D'Sa
- Department of Neurodegenerative Disease, University College London, London, WC1N 3BG, UK
- Department of Medical & Molecular Genetics, School of Medical Sciences, King's College London, Guy's Hospital, London, SE1 1UL, UK
- Department of Clinical and Movement Neurosciences, University College London, London, WC1N 3BG, UK
| | - Sebastian Guelfi
- Department of Neurodegenerative Disease, University College London, London, WC1N 3BG, UK
- Verge Genomics, Tower Pl, South San Francisco, CA, 94080, USA
| | - Jana Vandrovcova
- Dept of Neuromuscular Disease, UCL Queen Square Institute of Neurology, London, WC1N 3BG, UK
| | - Regina H Reynolds
- Great Ormond Street Institute of Child Health, Genetics and Genomic Medicine, University College London, London, WC1N 1EH, UK
| | - David Zhang
- Great Ormond Street Institute of Child Health, Genetics and Genomic Medicine, University College London, London, WC1N 1EH, UK
| | - John Hardy
- Department of Neurodegenerative Disease, University College London, London, WC1N 3BG, UK
- UK Dementia Research Institute at University College London, London, WC1N 3BG, UK
| | - Juan A Botía
- Great Ormond Street Institute of Child Health, Genetics and Genomic Medicine, University College London, London, WC1N 1EH, UK
- Departamento de Ingeniería de la Información y las Comunicaciones, Universidad de Murcia, 30100, Murcia, Spain
| | - Michael E Weale
- Department of Medical & Molecular Genetics, School of Medical Sciences, King's College London, Guy's Hospital, London, SE1 1UL, UK
- Genomics Plc, Oxford, OX1 1JD, UK
| | - Sarah A Gagliano Taliun
- Department of Medicine, Université de Montréal, Montréal, QC, H3T 1J4, Canada
- Montréal Heart Institute, Montréal, QC, H1T 1C8, Canada
- Department of Neurosciences, Université de Montréal, Montréal, QC, H3T 1J4, Canada
| | - Kerrin S Small
- Department of Twin Research and Genetic Epidemiology, King's College London, London, SE1 7EH, UK
| | - Mina Ryten
- Great Ormond Street Institute of Child Health, Genetics and Genomic Medicine, University College London, London, WC1N 1EH, UK.
- NIHR Great Ormond Street Hospital Biomedical Research Centre, University College London, London, WC1N 3JH, UK.
| |
Collapse
|
8
|
Wienecke AN, Barry ML, Pollard DA. Natural variation in codon bias and mRNA folding strength interact synergistically to modify protein expression in Saccharomyces cerevisiae. Genetics 2023; 224:iyad113. [PMID: 37310925 PMCID: PMC10411576 DOI: 10.1093/genetics/iyad113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 04/10/2023] [Accepted: 05/15/2023] [Indexed: 06/15/2023] Open
Abstract
Codon bias and mRNA folding strength (mF) are hypothesized molecular mechanisms by which polymorphisms in genes modify protein expression. Natural patterns of codon bias and mF across genes as well as effects of altering codon bias and mF suggest that the influence of these 2 mechanisms may vary depending on the specific location of polymorphisms within a transcript. Despite the central role codon bias and mF may play in natural trait variation within populations, systematic studies of how polymorphic codon bias and mF relate to protein expression variation are lacking. To address this need, we analyzed genomic, transcriptomic, and proteomic data for 22 Saccharomyces cerevisiae isolates, estimated protein accumulation for each allele of 1,620 genes as the log of protein molecules per RNA molecule (logPPR), and built linear mixed-effects models associating allelic variation in codon bias and mF with allelic variation in logPPR. We found that codon bias and mF interact synergistically in a positive association with logPPR, and this interaction explains almost all the effects of codon bias and mF. We examined how the locations of polymorphisms within transcripts influence their effects and found that codon bias primarily acts through polymorphisms in domain-encoding and 3' coding sequences, while mF acts most significantly through coding sequences with weaker effects from untranslated regions. Our results present the most comprehensive characterization to date of how polymorphisms in transcripts influence protein expression.
Collapse
Affiliation(s)
- Anastacia N Wienecke
- Biology Department, Western Washington University, Bellingham, WA 98225, USA
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Margaret L Barry
- Biology Department, Western Washington University, Bellingham, WA 98225, USA
| | - Daniel A Pollard
- Biology Department, Western Washington University, Bellingham, WA 98225, USA
| |
Collapse
|
9
|
Connally NJ, Nazeen S, Lee D, Shi H, Stamatoyannopoulos J, Chun S, Cotsapas C, Cassa CA, Sunyaev SR. The missing link between genetic association and regulatory function. eLife 2022; 11:e74970. [PMID: 36515579 PMCID: PMC9842386 DOI: 10.7554/elife.74970] [Citation(s) in RCA: 76] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 12/02/2022] [Indexed: 12/15/2022] Open
Abstract
The genetic basis of most traits is highly polygenic and dominated by non-coding alleles. It is widely assumed that such alleles exert small regulatory effects on the expression of cis-linked genes. However, despite the availability of gene expression and epigenomic datasets, few variant-to-gene links have emerged. It is unclear whether these sparse results are due to limitations in available data and methods, or to deficiencies in the underlying assumed model. To better distinguish between these possibilities, we identified 220 gene-trait pairs in which protein-coding variants influence a complex trait or its Mendelian cognate. Despite the presence of expression quantitative trait loci near most GWAS associations, by applying a gene-based approach we found limited evidence that the baseline expression of trait-related genes explains GWAS associations, whether using colocalization methods (8% of genes implicated), transcription-wide association (2% of genes implicated), or a combination of regulatory annotations and distance (4% of genes implicated). These results contradict the hypothesis that most complex trait-associated variants coincide with homeostatic expression QTLs, suggesting that better models are needed. The field must confront this deficit and pursue this 'missing regulation.'
Collapse
Affiliation(s)
- Noah J Connally
- Department of Biomedical Informatics, Harvard Medical SchoolBostonUnited States
- Brigham and Women’s Hospital, Division of Genetics, Harvard Medical SchoolBostonUnited States
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
| | - Sumaiya Nazeen
- Department of Biomedical Informatics, Harvard Medical SchoolBostonUnited States
- Brigham and Women’s Hospital, Division of Genetics, Harvard Medical SchoolBostonUnited States
- Brigham and Women’s Hospital, Department of Neurology, Harvard Medical SchoolBostonUnited States
| | - Daniel Lee
- Department of Biomedical Informatics, Harvard Medical SchoolBostonUnited States
- Brigham and Women’s Hospital, Division of Genetics, Harvard Medical SchoolBostonUnited States
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
| | - Huwenbo Shi
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
- Department of Epidemiology, Harvard T.H. Chan School of Public HealthBostonUnited States
| | | | - Sung Chun
- Division of Pulmonary Medicine, Boston Children’s HospitalBostonUnited States
| | - Chris Cotsapas
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
- Department of Neurology, Yale Medical SchoolNew HavenUnited States
- Department of Genetics, Yale Medical SchoolNew HavenUnited States
| | - Christopher A Cassa
- Brigham and Women’s Hospital, Division of Genetics, Harvard Medical SchoolBostonUnited States
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
| | - Shamil R Sunyaev
- Department of Biomedical Informatics, Harvard Medical SchoolBostonUnited States
- Brigham and Women’s Hospital, Division of Genetics, Harvard Medical SchoolBostonUnited States
- Program in Medical and Population Genetics, Broad Institute of MIT and HarvardCambridgeUnited States
| |
Collapse
|
10
|
Agarwal V, Kelley DR. The genetic and biochemical determinants of mRNA degradation rates in mammals. Genome Biol 2022; 23:245. [PMID: 36419176 PMCID: PMC9684954 DOI: 10.1186/s13059-022-02811-x] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 11/02/2022] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND Degradation rate is a fundamental aspect of mRNA metabolism, and the factors governing it remain poorly characterized. Understanding the genetic and biochemical determinants of mRNA half-life would enable more precise identification of variants that perturb gene expression through post-transcriptional gene regulatory mechanisms. RESULTS We establish a compendium of 39 human and 27 mouse transcriptome-wide mRNA decay rate datasets. A meta-analysis of these data identified a prevalence of technical noise and measurement bias, induced partially by the underlying experimental strategy. Correcting for these biases allowed us to derive more precise, consensus measurements of half-life which exhibit enhanced consistency between species. We trained substantially improved statistical models based upon genetic and biochemical features to better predict half-life and characterize the factors molding it. Our state-of-the-art model, Saluki, is a hybrid convolutional and recurrent deep neural network which relies only upon an mRNA sequence annotated with coding frame and splice sites to predict half-life (r=0.77). The key novel principle learned by Saluki is that the spatial positioning of splice sites, codons, and RNA-binding motifs within an mRNA is strongly associated with mRNA half-life. Saluki predicts the impact of RNA sequences and genetic mutations therein on mRNA stability, in agreement with functional measurements derived from massively parallel reporter assays. CONCLUSIONS Our work produces a more robust ground truth for transcriptome-wide mRNA half-lives in mammalian cells. Using these revised measurements, we trained Saluki, a model that is over 50% more accurate in predicting half-life from sequence than existing models. Saluki succinctly captures many of the known determinants of mRNA half-life and can be rapidly deployed to predict the functional consequences of arbitrary mutations in the transcriptome.
Collapse
Affiliation(s)
- Vikram Agarwal
- Calico Life Sciences LLC, South San Francisco, CA, 94080, USA.
- Present Address: mRNA Center of Excellence, Sanofi Pasteur Inc., Waltham, MA, 02451, USA.
| | - David R Kelley
- Calico Life Sciences LLC, South San Francisco, CA, 94080, USA.
| |
Collapse
|
11
|
Han J, Lee C. Antagonistic regulatory effects of a single cis-acting expression quantitative trait locus between transcription and translation of the MRPL43 gene. BMC Genom Data 2022; 23:42. [PMID: 35659240 PMCID: PMC9167510 DOI: 10.1186/s12863-022-01057-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 05/23/2022] [Indexed: 01/01/2023] Open
Abstract
Background Heterogeneity of expression quantitative trait locus (eQTL) effects have been shown across gene expression processes. Knowledge on how to produce the heterogeneity is quite limited. This study aims to examine fluctuations in differential gene expression by alleles of sequence variants across expression processes. Results Genome-wide eQTL analyses with transcriptome-wide gene expression data revealed 20 cis-acting eQTLs associated simultaneously with mRNA expression, ribosome occupancy, and protein abundance. A 97 kb-long eQTL signal for mitochondrial ribosomal protein L43 (MRPL43) covered the gene, showing a heterogeneous effect size on gene products across expression stages. One allele of the eQTL was associated with increased mRNA expression and ribosome occupancy but decreased protein abundance. We examined the heterogeneity and found that the eQTL can be attributed to the independent functions of three nucleotide variants, with a strong linkage. NC_000010.11:g.100987606G > T, upstream of MRPL43, may regulate the binding affinity of transcription factors. NC_000010.11:g.100986746C > G, 3 bp from an MRPL43 splice donor site, may alter the splice site. NC_000010.11:g.100978794A > G, in the isoform with a long 3′-UTR, may strengthen the binding affinity of the microRNA. Individuals with the TGG haplotype at these three variants had higher levels of mRNA expression and ribosome occupancy than individuals with the GCA haplotype but lower protein levels, producing the flipped effect throughout the expression process. Conclusions These findings suggest that multiple functional variants in a linkage exert their regulatory functions at different points in the gene expression process, producing a complexity of single eQTLs. Supplementary Information The online version contains supplementary material available at 10.1186/s12863-022-01057-7.
Collapse
Affiliation(s)
- Jooyeon Han
- Department of Bioinformatics and Life Science, Soongsil University, Seoul, 06978, South Korea
| | - Chaeyoung Lee
- Department of Bioinformatics and Life Science, Soongsil University, Seoul, 06978, South Korea.
| |
Collapse
|
12
|
Olayinka OA, O'Neill NK, Farrer LA, Wang G, Zhang X. Molecular Quantitative Trait Locus Mapping in Human Complex Diseases. Curr Protoc 2022; 2:e426. [PMID: 35587224 PMCID: PMC9186089 DOI: 10.1002/cpz1.426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Mapping quantitative trait loci (QTLs) for molecular traits from chromatin to metabolites (i.e., xQTLs) provides insight into the locations and effect modes of genetic variants that influence these molecular phenotypes and the propagation of functional consequences of each variant. xQTL studies indirectly interrogate the functional landscape of the molecular basis of complex diseases, including the impact of non-coding regulatory variants, the tissue specificity of regulatory elements, and their contribution to disease by integrating with genome-wide association studies (GWAS). We summarize a variety of molecular xQTL studies in human tissues and cells. In addition, using the Alzheimer's Disease Sequencing Project (ADSP) as an example, we describe the ADSP xQTL project, a collaborative effort across the ADSP Functional Genomics Consortium (ADSP-FGC). The project's ultimate goal is a reference map of Alzheimer's-related QTLs using existing datasets from multiple omics layers to help us study the consequences of genetic variants identified in the ADSP. xQTL studies enable the identification of the causal genes and pathways in GWAS loci, which will likely aid in the discovery of novel biomarkers and therapeutic targets for complex diseases. © 2022 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Oluwatosin A Olayinka
- Bioinformatics Program, Boston University, Boston, Massachusetts
- Department of Medicine (Biomedical Genetics), Boston University School of Medicine, Boston, Massachusetts
| | - Nicholas K O'Neill
- Bioinformatics Program, Boston University, Boston, Massachusetts
- Department of Medicine (Biomedical Genetics), Boston University School of Medicine, Boston, Massachusetts
| | - Lindsay A Farrer
- Bioinformatics Program, Boston University, Boston, Massachusetts
- Department of Medicine (Biomedical Genetics), Boston University School of Medicine, Boston, Massachusetts
- Department of Neurology, Boston University School of Medicine, Boston, Massachusetts
- Department of Ophthalmology, Boston University School of Medicine, Boston, Massachusetts
- Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts
- Department of Epidemiology, Boston University School of Public Health, Boston, Massachusetts
| | - Gao Wang
- Department of Neurology, Columbia University, New York, New York
- Gertrude H. Sergievsky Center, Columbia University, New York, New York
| | - Xiaoling Zhang
- Bioinformatics Program, Boston University, Boston, Massachusetts
- Department of Medicine (Biomedical Genetics), Boston University School of Medicine, Boston, Massachusetts
- Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts
| |
Collapse
|
13
|
Bossini-Castillo L, Glinos DA, Kunowska N, Golda G, Lamikanra AA, Spitzer M, Soskic B, Cano-Gamez E, Smyth DJ, Cattermole C, Alasoo K, Mann A, Kundu K, Lorenc A, Soranzo N, Dunham I, Roberts DJ, Trynka G. Immune disease variants modulate gene expression in regulatory CD4 + T cells. CELL GENOMICS 2022; 2:None. [PMID: 35591976 PMCID: PMC9010307 DOI: 10.1016/j.xgen.2022.100117] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 11/02/2021] [Accepted: 03/15/2022] [Indexed: 12/30/2022]
Abstract
Identifying cellular functions dysregulated by disease-associated variants could implicate novel pathways for drug targeting or modulation in cell therapies. However, follow-up studies can be challenging if disease-relevant cell types are difficult to sample. Variants associated with immune diseases point toward the role of CD4+ regulatory T cells (Treg cells). We mapped genetic regulation (quantitative trait loci [QTL]) of gene expression and chromatin activity in Treg cells, and we identified 133 colocalizing loci with immune disease variants. Colocalizations of immune disease genome-wide association study (GWAS) variants with expression QTLs (eQTLs) controlling the expression of CD28 and STAT5A, involved in Treg cell activation and interleukin-2 (IL-2) signaling, support the contribution of Treg cells to the pathobiology of immune diseases. Finally, we identified seven known drug targets suitable for drug repurposing and suggested 63 targets with drug tractability evidence among the GWAS signals that colocalized with Treg cell QTLs. Our study is the first in-depth characterization of immune disease variant effects on Treg cell gene expression modulation and dysregulation of Treg cell function.
Collapse
Affiliation(s)
| | - Dafni A. Glinos
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- New York Genome Center, New York, NY, USA
| | - Natalia Kunowska
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Gosia Golda
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Abigail A. Lamikanra
- NHS Blood and Transplant, Oxford, UK
- BRC Haematology Theme, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Michaela Spitzer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Blagoje Soskic
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Eddie Cano-Gamez
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Deborah J. Smyth
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | | | - Kaur Alasoo
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Alice Mann
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Kousik Kundu
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Anna Lorenc
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Nicole Soranzo
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - David J. Roberts
- NHS Blood and Transplant, Oxford, UK
- BRC Haematology Theme, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Gosia Trynka
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| |
Collapse
|
14
|
Li JR, Tang M, Li Y, Amos CI, Cheng C. Genetic variants associated mRNA stability in lung. BMC Genomics 2022; 23:196. [PMID: 35272635 PMCID: PMC8915503 DOI: 10.1186/s12864-022-08405-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 02/21/2022] [Indexed: 12/04/2022] Open
Abstract
Background Expression quantitative trait loci (eQTLs) analyses have been widely used to identify genetic variants associated with gene expression levels to understand what molecular mechanisms underlie genetic traits. The resultant eQTLs might affect the expression of associated genes through transcriptional or post-transcriptional regulation. In this study, we attempt to distinguish these two types of regulation by identifying genetic variants associated with mRNA stability of genes (stQTLs). Results Here, we presented a computational framework that takes advantage of recently developed methods to infer the mRNA stability of genes based on RNA-seq data and performed association analysis to identify stQTLs. Using the Genotype-Tissue Expression (GTEx) lung RNA-Seq data, we identified a total of 142,801 stQTLs for 3942 genes and 186,132 eQTLs for 4751 genes from 15,122,700 genetic variants for 13,476 genes on the autosomes, respectively. Interestingly, our results indicated that stQTLs were enriched in the CDS and 3’UTR regions, while eQTLs are enriched in the CDS, 3’UTR, 5’UTR, and upstream regions. We also found that stQTLs are more likely than eQTLs to overlap with RNA binding protein (RBP) and microRNA (miRNA) binding sites. Our analyses demonstrate that simultaneous identification of stQTLs and eQTLs can provide more mechanistic insight on the association between genetic variants and gene expression levels. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08405-y.
Collapse
Affiliation(s)
- Jian-Rong Li
- Department of Medicine, Baylor College of Medicine, Houston, TX, USA.,Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA
| | - Mabel Tang
- Department of BioSciences, Biochemistry and Cell Biology, Rice University, Houston, TX, USA
| | - Yafang Li
- Department of Medicine, Baylor College of Medicine, Houston, TX, USA.,Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA.,Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Christopher I Amos
- Department of Medicine, Baylor College of Medicine, Houston, TX, USA.,Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA.,Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Chao Cheng
- Department of Medicine, Baylor College of Medicine, Houston, TX, USA. .,Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA. .,Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
15
|
Towards the Genetic Architecture of Complex Gene Expression Traits: Challenges and Prospects for eQTL Mapping in Humans. Genes (Basel) 2022; 13:genes13020235. [PMID: 35205280 PMCID: PMC8871770 DOI: 10.3390/genes13020235] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 01/21/2022] [Accepted: 01/25/2022] [Indexed: 12/10/2022] Open
Abstract
The discovery of expression quantitative trait loci (eQTLs) and their target genes (eGenes) has not only compensated for the limitations of genome-wide association studies for complex phenotypes but has also provided a basis for predicting gene expression. Efforts have been made to develop analytical methods in statistical genetics, a key discipline in eQTL analysis. In particular, mixed model– and deep learning–based analytical methods have been extremely beneficial in mapping eQTLs and predicting gene expression. Nevertheless, we still face many challenges associated with eQTL discovery. Here, we discuss two key aspects of these challenges: 1, the complexity of eTraits with various factors such as polygenicity and epistasis and 2, the voluminous work required for various types of eQTL profiles. The properties and prospects of statistical methods, including the mixed model method, Bayesian inference, the deep learning method, and the integration method, are presented as future directions for eQTL discovery. This review will help expedite the design and use of efficient methods for eQTL discovery and eTrait prediction.
Collapse
|
16
|
Keele GR, Zhang T, Pham DT, Vincent M, Bell TA, Hock P, Shaw GD, Paulo JA, Munger SC, Pardo-Manuel de Villena F, Ferris MT, Gygi SP, Churchill GA. Regulation of protein abundance in genetically diverse mouse populations. CELL GENOMICS 2021; 1:100003. [PMID: 36212994 PMCID: PMC9536773 DOI: 10.1016/j.xgen.2021.100003] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 04/01/2021] [Accepted: 05/10/2021] [Indexed: 12/12/2022]
Abstract
Genetically diverse mouse populations are powerful tools for characterizing the regulation of the proteome and its relationship to whole-organism phenotypes. We used mass spectrometry to profile and quantify the abundance of 6,798 proteins in liver tissue from mice of both sexes across 58 Collaborative Cross (CC) inbred strains. We previously collected liver proteomics data from the related Diversity Outbred (DO) mice and their founder strains. We show concordance across the proteomics datasets despite being generated from separate experiments, allowing comparative analysis. We map protein abundance quantitative trait loci (pQTLs), identifying 1,087 local and 285 distal in the CC mice and 1,706 local and 414 distal in the DO mice. We find that regulatory effects on individual proteins are conserved across the mouse populations, in particular for local genetic variation and sex differences. In comparison, proteins that form complexes are often co-regulated, displaying varying genetic architectures, and overall show lower heritability and map fewer pQTLs. We have made this resource publicly available to enable quantitative analyses of the regulation of the proteome.
Collapse
Affiliation(s)
| | - Tian Zhang
- Harvard Medical School, Boston, MA 02115, USA
| | - Duy T. Pham
- The Jackson Laboratory, Bar Harbor, ME 04609, USA
| | | | - Timothy A. Bell
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Pablo Hock
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Ginger D. Shaw
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | | | - Fernando Pardo-Manuel de Villena
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Martin T. Ferris
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | | |
Collapse
|
17
|
Gruhl F, Janich P, Kaessmann H, Gatfield D. Circular RNA repertoires are associated with evolutionarily young transposable elements. eLife 2021; 10:67991. [PMID: 34542406 PMCID: PMC8516420 DOI: 10.7554/elife.67991] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 09/19/2021] [Indexed: 12/29/2022] Open
Abstract
Circular RNAs (circRNAs) are found across eukaryotes and can function in post-transcriptional gene regulation. Their biogenesis through a circle-forming backsplicing reaction is facilitated by reverse-complementary repetitive sequences promoting pre-mRNA folding. Orthologous genes from which circRNAs arise, overall contain more strongly conserved splice sites and exons than other genes, yet it remains unclear to what extent this conservation reflects purifying selection acting on the circRNAs themselves. Our analyses of circRNA repertoires from five species representing three mammalian lineages (marsupials, eutherians: rodents, primates) reveal that surprisingly few circRNAs arise from orthologous exonic loci across all species. Even the circRNAs from orthologous loci are associated with young, recently active and species-specific transposable elements, rather than with common, ancient transposon integration events. These observations suggest that many circRNAs emerged convergently during evolution - as a byproduct of splicing in orthologs prone to transposon insertion. Overall, our findings argue against widespread functional circRNA conservation.
Collapse
Affiliation(s)
- Franziska Gruhl
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Peggy Janich
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland.,Krebsforschung Schweiz, Bern, Switzerland
| | - Henrik Kaessmann
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany
| | - David Gatfield
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
18
|
Abstract
Saccharomyces cerevisiae rewires its transcriptional output to survive stressful environments, such as nitrogen scarcity under fermentative conditions. Although divergence in nitrogen metabolism among natural yeast populations has been reported, the impact of regulatory genetic variants modulating gene expression and nitrogen consumption remains to be investigated. Here, we employed an F1 hybrid from two contrasting S. cerevisiae strains, providing a controlled genetic environment to map cis factors involved in the divergence of gene expression regulation in response to nitrogen scarcity. We used a dual approach to obtain genome-wide allele-specific profiles of chromatin accessibility, transcription factor binding, and gene expression through ATAC-seq (assay for transposase accessible chromatin) and RNA-seq (transcriptome sequencing). We observed large variability in allele-specific expression and accessibility between the two genetic backgrounds, with a third of these differences specific to a deficient nitrogen environment. Furthermore, we discovered events of allelic bias in gene expression correlating with allelic bias in transcription factor binding solely under nitrogen scarcity, where the majority of these transcription factors orchestrates the nitrogen catabolite repression regulatory pathway and demonstrates a cis × environment-specific response. Our approach allowed us to find cis variants modulating gene expression, chromatin accessibility, and allelic differences in transcription factor binding in response to low nitrogen culture conditions. IMPORTANCE Historically, coding variants were prioritized when searching for causal mechanisms driving adaptation of natural populations to stressful environments. However, the recent focus on noncoding variants demonstrated their ubiquitous role in adaptation. Here, we performed genome-wide regulatory variation profiles between two divergent yeast strains when facing nitrogen nutritional stress. The open chromatin availability of several regulatory regions changes in response to nitrogen scarcity. Importantly, we describe regulatory events that deviate between strains. Our results demonstrate a widespread variation in gene expression regulation between naturally occurring populations in response to stressful environments.
Collapse
|
19
|
Teran NA, Nachun DC, Eulalio T, Ferraro NM, Smail C, Rivas MA, Montgomery SB. Nonsense-mediated decay is highly stable across individuals and tissues. Am J Hum Genet 2021; 108:1401-1408. [PMID: 34216550 DOI: 10.1016/j.ajhg.2021.06.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 06/09/2021] [Indexed: 10/21/2022] Open
Abstract
Precise interpretation of the effects of rare protein-truncating variants (PTVs) is important for accurate determination of variant impact. Current methods for assessing the ability of PTVs to induce nonsense-mediated decay (NMD) focus primarily on the position of the variant in the transcript. We used RNA sequencing of the Genotype Tissue Expression v.8 cohort to compute the efficiency of NMD using allelic imbalance for 2,320 rare (genome aggregation database minor allele frequency ≤ 1%) PTVs across 809 individuals in 49 tissues. We created an interpretable predictive model using penalized logistic regression in order to evaluate the comprehensive influence of variant annotation, tissue, and inter-individual variation on NMD. We found that variant position, allele frequency, the inclusion of ultra-rare and singleton variants, and conservation were predictive of allelic imbalance. Furthermore, we found that NMD effects were highly concordant across tissues and individuals. Due to this high consistency, we demonstrate in silico that utilizing peripheral tissues or cell lines provides accurate prediction of NMD for PTVs.
Collapse
|
20
|
Floc'hlay S, Wong ES, Zhao B, Viales RR, Thomas-Chollier M, Thieffry D, Garfield DA, Furlong EEM. Cis-acting variation is common across regulatory layers but is often buffered during embryonic development. Genome Res 2021; 31:211-224. [PMID: 33310749 PMCID: PMC7849415 DOI: 10.1101/gr.266338.120] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 12/09/2020] [Indexed: 12/14/2022]
Abstract
Precise patterns of gene expression are driven by interactions between transcription factors, regulatory DNA sequences, and chromatin. How DNA mutations affecting any one of these regulatory "layers" are buffered or propagated to gene expression remains unclear. To address this, we quantified allele-specific changes in chromatin accessibility, histone modifications, and gene expression in F1 embryos generated from eight Drosophila crosses at three embryonic stages, yielding a comprehensive data set of 240 samples spanning multiple regulatory layers. Genetic variation (allelic imbalance) impacts gene expression more frequently than chromatin features, with metabolic and environmental response genes being most often affected. Allelic imbalance in cis-regulatory elements (enhancers) is common and highly heritable, yet its functional impact does not generally propagate to gene expression. When it does, genetic variation impacts RNA levels through two alternative mechanisms involving either H3K4me3 or chromatin accessibility and H3K27ac. Changes in RNA are more predictive of variation in H3K4me3 than vice versa, suggesting a role for H3K4me3 downstream from transcription. The impact of a substantial proportion of genetic variation is consistent across embryonic stages, with 50% of allelic imbalanced features at one stage being also imbalanced at subsequent developmental stages. Crucially, buffering, as well as the magnitude and evolutionary impact of genetic variants, is influenced by regulatory complexity (i.e., number of enhancers regulating a gene), with transcription factors being most robust to cis-acting, but most influenced by trans-acting, variation.
Collapse
Affiliation(s)
- Swann Floc'hlay
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | - Emily S Wong
- Molecular, Structural and Computational Biology Division, Victor Chang Cardiac Research Institute, Darlinghurst, New South Wales 2010, Australia
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, New South Wales 2052, Australia
| | - Bingqing Zhao
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| | - Rebecca R Viales
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| | - Morgane Thomas-Chollier
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
- Institut Universitaire de France (IUF), 75005 Paris, France
| | - Denis Thieffry
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | - David A Garfield
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| | - Eileen E M Furlong
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| |
Collapse
|
21
|
Robles-Espinoza CD, Mohammadi P, Bonilla X, Gutierrez-Arcelus M. Allele-specific expression: applications in cancer and technical considerations. Curr Opin Genet Dev 2021; 66:10-19. [PMID: 33383480 PMCID: PMC7985293 DOI: 10.1016/j.gde.2020.10.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 10/26/2020] [Accepted: 10/31/2020] [Indexed: 11/18/2022]
Abstract
Allele-specific gene expression can influence disease traits. Non-coding germline genetic variants that alter regulatory elements can cause allele-specific gene expression and contribute to cancer susceptibility. In tumors, both somatic copy number alterations and somatic single nucleotide variants have been shown to lead to allele-specific expression of genes, many of which are considered drivers of tumor growth. Here, we review recent studies revealing the pervasive presence of this phenomenon in cancer susceptibility and progression. Furthermore, we underscore the importance of careful experimental design and computational analysis for accurate allelic expression quantification and avoidance of false positives. Finally, we discuss additional methodological challenges encountered in cancer studies and in the burgeoning field of single-cell transcriptomics.
Collapse
Affiliation(s)
- Carla Daniela Robles-Espinoza
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Campus Juriquilla, Boulevard Juriquilla 3001, Santiago de Querétaro 76230, Mexico; Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK
| | - Pejman Mohammadi
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, USA; Scripps Translational Science Institute, The Scripps Research Institute, La Jolla, CA, USA
| | - Ximena Bonilla
- Department of Computer Science, ETH Zurich, Universitätsstr. 6, 8092 Zürich, Switzerland; Swiss Institute of Bioinformatics, Quartier Sorge - Bâtiment Amphipôle, Lausanne 1015, Switzerland; University Hospital Zurich, Rämistrasse 100, 8091 Zürich, Switzerland
| | - Maria Gutierrez-Arcelus
- Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Division of Rheumatology, Inflammation and Immunity, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, USA; Division of Immunology, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, USA.
| |
Collapse
|
22
|
Molecular and evolutionary processes generating variation in gene expression. Nat Rev Genet 2020; 22:203-215. [PMID: 33268840 DOI: 10.1038/s41576-020-00304-w] [Citation(s) in RCA: 143] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/21/2020] [Indexed: 12/18/2022]
Abstract
Heritable variation in gene expression is common within and between species. This variation arises from mutations that alter the form or function of molecular gene regulatory networks that are then filtered by natural selection. High-throughput methods for introducing mutations and characterizing their cis- and trans-regulatory effects on gene expression (particularly, transcription) are revealing how different molecular mechanisms generate regulatory variation, and studies comparing these mutational effects with variation seen in the wild are teasing apart the role of neutral and non-neutral evolutionary processes. This integration of molecular and evolutionary biology allows us to understand how the variation in gene expression we see today came to be and to predict how it is most likely to evolve in the future.
Collapse
|
23
|
Patel RK, West JD, Jiang Y, Fogarty EA, Grimson A. Robust partitioning of microRNA targets from downstream regulatory changes. Nucleic Acids Res 2020; 48:9724-9746. [PMID: 32821933 PMCID: PMC7515711 DOI: 10.1093/nar/gkaa687] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 07/19/2020] [Accepted: 08/08/2020] [Indexed: 11/14/2022] Open
Abstract
The biological impact of microRNAs (miRNAs) is determined by their targets, and robustly identifying direct miRNA targets remains challenging. Existing methods suffer from high false-positive rates and are unable to effectively differentiate direct miRNA targets from downstream regulatory changes. Here, we present an experimental and computational framework to deconvolute post-transcriptional and transcriptional changes using a combination of RNA-seq and PRO-seq. This novel approach allows us to systematically profile the regulatory impact of a miRNA. We refer to this approach as CARP: Combined Analysis of RNA-seq and PRO-seq. We apply CARP to multiple miRNAs and show that it robustly distinguishes direct targets from downstream changes, while greatly reducing false positives. We validate our approach using Argonaute eCLIP-seq and ribosome profiling, demonstrating that CARP defines a comprehensive repertoire of targets. Using this approach, we identify miRNA-specific activity of target sites within the open reading frame. Additionally, we show that CARP facilitates the dissection of complex changes in gene regulatory networks triggered by miRNAs and identification of transcription factors that mediate downstream regulatory changes. Given the robustness of the approach, CARP would be particularly suitable for dissecting miRNA regulatory networks in vivo.
Collapse
Affiliation(s)
- Ravi K Patel
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
- Graduate Field of Genetics, Genomics, and Development, Cornell University, Ithaca, New York 14853, USA
| | - Jessica D West
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
- Graduate Field of Biochemistry, Molecular and Cell Biology, Cornell University, Ithaca, New York 14853, USA
| | - Ya Jiang
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
- Graduate Field of Genetics, Genomics, and Development, Cornell University, Ithaca, New York 14853, USA
| | - Elizabeth A Fogarty
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Andrew Grimson
- To whom correspondence should be addressed. Tel: +1 607 254 1307; Fax: +1 607 254 1307;
| |
Collapse
|
24
|
Zhang Z, Luo K, Zou Z, Qiu M, Tian J, Sieh L, Shi H, Zou Y, Wang G, Morrison J, Zhu AC, Qiao M, Li Z, Stephens M, He X, He C. Genetic analyses support the contribution of mRNA N 6-methyladenosine (m 6A) modification to human disease heritability. Nat Genet 2020; 52:939-949. [PMID: 32601472 PMCID: PMC7483307 DOI: 10.1038/s41588-020-0644-z] [Citation(s) in RCA: 127] [Impact Index Per Article: 25.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Accepted: 05/11/2020] [Indexed: 12/18/2022]
Abstract
N6-methyladenosine (m6A) plays important roles in regulating messenger RNA processing. Despite rapid progress in this field, little is known about the genetic determinants of m6A modification and their role in common diseases. In this study, we mapped the quantitative trait loci (QTLs) of m6A peaks in 60 Yoruba (YRI) lymphoblastoid cell lines. We found that m6A QTLs are largely independent of expression and splicing QTLs and are enriched with binding sites of RNA-binding proteins, RNA structure-changing variants and transcriptional features. Joint analysis of the QTLs of m6A and related molecular traits suggests that the downstream effects of m6A are heterogeneous and context dependent. We identified proteins that mediate m6A effects on translation. Through integration with data from genome-wide association studies, we show that m6A QTLs contribute to the heritability of various immune and blood-related traits at levels comparable to splicing QTLs and roughly half of expression QTLs. By leveraging m6A QTLs in a transcriptome-wide association study framework, we identified putative risk genes of these traits.
Collapse
Affiliation(s)
- Zijie Zhang
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
| | - Kaixuan Luo
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA
| | - Zhongyu Zou
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
| | - Maguanyun Qiu
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
| | - Jiakun Tian
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
| | - Laura Sieh
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
| | - Hailing Shi
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
| | - Yuxin Zou
- Department of Statistics, The University of Chicago, Chicago, IL, USA
| | - Gao Wang
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA
| | - Jean Morrison
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA
| | - Allen C Zhu
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA
- Medical Scientist Training Program/Committee on Cancer Biology, The University of Chicago, Chicago, IL, USA
| | - Min Qiao
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Zhongshan Li
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Matthew Stephens
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA.
- Department of Statistics, The University of Chicago, Chicago, IL, USA.
| | - Xin He
- Department of Human Genetics, The University of Chicago, Chicago, IL, USA.
| | - Chuan He
- Department of Chemistry and Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, USA.
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL, USA.
- Medical Scientist Training Program/Committee on Cancer Biology, The University of Chicago, Chicago, IL, USA.
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA.
| |
Collapse
|
25
|
Zheng Z, Huang D, Wang J, Zhao K, Zhou Y, Guo Z, Zhai S, Xu H, Cui H, Yao H, Wang Z, Yi X, Zhang S, Sham PC, Li MJ. QTLbase: an integrative resource for quantitative trait loci across multiple human molecular phenotypes. Nucleic Acids Res 2020; 48:D983-D991. [PMID: 31598699 PMCID: PMC6943073 DOI: 10.1093/nar/gkz888] [Citation(s) in RCA: 96] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 09/24/2019] [Accepted: 10/02/2019] [Indexed: 12/20/2022] Open
Abstract
Recent advances in genome sequencing and functional genomic profiling have promoted many large-scale quantitative trait locus (QTL) studies, which connect genotypes with tissue/cell type-specific cellular functions from transcriptional to post-translational level. However, no comprehensive resource can perform QTL lookup across multiple molecular phenotypes and investigate the potential cascade effect of functional variants. We developed a versatile resource, named QTLbase, for interpreting the possible molecular functions of genetic variants, as well as their tissue/cell-type specificity. Overall, QTLbase has five key functions: (i) curating and compiling genome-wide QTL summary statistics for 13 human molecular traits from 233 independent studies; (ii) mapping QTL-relevant tissue/cell types to 78 unified terms according to a standard anatomogram; (iii) normalizing variant and trait information uniformly, yielding >170 million significant QTLs; (iv) providing a rich web client that enables phenome- and tissue-wise visualization; and (v) integrating the most comprehensive genomic features and functional predictions to annotate the potential QTL mechanisms. QTLbase provides a one-stop shop for QTL retrieval and comparison across multiple tissues and multiple layers of molecular complexity, and will greatly help researchers interrogate the biological mechanism of causal variants and guide the direction of functional validation. QTLbase is freely available at http://mulinlab.org/qtlbase.
Collapse
Affiliation(s)
- Zhanye Zheng
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Dandan Huang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Jianhua Wang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Ke Zhao
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Yao Zhou
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Zhenyang Guo
- School of Biomedical Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR 999077, China
| | - Sinan Zhai
- School of Biomedical Engineering, Tianjin Medical University, Tianjin 300070, China
| | - Hang Xu
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Hui Cui
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Hongcheng Yao
- School of Biomedical Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR 999077, China
| | - Zhao Wang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Xianfu Yi
- School of Biomedical Engineering, Tianjin Medical University, Tianjin 300070, China
| | - Shijie Zhang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| | - Pak Chung Sham
- Centre of Genomics Sciences, State Key Laboratory of Brain and Cognitive Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR 999077, China
| | - Mulin Jun Li
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin 300070, China
| |
Collapse
|
26
|
Abstract
Expression quantitative trait locus (eQTL) analysis is a powerful method to understand the association between genetic variant and gene expression; it also has potential impact for the study of transcription medicine for human complex disease. In the past two decades, the researchers focus on studying the eQTL, while more and more evidence shows that the regulatory genetic variants locating noncoding region have strong effect for the gene expression. More and more researchers working on eQTL analysis realize the importance of other types of QTLs beyond eQTL. In this chapter, we will explore some QTLs beyond eQTLs that show the regulatory association with eQTLs and explain the underlying link among these types of QTLs.
Collapse
Affiliation(s)
- Jia Wen
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, NC, USA.
| | - Conor Nodzak
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, NC, USA
| | - Xinghua Shi
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, NC, USA
| |
Collapse
|
27
|
Vandiedonck C. Genetic association of molecular traits: A help to identify causative variants in complex diseases. Clin Genet 2019; 93:520-532. [PMID: 29194587 DOI: 10.1111/cge.13187] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 11/24/2017] [Accepted: 11/27/2017] [Indexed: 12/14/2022]
Abstract
In the past 15 years, major progresses have been made in the understanding of the genetic basis of regulation of gene expression. These new insights have revolutionized our approach to resolve the genetic variation underlying complex diseases. Gene transcript levels were the first expression phenotypes that were studied. They are heritable and therefore amenable to genome-wide association studies. The genetic variants that modulate them are called expression quantitative trait loci. Their study has been extended to other molecular quantitative trait loci (molQTLs) that regulate gene expression at the various levels, from chromatin state to cellular responses. Altogether, these studies have generated a wealth of basic information on the genome-wide patterns of gene expression and their inter-individual variation. Most importantly, molQTLs have become an invaluable asset in the genetic study of complex diseases. Although the identification of the disease-causing variants on the basis of their overlap with molQTLs requires caution, molQTLs can help to prioritize the relevant candidate gene(s) in the disease-associated regions and bring a functional interpretation of the associated variants, therefore, bridging the gap between genotypes and clinical phenotypes.
Collapse
Affiliation(s)
- C Vandiedonck
- Univ Paris Diderot, Sorbonne Paris Cité, Paris, France
| |
Collapse
|
28
|
Hartenian E, Glaunsinger BA. Feedback to the central dogma: cytoplasmic mRNA decay and transcription are interdependent processes. Crit Rev Biochem Mol Biol 2019; 54:385-398. [PMID: 31656086 PMCID: PMC6871655 DOI: 10.1080/10409238.2019.1679083] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Revised: 09/13/2019] [Accepted: 10/08/2019] [Indexed: 02/06/2023]
Abstract
Transcription and RNA decay are key determinants of gene expression; these processes are typically considered as the uncoupled beginning and end of the messenger RNA (mRNA) lifecycle. Here we describe the growing number of studies demonstrating interplay between these spatially disparate processes in eukaryotes. Specifically, cells can maintain mRNA levels by buffering against changes in mRNA stability or transcription, and can also respond to virally induced accelerated decay by reducing RNA polymerase II gene expression. In addition to these global responses, there is also evidence that mRNAs containing a premature stop codon can cause transcriptional upregulation of homologous genes in a targeted fashion. In each of these systems, RNA binding proteins (RBPs), particularly those involved in mRNA degradation, are critical for cytoplasmic to nuclear communication. Although their specific mechanistic contributions are yet to be fully elucidated, differential trafficking of RBPs between subcellular compartments are likely to play a central role in regulating this gene expression feedback pathway.
Collapse
Affiliation(s)
- Ella Hartenian
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720
| | - Britt A. Glaunsinger
- Department of Molecular and Cell Biology, University of California, Berkeley, CA 94720
- Department of Plant & Microbial Biology, University of California, Berkeley, CA 94720
- Howard Hughes Medical Institute, Berkeley, CA 94720
| |
Collapse
|
29
|
Rodiño-Janeiro BK, Pardo-Camacho C, Santos J, Martínez C. Mucosal RNA and protein expression as the next frontier in IBS: abnormal function despite morphologically intact small intestinal mucosa. Am J Physiol Gastrointest Liver Physiol 2019; 316:G701-G719. [PMID: 30767681 DOI: 10.1152/ajpgi.00186.2018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Irritable bowel syndrome (IBS) is one of the commonest gastrointestinal disorders. Although long-time considered a pure functional disorder, intense research in past years has rendered a very complex and varied array of observations indicating the presence of structural and molecular abnormalities underlying characteristic motor and sensitive changes and clinical manifestations. Analysis of gene and protein expression in the intestinal mucosa has shed light on the molecular mechanisms implicated in IBS physiopathology. This analysis uncovers constitutive and inductive genetic and epigenetic marks in the small and large intestine that highlight the role of epithelial barrier, immune activation, and mucosal processing of foods and toxins and several new molecular pathways in the origin of IBS. The incorporation of innovative high-throughput techniques into IBS research is beginning to provide new insights into highly structured and interconnected molecular mechanisms modulating gene and protein expression at tissue level. Integration and correlation of these molecular mechanisms with clinical and environmental data applying systems biology/medicine and data mining tools emerge as crucial steps that will allow us to get meaningful and more definitive comprehension of IBS-detailed development and show the real mechanisms and causality of the disease and the way to identify more specific diagnostic biomarkers and effective treatments.
Collapse
Affiliation(s)
- Bruno Kotska Rodiño-Janeiro
- Laboratory of Neuro-Immuno-Gastroenterology, Digestive System Research Unit, Vall d'Hebron Institut de Recerca , Barcelona , Spain.,Department of Gastroenterology, Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona (Facultat de Medicina) , Barcelona , Spain
| | - Cristina Pardo-Camacho
- Laboratory of Neuro-Immuno-Gastroenterology, Digestive System Research Unit, Vall d'Hebron Institut de Recerca , Barcelona , Spain.,Department of Gastroenterology, Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona (Facultat de Medicina) , Barcelona , Spain
| | - Javier Santos
- Laboratory of Neuro-Immuno-Gastroenterology, Digestive System Research Unit, Vall d'Hebron Institut de Recerca , Barcelona , Spain.,Department of Gastroenterology, Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona (Facultat de Medicina) , Barcelona , Spain.,Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas , Madrid , Spain
| | - Cristina Martínez
- Laboratory of Neuro-Immuno-Gastroenterology, Digestive System Research Unit, Vall d'Hebron Institut de Recerca , Barcelona , Spain.,Department of Gastroenterology, Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona (Facultat de Medicina) , Barcelona , Spain
| |
Collapse
|
30
|
Lee C. Bayesian Inference for Mixed Model-Based Genome-Wide Analysis of Expression Quantitative Trait Loci by Gibbs Sampling. Front Genet 2019; 10:199. [PMID: 30967893 PMCID: PMC6438854 DOI: 10.3389/fgene.2019.00199] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Accepted: 02/25/2019] [Indexed: 11/13/2022] Open
Abstract
The importance of expression quantitative trait locus (eQTL) has been emphasized in understanding the genetic basis of cellular activities and complex phenotypes. Mixed models can be employed to effectively identify eQTLs by explaining polygenic effects. In these mixed models, the polygenic effects are considered as random variables, and their variability is explained by the polygenic variance component. The polygenic and residual variance components are first estimated, and then eQTL effects are estimated depending on the variance component estimates within the frequentist mixed model framework. The Bayesian approach to the mixed model-based genome-wide eQTL analysis can also be applied to estimate the parameters that exhibit various benefits. Bayesian inferences on unknown parameters are based on their marginal posterior distributions, and the marginalization of the joint posterior distribution is a challenging task. This problem can be solved by employing a numerical algorithm of integrals called Gibbs sampling as a Markov chain Monte Carlo. This article reviews the mixed model-based Bayesian eQTL analysis by Gibbs sampling. Theoretical and practical issues of Bayesian inference are discussed using a concise description of Bayesian modeling and the corresponding Gibbs sampling. The strengths of Bayesian inference are also discussed. Posterior probability distribution in the Bayesian inference reflects uncertainty in unknown parameters. This factor is useful in the context of eQTL analysis where a sample size is too small to apply the frequentist approach. Bayesian inference based on the posterior that reflects prior knowledge, will be increasingly preferred with the accumulation of eQTL data. Extensive use of the mixed model-based Bayesian eQTL analysis will accelerate understanding of eQTLs exhibiting various regulatory functions.
Collapse
Affiliation(s)
- Chaeyoung Lee
- Department of Bioinformatics and Life Science, Soongsil University, Seoul, South Korea
| |
Collapse
|
31
|
van der Wijst MGP, de Vries DH, Brugge H, Westra HJ, Franke L. An integrative approach for building personalized gene regulatory networks for precision medicine. Genome Med 2018; 10:96. [PMID: 30567569 PMCID: PMC6299585 DOI: 10.1186/s13073-018-0608-4] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Only a small fraction of patients respond to the drug prescribed to treat their disease, which means that most are at risk of unnecessary exposure to side effects through ineffective drugs. This inter-individual variation in drug response is driven by differences in gene interactions caused by each patient's genetic background, environmental exposures, and the proportions of specific cell types involved in disease. These gene interactions can now be captured by building gene regulatory networks, by taking advantage of RNA velocity (the time derivative of the gene expression state), the ability to study hundreds of thousands of cells simultaneously, and the falling price of single-cell sequencing. Here, we propose an integrative approach that leverages these recent advances in single-cell data with the sensitivity of bulk data to enable the reconstruction of personalized, cell-type- and context-specific gene regulatory networks. We expect this approach will allow the prioritization of key driver genes for specific diseases and will provide knowledge that opens new avenues towards improved personalized healthcare.
Collapse
Affiliation(s)
- Monique G P van der Wijst
- Department of Genetics, 5th floor ERIBA building, Antonius Deusinglaan 1, 9713AV Groningen, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Dylan H de Vries
- Department of Genetics, 5th floor ERIBA building, Antonius Deusinglaan 1, 9713AV Groningen, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Harm Brugge
- Department of Genetics, 5th floor ERIBA building, Antonius Deusinglaan 1, 9713AV Groningen, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Harm-Jan Westra
- Department of Genetics, 5th floor ERIBA building, Antonius Deusinglaan 1, 9713AV Groningen, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Lude Franke
- Department of Genetics, 5th floor ERIBA building, Antonius Deusinglaan 1, 9713AV Groningen, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands.
| |
Collapse
|
32
|
Rivandi M, Martens JWM, Hollestelle A. Elucidating the Underlying Functional Mechanisms of Breast Cancer Susceptibility Through Post-GWAS Analyses. Front Genet 2018; 9:280. [PMID: 30116257 PMCID: PMC6082943 DOI: 10.3389/fgene.2018.00280] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 07/09/2018] [Indexed: 12/12/2022] Open
Abstract
Genome-wide association studies (GWAS) have identified more than 170 single nucleotide polymorphisms (SNPs) associated with the susceptibility to breast cancer. Together, these SNPs explain 18% of the familial relative risk, which is estimated to be nearly half of the total familial breast cancer risk that is collectively explained by low-risk susceptibility alleles. An important aspect of this success has been the access to large sample sizes through collaborative efforts within the Breast Cancer Association Consortium (BCAC), but also collaborations between cancer association consortia. Despite these achievements, however, understanding of each variant's underlying mechanism and how these SNPs predispose women to breast cancer remains limited and represents a major challenge in the field, particularly since the vast majority of the GWAS-identified SNPs are located in non-coding regions of the genome and are merely tags for the causal variants. In recent years, fine-scale mapping studies followed by functional evaluation of putative causal variants have begun to elucidate the biological function of several GWAS-identified variants. In this review, we discuss the findings and lessons learned from these post-GWAS analyses of 22 risk loci. Identifying the true causal variants underlying breast cancer susceptibility and their function not only provides better estimates of the explained familial relative risk thereby improving polygenetic risk scores (PRSs), it also increases our understanding of the biological mechanisms responsible for causing susceptibility to breast cancer. This will facilitate the identification of further breast cancer risk alleles and the development of preventive medicine for those women at increased risk for developing the disease.
Collapse
Affiliation(s)
- Mahdi Rivandi
- Department of Medical Oncology, Erasmus MC Cancer Institute, Rotterdam, Netherlands.,Department of Modern Sciences and Technologies, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - John W M Martens
- Department of Medical Oncology, Erasmus MC Cancer Institute, Rotterdam, Netherlands.,Cancer Genomics Centre, Utrecht, Netherlands
| | | |
Collapse
|
33
|
Trans-eQTLs identified in whole blood have limited influence on complex disease biology. Eur J Hum Genet 2018; 26:1361-1368. [PMID: 29891877 DOI: 10.1038/s41431-018-0174-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 04/03/2018] [Accepted: 04/26/2018] [Indexed: 12/15/2022] Open
Abstract
Trans-eQTLs have been implicated in complex traits and common diseases, but many were initially identified on the basis of having an effect in cis, and there has been no assessment of the significance of the overlap in relation to chance expectations. Here, we investigated whether trans-expression quantitative trait loci (eQTL) associations identified in whole blood contribute to variance in complex traits by determining (1) whether genome-wide significant (GWS) single-nucleotide polymorphisms (SNPs) were enriched for trans-eQTL (including trans-only eQTL), and (2) whether the genomic regions surrounding associated trans-genes were enriched for statistical associations in the relevant GWAS. On average for a given phenotype, we identify 4.8% of GWS SNPs overlapping with trans-eQTL present in blood, and show that for the majority of these phenotypes, this observation does not exceed that expected by chance. Likewise, we observe no enrichment for genetic associations with the GWAS phenotype in the regions surrounding the linked trans-genes, with the exception of rheumatoid arthritis. Interestingly, the GWS SNPs for each phenotype were consistently more enriched for unique trans-eQTL SNPs than trans-eQTL SNP-probe pairs (p = 4 × 10-7), with schizophrenia the only exception. This relative enrichment for trans-eQTL SNPs over trans-eQTL SNP-probe pairs implies that trait-associated trans-eQTL SNPs in whole blood are less likely to be 'master regulators' than random trans-eQTL SNPs. Taken together, these results suggest little evidence for the role of blood-based trans-eQTL in complex traits and disease, although this may reflect the finite size of currently available data sets and our findings may not hold for trans-eQTLs in more trait-relevant tissues. All software is publically available at https://github.com/IMB-Computational-Genomics-Lab/eqtlOverlapper .
Collapse
|
34
|
Schaefke B, Sun W, Li YS, Fang L, Chen W. The evolution of posttranscriptional regulation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2018; 9:e1485. [PMID: 29851258 DOI: 10.1002/wrna.1485] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Revised: 04/23/2018] [Accepted: 04/26/2018] [Indexed: 12/13/2022]
Abstract
"DNA makes RNA makes protein." After transcription, mRNAs undergo a series of intertwining processes to be finally translated into functional proteins. The "posttranscriptional" regulation (PTR) provides cells an extended option to fine-tune their proteomes. To meet the demands of complex organism development and the appropriate response to environmental stimuli, every step in these processes needs to be finely regulated. Moreover, changes in these regulatory processes are important driving forces underlying the evolution of phenotypic differences across different species. The major PTR mechanisms discussed in this review include the regulation of splicing, polyadenylation, decay, and translation. For alternative splicing and polyadenylation, we mainly discuss their evolutionary dynamics and the genetic changes underlying the regulatory differences in cis-elements versus trans-factors. For mRNA decay and translation, which, together with transcription, determine the cellular RNA or protein abundance, we focus our discussion on how their divergence coordinates with transcriptional changes to shape the evolution of gene expression. Then to highlight the importance of PTR in the evolution of higher complexity, we focus on their roles in two major phenomena during eukaryotic evolution: the evolution of multicellularity and the division of labor between different cell types and tissues; and the emergence of diverse, often highly specialized individual phenotypes, especially those concerning behavior in eusocial insects. This article is categorized under: RNA Evolution and Genomics > RNA and Ribonucleoprotein Evolution Translation > Translation Regulation RNA Processing > Splicing Regulation/Alternative Splicing.
Collapse
Affiliation(s)
- Bernhard Schaefke
- Department of Biology, Southern University of Science and Technology, Shenzhen, China
| | - Wei Sun
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Department of Pharmaceutical Chemistry and Cardiovascular Research Institute, University of California San Francisco, San Francisco
| | - Yi-Sheng Li
- Department of Biology, Southern University of Science and Technology, Shenzhen, China
| | - Liang Fang
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Medi-X Institute, SUSTech Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| | - Wei Chen
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Medi-X Institute, SUSTech Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
35
|
Sun W, Gao Q, Schaefke B, Hu Y, Chen W. Pervasive allele-specific regulation on RNA decay in hybrid mice. Life Sci Alliance 2018; 1:e201800052. [PMID: 30456349 PMCID: PMC6238540 DOI: 10.26508/lsa.201800052] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Revised: 05/02/2018] [Accepted: 05/03/2018] [Indexed: 02/05/2023] Open
Abstract
Cellular RNA abundance is determined by both RNA transcription and decay. Therefore, change in RNA abundance, which can drive phenotypic diversity between different species, could arise from genetic variants affecting either process. However, previous studies in the evolution of RNA expression have been largely focused on transcription. Here, to globally investigate the effects of cis-regulatory divergence on RNA decay in mammals for the first time, we quantified allele-specific differences in RNA decay rates (ASD) in an F1 hybrid mouse. Out of 8,815 genes with sufficient data, we identified 621 genes exhibiting significant cis-divergence. Systematic analysis of these genes revealed that the genetic variants affecting microRNA binding and RNA secondary structures contribute to the observed divergences. Finally, we demonstrated that although the divergences in RNA abundance were predominantly determined by allelic differences in RNA transcription, most genes with significant ASD did not exhibit significant difference in RNA abundance. For these genes, the apparently compensatory effect between the allelic differences in RNA transcription and ASD suggests that changes in RNA decay could serve as important means to stabilize RNA abundances during mammalian evolution.
Collapse
Affiliation(s)
- Wei Sun
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Laboratory for Functional and Medical Genomics, Berlin Institute for Medical Systems Biology, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| | - Qingsong Gao
- Laboratory for Functional and Medical Genomics, Berlin Institute for Medical Systems Biology, Max-Delbrück-Centrum für Molekulare Medizin, Berlin, Germany
| | - Bernhard Schaefke
- Department of Biology, Southern University of Science and Technology, Shenzhen, China
| | - Yuhui Hu
- Department of Biology, Southern University of Science and Technology, Shenzhen, China
| | - Wei Chen
- Department of Biology, Southern University of Science and Technology, Shenzhen, China.,Medi-X Institute, SUSTech Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen, China
| |
Collapse
|
36
|
Danko CG, Choate LA, Marks BA, Rice EJ, Wang Z, Chu T, Martins AL, Dukler N, Coonrod SA, Tait Wojno ED, Lis JT, Kraus WL, Siepel A. Dynamic evolution of regulatory element ensembles in primate CD4 + T cells. Nat Ecol Evol 2018; 2:537-548. [PMID: 29379187 PMCID: PMC5957490 DOI: 10.1038/s41559-017-0447-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 12/08/2017] [Indexed: 12/12/2022]
Abstract
How evolutionary changes at enhancers affect the transcription of target genes remains an important open question. Previous comparative studies of gene expression have largely measured the abundance of messenger RNA, which is affected by post-transcriptional regulatory processes, hence limiting inferences about the mechanisms underlying expression differences. Here, we directly measured nascent transcription in primate species, allowing us to separate transcription from post-transcriptional regulation. We used precision run-on and sequencing to map RNA polymerases in resting and activated CD4+ T cells in multiple human, chimpanzee and rhesus macaque individuals, with rodents as outgroups. We observed general conservation in coding and non-coding transcription, punctuated by numerous differences between species, particularly at distal enhancers and non-coding RNAs. Genes regulated by larger numbers of enhancers are more frequently transcribed at evolutionarily stable levels, despite reduced conservation at individual enhancers. Adaptive nucleotide substitutions are associated with lineage-specific transcription and at one locus, SGPP2, we predict and experimentally validate that multiple substitutions contribute to human-specific transcription. Collectively, our findings suggest a pervasive role for evolutionary compensation across ensembles of enhancers that jointly regulate target genes.
Collapse
Affiliation(s)
- Charles G Danko
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA.
- Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA.
| | - Lauren A Choate
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Brooke A Marks
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Edward J Rice
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Zhong Wang
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Tinyi Chu
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
- Graduate Field of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Andre L Martins
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
- Graduate Field of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Noah Dukler
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, USA
| | - Scott A Coonrod
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
- Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Elia D Tait Wojno
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
- Department of Microbiology and Immunology, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - John T Lis
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - W Lee Kraus
- Laboratory of Signaling and Gene Regulation, Cecil H. and Ida Green Center for Reproductive Biology Sciences, University of Texas Southwestern Medical Center, Dallas, TX, USA
- Division of Basic Research, Department of Obstetrics and Gynecology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
37
|
Arabidopsis mRNA decay landscape arises from specialized RNA decay substrates, decapping-mediated feedback, and redundancy. Proc Natl Acad Sci U S A 2018; 115:E1485-E1494. [PMID: 29386391 DOI: 10.1073/pnas.1712312115] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The decay of mRNA plays a vital role in modulating mRNA abundance, which, in turn, influences cellular and organismal processes. In plants and metazoans, three distinct pathways carry out the decay of most cytoplasmic mRNAs: The mRNA decapping complex, which requires the scaffold protein VARICOSE (VCS), removes a protective 5' cap, allowing for 5' to 3' decay via EXORIBONUCLEASE4 (XRN4, XRN1 in metazoans and yeast), and both the exosome and SUPPRESSOR OF VCS (SOV)/DIS3L2 degrade RNAs in the 3' to 5' direction. However, the unique biological contributions of these three pathways, and whether they degrade specialized sets of transcripts, are unknown. In Arabidopsis, the participation of SOV in RNA homeostasis is also unclear, because Arabidopsis sov mutants have a normal phenotype. We carried out mRNA decay analyses in wild-type, sov, vcs, and vcs sov seedlings, and used a mathematical modeling approach to determine decay rates and quantify gene-specific contributions of VCS and SOV to decay. This analysis revealed that VCS (decapping) contributes to decay of 68% of the transcriptome, and, while it initiates degradation of mRNAs with a wide range of decay rates, it especially contributes to decay of short-lived RNAs. Only a few RNAs were clear SOV substrates in that they decayed more slowly in sov mutants. However, 4,506 RNAs showed VCS-dependent feedback in sov that modulated decay rates, and, by inference, transcription, to maintain RNA abundances, suggesting that these RNAs might also be SOV substrates. This feedback was shown to be independent of siRNA activity.
Collapse
|
38
|
Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response. Nat Genet 2018; 50:424-431. [PMID: 29379200 DOI: 10.1038/s41588-018-0046-7] [Citation(s) in RCA: 200] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 12/22/2017] [Indexed: 01/25/2023]
Abstract
Regulatory variants are often context specific, modulating gene expression in a subset of possible cellular states. Although these genetic effects can play important roles in disease, the molecular mechanisms underlying context specificity are poorly understood. Here, we identified shared quantitative trait loci (QTLs) for chromatin accessibility and gene expression in human macrophages exposed to IFNγ, Salmonella and IFNγ plus Salmonella. We observed that ~60% of stimulus-specific expression QTLs with a detectable effect on chromatin altered the chromatin accessibility in naive cells, thus suggesting that they perturb enhancer priming. Such variants probably influence binding of cell-type-specific transcription factors, such as PU.1, which can then indirectly alter the binding of stimulus-specific transcription factors, such as NF-κB or STAT2. Thus, although chromatin accessibility assays are powerful for fine-mapping causal regulatory variants, detecting their downstream effects on gene expression will be challenging, requiring profiling of large numbers of stimulated cellular states and time points.
Collapse
|
39
|
High-resolution mapping of cis-regulatory variation in budding yeast. Proc Natl Acad Sci U S A 2017; 114:E10736-E10744. [PMID: 29183975 DOI: 10.1073/pnas.1717421114] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genetic variants affecting gene-expression levels are a major source of phenotypic variation. The approximate locations of these variants can be mapped as expression quantitative trait loci (eQTLs); however, a major limitation of eQTLs is their low resolution, which precludes investigation of the causal variants and their molecular mechanisms. Here we report RNA-seq and full genome sequences for 85 diverse isolates of the yeast Saccharomyces cerevisiae-including wild, domesticated, and human clinical strains-which allowed us to perform eQTL mapping with 50-fold higher resolution than previously possible. In addition to variants in promoters, we uncovered an important role for variants in 3'UTRs, especially those affecting binding of the PUF family of RNA-binding proteins. The eQTLs are predominantly under negative selection, particularly those affecting essential genes and conserved genes. However, applying the sign test for lineage-specific selection revealed the polygenic up-regulation of dozens of biofilm suppressor genes in strains isolated from human patients, consistent with the key role of biofilms in fungal pathogenicity. In addition, a single variant in the promoter of a biofilm suppressor, NIT3, showed the strongest genome-wide association with clinical origin. Altogether, our results demonstrate the power of high-resolution eQTL mapping in understanding the molecular mechanisms of regulatory variation, as well as the natural selection acting on this variation that drives adaptation to environments, ranging from laboratories to vineyards to the human body.
Collapse
|
40
|
Vockley CM, Barrera A, Reddy TE. Decoding the role of regulatory element polymorphisms in complex disease. Curr Opin Genet Dev 2016; 43:38-45. [PMID: 27984826 DOI: 10.1016/j.gde.2016.10.007] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 10/24/2016] [Indexed: 11/29/2022]
Abstract
Genetic variation in gene regulatory elements contributes to diverse human diseases, ranging from rare and severe developmental defects to common and complex diseases such as obesity and diabetes. Early examples of regulatory mechanisms of human diseases involve large chromosomal rearrangements that change the regulatory connections within the genome. Single nucleotide variants in regulatory elements can also contribute to disease, potentially via demonstrated associations with changes in transcription factor binding, enhancer activity, post-translational histone modifications, long-range enhancer-promoter interactions, or RNA polymerase recruitment. Establishing causality between non-coding genetic variants, gene regulation, and disease has recently become more feasible with advances in genome-editing and epigenome-editing technologies. As establishing causal regulatory mechanisms of diseases becomes routine, functional annotation of target genes is likely to emerge as a major bottleneck for translation into patient benefits. In this review, we discuss the history and recent advances in understanding the regulatory mechanisms of human disease, and new challenges likely to be encountered once establishing those mechanisms becomes rote.
Collapse
Affiliation(s)
- Christopher M Vockley
- Department of Cell Biology, Duke University Medical Center, Durham, NC 27710, United States; Department of Biostatistics & Bioinformatics, and Center for Genomic & Computational Biology, Duke University Medical Center, Durham, NC 27710, United States
| | - Alejandro Barrera
- Department of Biostatistics & Bioinformatics, and Center for Genomic & Computational Biology, Duke University Medical Center, Durham, NC 27710, United States
| | - Timothy E Reddy
- Department of Biostatistics & Bioinformatics, and Center for Genomic & Computational Biology, Duke University Medical Center, Durham, NC 27710, United States.
| |
Collapse
|
41
|
The roles of RNA processing in translating genotype to phenotype. NATURE REVIEWS. MOLECULAR CELL BIOLOGY 2016. [PMID: 27847391 DOI: 10.1038/nrm.2016.139.] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A goal of human genetics studies is to determine the mechanisms by which genetic variation produces phenotypic differences that affect human health. Efforts in this respect have previously focused on genetic variants that affect mRNA levels by altering epigenetic and transcriptional regulation. Recent studies show that genetic variants that affect RNA processing are at least equally as common as, and are largely independent from, those variants that affect transcription. We highlight the impact of genetic variation on pre-mRNA splicing and polyadenylation, and on the stability, translation and structure of mRNAs as mechanisms that produce phenotypic traits. These results emphasize the importance of including RNA processing signals in analyses to identify functional variants.
Collapse
|
42
|
Manning KS, Cooper TA. The roles of RNA processing in translating genotype to phenotype. Nat Rev Mol Cell Biol 2016; 18:102-114. [PMID: 27847391 DOI: 10.1038/nrm.2016.139] [Citation(s) in RCA: 141] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
A goal of human genetics studies is to determine the mechanisms by which genetic variation produces phenotypic differences that affect human health. Efforts in this respect have previously focused on genetic variants that affect mRNA levels by altering epigenetic and transcriptional regulation. Recent studies show that genetic variants that affect RNA processing are at least equally as common as, and are largely independent from, those variants that affect transcription. We highlight the impact of genetic variation on pre-mRNA splicing and polyadenylation, and on the stability, translation and structure of mRNAs as mechanisms that produce phenotypic traits. These results emphasize the importance of including RNA processing signals in analyses to identify functional variants.
Collapse
Affiliation(s)
- Kassie S Manning
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, USA.,Integrative Molecular and Biomedical Sciences Program, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Thomas A Cooper
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular Physiology and Biophysics, Baylor College of Medicine, Houston, Texas 77030, USA.,Integrative Molecular and Biomedical Sciences Program, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
43
|
Abstract
We give an overview of experimental and computational methods to estimate RNA metabolism rates genome-wide. We then advocate a local definition of RNA metabolism rate at the level of individual phosphodiester bonds. Rates of formation and disappearance of individual bonds are unambiguously defined, in contrast to rates of complete transcripts. We show that over previous approaches, the recently developed transient transcriptome sequencing (TT-seq) protocol allows for estimation of metabolism rates of individual bonds with least positional bias.
Collapse
Affiliation(s)
- Leonhard Wachutka
- a Department of Informatics , Technical University of Munich, Garching bei München , Germany
| | - Julien Gagneur
- a Department of Informatics , Technical University of Munich, Garching bei München , Germany
| |
Collapse
|
44
|
Taliaferro JM, Lambert NJ, Sudmant PH, Dominguez D, Merkin JJ, Alexis MS, Bazile C, Burge CB. RNA Sequence Context Effects Measured In Vitro Predict In Vivo Protein Binding and Regulation. Mol Cell 2016; 64:294-306. [PMID: 27720642 DOI: 10.1016/j.molcel.2016.08.035] [Citation(s) in RCA: 88] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2016] [Revised: 08/01/2016] [Accepted: 08/30/2016] [Indexed: 10/20/2022]
Abstract
Many RNA binding proteins (RBPs) bind specific RNA sequence motifs, but only a small fraction (∼15%-40%) of RBP motif occurrences are occupied in vivo. To determine which contextual features discriminate between bound and unbound motifs, we performed an in vitro binding assay using 12,000 mouse RNA sequences with the RBPs MBNL1 and RBFOX2. Surprisingly, the strength of binding to motif occurrences in vitro was significantly correlated with in vivo binding, developmental regulation, and evolutionary age of alternative splicing. Multiple lines of evidence indicate that the primary context effect that affects binding in vitro and in vivo is RNA secondary structure. Large-scale combinatorial mutagenesis of unfavorable sequence contexts revealed a consistent pattern whereby mutations that increased motif accessibility improved protein binding and regulatory activity. Our results indicate widespread inhibition of motif binding by local RNA secondary structure and suggest that mutations that alter sequence context commonly affect RBP binding and regulation.
Collapse
Affiliation(s)
- J Matthew Taliaferro
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Nicole J Lambert
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Peter H Sudmant
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Daniel Dominguez
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Jason J Merkin
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Maria S Alexis
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Cassandra Bazile
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Christopher B Burge
- Departments of Biology and Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.,Program in Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| |
Collapse
|
45
|
RNA Sequencing and Genetic Disease. CURRENT GENETIC MEDICINE REPORTS 2016. [DOI: 10.1007/s40142-016-0098-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
46
|
Lappalainen T. Functional genomics bridges the gap between quantitative genetics and molecular biology. Genome Res 2016; 25:1427-31. [PMID: 26430152 PMCID: PMC4579327 DOI: 10.1101/gr.190983.115] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Deep characterization of molecular function of genetic variants in the human genome is becoming increasingly important for understanding genetic associations to disease and for learning to read the regulatory code of the genome. In this paper, I discuss how recent advances in both quantitative genetics and molecular biology have contributed to understanding functional effects of genetic variants, lessons learned from eQTL studies, and future challenges in this field.
Collapse
Affiliation(s)
- Tuuli Lappalainen
- New York Genome Center, New York, New York 10013, USA; Department of Systems Biology, Columbia University, New York, New York 10032, USA
| |
Collapse
|
47
|
Rambout X, Detiffe C, Bruyr J, Mariavelle E, Cherkaoui M, Brohée S, Demoitié P, Lebrun M, Soin R, Lesage B, Guedri K, Beullens M, Bollen M, Farazi TA, Kettmann R, Struman I, Hill DE, Vidal M, Kruys V, Simonis N, Twizere JC, Dequiedt F. The transcription factor ERG recruits CCR4-NOT to control mRNA decay and mitotic progression. Nat Struct Mol Biol 2016; 23:663-72. [PMID: 27273514 DOI: 10.1038/nsmb.3243] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 05/13/2016] [Indexed: 01/08/2023]
Abstract
Control of mRNA levels, a fundamental aspect in the regulation of gene expression, is achieved through a balance between mRNA synthesis and decay. E26-related gene (Erg) proteins are canonical transcription factors whose previously described functions are confined to the control of mRNA synthesis. Here, we report that ERG also regulates gene expression by affecting mRNA stability and identify the molecular mechanisms underlying this function in human cells. ERG is recruited to mRNAs via interaction with the RNA-binding protein RBPMS, and it promotes mRNA decay by binding CNOT2, a component of the CCR4-NOT deadenylation complex. Transcriptome-wide mRNA stability analysis revealed that ERG controls the degradation of a subset of mRNAs highly connected to Aurora signaling, whose decay during S phase is necessary for mitotic progression. Our data indicate that control of gene expression by mammalian transcription factors may follow a more complex scheme than previously anticipated, integrating mRNA synthesis and degradation.
Collapse
Affiliation(s)
- Xavier Rambout
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Cécile Detiffe
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Jonathan Bruyr
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Emeline Mariavelle
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Majid Cherkaoui
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Sylvain Brohée
- BiGRe, Université Libre de Bruxelles (ULB), Bruxelles, Belgium.,Computer Science Department, ULB, Bruxelles, Belgium
| | - Pauline Demoitié
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Marielle Lebrun
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Inflammation, Infection &Immunity, ULg, Liège, Belgium
| | | | - Bart Lesage
- Department of Cellular and Molecular Medicine, University of Leuven (KUL), Leuven, Belgium
| | - Katia Guedri
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Monique Beullens
- Department of Cellular and Molecular Medicine, University of Leuven (KUL), Leuven, Belgium
| | - Mathieu Bollen
- Department of Cellular and Molecular Medicine, University of Leuven (KUL), Leuven, Belgium
| | - Thalia A Farazi
- Howard Hughes Medical Institute, Rockefeller University, New York, New York, USA
| | - Richard Kettmann
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Ingrid Struman
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Cancer, ULg, Liège, Belgium
| | - David E Hill
- Center for Cancer Systems Biology (CCSB), Department of Cancer Biology, Dana-Farber Cancer Institute (DFCI), Boston, Massachusetts, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB), Department of Cancer Biology, Dana-Farber Cancer Institute (DFCI), Boston, Massachusetts, USA.,Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
| | | | - Nicolas Simonis
- BiGRe, Université Libre de Bruxelles (ULB), Bruxelles, Belgium
| | - Jean-Claude Twizere
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| | - Franck Dequiedt
- Interdisciplinary Cluster for Applied Genoproteomics (GIGA-R), University of Liège (ULg), Liège, Belgium.,GIGA-Molecular Biology in Diseases, ULg, Liège, Belgium
| |
Collapse
|
48
|
Aw J, Shen Y, Wilm A, Sun M, Lim X, Boon KL, Tapsin S, Chan YS, Tan CP, Sim A, Zhang T, Susanto T, Fu Z, Nagarajan N, Wan Y. In Vivo Mapping of Eukaryotic RNA Interactomes Reveals Principles of Higher-Order Organization and Regulation. Mol Cell 2016; 62:603-17. [DOI: 10.1016/j.molcel.2016.04.028] [Citation(s) in RCA: 226] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2016] [Revised: 04/07/2016] [Accepted: 04/22/2016] [Indexed: 01/01/2023]
|
49
|
Li YI, van de Geijn B, Raj A, Knowles DA, Petti AA, Golan D, Gilad Y, Pritchard JK. RNA splicing is a primary link between genetic variation and disease. Science 2016; 352:600-4. [PMID: 27126046 PMCID: PMC5182069 DOI: 10.1126/science.aad9417] [Citation(s) in RCA: 445] [Impact Index Per Article: 49.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 03/25/2016] [Indexed: 12/14/2022]
Abstract
Noncoding variants play a central role in the genetics of complex traits, but we still lack a full understanding of the molecular pathways through which they act. We quantified the contribution of cis-acting genetic effects at all major stages of gene regulation from chromatin to proteins, in Yoruba lymphoblastoid cell lines (LCLs). About ~65% of expression quantitative trait loci (eQTLs) have primary effects on chromatin, whereas the remaining eQTLs are enriched in transcribed regions. Using a novel method, we also detected 2893 splicing QTLs, most of which have little or no effect on gene-level expression. These splicing QTLs are major contributors to complex traits, roughly on a par with variants that affect gene expression levels. Our study provides a comprehensive view of the mechanisms linking genetic variation to variation in human gene regulation.
Collapse
Affiliation(s)
- Yang I Li
- Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Anil Raj
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - David A Knowles
- Department of Computer Science, Stanford University, Stanford, CA, USA. Department of Radiology, Stanford University, Stanford, CA, USA
| | - Allegra A Petti
- Genome Institute, Washington University in St. Louis, St. Louis, MO, USA
| | - David Golan
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA, USA. Department of Biology, Stanford University, Stanford, CA, USA. Howard Hughes Medical Institute, Stanford University, Stanford, CA, USA.
| |
Collapse
|
50
|
Gutierrez-Arcelus M, Rich SS, Raychaudhuri S. Autoimmune diseases - connecting risk alleles with molecular traits of the immune system. Nat Rev Genet 2016; 17:160-74. [PMID: 26907721 PMCID: PMC4896831 DOI: 10.1038/nrg.2015.33] [Citation(s) in RCA: 166] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Genome-wide strategies have driven the discovery of more than 300 susceptibility loci for autoimmune diseases. However, for almost all loci, understanding of the mechanisms leading to autoimmunity remains limited, and most variants that are likely to be causal are in non-coding regions of the genome. A critical next step will be to identify the in vivo and ex vivo immunophenotypes that are affected by risk variants. To do this, key cell types and cell states that are implicated in autoimmune diseases will need to be defined. Functional genomic annotations from these cell types and states can then be used to resolve candidate genes and causal variants. Together with longitudinal studies, this approach may yield pivotal insights into how autoimmunity is triggered.
Collapse
Affiliation(s)
- Maria Gutierrez-Arcelus
- Division of Genetics, and Division of Rheumatology, Immunology and Allergy, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts 02142, USA
- Partners Center for Personalized Genetic Medicine, Boston, Massachusetts 02115, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, Virginia 22908, USA
| | - Soumya Raychaudhuri
- Division of Genetics, and Division of Rheumatology, Immunology and Allergy, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts 02142, USA
- Partners Center for Personalized Genetic Medicine, Boston, Massachusetts 02115, USA
- Faculty of Medical and Human Sciences, University of Manchester, Manchester M13 9PL, UK
- Department of Medicine, Karolinska Institutet and Karolinska University Hospital Solna, Stockholm SE-171 77, Sweden
| |
Collapse
|