2
|
Webb A, Papp AC, Curtis A, Newman LC, Pietrzak M, Seweryn M, Handelman SK, Rempala GA, Wang D, Graziosa E, Tyndale RF, Lerman C, Kelsoe JR, Mash DC, Sadee W. RNA sequencing of transcriptomes in human brain regions: protein-coding and non-coding RNAs, isoforms and alleles. BMC Genomics 2015; 16:990. [PMID: 26597164 PMCID: PMC4657279 DOI: 10.1186/s12864-015-2207-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 11/12/2015] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND We used RNA sequencing to analyze transcript profiles of ten autopsy brain regions from ten subjects. RNA sequencing techniques were designed to detect both coding and non-coding RNA, splice isoform composition, and allelic expression. Brain regions were selected from five subjects with a documented history of smoking and five non-smokers. Paired-end RNA sequencing was performed on SOLiD instruments to a depth of >40 million reads, using linearly amplified, ribosomally depleted RNA. Sequencing libraries were prepared with both poly-dT and random hexamer primers to detect all RNA classes, including long non-coding (lncRNA), intronic and intergenic transcripts, and transcripts lacking poly-A tails, providing additional data not previously available. The study was designed to generate a database of the complete transcriptomes in brain region for gene network analyses and discovery of regulatory variants. RESULTS Of 20,318 protein coding and 18,080 lncRNA genes annotated from GENCODE and lncipedia, 12 thousand protein coding and 2 thousand lncRNA transcripts were detectable at a conservative threshold. Of the aligned reads, 52 % were exonic, 34 % intronic and 14 % intergenic. A majority of protein coding genes (65 %) was expressed in all regions, whereas ncRNAs displayed a more restricted distribution. Profiles of RNA isoforms varied across brain regions and subjects at multiple gene loci, with neurexin 3 (NRXN3) a prominent example. Allelic RNA ratios deviating from unity were identified in > 400 genes, detectable in both protein-coding and non-coding genes, indicating the presence of cis-acting regulatory variants. Mathematical modeling was used to identify RNAs stably expressed in all brain regions (serving as potential markers for normalizing expression levels), linked to basic cellular functions. An initial analysis of differential expression analysis between smokers and nonsmokers implicated a number of genes, several previously associated with nicotine exposure. CONCLUSIONS RNA sequencing identifies distinct and consistent differences in gene expression between brain regions, with non-coding RNA displaying greater diversity between brain regions than mRNAs. Numerous RNAs exhibit robust allele selective expression, proving a means for discovery of cis-acting regulatory factors with potential clinical relevance.
Collapse
Affiliation(s)
- Amy Webb
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
| | - Audrey C Papp
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
| | - Amanda Curtis
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
| | - Leslie C Newman
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
| | - Maciej Pietrzak
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
- Division of Biostatistics, College of Public Health, and Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, USA.
| | - Michal Seweryn
- Division of Biostatistics, College of Public Health, and Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, USA.
| | - Samuel K Handelman
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
| | - Grzegorz A Rempala
- Division of Biostatistics, College of Public Health, and Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, USA.
| | - Daqing Wang
- Thermo Fisher Scientific, South San Francisco, CA, 94080, USA.
| | - Erica Graziosa
- Thermo Fisher Scientific, South San Francisco, CA, 94080, USA.
| | - Rachel F Tyndale
- Center for Addiction and Mental Health and Departments of Psychiatry and Pharmacology & Toxicology, University of Toronto, Toronto, Ontario, Canada.
| | - Caryn Lerman
- Department of Psychiatry, Annenberg School for Communication, and Abramson Cancer Center, University of Pennsylvania, Philadelphia, PA, USA.
| | - John R Kelsoe
- Department of Psychiatry, Laboratory of Psychiatric Genomics, University of California, San Diego, USA.
- VA San Diego Healthcare System, La Jolla, San Diego, CA, USA.
| | - Deborah C Mash
- Department of Neurology, Miller School of Medicine, University of Miami, Miami, FL, 33136, USA.
| | - Wolfgang Sadee
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
- Departments of Pharmacology, College of Medicine; Colleges of Pharmacy and Environmental Health Sciences, The Ohio State University, Columbus, OH, USA.
- Departments of Psychiatry, College of Medicine; Colleges of Pharmacy and Environmental Health Sciences, The Ohio State University, Columbus, OH, USA.
- Departments of Human Genetics/Internal Medicine, College of Medicine; Colleges of Pharmacy and Environmental Health Sciences, The Ohio State University, 5078 Graves Hall, 333 W. 10th Avenue, Columbus, OH, 43210, USA.
| |
Collapse
|
3
|
Mascarenhas R, Pietrzak M, Smith RM, Webb A, Wang D, Papp AC, Pinsonneault JK, Seweryn M, Rempala G, Sadee W. Allele-Selective Transcriptome Recruitment to Polysomes Primed for Translation: Protein-Coding and Noncoding RNAs, and RNA Isoforms. PLoS One 2015; 10:e0136798. [PMID: 26331722 PMCID: PMC4558023 DOI: 10.1371/journal.pone.0136798] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2015] [Accepted: 08/07/2015] [Indexed: 11/19/2022] Open
Abstract
mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs), and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs) in lymphoblast cell lines (LCL) and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T) in ABCB1 (MDR1) on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq) in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs) affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.
Collapse
Affiliation(s)
- Roshan Mascarenhas
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Maciej Pietrzak
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, Ohio, United States of America
| | - Ryan M. Smith
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Amy Webb
- Department of Biomedical Informatics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Danxin Wang
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Audrey C. Papp
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Julia K. Pinsonneault
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Michal Seweryn
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, Ohio, United States of America
- Department of Mathematics and Computer Science, University of Lodz, Lodz, Poland
- Mathematical Biosciences Institute, The Ohio State University, Columbus, Ohio, United States of America
| | - Grzegorz Rempala
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, Ohio, United States of America
- Mathematical Biosciences Institute, The Ohio State University, Columbus, Ohio, United States of America
| | - Wolfgang Sadee
- Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
- Department of Medical Genetics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| |
Collapse
|
4
|
Lu R, Smith RM, Seweryn M, Wang D, Hartmann K, Webb A, Sadee W, Rempala GA. Analyzing allele specific RNA expression using mixture models. BMC Genomics 2015; 16:566. [PMID: 26231172 PMCID: PMC4521363 DOI: 10.1186/s12864-015-1749-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2015] [Accepted: 07/03/2015] [Indexed: 11/10/2022] Open
Abstract
Background Measuring allele-specific RNA expression provides valuable insights into cis-acting genetic and epigenetic regulation of gene expression. Widespread adoption of high-throughput sequencing technologies for studying RNA expression (RNA-Seq) permits measurement of allelic RNA expression imbalance (AEI) at heterozygous single nucleotide polymorphisms (SNPs) across the entire transcriptome, and this approach has become especially popular with the emergence of large databases, such as GTEx. However, the existing binomial-type methods used to model allelic expression from RNA-seq assume a strong negative correlation between reference and variant allele reads, which may not be reasonable biologically. Results Here we propose a new strategy for AEI analysis using RNA-seq data. Under the null hypothesis of no AEI, a group of SNPs (possibly across multiple genes) is considered comparable if their respective total sums of the allelic reads are of similar magnitude. Within each group of “comparable” SNPs, we identify SNPs with AEI signal by fitting a mixture of folded Skellam distributions to the absolute values of read differences. By applying this methodology to RNA-Seq data from human autopsy brain tissues, we identified numerous instances of moderate to strong imbalanced allelic RNA expression at heterozygous SNPs. Findings with SLC1A3 mRNA exhibiting known expression differences are discussed as examples. Conclusion The folded Skellam mixture model searches for SNPs with significant difference between reference and variant allele reads (adjusted for different library sizes), using information from a group of “comparable” SNPs across multiple genes. This model is particularly suitable for performing AEI analysis on genes with few heterozygous SNPs available from RNA-seq, and it can fit over-dispersed read counts without specifying the direction of the correlation between reference and variant alleles. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1749-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rong Lu
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, 43210, USA
| | - Ryan M Smith
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
| | - Michal Seweryn
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, 43210, USA.,Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, 43201, USA
| | - Danxin Wang
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
| | - Katherine Hartmann
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
| | - Amy Webb
- Department of Biomedical Informatics, Program in Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, Columbus, OH, USA
| | - Wolfgang Sadee
- Center for Pharmacogenomics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
| | - Grzegorz A Rempala
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, 43210, USA. .,Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, 43201, USA.
| |
Collapse
|