101
|
Gusic M, Prokisch H. ncRNAs: New Players in Mitochondrial Health and Disease? Front Genet 2020; 11:95. [PMID: 32180794 PMCID: PMC7059738 DOI: 10.3389/fgene.2020.00095] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Accepted: 01/28/2020] [Indexed: 12/19/2022] Open
Abstract
The regulation of mitochondrial proteome is unique in that its components have origins in both mitochondria and nucleus. With the development of OMICS technologies, emerging evidence indicates an interaction between mitochondria and nucleus based not only on the proteins but also on the non-coding RNAs (ncRNAs). It is now accepted that large parts of the non‐coding genome are transcribed into various ncRNA species. Although their characterization has been a hot topic in recent years, the function of the majority remains unknown. Recently, ncRNA species microRNA (miRNA) and long-non coding RNAs (lncRNA) have been gaining attention as direct or indirect modulators of the mitochondrial proteome homeostasis. These ncRNA can impact mitochondria indirectly by affecting transcripts encoding for mitochondrial proteins in the cytoplasm. Furthermore, reports of mitochondria-localized miRNAs, termed mitomiRs, and lncRNAs directly regulating mitochondrial gene expression suggest the import of RNA to mitochondria, but also transcription from the mitochondrial genome. Interestingly, ncRNAs have been also shown to hide small open reading frames (sORFs) encoding for small functional peptides termed micropeptides, with several examples reported with a role in mitochondria. In this review, we provide a literature overview on ncRNAs and micropeptides found to be associated with mitochondrial biology in the context of both health and disease. Although reported, small study overlap and rare replications by other groups make the presence, transport, and role of ncRNA in mitochondria an attractive, but still challenging subject. Finally, we touch the topic of their potential as prognosis markers and therapeutic targets.
Collapse
Affiliation(s)
- Mirjana Gusic
- Institute of Human Genetics, Helmholtz Zentrum München, Neuherberg, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany.,Institute of Human Genetics, Technical University of Munich, Munich, Germany
| | - Holger Prokisch
- Institute of Human Genetics, Helmholtz Zentrum München, Neuherberg, Germany.,Institute of Human Genetics, Technical University of Munich, Munich, Germany
| |
Collapse
|
102
|
Finkel Y, Schmiedel D, Tai-Schmiedel J, Nachshon A, Winkler R, Dobesova M, Schwartz M, Mandelboim O, Stern-Ginossar N. Comprehensive annotations of human herpesvirus 6A and 6B genomes reveal novel and conserved genomic features. eLife 2020; 9:e50960. [PMID: 31944176 PMCID: PMC6964970 DOI: 10.7554/elife.50960] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 11/27/2019] [Indexed: 12/14/2022] Open
Abstract
Human herpesvirus-6 (HHV-6) A and B are ubiquitous betaherpesviruses, infecting the majority of the human population. They encompass large genomes and our understanding of their protein coding potential is far from complete. Here, we employ ribosome-profiling and systematic transcript-analysis to experimentally define HHV-6 translation products. We identify hundreds of new open reading frames (ORFs), including upstream ORFs (uORFs) and internal ORFs (iORFs), generating a complete unbiased atlas of HHV-6 proteome. By integrating systematic data from the prototypic betaherpesvirus, human cytomegalovirus, we uncover numerous uORFs and iORFs conserved across betaherpesviruses and we show uORFs are enriched in late viral genes. We identified three highly abundant HHV-6 encoded long non-coding RNAs, one of which generates a non-polyadenylated stable intron appearing to be a conserved feature of betaherpesviruses. Overall, our work reveals the complexity of HHV-6 genomes and highlights novel features conserved between betaherpesviruses, providing a rich resource for future functional studies.
Collapse
Affiliation(s)
- Yaara Finkel
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| | - Dominik Schmiedel
- The Lautenberg Center for General and Tumor ImmunologyInstitute for Medical Research Israel-Canada, The Hebrew University Hadassah Medical SchoolJerusalemIsrael
| | | | - Aharon Nachshon
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| | - Roni Winkler
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| | - Martina Dobesova
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| | - Michal Schwartz
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| | - Ofer Mandelboim
- The Lautenberg Center for General and Tumor ImmunologyInstitute for Medical Research Israel-Canada, The Hebrew University Hadassah Medical SchoolJerusalemIsrael
| | | |
Collapse
|
103
|
Vaklavas C, Blume SW, Grizzle WE. Hallmarks and Determinants of Oncogenic Translation Revealed by Ribosome Profiling in Models of Breast Cancer. Transl Oncol 2020; 13:452-470. [PMID: 31911279 PMCID: PMC6948383 DOI: 10.1016/j.tranon.2019.12.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 11/28/2019] [Accepted: 12/01/2019] [Indexed: 12/21/2022] Open
Abstract
Gene expression is extensively and dynamically modulated at the level of translation. How cancer cells prioritize the translation of certain mRNAs over others from a pool of competing mRNAs remains an open question. Here, we analyze translation in cell line models of breast cancer and normal mammary tissue by ribosome profiling. We identify key recurrent themes of oncogenic translation: higher ribosome occupancy, greater variance of translational efficiencies, and preferential translation of transcriptional regulators and signaling proteins in malignant cells as compared with their nonmalignant counterpart. We survey for candidate RNA interacting proteins that could associate with the 5′untranslated regions of the transcripts preferentially translated in breast tumour cells. We identify SRSF1, a prototypic splicing factor, to have a pervasive direct and indirect impact on translation. In a representative estrogen receptor–positive and estrogen receptor–negative cell line, we find that protein synthesis relies heavily on SRSF1. SRSF1 is predominantly intranuclear. Under certain conditions, SRSF1 translocates from the nucleus to the cytoplasm where it associates with MYC and CDK1 mRNAs and upregulates their internal ribosome entry site–mediated translation. Our results point to a synergy between splicing and translation and unveil how certain RNA-binding proteins modulate the translational landscape in breast cancer.
Collapse
Affiliation(s)
- Christos Vaklavas
- Department of Medicine, Division of Hematology / Oncology, University of Alabama at Birmingham, Birmingham, AL 35294, USA.
| | - Scott W Blume
- Department of Medicine, Division of Hematology / Oncology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - William E Grizzle
- Department of Pathology, O'Neal Comprehensive Cancer Centre, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| |
Collapse
|
104
|
Clauwaert J, Menschaert G, Waegeman W. DeepRibo: a neural network for precise gene annotation of prokaryotes by combining ribosome profiling signal and binding site patterns. Nucleic Acids Res 2019; 47:e36. [PMID: 30753697 PMCID: PMC6451124 DOI: 10.1093/nar/gkz061] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 01/02/2019] [Accepted: 01/30/2019] [Indexed: 12/13/2022] Open
Abstract
Annotation of gene expression in prokaryotes often finds itself corrected due to small variations of the annotated gene regions observed between different (sub)-species. It has become apparent that traditional sequence alignment algorithms, used for the curation of genomes, are not able to map the full complexity of the genomic landscape. We present DeepRibo, a novel neural network utilizing features extracted from ribosome profiling information and binding site sequence patterns that shows to be a precise tool for the delineation and annotation of expressed genes in prokaryotes. The neural network combines recurrent memory cells and convolutional layers, adapting the information gained from both the high-throughput ribosome profiling data and ribosome binding translation initiation sequence region into one model. DeepRibo is designed as a single model trained on a variety of ribosome profiling experiments, used for the identification of open reading frames in prokaryotes without a priori knowledge of the translational landscape. Through extensive validation of the model trained on various sets of data, multiple species sequence similarity, mass spectrometry and Edman degradation verified proteins, the effectiveness of DeepRibo is highlighted.
Collapse
Affiliation(s)
- Jim Clauwaert
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000 Gent, Belgium
| | - Gerben Menschaert
- Biobix, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000 Gent, Belgium
| | - Willem Waegeman
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000 Gent, Belgium
| |
Collapse
|
105
|
Reisacher C, Arbibe L. Not lost in host translation: The new roles of long noncoding RNAs in infectious diseases. Cell Microbiol 2019; 21:e13119. [PMID: 31634981 DOI: 10.1111/cmi.13119] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 09/10/2019] [Accepted: 09/17/2019] [Indexed: 12/20/2022]
Abstract
Long non-coding RNAs (lncRNAs) play a central role in the regulation of gene expression. Although they were initially described as mRNA-like transcripts not encoding proteins, global approaches such as ribosome profiling have shown that they frequently associate with ribosomes, opening the possibility that lncRNAs are a source of cryptic translation events with functional roles. Recent studies have shed more light on small ORFs borne by lncRNAs and encoding short peptides potentially involved in infectious immunity. This review outlines the main strategies used to determine the coding potential of lncRNAs and discusses our emerging understanding of the implication of the encoded peptides in infectious diseases.
Collapse
Affiliation(s)
- Caroline Reisacher
- Department of Immunology, Infectiology and Hematology, Institut Necker-Enfants Malades (INEM), INSERM U1151, CNRS UMR 8253, Université Paris Descartes, Paris, France
| | - Laurence Arbibe
- Department of Immunology, Infectiology and Hematology, Institut Necker-Enfants Malades (INEM), INSERM U1151, CNRS UMR 8253, Université Paris Descartes, Paris, France
| |
Collapse
|
106
|
Mudge JM, Jungreis I, Hunt T, Gonzalez JM, Wright JC, Kay M, Davidson C, Fitzgerald S, Seal R, Tweedie S, He L, Waterhouse RM, Li Y, Bruford E, Choudhary JS, Frankish A, Kellis M. Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci. Genome Res 2019; 29:2073-2087. [PMID: 31537640 PMCID: PMC6886504 DOI: 10.1101/gr.246462.118] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 09/09/2019] [Indexed: 12/15/2022]
Abstract
The most widely appreciated role of DNA is to encode protein, yet the exact portion of the human genome that is translated remains to be ascertained. We previously developed PhyloCSF, a widely used tool to identify evolutionary signatures of protein-coding regions using multispecies genome alignments. Here, we present the first whole-genome PhyloCSF prediction tracks for human, mouse, chicken, fly, worm, and mosquito. We develop a workflow that uses machine learning to predict novel conserved protein-coding regions and efficiently guide their manual curation. We analyze more than 1000 high-scoring human PhyloCSF regions and confidently add 144 conserved protein-coding genes to the GENCODE gene set, as well as additional coding regions within 236 previously annotated protein-coding genes, and 169 pseudogenes, most of them disabled after primates diverged. The majority of these represent new discoveries, including 70 previously undetected protein-coding genes. The novel coding genes are additionally supported by single-nucleotide variant evidence indicative of continued purifying selection in the human lineage, coding-exon splicing evidence from new GENCODE transcripts using next-generation transcriptomic data sets, and mass spectrometry evidence of translation for several new genes. Our discoveries required simultaneous comparative annotation of other vertebrate genomes, which we show is essential to remove spurious ORFs and to distinguish coding from pseudogene regions. Our new coding regions help elucidate disease-associated regions by revealing that 118 GWAS variants previously thought to be noncoding are in fact protein altering. Altogether, our PhyloCSF data sets and algorithms will help researchers seeking to interpret these genomes, while our new annotations present exciting loci for further experimental characterization.
Collapse
Affiliation(s)
- Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Toby Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Jose Manuel Gonzalez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - James C Wright
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, London SW7 3RP, United Kingdom
| | - Mike Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Claire Davidson
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Stephen Fitzgerald
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Ruth Seal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.,Department of Haematology, University of Cambridge, Cambridge CB2 0PT, United Kingdom
| | - Susan Tweedie
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Liang He
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Robert M Waterhouse
- Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Yue Li
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Elspeth Bruford
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom.,Department of Haematology, University of Cambridge, Cambridge CB2 0PT, United Kingdom
| | - Jyoti S Choudhary
- Functional Proteomics, Division of Cancer Biology, Institute of Cancer Research, London SW7 3RP, United Kingdom
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, Massachusetts 02139, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
107
|
Wu HYL, Song G, Walley JW, Hsu PY. The Tomato Translational Landscape Revealed by Transcriptome Assembly and Ribosome Profiling. PLANT PHYSIOLOGY 2019; 181:367-380. [PMID: 31248964 PMCID: PMC6716236 DOI: 10.1104/pp.19.00541] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Accepted: 06/10/2019] [Indexed: 05/14/2023]
Abstract
Recent applications of translational control in Arabidopsis (Arabidopsis thaliana) highlight the potential power of manipulating mRNA translation for crop improvement. However, to what extent translational regulation is conserved between Arabidopsis and other species is largely unknown, and the translatome of most crops remains poorly studied. Here, we combined de novo transcriptome assembly and ribosome profiling to study global mRNA translation in tomato (Solanum lycopersicum) roots. Exploiting features corresponding to active translation, we discovered widespread unannotated translation events, including 1,329 upstream open reading frames (uORFs) within the 5' untranslated regions of annotated coding genes and 354 small ORFs (sORFs) among unannotated transcripts. uORFs may repress translation of their downstream main ORFs, whereas sORFs may encode signaling peptides. Besides evolutionarily conserved sORFs, we uncovered 96 Solanaceae-specific sORFs, revealing the importance of studying translatomes directly in crops. Proteomic analysis confirmed that some of the unannotated ORFs generate stable proteins in planta. In addition to defining the translatome, our results reveal the global regulation by uORFs and microRNAs. Despite diverging over 100 million years ago, many translational features are well conserved between Arabidopsis and tomato. Thus, our approach provides a high-throughput method to discover unannotated ORFs, elucidates evolutionarily conserved and unique translational features, and identifies regulatory mechanisms hidden in a crop genome.
Collapse
Affiliation(s)
- Hsin-Yen Larry Wu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824
| | - Gaoyuan Song
- Department of Plant Pathology and Microbiology, Iowa State University, Ames, Iowa 50011
| | - Justin W Walley
- Department of Plant Pathology and Microbiology, Iowa State University, Ames, Iowa 50011
| | - Polly Yingshan Hsu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
108
|
Xu Z, Hu L, Shi B, Geng S, Xu L, Wang D, Lu ZJ. Ribosome elongating footprints denoised by wavelet transform comprehensively characterize dynamic cellular translation events. Nucleic Acids Res 2019; 46:e109. [PMID: 29945224 PMCID: PMC6182183 DOI: 10.1093/nar/gky533] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 05/31/2018] [Indexed: 02/06/2023] Open
Abstract
Translation is dynamically regulated during cell development and stress response. In order to detect actively translated open reading frames (ORFs) and dynamic cellular translation events, we have developed a computational method, RiboWave, to process ribosome profiling data. RiboWave utilizes wavelet transform to denoise the original signal by extracting 3-nt periodicity of ribosomes and precisely locate their footprint denoted as Periodic Footprint P-site (PF P-site). Such high-resolution footprint is found to capture the full track of actively elongating ribosomes, from which translational landscape can be explicitly characterized. We compare RiboWave with several published methods, like RiboTaper, ORFscore and RibORF, and found that RiboWave outperforms them in both accuracy and usage when defining actively translated ORFs. Moreover, we show that PF P-site derived by RiboWave shows superior performance in characterizing the dynamics and complexity of cellular translatome by accurately estimating the abundance of protein levels, assessing differential translation and identifying dynamic translation frameshift.
Collapse
Affiliation(s)
- Zhiyu Xu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Long Hu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Binbin Shi
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - SiSi Geng
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Longchen Xu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Dong Wang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Zhi J Lu
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
109
|
Michel AM, Kiniry SJ, O'Connor PBF, Mullan JP, Baranov PV. GWIPS-viz: 2018 update. Nucleic Acids Res 2019; 46:D823-D830. [PMID: 28977460 PMCID: PMC5753223 DOI: 10.1093/nar/gkx790] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 08/29/2017] [Indexed: 12/15/2022] Open
Abstract
The GWIPS-viz browser (http://gwips.ucc.ie/) is an on-line genome browser which is tailored for exploring ribosome profiling (Ribo-seq) data. Since its publication in 2014, GWIPS-viz provides Ribo-seq data for an additional 14 genomes bringing the current total to 23. The integration of new Ribo-seq data has been automated thereby increasing the number of available tracks to 1792, a 10-fold increase in the last three years. The increase is particularly substantial for data derived from human sources. Following user requests, we added the functionality to download these tracks in bigWig format. We also incorporated new types of data (e.g. TCP-seq) as well as auxiliary tracks from other sources that help with the interpretation of Ribo-seq data. Improvements in the visualization of the data have been carried out particularly for bacterial genomes where the Ribo-seq data are now shown in a strand specific manner. For higher eukaryotic datasets, we provide characteristics of individual datasets using the RUST program which includes the triplet periodicity, sequencing biases and relative inferred A-site dwell times. This information can be used for assessing the quality of Ribo-seq datasets. To improve the power of the signal, we aggregate Ribo-seq data from several studies into Global aggregate tracks for each genome.
Collapse
Affiliation(s)
- Audrey M Michel
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Stephen J Kiniry
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | | | - James P Mullan
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| |
Collapse
|
110
|
Verbruggen S, Ndah E, Van Criekinge W, Gessulat S, Kuster B, Wilhelm M, Van Damme P, Menschaert G. PROTEOFORMER 2.0: Further Developments in the Ribosome Profiling-assisted Proteogenomic Hunt for New Proteoforms. Mol Cell Proteomics 2019; 18:S126-S140. [PMID: 31040227 PMCID: PMC6692777 DOI: 10.1074/mcp.ra118.001218] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 04/30/2019] [Indexed: 12/20/2022] Open
Abstract
PROTEOFORMER is a pipeline that enables the automated processing of data derived from ribosome profiling (RIBO-seq, i.e. the sequencing of ribosome-protected mRNA fragments). As such, genome-wide ribosome occupancies lead to the delineation of data-specific translation product candidates and these can improve the mass spectrometry-based identification. Since its first publication, different upgrades, new features and extensions have been added to the PROTEOFORMER pipeline. Some of the most important upgrades include P-site offset calculation during mapping, comprehensive data pre-exploration, the introduction of two alternative proteoform calling strategies and extended pipeline output features. These novelties are illustrated by analyzing ribosome profiling data of human HCT116 and Jurkat data. The different proteoform calling strategies are used alongside one another and in the end combined together with reference sequences from UniProt. Matching mass spectrometry data are searched against this extended search space with MaxQuant. Overall, besides annotated proteoforms, this pipeline leads to the identification and validation of different categories of new proteoforms, including translation products of up- and downstream open reading frames, 5' and 3' extended and truncated proteoforms, single amino acid variants, splice variants and translation products of so-called noncoding regions. Further, proof-of-concept is reported for the improvement of spectrum matching by including Prosit, a deep neural network strategy that adds extra fragmentation spectrum intensity features to the analysis. In the light of ribosome profiling-driven proteogenomics, it is shown that this allows validating the spectrum matches of newly identified proteoforms with elevated stringency. These updates and novel conclusions provide new insights and lessons for the ribosome profiling-based proteogenomic research field. More practical information on the pipeline, raw code, the user manual (README) and explanations on the different modes of availability can be found at the GitHub repository of PROTEOFORMER: https://github.com/Biobix/proteoformer.
Collapse
Affiliation(s)
- Steven Verbruggen
- BioBix, Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modeling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.
| | - Elvis Ndah
- BioBix, Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modeling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium; VIB-UGent Center for Medical Biotechnology, Ghent, Belgium
| | - Wim Van Criekinge
- BioBix, Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modeling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium
| | - Siegfried Gessulat
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Munich, Germany; SAP SE, Potsdam, Germany
| | - Bernhard Kuster
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Munich, Germany
| | - Mathias Wilhelm
- Chair of Proteomics and Bioanalytics, Technical University of Munich, Munich, Germany
| | - Petra Van Damme
- VIB-UGent Center for Medical Biotechnology, Ghent, Belgium; Department of Biochemistry and Microbiology, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Gerben Menschaert
- BioBix, Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modeling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.
| |
Collapse
|
111
|
Xiao Z, Huang R, Xing X, Chen Y, Deng H, Yang X. De novo annotation and characterization of the translatome with ribosome profiling data. Nucleic Acids Res 2019. [PMID: 29538776 PMCID: PMC6007384 DOI: 10.1093/nar/gky179] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
By capturing and sequencing the RNA fragments protected by translating ribosomes, ribosome profiling provides snapshots of translation at subcodon resolution. The growing needs for comprehensive annotation and characterization of the context-dependent translatomes are calling for an efficient and unbiased method to accurately recover the signal of active translation from the ribosome profiling data. Here we present our new method, RiboCode, for such purpose. Being tested with simulated and real ribosome profiling data, and validated with cell type-specific QTI-seq and mass spectrometry data, RiboCode exhibits superior efficiency, sensitivity, and accuracy for de novo annotation of the translatome, which covers various types of ORFs in the previously annotated coding and non-coding regions. As an example, RiboCode was applied to assemble the context-specific translatomes of yeast under normal and stress conditions. Comparisons among these translatomes revealed stress-activated novel upstream and downstream ORFs, some of which are associated with translational dysregulations of the annotated main ORFs under the stress conditions.
Collapse
Affiliation(s)
- Zhengtao Xiao
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China.,School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Rongyao Huang
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China.,School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Xudong Xing
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China.,School of Life Sciences, Tsinghua University, Beijing 100084, China.,Joint Graduate Program of Peking-Tsinghua-National Institute of Biological Science, Tsinghua University, Beijing 100084, China
| | - Yuling Chen
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China.,School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Haiteng Deng
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China.,School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Xuerui Yang
- MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing 100084, China.,School of Life Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
112
|
McGillivray P, Ault R, Pawashe M, Kitchen R, Balasubramanian S, Gerstein M. A comprehensive catalog of predicted functional upstream open reading frames in humans. Nucleic Acids Res 2019; 46:3326-3338. [PMID: 29562350 PMCID: PMC6283423 DOI: 10.1093/nar/gky188] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 03/14/2018] [Indexed: 01/09/2023] Open
Abstract
Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans.
Collapse
Affiliation(s)
- Patrick McGillivray
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA
| | - Russell Ault
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA
| | - Mayur Pawashe
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA
| | - Robert Kitchen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Suganthi Balasubramanian
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA.,Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Mark Gerstein
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, CT 06520, USA.,Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA.,Department of Computer Science, Yale University, New Haven, CT 06520 USA
| |
Collapse
|
113
|
Wang H, Wang Y, Xie Z. Computational resources for ribosome profiling: from database to Web server and software. Brief Bioinform 2019; 20:144-155. [PMID: 28968766 DOI: 10.1093/bib/bbx093] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Indexed: 01/04/2023] Open
Abstract
Ribosome profiling is emerging as a powerful technique that enables genome-wide investigation of in vivo translation at sub-codon resolution. The increasing application of ribosome profiling in recent years has achieved remarkable progress toward understanding the composition, regulation and mechanism of translation. This benefits from not only the awesome power of ribosome profiling but also an extensive range of computational resources available for ribosome profiling. At present, however, a comprehensive review on these resources is still lacking. Here, we survey the recent computational advances guided by ribosome profiling, with a focus on databases, Web servers and software tools for storing, visualizing and analyzing ribosome profiling data. This review is intended to provide experimental and computational biologists with a reference to make appropriate choices among existing resources for the question at hand.
Collapse
Affiliation(s)
- Hongwei Wang
- Zhongshan Ophthalmic Center, Sun Yat-sen University
| | - Yan Wang
- Zhongshan Ophthalmic Center, Sun Yat-sen University
| | - Zhi Xie
- Zhongshan Ophthalmic Center, Sun Yat-sen University
| |
Collapse
|
114
|
McDermott BT, Peffers MJ, McDonagh B, Tew SR. Translational regulation contributes to the secretory response of chondrocytic cells following exposure to interleukin-1β. J Biol Chem 2019; 294:13027-13039. [PMID: 31300557 PMCID: PMC6721953 DOI: 10.1074/jbc.ra118.006865] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 06/12/2019] [Indexed: 01/18/2023] Open
Abstract
Osteoarthritis is a chronic disease characterized by the loss of articular cartilage in synovial joints through a process of extracellular matrix destruction that is strongly associated with inflammatory stimuli. Chondrocytes undergo changes to their protein translational capacity during osteoarthritis, but a study of how disease-relevant signals affect chondrocyte protein translation at the transcriptomic level has not previously been performed. In this study, we describe how the inflammatory cytokine interleukin 1-β (IL-1β) rapidly affects protein translation in the chondrocytic cell line SW1353. Using ribosome profiling we demonstrate that IL-1β induced altered translation of inflammatory-associated transcripts such as NFKB1, TNFAIP2, MMP13, CCL2, and CCL7, as well as a number of ribosome-associated transcripts, through differential translation and the use of multiple open reading frames. Proteomic analysis of the cellular layer and the conditioned media of these cells identified changes in a number of the proteins that were differentially translated. Translationally regulated secreted proteins included a number of chemokines and cytokines, underlining the rapid, translationally mediated inflammatory cascade that is initiated by IL-1β. Although fewer cellular proteins were found to be regulated in both ribosome profiling and proteomic data sets, we did find increased levels of SOD2, indicative of redox changes within SW1353 cells being modulated at the translational level. In conclusion, we have produced combined ribosome profiling and proteomic data sets that provide a valuable resource in understanding the processes that occur during cytokine stimulation of chondrocytic cells.
Collapse
Affiliation(s)
- Benjamin T McDermott
- Department of Musculoskeletal Biology, Institute of Ageing and Chronic Disease, University of Liverpool, William Henry Duncan Building, 6 West Derby Street, Liverpool L7 8TX, United Kingdom.
| | - Mandy J Peffers
- Department of Musculoskeletal Biology, Institute of Ageing and Chronic Disease, University of Liverpool, William Henry Duncan Building, 6 West Derby Street, Liverpool L7 8TX, United Kingdom
| | - Brian McDonagh
- Department of Physiology, School of Medicine, National University of Ireland (NUI), Galway H91 TK33, Ireland
| | - Simon R Tew
- Department of Musculoskeletal Biology, Institute of Ageing and Chronic Disease, University of Liverpool, William Henry Duncan Building, 6 West Derby Street, Liverpool L7 8TX, United Kingdom
| |
Collapse
|
115
|
Singh K, Lin J, Zhong Y, Burčul A, Mohan P, Jiang M, Sun L, Yong-Gonzalez V, Viale A, Cross JR, Hendrickson RC, Rätsch G, Ouyang Z, Wendel HG. c-MYC regulates mRNA translation efficiency and start-site selection in lymphoma. J Exp Med 2019; 216:1509-1524. [PMID: 31142587 PMCID: PMC6605752 DOI: 10.1084/jem.20181726] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 02/22/2019] [Accepted: 04/19/2019] [Indexed: 12/24/2022] Open
Abstract
The oncogenic c-MYC (MYC) transcription factor has broad effects on gene expression and cell behavior. We show that MYC alters the efficiency and quality of mRNA translation into functional proteins. Specifically, MYC drives the translation of most protein components of the electron transport chain in lymphoma cells, and many of these effects are independent from proliferation. Specific interactions of MYC-sensitive RNA-binding proteins (e.g., SRSF1/RBM42) with 5'UTR sequence motifs mediate many of these changes. Moreover, we observe a striking shift in translation initiation site usage. For example, in low-MYC conditions, lymphoma cells initiate translation of the CD19 mRNA from a site in exon 5. This results in the truncation of all extracellular CD19 domains and facilitates escape from CD19-directed CAR-T cell therapy. Together, our findings reveal MYC effects on the translation of key metabolic enzymes and immune receptors in lymphoma cells.
Collapse
Affiliation(s)
- Kamini Singh
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Jianan Lin
- The Jackson Laboratory for Genomic Medicine, Farmington, CT
- Department of Biomedical Engineering, University of Connecticut, Storrs, CT
| | - Yi Zhong
- Computational Biology Department, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Antonija Burčul
- Computational Biology Department, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Prathibha Mohan
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Man Jiang
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Liping Sun
- Integrated Genomics Operation, Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Vladimir Yong-Gonzalez
- Donald B. and Catherine C. Marron Cancer Metabolism Center, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Agnes Viale
- Integrated Genomics Operation, Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Justin R Cross
- Donald B. and Catherine C. Marron Cancer Metabolism Center, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Ronald C Hendrickson
- Proteomics and Microchemistry, Memorial Sloan- Kettering Cancer Center, New York, NY
| | - Gunnar Rätsch
- Computational Biology Department, Memorial Sloan Kettering Cancer Center, New York, NY
- Biomedical Informatics, Department of Computer Science, Swiss Federal Institute of Technology, Zürich, Switzerland
| | - Zhengqing Ouyang
- The Jackson Laboratory for Genomic Medicine, Farmington, CT
- Department of Genetics and Genome Sciences and Institute for System Genomics, University of Connecticut Health Center, Farmington, CT
| | - Hans-Guido Wendel
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY
| |
Collapse
|
116
|
The Translational Landscape of the Human Heart. Cell 2019; 178:242-260.e29. [DOI: 10.1016/j.cell.2019.05.010] [Citation(s) in RCA: 272] [Impact Index Per Article: 45.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 03/01/2019] [Accepted: 05/06/2019] [Indexed: 12/22/2022]
|
117
|
Zou Q, Xiao Z, Huang R, Wang X, Wang X, Zhao H, Yang X. Survey of the translation shifts in hepatocellular carcinoma with ribosome profiling. Theranostics 2019; 9:4141-4155. [PMID: 31281537 PMCID: PMC6592166 DOI: 10.7150/thno.35033] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 04/27/2019] [Indexed: 12/11/2022] Open
Abstract
Despite the critical position of translation in the multilevel gene expression regulation program, high-resolution and genome-wide view of the landscape of RNA translation in solid tumors is still limited. Methods: With a ribosome profiling procedure optimized for solid tissue samples, we profiled the translatomes of liver tumors and their adjacent noncancerous normal liver tissues from 10 patients with hepatocellular carcinoma (HCC). A set of bioinformatics tools was then applied to these data for the mining of novel insights into the translation shifts in HCC. Results: This is the first translatome data resource for dissecting dysregulated translation in HCC at the sub-codon resolution. Based on our data, quantitative comparisons of mRNA translation rates yielded the genes and processes that were subjected to patient specific or universal dysregulations of translation efficiencies in tumors. For example, multiple proteins involved in extracellular matrix organization exhibited significant translational upregulation in tumors. We then experimentally validated the tumor-promoting functions of two such genes as examples: AGRN and VWA1. In addition, the data was also used for de novo annotation of the translatomes in tumors and normal tissues, including multiple types of novel non-canonical small ORFs, which would be a resource for further functional studies. Conclusions: The present study generates the first survey of the HCC translatome with ribosome profiling, which is an insightful data resource for dissecting the translatome shift in liver cancer, at sub-codon resolution.
Collapse
|
118
|
Rodriguez CM, Chun SY, Mills RE, Todd PK. Translation of upstream open reading frames in a model of neuronal differentiation. BMC Genomics 2019; 20:391. [PMID: 31109297 PMCID: PMC6528255 DOI: 10.1186/s12864-019-5775-1] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 05/07/2019] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Upstream open reading frames (uORFs) initiate translation within mRNA 5' leaders, and have the potential to alter main coding sequence (CDS) translation on transcripts in which they reside. Ribosome profiling (RP) studies suggest that translating ribosomes are pervasive within 5' leaders across model systems. However, the significance of this observation remains unclear. To explore a role for uORF usage in a model of neuronal differentiation, we performed RP on undifferentiated and differentiated human neuroblastoma cells. RESULTS Using a spectral coherence algorithm (SPECtre), we identify 4954 consistently translated uORFs across 31% of all neuroblastoma transcripts. These uORFs predominantly utilize non-AUG initiation codons and exhibit translational efficiencies (TE) comparable to annotated coding regions. On a population basis, the global impact of both AUG and non-AUG initiated uORFs on basal CDS translation were small, even when analysis is limited to conserved and consistently translated uORFs. However, uORFs did alter the translation of a subset of genes, including the Diamond-Blackfan Anemia associated ribosomal gene RPS24. With retinoic acid induced differentiation, we observed an overall positive correlation in translational shifts between uORF/CDS pairs. However, CDSs downstream of uORFs show smaller shifts in TE with differentiation relative to CDSs without a predicted uORF, suggesting that uORF translation buffers cell state dependent fluctuations in CDS translation. CONCLUSION This work provides insights into the dynamic relationships and potential regulatory functions of uORF/CDS pairs in a model of neuronal differentiation.
Collapse
Affiliation(s)
- Caitlin M Rodriguez
- Department of Neurology, University of Michigan, Ann Arbor, MI, USA
- Neuroscience Graduate Program, University of Michigan, Ann Arbor, MI, USA
| | - Sang Y Chun
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, 48109, USA.
| | - Peter K Todd
- Department of Neurology, University of Michigan, Ann Arbor, MI, USA.
- VA Ann Arbor Healthcare System, Ann Arbor, MI, USA.
| |
Collapse
|
119
|
Ingolia NT, Hussmann JA, Weissman JS. Ribosome Profiling: Global Views of Translation. Cold Spring Harb Perspect Biol 2019; 11:cshperspect.a032698. [PMID: 30037969 DOI: 10.1101/cshperspect.a032698] [Citation(s) in RCA: 187] [Impact Index Per Article: 31.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The translation of messenger RNA (mRNA) into protein and the folding of the resulting protein into an active form are prerequisites for virtually every cellular process and represent the single largest investment of energy by cells. Ribosome profiling-based approaches have revolutionized our ability to monitor every step of protein synthesis in vivo, allowing one to measure the rate of protein synthesis across the proteome, annotate the protein coding capacity of genomes, monitor localized protein synthesis, and explore cotranslational folding and targeting. The rich and quantitative nature of ribosome profiling data provides an unprecedented opportunity to explore and model complex cellular processes. New analytical techniques and improved experimental protocols will provide a deeper understanding of the factors controlling translation speed and its impact on protein function and cell physiology as well as the role of ribosomal RNA and mRNA modifications in regulating translation.
Collapse
Affiliation(s)
- Nicholas T Ingolia
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720
| | - Jeffrey A Hussmann
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, California 94158.,Howard Hughes Medical Institute, San Francisco, California 94158
| | - Jonathan S Weissman
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, California 94158.,Howard Hughes Medical Institute, San Francisco, California 94158
| |
Collapse
|
120
|
Weaver J, Mohammad F, Buskirk AR, Storz G. Identifying Small Proteins by Ribosome Profiling with Stalled Initiation Complexes. mBio 2019; 10:e02819-18. [PMID: 30837344 PMCID: PMC6401488 DOI: 10.1128/mbio.02819-18] [Citation(s) in RCA: 126] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2018] [Accepted: 01/24/2019] [Indexed: 11/20/2022] Open
Abstract
Small proteins consisting of 50 or fewer amino acids have been identified as regulators of larger proteins in bacteria and eukaryotes. Despite the importance of these molecules, the total number of small proteins remains unknown because conventional annotation pipelines usually exclude small open reading frames (smORFs). We previously identified several dozen small proteins in the model organism Escherichia coli using theoretical bioinformatic approaches based on sequence conservation and matches to canonical ribosome binding sites. Here, we present an empirical approach for discovering new proteins, taking advantage of recent advances in ribosome profiling in which antibiotics are used to trap newly initiated 70S ribosomes at start codons. This approach led to the identification of many novel initiation sites in intergenic regions in E. coli We tagged 41 smORFs on the chromosome and detected protein synthesis for all but three. Not only are the corresponding genes intergenic but they are also found antisense to other genes, in operons, and overlapping other open reading frames (ORFs), some impacting the translation of larger downstream genes. These results demonstrate the utility of this method for identifying new genes, regardless of their genomic context.IMPORTANCE Proteins comprised of 50 or fewer amino acids have been shown to interact with and modulate the functions of larger proteins in a range of organisms. Despite the possible importance of small proteins, the true prevalence and capabilities of these regulators remain unknown as the small size of the proteins places serious limitations on their identification, purification, and characterization. Here, we present a ribosome profiling approach with stalled initiation complexes that led to the identification of 38 new small proteins.
Collapse
Affiliation(s)
- Jeremy Weaver
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland, USA
| | - Fuad Mohammad
- Department of Molecular Biology and Genetics, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Allen R Buskirk
- Department of Molecular Biology and Genetics, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Gisela Storz
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland, USA
| |
Collapse
|
121
|
Abstract
INTRODUCTION Small open reading frames (sORFs) with potential protein-coding capacity have been disclosed in various transcripts, including long noncoding RNAs (LncRNAs), mRNAs (5'-upstream, coding domain, and 3'-downstream), circular RNAs, pri-miRNAs, and ribosomal RNAs (rRNAs). Recent characterization of several sORF-encoded peptides (SEPs or micropeptides) revealed their important roles in many fundamental biological processes in a broad range of species from yeast to human. The success in the mining of micropeptides attributes to the advanced bioinformatics and high-throughput sequencing techniques. Areas covered: sORFs and SEPs were overlooked for their tiny size and the difficulty of identification by bioinformatics analyses. With more and more sORFs and SEPs have been identified, this field has attracted more attention. This review covers recent advances in the strategies for the detection and identification of sORFs and SEPs. Expert commentary: The advantages and drawbacks of the strategies for detection and identification of sORFs and SEPs are discussed, as well as the techniques that are used to decipher the roles of micropeptides in organisms are described.
Collapse
Affiliation(s)
- Xinqiang Yin
- a The Engineering Research Center of Synthetic Polypeptide Drug Discovery and Evaluation of Jiangsu Province , China Pharmaceutical University , Nanjing , China.,b The Basic Medical School , North Sichuan Medical College , Nanchong , China
| | - Yuanyuan Jing
- c Department of Preventive Medicine , North Sichuan Medical College , Nanchong , China
| | - Hanmei Xu
- a The Engineering Research Center of Synthetic Polypeptide Drug Discovery and Evaluation of Jiangsu Province , China Pharmaceutical University , Nanjing , China.,d State Key Laboratory of Natural Medicines, Ministry of Education , China Pharmaceutical University , Nanjing , China
| |
Collapse
|
122
|
Wei J, Kishton RJ, Angel M, Conn CS, Dalla-Venezia N, Marcel V, Vincent A, Catez F, Ferré S, Ayadi L, Marchand V, Dersh D, Gibbs JS, Ivanov IP, Fridlyand N, Couté Y, Diaz JJ, Qian SB, Staudt LM, Restifo NP, Yewdell JW. Ribosomal Proteins Regulate MHC Class I Peptide Generation for Immunosurveillance. Mol Cell 2019; 73:1162-1173.e5. [PMID: 30712990 DOI: 10.1016/j.molcel.2018.12.020] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 10/29/2018] [Accepted: 12/21/2018] [Indexed: 02/07/2023]
Abstract
The MHC class I antigen presentation system enables T cell immunosurveillance of cancers and viruses. A substantial fraction of the immunopeptidome derives from rapidly degraded nascent polypeptides (DRiPs). By knocking down each of the 80 ribosomal proteins, we identified proteins that modulate peptide generation without altering source protein expression. We show that 60S ribosomal proteins L6 (RPL6) and RPL28, which are adjacent on the ribosome, play opposite roles in generating an influenza A virus-encoded peptide. Depleting RPL6 decreases ubiquitin-dependent peptide presentation, whereas depleting RPL28 increases ubiquitin-dependent and -independent peptide presentation. 40S ribosomal protein S28 (RPS28) knockdown increases total peptide supply in uninfected cells by increasing DRiP synthesis from non-canonical translation of "untranslated" regions and non-AUG start codons and sensitizes tumor cells for T cell targeting. Our findings raise the possibility of modulating immunosurveillance by pharmaceutical targeting ribosomes.
Collapse
MESH Headings
- Animals
- Antigen Presentation
- Cell Line, Tumor
- Coculture Techniques
- HEK293 Cells
- Histocompatibility Antigens Class I/biosynthesis
- Histocompatibility Antigens Class I/immunology
- Host-Pathogen Interactions
- Humans
- Immunologic Surveillance
- Influenza A virus/immunology
- Influenza A virus/pathogenicity
- Melanoma/immunology
- Melanoma/metabolism
- Mice, Inbred C57BL
- Mice, Transgenic
- Ribosomal Proteins/genetics
- Ribosomal Proteins/metabolism
- Ribosome Subunits, Large, Eukaryotic/genetics
- Ribosome Subunits, Large, Eukaryotic/metabolism
- Ribosome Subunits, Small, Eukaryotic/genetics
- Ribosome Subunits, Small, Eukaryotic/metabolism
- Skin Neoplasms/immunology
- Skin Neoplasms/metabolism
- T-Lymphocytes/immunology
- T-Lymphocytes/metabolism
- T-Lymphocytes/virology
Collapse
Affiliation(s)
- Jiajie Wei
- Laboratory of Viral Diseases, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD 20892, USA.
| | | | - Matthew Angel
- Laboratory of Viral Diseases, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD 20892, USA
| | - Crystal S Conn
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Nicole Dalla-Venezia
- University of Lyon, Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, Center Léon Bérard, Center de Recherche en Cancérologie de Lyon, Lyon, 69373 Cedex 08, France
| | - Virginie Marcel
- University of Lyon, Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, Center Léon Bérard, Center de Recherche en Cancérologie de Lyon, Lyon, 69373 Cedex 08, France
| | - Anne Vincent
- University of Lyon, Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, Center Léon Bérard, Center de Recherche en Cancérologie de Lyon, Lyon, 69373 Cedex 08, France
| | - Frédéric Catez
- University of Lyon, Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, Center Léon Bérard, Center de Recherche en Cancérologie de Lyon, Lyon, 69373 Cedex 08, France
| | - Sabrina Ferré
- University of Grenoble Alpes, CEA, INSERM, BIG-BGE, 38000 Grenoble, France
| | - Lilia Ayadi
- Next-Generation Sequencing Core Facility, UMS2008 IBSLor CNRS-INSERM-University of Lorraine, 54505 Vandoeuvre-les-Nancy, France; Laboratory IMoPA, UMR7365 CNRS-University of Lorraine, 54505 Vandoeuvre-les-Nancy, France
| | - Virginie Marchand
- Next-Generation Sequencing Core Facility, UMS2008 IBSLor CNRS-INSERM-University of Lorraine, 54505 Vandoeuvre-les-Nancy, France; Laboratory IMoPA, UMR7365 CNRS-University of Lorraine, 54505 Vandoeuvre-les-Nancy, France
| | - Devin Dersh
- Laboratory of Viral Diseases, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD 20892, USA
| | - James S Gibbs
- Laboratory of Viral Diseases, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD 20892, USA
| | - Ivaylo P Ivanov
- Laboratory of Gene Regulation and Development, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20892, USA
| | - Nathan Fridlyand
- Laboratory of Translational Biology, School of Biosciences and Biotechnology, University of Camerino, Camerino MC 62032, Italy
| | - Yohann Couté
- University of Grenoble Alpes, CEA, INSERM, BIG-BGE, 38000 Grenoble, France
| | - Jean-Jacques Diaz
- University of Lyon, Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, Center Léon Bérard, Center de Recherche en Cancérologie de Lyon, Lyon, 69373 Cedex 08, France
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Louis M Staudt
- Lymphoid Malignancies Branch, National Cancer Institute, NIH, Bethesda, MD 20892, USA
| | - Nicholas P Restifo
- National Cancer Institute, NIH, Bethesda, MD 20892, USA; Center for Cell-Based Therapy, Center for Cancer Research, NIH, Bethesda, MD 20892, USA
| | - Jonathan W Yewdell
- Laboratory of Viral Diseases, National Institute of Allergy and Infectious Diseases, NIH, Bethesda, MD 20892, USA.
| |
Collapse
|
123
|
Translation of Small Open Reading Frames: Roles in Regulation and Evolutionary Innovation. Trends Genet 2018; 35:186-198. [PMID: 30606460 DOI: 10.1016/j.tig.2018.12.003] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 12/07/2018] [Indexed: 01/01/2023]
Abstract
The translatome can be defined as the sum of the RNA sequences that are translated into proteins in the cell by the ribosomal machinery. Until recently, it was generally assumed that the translatome was essentially restricted to evolutionary conserved proteins encoded by the set of annotated protein-coding genes. However, it has become increasingly clear that it also includes small regulatory open reading frames (ORFs), functional micropeptides, de novo proteins, and the pervasive translation of likely nonfunctional proteins. Many of these ORFs have been discovered thanks to the development of ribosome profiling, a technique to sequence ribosome-protected RNA fragments. To fully capture the diversity of translated ORFs, we propose a comprehensive classification that includes the new types of translated ORFs in addition to standard proteins.
Collapse
|
124
|
Zhu S, Wang J, He Y, Meng N, Yan GR. Peptides/Proteins Encoded by Non-coding RNA: A Novel Resource Bank for Drug Targets and Biomarkers. Front Pharmacol 2018; 9:1295. [PMID: 30483132 PMCID: PMC6243196 DOI: 10.3389/fphar.2018.01295] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Accepted: 10/22/2018] [Indexed: 12/13/2022] Open
Abstract
Non-coding RNAs (ncRNAs) are defined as RNA molecules that do not encode proteins, but recent evidence has proven that peptides/proteins encoded by ncRNAs do indeed exist and usually contain less than 100 amino acids. These peptides/proteins play an important role in regulating tumor energy metabolism, epithelial to mesenchymal transition of cancer cells, the stability of the c-Myc oncoprotein, and the ubiquitination and degradation of proliferating cell nuclear antigen (PCNA). These peptides/proteins represent promising drug targets for fighting against tumor growth or biomarkers for predicting the prognosis of cancer patients. In this review, we summarize the characteristics of peptides/proteins that have recently been identified as putative ncRNA translation products and their outlook for small molecule peptide drugs, drug targets, and biomarkers.
Collapse
Affiliation(s)
- Song Zhu
- Biomedicine Research Center, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.,Key Laboratory of Protein Modification and Degradation, Guangzhou Medical University, Guangzhou, China
| | - Jizhong Wang
- Biomedicine Research Center, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.,Key Laboratory of Protein Modification and Degradation, Guangzhou Medical University, Guangzhou, China
| | - Yutian He
- Biomedicine Research Center, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.,Key Laboratory of Protein Modification and Degradation, Guangzhou Medical University, Guangzhou, China
| | - Nan Meng
- Biomedicine Research Center, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.,Key Laboratory of Protein Modification and Degradation, Guangzhou Medical University, Guangzhou, China
| | - Guang-Rong Yan
- Biomedicine Research Center, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.,Key Laboratory of Protein Modification and Degradation, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
125
|
Zhou Y, Koelling N, Fenwick AL, McGowan SJ, Calpena E, Wall SA, Smithson SF, Wilkie AO, Twigg SR. Disruption of TWIST1 translation by 5' UTR variants in Saethre-Chotzen syndrome. Hum Mutat 2018; 39:1360-1365. [PMID: 30040876 PMCID: PMC6175480 DOI: 10.1002/humu.23598] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 07/15/2018] [Accepted: 07/22/2018] [Indexed: 11/24/2022]
Abstract
Saethre-Chotzen syndrome (SCS), one of the most common forms of syndromic craniosynostosis (premature fusion of the cranial sutures), results from haploinsufficiency of TWIST1, caused by deletions of the entire gene or loss-of-function variants within the coding region. To determine whether non-coding variants also contribute to SCS, we screened 14 genetically undiagnosed SCS patients using targeted capture sequencing, and identified novel single nucleotide variants (SNVs) in the 5' untranslated region (UTR) of TWIST1 in two unrelated SCS cases. We show experimentally that these variants, which create translation start sites in the TWIST1 leader sequence, reduce translation from the main open reading frame (mORF). This is the first demonstration that non-coding SNVs of TWIST1 can cause SCS, and highlights the importance of screening the 5' UTR in clinically diagnosed SCS patients without a coding mutation. Similar 5' UTR variants, particularly of haploinsufficient genes, may represent an under-ascertained cause of monogenic disease.
Collapse
Affiliation(s)
- Yan Zhou
- Clinical Genetics Group, MRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
| | - Nils Koelling
- Clinical Genetics Group, MRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
| | - Aimée L. Fenwick
- Clinical Genetics Group, MRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
| | - Simon J. McGowan
- Analysis, Visualisation and Informatics Group, MRC WIMM Centre for Computational BiologyMRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
| | - Eduardo Calpena
- Clinical Genetics Group, MRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
| | - Steven A. Wall
- Craniofacial Unit, Department of Plastic and Reconstructive Surgery, Oxford University Hospitals NHS Foundation Trust, John Radcliffe HospitalUniversity of OxfordOxfordUK
| | - Sarah F. Smithson
- Department of Clinical Genetics, St Michaels Hospital & School of Clinical SciencesUniversity of BristolBristolUK
| | - Andrew O.M. Wilkie
- Clinical Genetics Group, MRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
- Craniofacial Unit, Department of Plastic and Reconstructive Surgery, Oxford University Hospitals NHS Foundation Trust, John Radcliffe HospitalUniversity of OxfordOxfordUK
| | - Stephen R.F. Twigg
- Clinical Genetics Group, MRC Weatherall Institute of Molecular MedicineUniversity of OxfordOxfordUK
| |
Collapse
|
126
|
Kumari R, Michel AM, Baranov PV. PausePred and Rfeet: webtools for inferring ribosome pauses and visualizing footprint density from ribosome profiling data. RNA (NEW YORK, N.Y.) 2018; 24:1297-1304. [PMID: 30049792 PMCID: PMC6140459 DOI: 10.1261/rna.065235.117] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 07/23/2018] [Indexed: 05/25/2023]
Abstract
The process of translation is characterized by irregularities in the local decoding rates of specific mRNA codons. This includes the occurrences of long pauses that can take place when ribosomes decode certain peptide sequences, encounter strong RNA secondary structures, or decode "hungry" codons. Examples are known where such pausing or stalling is used for regulating protein synthesis. This can be achieved at the level of translation via direct alteration of ribosome progression through mRNA or by altering mRNA stability via NoGo decay. Ribosome pausing has also been implicated in the cotranslational folding of proteins. Ribosome profiling data often are used for inferring the locations of ribosome pauses. However, no dedicated online software is available for this purpose. Here we present PausePred (https://pausepred.ucc.ie/), which can be used to infer ribosome pauses from ribosome profiling (Ribo-seq) data. Peaks of ribosome footprint density are scored based on their magnitude relative to the background density within the surrounding area. The scoring allows the comparison of peaks across the transcriptome or genome. In addition to the score, PausePred reports the coordinates of the pause, the footprint density at the pause site, and the surrounding nucleotide sequence. The pauses can be visualized in the context of Ribo-seq and RNA-seq density plots generated for specific transcripts or genomic regions with the Rfeet tool. PausePred does not require input on the location of protein coding ORFs (although gene annotations can be optionally supplied). As a result, it can be used universally and its output does not depend on ever evolving annotations.
Collapse
Affiliation(s)
- Romika Kumari
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| | - Audrey M Michel
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, Western Gateway Building, University College Cork, Cork, Ireland
| |
Collapse
|
127
|
Ji Z. RibORF: Identifying Genome-Wide Translated Open Reading Frames Using Ribosome Profiling. CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 2018; 124:e67. [PMID: 30178897 PMCID: PMC6168376 DOI: 10.1002/cpmb.67] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Ribosome profiling identifies RNA fragments associated with translating ribosomes. The technology provides an opportunity to examine genome-wide translation events at single-nucleotide resolution and in an unbiased manner. Here I present a computational pipeline named RibORF to systematically identify translated open reading frames (ORFs), based on read distribution features representing active translation, including 3-nt periodicity and uniformness across codons. Analyses using the computational tool revealed pervasive translation in putative 'noncoding' regions, such as lncRNAs, pseudogenes, and 5'UTRs. The computational tool is useful for studying functional roles of non-canonical translation events in various biological processes. © 2018 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Zhe Ji
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA. Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL 60208, USA
| |
Collapse
|
128
|
Lauria F, Tebaldi T, Bernabò P, Groen EJN, Gillingwater TH, Viero G. riboWaltz: Optimization of ribosome P-site positioning in ribosome profiling data. PLoS Comput Biol 2018; 14:e1006169. [PMID: 30102689 PMCID: PMC6112680 DOI: 10.1371/journal.pcbi.1006169] [Citation(s) in RCA: 133] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 08/28/2018] [Accepted: 04/30/2018] [Indexed: 12/13/2022] Open
Abstract
Ribosome profiling is a powerful technique used to study translation at the genome-wide level, generating unique information concerning ribosome positions along RNAs. Optimal localization of ribosomes requires the proper identification of the ribosome P-site in each ribosome protected fragment, a crucial step to determine the trinucleotide periodicity of translating ribosomes, and draw correct conclusions concerning where ribosomes are located. To determine the P-site within ribosome footprints at nucleotide resolution, the precise estimation of its offset with respect to the protected fragment is necessary. Here we present riboWaltz, an R package for calculation of optimal P-site offsets, diagnostic analysis and visual inspection of ribosome profiling data. Compared to existing tools, riboWaltz shows improved accuracies for P-site estimation and neat ribosome positioning in multiple case studies. riboWaltz was implemented in R and is available as an R package at https://github.com/LabTranslationalArchitectomics/RiboWaltz.
Collapse
Affiliation(s)
- Fabio Lauria
- Institute of Biophysics, CNR Unit at Trento, Trento, Italy
- * E-mail: (FL); (GV)
| | - Toma Tebaldi
- Centre for Integrative Biology, University of Trento, Trento, Italy
| | - Paola Bernabò
- Institute of Biophysics, CNR Unit at Trento, Trento, Italy
| | - Ewout J. N. Groen
- Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
- Centre for Integrative Physiology, University of Edinburgh, Edinburgh, United Kingdom
| | - Thomas H. Gillingwater
- Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
- Centre for Integrative Physiology, University of Edinburgh, Edinburgh, United Kingdom
| | - Gabriella Viero
- Institute of Biophysics, CNR Unit at Trento, Trento, Italy
- * E-mail: (FL); (GV)
| |
Collapse
|
129
|
Zhang H, Dou S, He F, Luo J, Wei L, Lu J. Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. PLoS Biol 2018; 16:e2003903. [PMID: 30028832 PMCID: PMC6070289 DOI: 10.1371/journal.pbio.2003903] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 08/01/2018] [Accepted: 07/03/2018] [Indexed: 11/19/2022] Open
Abstract
Upstream open reading frames (uORFs) play important roles in regulating the main coding DNA sequences (CDSs) via translational repression. Despite their prevalence in the genomes, uORFs are overall discriminated against by natural selection. However, it remains unclear why in the genomes there are so many uORFs more conserved than expected under the assumption of neutral evolution. Here, we generated genome-wide maps of translational efficiency (TE) at the codon level throughout the life cycle of Drosophila melanogaster. We identified 35,735 uORFs that were expressed, and 32,224 (90.2%) of them showed evidence of ribosome occupancy during Drosophila development. The ribosome occupancy of uORFs is determined by genomic features, such as optimized sequence contexts around their start codons, a shorter distance to CDSs, and higher coding potentials. Our population genomic analysis suggests the segregating mutations that create or disrupt uORFs are overall deleterious in D. melanogaster. However, we found for the first time that many (68.3% of) newly fixed uORFs that are associated with ribosomes in D. melanogaster are driven by positive Darwinian selection. Our findings also suggest that uORFs play a vital role in controlling the translational program in Drosophila. Moreover, we found that many uORFs are transcribed or translated in a developmental stage-, sex-, or tissue-specific manner, suggesting that selective transcription or translation of uORFs could potentially modulate the TE of the downstream CDSs during Drosophila development. Upstream open reading frames (uORFs) in the 5′ untranslated regions (UTRs) of messenger RNAs can potentially inhibit translation of the downstream regions that encode proteins by sequestering protein-making machinery the ribosome. Moreover, mutations that destroy existing uORFs or create new ones are known to cause human disease. Although mutations that create new uORFs are generally deleterious and are selected against, many uORFs are evolutionarily conserved across eukaryotic species. To resolve this dilemma, we used extensive mRNA-Seq and ribosome profiling to generate high-resolution genome-wide maps of ribosome occupancy and translational efficiency (TE) during the life cycle of the fruit fly D. melanogaster. This allowed us to identify the sequence features of uORFs that influence their ability to associate with ribosomes. We demonstrate for the first time that the majority of the newly fixed uORFs in D. melanogaster, especially the translated ones, are under positive Darwinian selection. We also show that uORFs exert widespread repressive effects on the translation of the downstream protein-coding region. We find that many uORFs are transcribed or translated in a developmental stage-, sex-, or tissue-specific manner. Our results suggest that during Drosophila development, changes in the TE of uORFs, as well as the inclusion/exclusion of uORFs, are frequently exploited to inversely influence the translation of the downstream protein-coding regions. Our study provides novel insights into the molecular mechanisms and functional consequences of uORF-mediated regulation.
Collapse
Affiliation(s)
- Hong Zhang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Shengqian Dou
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Feng He
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Junjie Luo
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Liping Wei
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- * E-mail:
| |
Collapse
|
130
|
Dermit M, Dodel M, Mardakheh FK. Methods for monitoring and measurement of protein translation in time and space. MOLECULAR BIOSYSTEMS 2018; 13:2477-2488. [PMID: 29051942 PMCID: PMC5795484 DOI: 10.1039/c7mb00476a] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Regulation of protein translation constitutes a crucial step in control of gene expression. Here we review recent methods for system-wide monitoring and measurement of protein translation.
Regulation of protein translation constitutes a crucial step in control of gene expression. In comparison to transcriptional regulation, however, translational control has remained a significantly under-studied layer of gene expression. This trend is now beginning to shift thanks to recent advances in next-generation sequencing, proteomics, and microscopy based methodologies which allow accurate monitoring of protein translation rates, from single target messenger RNA molecules to genome-wide scale studies. In this review, we summarize these recent advances, and discuss how they are enabling researchers to study translational regulation in a wide variety of in vitro and in vivo biological systems, with unprecedented depth and spatiotemporal resolution.
Collapse
Affiliation(s)
- Maria Dermit
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, Charterhouse Square, London EC1M 6BQ, UK.
| | - Martin Dodel
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, Charterhouse Square, London EC1M 6BQ, UK.
| | - Faraz K Mardakheh
- Centre for Molecular Oncology, Barts Cancer Institute, Queen Mary University of London, John Vane Science Centre, Charterhouse Square, London EC1M 6BQ, UK.
| |
Collapse
|
131
|
Lim CS, T. Wardell SJ, Kleffmann T, Brown CM. The exon-intron gene structure upstream of the initiation codon predicts translation efficiency. Nucleic Acids Res 2018; 46:4575-4591. [PMID: 29684192 PMCID: PMC5961209 DOI: 10.1093/nar/gky282] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 03/28/2018] [Accepted: 04/06/2018] [Indexed: 12/16/2022] Open
Abstract
Introns in mRNA leaders are common in complex eukaryotes, but often overlooked. These introns are spliced out before translation, leaving exon-exon junctions in the mRNA leaders (leader EEJs). Our multi-omic approach shows that the number of leader EEJs inversely correlates with the main protein translation, as does the number of upstream open reading frames (uORFs). Across the five species studied, the lowest levels of translation were observed for mRNAs with both leader EEJs and uORFs (29%). This class of mRNAs also have ribosome footprints on uORFs, with strong triplet periodicity indicating uORF translation. Furthermore, the positions of both leader EEJ and uORF are conserved between human and mouse. Thus, the uORF, in combination with leader EEJ predicts lower expression for nearly one-third of eukaryotic proteins.
Collapse
Affiliation(s)
- Chun Shen Lim
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Samuel J T. Wardell
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Torsten Kleffmann
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Chris M Brown
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| |
Collapse
|
132
|
Battling for Ribosomes: Translational Control at the Forefront of the Antiviral Response. J Mol Biol 2018; 430:1965-1992. [PMID: 29746850 DOI: 10.1016/j.jmb.2018.04.040] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2018] [Revised: 04/24/2018] [Accepted: 04/27/2018] [Indexed: 01/05/2023]
Abstract
In the early stages of infection, gaining control of the cellular protein synthesis machinery including its ribosomes is the ultimate combat objective for a virus. To successfully replicate, viruses unequivocally need to usurp and redeploy this machinery for translation of their own mRNA. In response, the host triggers global shutdown of translation while paradoxically allowing swift synthesis of antiviral proteins as a strategy to limit collateral damage. This fundamental conflict at the level of translational control defines the outcome of infection. As part of this special issue on molecular mechanisms of early virus-host cell interactions, we review the current state of knowledge regarding translational control during viral infection with specific emphasis on protein kinase RNA-activated and mammalian target of rapamycin-mediated mechanisms. We also describe recent technological advances that will allow unprecedented insight into how viruses and host cells battle for ribosomes.
Collapse
|
133
|
Eastman G, Smircich P, Sotelo-Silveira JR. Following Ribosome Footprints to Understand Translation at a Genome Wide Level. Comput Struct Biotechnol J 2018; 16:167-176. [PMID: 30069283 PMCID: PMC6066590 DOI: 10.1016/j.csbj.2018.04.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 04/06/2018] [Accepted: 04/10/2018] [Indexed: 12/11/2022] Open
Abstract
Protein translation is a key step in gene expression. The development of Ribosome Profiling has allowed the global analysis of this process at sub-codon resolution. In the last years the method has been applied to several models ranging from bacteria to mammalian cells yielding a surprising amount of insight on the mechanism and the regulation of translation. In this review we describe the key aspects of the experimental protocol and comment on the main conclusions raised in different models.
Collapse
Affiliation(s)
- Guillermo Eastman
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Av. Italia 3318, Montevideo, CP 11600, Uruguay
| | - Pablo Smircich
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Av. Italia 3318, Montevideo, CP 11600, Uruguay
- Laboratory of Molecular Interactions, Facultad de Ciencias, Universidad de la República, Iguá 4225, Montevideo, CP 11400, Uruguay
| | - José R. Sotelo-Silveira
- Department of Genomics, Instituto de Investigaciones Biológicas Clemente Estable, MEC, Av. Italia 3318, Montevideo, CP 11600, Uruguay
- Department of Cell and Molecular Biology, Facultad de Ciencias, Universidad de la República, Iguá 4225, Montevideo, CP 11400, Uruguay
| |
Collapse
|
134
|
Finkel Y, Stern‐Ginossar N, Schwartz M. Viral Short ORFs and Their Possible Functions. Proteomics 2018; 18:e1700255. [PMID: 29150926 PMCID: PMC7167739 DOI: 10.1002/pmic.201700255] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Revised: 11/06/2017] [Indexed: 12/30/2022]
Abstract
Definition of functional genomic elements is one of the greater challenges of the genomic era. Traditionally, putative short open reading frames (sORFs) coding for less than 100 amino acids were disregarded due to computational and experimental limitations; however, it has become clear over the past several years that translation of sORFs is pervasive and serves diverse functions. The development of ribosome profiling, allowing identification of translated sequences genome wide, revealed wide spread, previously unidentified translation events. New computational methodologies as well as improved mass spectrometry approaches also contributed to the task of annotating translated sORFs in different organisms. Viruses are of special interest due to the selective pressure on their genome size, their rapid and confining evolution, and the potential contribution of novel peptides to the host immune response. Indeed, many functional viral sORFs were characterized to date, and ribosome profiling analyses suggest that this may be the tip of the iceberg. Our computational analyses of sORFs identified by ribosome profiling in DNA viruses demonstrate that they may be enriched in specific features implying that at least some of them are functional. Combination of systematic genome editing strategies with synthetic tagging will take us into the next step-elucidation of the biological relevance and function of this intriguing class of molecules.
Collapse
Affiliation(s)
- Yaara Finkel
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| | | | - Michal Schwartz
- Department of Molecular GeneticsWeizmann Institute of ScienceRehovotIsrael
| |
Collapse
|
135
|
Erhard F, Halenius A, Zimmermann C, L’Hernault A, Kowalewski DJ, Weekes MP, Stevanovic S, Zimmer R, Dölken L. Improved Ribo-seq enables identification of cryptic translation events. Nat Methods 2018; 15:363-366. [PMID: 29529017 PMCID: PMC6152898 DOI: 10.1038/nmeth.4631] [Citation(s) in RCA: 140] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 12/22/2017] [Indexed: 02/06/2023]
Abstract
Ribosome profiling has been used to predict thousands of short open reading frames (sORFs) in eukaryotic cells, but it suffers from substantial levels of noise. PRICE (https://github.com/erhard-lab/price) is a computational method that models experimental noise to enable researchers to accurately resolve overlapping sORFs and noncanonical translation initiation. We experimentally validated translation using major histocompatibility complex class I (MHC I) peptidomics and observed that sORF-derived peptides efficiently enter the MHC I presentation pathway and thus constitute a substantial fraction of the antigen repertoire.
Collapse
Affiliation(s)
- Florian Erhard
- Institute for Informatics, Ludwig-Maximilians-Universität München, Amalienstraße 17, 80333 München, Germany
- Institute for Virology and Immunobiology, Julius-Maximilians-Universität Würzburg, Versbacher Straße 7, 97078 Würzburg, Germany
| | - Anne Halenius
- Institute of Virology, Medical Center, University of Freiburg, Hermann-Herder-Straße 11, 79104 Freiburg, Germany
- Faculty of Medicine, University of Freiburg, Hermann-Herder-Straße 11, 79104 Freiburg, Germany
| | - Cosima Zimmermann
- Institute of Virology, Medical Center, University of Freiburg, Hermann-Herder-Straße 11, 79104 Freiburg, Germany
- Faculty of Medicine, University of Freiburg, Hermann-Herder-Straße 11, 79104 Freiburg, Germany
| | - Anne L’Hernault
- AstraZeneca UK Ltd, Innovative Medicines & Early Development, Cambridge Science Park, Milton Road, Cambridge, CB4 0WG, UK
| | - Daniel J. Kowalewski
- Interfaculty Institute for Cell Biology, Department of Immunology, University of Tübingen Auf der Morgenstelle 15, 72076 Tübingen, Germany
- Immatics Biotechnologies GmbH, Tübingen, Germany
| | - Michael P. Weekes
- Cambridge Institute for Medical Research, University of Cambridge, Hills Road, CB20XY Cambridge, United Kingdom
| | - Stefan Stevanovic
- Interfaculty Institute for Cell Biology, Department of Immunology, University of Tübingen Auf der Morgenstelle 15, 72076 Tübingen, Germany
| | - Ralf Zimmer
- Institute for Informatics, Ludwig-Maximilians-Universität München, Amalienstraße 17, 80333 München, Germany
| | - Lars Dölken
- Institute for Virology and Immunobiology, Julius-Maximilians-Universität Würzburg, Versbacher Straße 7, 97078 Würzburg, Germany
| |
Collapse
|
136
|
Yeasmin F, Yada T, Akimitsu N. Micropeptides Encoded in Transcripts Previously Identified as Long Noncoding RNAs: A New Chapter in Transcriptomics and Proteomics. Front Genet 2018; 9:144. [PMID: 29922328 PMCID: PMC5996887 DOI: 10.3389/fgene.2018.00144] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 04/09/2018] [Indexed: 11/13/2022] Open
Abstract
Integrative analysis using omics-based technologies results in the identification of a large number of putative short open reading frames (sORFs) with protein-coding capacity within transcripts previously identified as long noncoding RNAs (lncRNAs) or transcripts of unknown function (TUFs). sORFs were previously overlooked because of their diminutive size and the difficulty of identification by bioinformatics analyses. There is now growing evidence of the existence of potentially functional micropeptides produced from sORFs within cells of diverse species. Recent characterization of a few of these revealed their significant divergent roles in many fundamental biological processes, where some also show important relationships with pathogenesis. Recent works therefore provide new insights for exploring the wealth of information that may lie within sORF-encoded short proteins. Here, we summarize the current progress and view of micropeptides encoded in sORFs of protein-coding genes.
Collapse
Affiliation(s)
- Fouzia Yeasmin
- Isotope Science Centre, The University of Tokyo, Tokyo, Japan
| | - Tetsushi Yada
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Fukuoka, Japan
| | | |
Collapse
|
137
|
Brunet MA, Levesque SA, Hunting DJ, Cohen AA, Roucou X. Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship. Genome Res 2018; 28:609-624. [PMID: 29626081 PMCID: PMC5932603 DOI: 10.1101/gr.230938.117] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 03/27/2018] [Indexed: 12/12/2022]
Abstract
Technological advances promise unprecedented opportunities for whole exome sequencing and proteomic analyses of populations. Currently, data from genome and exome sequencing or proteomic studies are searched against reference genome annotations. This provides the foundation for research and clinical screening for genetic causes of pathologies. However, current genome annotations substantially underestimate the proteomic information encoded within a gene. Numerous studies have now demonstrated the expression and function of alternative (mainly small, sometimes overlapping) ORFs within mature gene transcripts. This has important consequences for the correlation of phenotypes and genotypes. Most alternative ORFs are not yet annotated because of a lack of evidence, and this absence from databases precludes their detection by standard proteomic methods, such as mass spectrometry. Here, we demonstrate how current approaches tend to overlook alternative ORFs, hindering the discovery of new genetic drivers and fundamental research. We discuss available tools and techniques to improve identification of proteins from alternative ORFs and finally suggest a novel annotation system to permit a more complete representation of the transcriptomic and proteomic information contained within a gene. Given the crucial challenge of distinguishing functional ORFs from random ones, the suggested pipeline emphasizes both experimental data and conservation signatures. The addition of alternative ORFs in databases will render identification less serendipitous and advance the pace of research and genomic knowledge. This review highlights the urgent medical and research need to incorporate alternative ORFs in current genome annotations and thus permit their inclusion in hypotheses and models, which relate phenotypes and genotypes.
Collapse
Affiliation(s)
- Marie A Brunet
- Biochemistry Department, Université de Sherbrooke, Quebec J1E 4K8, Canada.,Groupe de recherche PRIMUS, Department of Family and Emergency Medicine, Quebec J1H 5N4, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec G1V 0A6, Canada
| | - Sébastien A Levesque
- Pediatric Department, Centre Hospitalier de l'Université de Sherbrooke, Quebec J1H 5N4, Canada
| | - Darel J Hunting
- Department of Nuclear Medicine & Radiobiology, Université de Sherbrooke, Quebec J1H 5N4, Canada
| | - Alan A Cohen
- Groupe de recherche PRIMUS, Department of Family and Emergency Medicine, Quebec J1H 5N4, Canada
| | - Xavier Roucou
- Biochemistry Department, Université de Sherbrooke, Quebec J1E 4K8, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec G1V 0A6, Canada
| |
Collapse
|
138
|
Lu TC, Leu JY, Lin WC. A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts. Mol Biol Evol 2018; 34:2823-2838. [PMID: 28981695 PMCID: PMC5850716 DOI: 10.1093/molbev/msx210] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340 S. cerevisiae-specific de novo genes and 8,871 S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network.
Collapse
Affiliation(s)
- Tzu-Chiao Lu
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan.,Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan.,Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
| | - Jun-Yi Leu
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan.,Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Wen-Chang Lin
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan.,Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
139
|
Abstract
The nuclear RNA exosome is an essential and versatile machinery that regulates maturation and degradation of a huge plethora of RNA species. The past two decades have witnessed remarkable progress in understanding the whole picture of its RNA substrates and the structural basis of its functions. In addition to the exosome itself, recent studies focusing on associated co-factors have been elucidating how the exosome is directed towards specific substrates. Moreover, it has been gradually realized that loss-of-function of exosome subunits affect multiple biological processes such as the DNA damage response, R-loop resolution, maintenance of genome integrity, RNA export, translation and cell differentiation. In this review, we summarize the current knowledge of the mechanisms of nuclear exosome-mediated RNA metabolism and discuss their physiological significance.
Collapse
|
140
|
Tzani I, Monger C, Kelly P, Barron N, Kelly RM, Clarke C. Understanding biopharmaceutical production at single nucleotide resolution using ribosome footprint profiling. Curr Opin Biotechnol 2018; 53:182-190. [PMID: 29471208 DOI: 10.1016/j.copbio.2018.01.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 01/29/2018] [Accepted: 01/31/2018] [Indexed: 01/06/2023]
Abstract
Biopharmaceuticals such as monoclonal antibodies have revolutionised the treatment of a variety of diseases. The production of recombinant therapeutic proteins, however, remains expensive due to the manufacturing complexity of mammalian expression systems and the regulatory burden associated with administrating these medicines to patients in a safe and efficacious manner. In recent years, academic and industrial groups have begun to develop a greater understanding of the biology of host cell lines, such as Chinese hamster ovary (CHO) cells and utilise that information for process development and cell line engineering. In this review, we focus on ribosome footprint profiling (RiboSeq), an exciting next generation sequencing (NGS) method that provides genome-wide information on translation, and discuss how its application can transform our understanding of therapeutic protein production.
Collapse
Affiliation(s)
- Ioanna Tzani
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co., Dublin, Ireland
| | - Craig Monger
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co., Dublin, Ireland
| | - Paul Kelly
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co., Dublin, Ireland
| | - Niall Barron
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co., Dublin, Ireland
| | - Ronan M Kelly
- Bioprocess Research and Development, Eli Lilly and Company, LTC-North, 1200 Kentucky Avenue, Indianapolis, IN 46225, United States
| | - Colin Clarke
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co., Dublin, Ireland.
| |
Collapse
|
141
|
Li Q, Ahsan MA, Chen H, Xue J, Chen M. Discovering Putative Peptides Encoded from Noncoding RNAs in Ribosome Profiling Data of Arabidopsis thaliana. ACS Synth Biol 2018; 7:655-663. [PMID: 29376339 DOI: 10.1021/acssynbio.7b00386] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Most noncoding RNAs are considered by their expression at low levels and as having a limited phylogenetic distribution in the cytoplasm, indicating that they may be only involved in specific biological processes. However, recent studies showed the protein-coding potential of ncRNAs, indicating that they might be a source of some special proteins. Although there are increasing noncoding RNAs identified to be able to code proteins, it is challenging to distinguish coding RNAs from previously annotated ncRNAs, and to detect the proteins from their translation. In this article, we designed a pipeline to identify these noncoding RNAs in Arabidopsis thaliana from three NCBI GEO data sets with coding potential and predict their translation products. 31 311 noncoding RNAs were predicted to be translated into peptides, and they showed lower conservation rate than common proteins. In addition, we built an interaction network between these peptides and annotated Arabidopsis proteins using BIPS, which included 69 peptides from noncoding RNAs. Peptides in the interaction network showed different characteristics from other noncoding RNA-derived peptides, and they participated in several crucial biological processes, such as photorespiration and stress-responses. All the information of putative ncPEPs and their interaction with proteins predicted above are finally integrated in a database, PncPEPDB ( http://bis.zju.edu.cn/PncPEPDB ). These results showed that peptides derived from noncoding RNAs may play important roles in noncoding RNA regulation, which provided another hypothesis that noncoding RNA may regulate the metabolism via their translation products.
Collapse
Affiliation(s)
- Qilin Li
- Department
of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Md. Asif Ahsan
- Department
of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Hongjun Chen
- Department
of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Jitong Xue
- Department
of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
- James
D. Watson Institute of Genome Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ming Chen
- Department
of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
- James
D. Watson Institute of Genome Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
142
|
Laumont CM, Perreault C. Exploiting non-canonical translation to identify new targets for T cell-based cancer immunotherapy. Cell Mol Life Sci 2018; 75:607-621. [PMID: 28823056 PMCID: PMC11105255 DOI: 10.1007/s00018-017-2628-4] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 08/03/2017] [Accepted: 08/16/2017] [Indexed: 01/11/2023]
Abstract
Cryptic MHC I-associated peptides (MAPs) are produced via two mechanisms: translation of protein-coding genes in non-canonical reading frames and translation of allegedly non-coding sequences. In general, cryptic MAPs are coded by relatively short open reading frames whose translation can be regulated at the level of initiation, elongation or termination. In contrast to conventional MAPs, the processing of cryptic MAPs is frequently proteasome independent. The existence of cryptic MAPs derived from allegedly non-coding regions enlarges the scope of CD8 T cell immunosurveillance from a mere ~2% to as much as ~75% of the human genome. Considering that 99% of cancer-specific mutations are located in those allegedly non-coding regions, cryptic MAPs could furthermore represent a particularly rich source of tumor-specific antigens. However, extensive proteogenomic analyses will be required to determine the breath as well as the temporal and spatial plasticity of the cryptic MAP repertoire in normal and neoplastic cells.
Collapse
Affiliation(s)
- Céline M Laumont
- Institute for Research in Immunology and Cancer, Université de Montréal, Station Centre-Ville, PO Box 6128, Montreal, QC, H3C 3J7, Canada
- Department of Medicine, Faculty of Medicine, Université de Montréal, Station Centre-Ville, PO Box 6128, Montreal, QC, H3C 3J7, Canada
| | - Claude Perreault
- Institute for Research in Immunology and Cancer, Université de Montréal, Station Centre-Ville, PO Box 6128, Montreal, QC, H3C 3J7, Canada.
- Department of Medicine, Faculty of Medicine, Université de Montréal, Station Centre-Ville, PO Box 6128, Montreal, QC, H3C 3J7, Canada.
- Division of Hematology, Hôpital Maisonneuve-Rosemont, 5415 de l'Assomption Boulevard, Montreal, QC, H1T 2M4, Canada.
| |
Collapse
|
143
|
Spealman P, Naik AW, May GE, Kuersten S, Freeberg L, Murphy RF, McManus J. Conserved non-AUG uORFs revealed by a novel regression analysis of ribosome profiling data. Genome Res 2017; 28:214-222. [PMID: 29254944 PMCID: PMC5793785 DOI: 10.1101/gr.221507.117] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 12/11/2017] [Indexed: 12/14/2022]
Abstract
Upstream open reading frames (uORFs), located in transcript leaders (5' UTRs), are potent cis-acting regulators of translation and mRNA turnover. Recent genome-wide ribosome profiling studies suggest that thousands of uORFs initiate with non-AUG start codons. Although intriguing, these non-AUG uORF predictions have been made without statistical control or validation; thus, the importance of these elements remains to be demonstrated. To address this, we took a comparative genomics approach to study AUG and non-AUG uORFs. We mapped transcription leaders in multiple Saccharomyces yeast species and applied a novel machine learning algorithm (uORF-seqr) to ribosome profiling data to identify statistically significant uORFs. We found that AUG and non-AUG uORFs are both frequently found in Saccharomyces yeasts. Although most non-AUG uORFs are found in only one species, hundreds have either conserved sequence or position within Saccharomyces uORFs initiating with UUG are particularly common and are shared between species at rates similar to that of AUG uORFs. However, non-AUG uORFs are translated less efficiently than AUG-uORFs and are less subject to removal via alternative transcription initiation under normal growth conditions. These results suggest that a subset of non-AUG uORFs may play important roles in regulating gene expression.
Collapse
Affiliation(s)
- Pieter Spealman
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Armaghan W Naik
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Gemma E May
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | | | | | - Robert F Murphy
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA.,Computational Biology Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Joel McManus
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| |
Collapse
|
144
|
Gao FB, Richter JD, Cleveland DW. Rethinking Unconventional Translation in Neurodegeneration. Cell 2017; 171:994-1000. [PMID: 29149615 DOI: 10.1016/j.cell.2017.10.042] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 10/16/2017] [Accepted: 10/25/2017] [Indexed: 11/27/2022]
Abstract
Eukaryotic translation is tightly regulated to ensure that protein production occurs at the right time and place. Recent studies on abnormal repeat proteins, especially in age-dependent neurodegenerative diseases caused by nucleotide repeat expansion, have highlighted or identified two forms of unconventional translation initiation: usage of AUG-like sites (near cognates) or repeat-associated non-AUG (RAN) translation. We discuss how repeat proteins may differ due to not just unconventional initiation, but also ribosomal frameshifting and/or imperfect repeat DNA replication, expansion, and repair, and we highlight how research on translation of repeats may uncover insights into the biology of translation and its contribution to disease.
Collapse
Affiliation(s)
- Fen-Biao Gao
- Department of Neurology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | - Joel D Richter
- Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, MA, 01605 USA.
| | - Don W Cleveland
- Ludwig Institute for Cancer Research, University of California at San Diego, La Jolla, CA 92093, USA; Department of Neurosciences, University of California at San Diego, La Jolla, CA 92093, USA; Department of Cellular and Molecular Medicine, University of California at San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
145
|
Hassan MA, Vasquez JJ, Guo-Liang C, Meissner M, Nicolai Siegel T. Comparative ribosome profiling uncovers a dominant role for translational control in Toxoplasma gondii. BMC Genomics 2017; 18:961. [PMID: 29228904 PMCID: PMC5725899 DOI: 10.1186/s12864-017-4362-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 12/01/2017] [Indexed: 11/17/2022] Open
Abstract
Background The lytic cycle of the protozoan parasite Toxoplasma gondii, which involves a brief sojourn in the extracellular space, is characterized by defined transcriptional profiles. For an obligate intracellular parasite that is shielded from the cytosolic host immune factors by a parasitophorous vacuole, the brief entry into the extracellular space is likely to exert enormous stress. Due to its role in cellular stress response, we hypothesize that translational control plays an important role in regulating gene expression in Toxoplasma during the lytic cycle. Unlike transcriptional profiles, insights into genome-wide translational profiles of Toxoplasma gondii are lacking. Methods We have performed genome-wide ribosome profiling, coupled with high throughput RNA sequencing, in intracellular and extracellular Toxoplasma gondii parasites to investigate translational control during the lytic cycle. Results Although differences in transcript abundance were mostly mirrored at the translational level, we observed significant differences in the abundance of ribosome footprints between the two parasite stages. Furthermore, our data suggest that mRNA translation in the parasite is potentially regulated by mRNA secondary structure and upstream open reading frames. Conclusion We show that most of the Toxoplasma genes that are dysregulated during the lytic cycle are translationally regulated. Electronic supplementary material The online version of this article (10.1186/s12864-017-4362-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Musa A Hassan
- Division of Infection and Immunity, The Roslin Institute, University of Edinburgh, Edinburgh, UK. .,The Centre for Tropical Livestock Genetics and Health, The Roslin Institute, University of Edinburgh, Edinburgh, UK.
| | - Juan J Vasquez
- Research Centre for Infectious Diseases, University of Wuerzburg, Wuerzburg, 97080, Germany.,Present address: Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Chew Guo-Liang
- Computational Biology Program, Basic Sciences and Public Health Sciences Division, Fred Hutchinson Cancer Research Centre, Seattle, WA, 98105, USA
| | - Markus Meissner
- Wellcome Centre for Molecular Parasitology, University of Glasgow, Glasgow, UK.,Department of Veterinary Sciences, Experimental Parasitology, Ludwig-Maximilians-Universität, München, 80802, Munich, Germany
| | - T Nicolai Siegel
- Research Centre for Infectious Diseases, University of Wuerzburg, Wuerzburg, 97080, Germany.,Department of Veterinary Sciences, Experimental Parasitology, Ludwig-Maximilians-Universität, München, 80802, Munich, Germany.,Biomedical Center Munich, Physiological Chemistry, Ludwig-Maximilians-Universität München, Planegg-Martinsried, 82152, Germany
| |
Collapse
|
146
|
Abstract
Peptides encoded by short open reading frames (sORFs) are usually defined as peptides ≤100 aa long. Usually sORFs were ignored by automatic genome annotation programs due to the high probability of false discovery. However, improved computational tools along with a high-throughput RIBO-seq approach identified a myriad of translated sORFs. Their importance becomes evident as we are gaining experimental validation of their diverse cellular functions. This Review examines various computational and experimental approaches of sORFs identification as well as provides the summary of our current knowledge of their functional roles in cells.
Collapse
Affiliation(s)
- Anastasia Chugunova
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| | - Tsimafei Navalayeu
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia
| | - Olga Dontsova
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| | - Petr Sergiev
- Lomonosov Moscow State University , Department of Chemistry and A.N. Belozersky Institute of Physico-Chemical Biology, Moscow 119992, Russia.,Skolkovo Institute of Science and Technology , Skolkovo, Moscow Region 143025, Russia
| |
Collapse
|
147
|
Ndah E, Jonckheere V, Giess A, Valen E, Menschaert G, Van Damme P. REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes. Nucleic Acids Res 2017; 45:e168. [PMID: 28977509 PMCID: PMC5714196 DOI: 10.1093/nar/gkx758] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2017] [Accepted: 08/17/2017] [Indexed: 12/13/2022] Open
Abstract
Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames.
Collapse
Affiliation(s)
- Elvis Ndah
- VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium.,Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modelling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, B-9000 Ghent, Belgium
| | - Veronique Jonckheere
- VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Adam Giess
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway.,Sars International Centre for Marine Molecular Biology, University of Bergen, 5008 Bergen, Norway
| | - Gerben Menschaert
- Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modelling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, B-9000 Ghent, Belgium
| | - Petra Van Damme
- VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| |
Collapse
|
148
|
Abstract
This review by Kearse and Wilusz discusses the profound impact of non-AUG start codons in eukaryotic translation. It describes how misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and how modulation of non-AUG usage may represent a novel therapeutic strategy. Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states.
Collapse
Affiliation(s)
- Michael G Kearse
- Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, 19104 USA
| | - Jeremy E Wilusz
- Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, 19104 USA
| |
Collapse
|
149
|
Zhang P, He D, Xu Y, Hou J, Pan BF, Wang Y, Liu T, Davis CM, Ehli EA, Tan L, Zhou F, Hu J, Yu Y, Chen X, Nguyen TM, Rosen JM, Hawke DH, Ji Z, Chen Y. Genome-wide identification and differential analysis of translational initiation. Nat Commun 2017; 8:1749. [PMID: 29170441 PMCID: PMC5701008 DOI: 10.1038/s41467-017-01981-8] [Citation(s) in RCA: 102] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2017] [Accepted: 10/31/2017] [Indexed: 01/28/2023] Open
Abstract
Translation is principally regulated at the initiation stage. The development of the translation initiation (TI) sequencing (TI-seq) technique has enabled the global mapping of TIs and revealed unanticipated complex translational landscapes in metazoans. Despite the wide adoption of TI-seq, there is no computational tool currently available for analyzing TI-seq data. To fill this gap, we develop a comprehensive toolkit named Ribo-TISH, which allows for detecting and quantitatively comparing TIs across conditions from TI-seq data. Ribo-TISH can also predict novel open reading frames (ORFs) from regular ribosome profiling (rRibo-seq) data and outperform several established methods in both computational efficiency and prediction accuracy. Applied to published TI-seq/rRibo-seq data sets, Ribo-TISH uncovers a novel signature of elevated mitochondrial translation during amino-acid deprivation and predicts novel ORFs in 5'UTRs, long noncoding RNAs, and introns. These successful applications demonstrate the power of Ribo-TISH in extracting biological insights from TI-seq/rRibo-seq data.
Collapse
Affiliation(s)
- Peng Zhang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Dandan He
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Yi Xu
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Jiakai Hou
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Bih-Fang Pan
- Proteomics and Metabolomics Facility, and Department of Systems Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Yunfei Wang
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Tao Liu
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY, 14203, USA
| | | | - Erik A Ehli
- Avera Institute for Human Genetics, Sioux Falls, SD, 57108, USA
| | - Lin Tan
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Feng Zhou
- Liver Cancer Institute, Zhongshan Hospital, Key Laboratory of Carcinogenesis and Cancer Invasion, Minister of Education, and Institutes of Biomedical Sciences, Fudan University, Shanghai, 200032, China
| | - Jian Hu
- Department of Cancer Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77054, USA
| | - Yonghao Yu
- Department of Biochemistry, The University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Xi Chen
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Tuan M Nguyen
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030, USA
- Program in Translational Biology and Molecular Medicine, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Jeffrey M Rosen
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030, USA
| | - David H Hawke
- Proteomics and Metabolomics Facility, and Department of Systems Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Zhe Ji
- Department of Biological Chemistry and Molecular and Pharmacology, Harvard Medical School, Boston, MA, 02115, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Yiwen Chen
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA.
| |
Collapse
|
150
|
Ransohoff JD, Wei Y, Khavari PA. The functions and unique features of long intergenic non-coding RNA. Nat Rev Mol Cell Biol 2017; 19:143-157. [PMID: 29138516 DOI: 10.1038/nrm.2017.104] [Citation(s) in RCA: 965] [Impact Index Per Article: 120.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Long intergenic non-coding RNA (lincRNA) genes have diverse features that distinguish them from mRNA-encoding genes and exercise functions such as remodelling chromatin and genome architecture, RNA stabilization and transcription regulation, including enhancer-associated activity. Some genes currently annotated as encoding lincRNAs include small open reading frames (smORFs) and encode functional peptides and thus may be more properly classified as coding RNAs. lincRNAs may broadly serve to fine-tune the expression of neighbouring genes with remarkable tissue specificity through a diversity of mechanisms, highlighting our rapidly evolving understanding of the non-coding genome.
Collapse
Affiliation(s)
- Julia D Ransohoff
- Program in Epithelial Biology, Stanford University School of Medicine, California 94305, USA
| | - Yuning Wei
- Program in Epithelial Biology, Stanford University School of Medicine, California 94305, USA
| | - Paul A Khavari
- Program in Epithelial Biology, Stanford University School of Medicine, California 94305, USA.,Veterans Affairs Palo Alto Healthcare System, Palo Alto, California 94304, USA
| |
Collapse
|