1
|
Cornelissen FMG, He Z, Ciputra E, de Haas RR, Beumer‐Chuwonpad A, Noske D, Vandertop WP, Piersma SR, Jiménez CR, Murre C, Westerman BA. The translatome of glioblastoma. Mol Oncol 2025; 19:716-740. [PMID: 39417309 PMCID: PMC11887679 DOI: 10.1002/1878-0261.13743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 07/17/2024] [Accepted: 07/19/2024] [Indexed: 10/19/2024] Open
Abstract
Glioblastoma (GB), the most common and aggressive brain tumor, demonstrates intrinsic resistance to current therapies, resulting in poor clinical outcomes. Cancer progression can be partially attributed to the deregulation of protein translation mechanisms that drive cancer cell growth. In this study, we present the translatome landscape of GB as a valuable data resource. Eight patient-derived GB sphere cultures (GSCs) were analyzed using ribosome profiling and messenger RNA (mRNA) sequencing. We investigated inter-cell-line differences through differential expression analysis at both the translatome and transcriptome levels. Translational changes post-radiotherapy were assessed at 30 and 60 min. The translation of non-coding RNAs (ncRNAs) was validated using in-house and public mass spectrometry (MS) data, whereas RNA expression was confirmed by quantitative PCR (qPCR). Our findings demonstrate that ribosome sequencing provides more detailed information than MS or transcriptional analyses. Transcriptional similarities among GSCs correlate with translational similarities, aligning with previously defined subtypes such as proneural and mesenchymal. Additionally, we identified a broad spectrum of open reading frame types in both coding and non-coding mRNA regions, including long non-coding RNAs (lncRNAs) and pseudogenes undergoing active translation. Translation of ncRNAs into peptides was independently confirmed by in-house data and external MS data. We also observed that translational regulation of histones (downregulated) and splicing factors (upregulated) occurs in response to radiotherapy. These data offer new insights into genome-wide protein synthesis, identifying translationally regulated genes and alternative translation initiation sites in GB under normal and radiotherapeutic conditions, providing a rich resource for GB research. Further functional validation of differentially expressed genes after radiotherapy is needed. Understanding translational control in GB can reveal mechanistic insights and identify currently unknown biomarkers, ultimately enhancing the diagnosis and treatment of this aggressive brain cancer.
Collapse
Affiliation(s)
- Fleur M. G. Cornelissen
- Department of Molecular BiologyUniversity of California, San DiegoLa JollaCAUSA
- Department of NeurosurgeryAmsterdam UMC, Location VUMC, Cancer CenterAmsterdamThe Netherlands
| | - Zhaoren He
- Department of Molecular BiologyUniversity of California, San DiegoLa JollaCAUSA
| | - Edward Ciputra
- Department of NeurosurgeryAmsterdam UMC, Location VUMC, Cancer CenterAmsterdamThe Netherlands
| | - Richard R. de Haas
- OncoProteomics Laboratory, Cancer Center AmsterdamAmsterdam UMCThe Netherlands
| | | | - David Noske
- Department of NeurosurgeryAmsterdam UMC, Location VUMC, Cancer CenterAmsterdamThe Netherlands
| | - W. Peter Vandertop
- Department of NeurosurgeryAmsterdam UMC, Location VUMC, Cancer CenterAmsterdamThe Netherlands
| | - Sander R. Piersma
- OncoProteomics Laboratory, Cancer Center AmsterdamAmsterdam UMCThe Netherlands
| | - Connie R. Jiménez
- OncoProteomics Laboratory, Cancer Center AmsterdamAmsterdam UMCThe Netherlands
| | - Cornelis Murre
- Department of Molecular BiologyUniversity of California, San DiegoLa JollaCAUSA
| | - Bart A. Westerman
- Department of NeurosurgeryAmsterdam UMC, Location VUMC, Cancer CenterAmsterdamThe Netherlands
| |
Collapse
|
2
|
Naidu P, Holford M. Microscopic marvels: Decoding the role of micropeptides in innate immunity. Immunology 2024; 173:605-621. [PMID: 39188052 DOI: 10.1111/imm.13850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 07/30/2024] [Indexed: 08/28/2024] Open
Abstract
The innate immune response is under selection pressures from changing environments and pathogens. While sequence evolution can be studied by comparing rates of amino acid mutations within and between species, how a gene's birth and death contribute to the evolution of immunity is less known. Short open reading frames, once regarded as untranslated or transcriptional noise, can often produce micropeptides of <100 amino acids with a wide array of biological functions. Some micropeptide sequences are well conserved, whereas others have no evolutionary conservation, potentially representing new functional compounds that arise from species-specific adaptations. To date, few reports have described the discovery of novel micropeptides of the innate immune system. The diversity of immune-related micropeptides is a blind spot for gene and functional annotation. Immune-related micropeptides represent a potential reservoir of untapped compounds for understanding and treating disease. This review consolidates what is currently known about the evolution and function of innate immune-related micropeptides to facilitate their investigation.
Collapse
Affiliation(s)
- Praveena Naidu
- Graduate Center, Programs in Biology, Biochemistry, Chemistry, City University of New York, New York, New York, USA
- Department of Chemistry and Biochemistry, City University of New York, Hunter College, Belfer Research Building, New York, New York, USA
| | - Mandë Holford
- Graduate Center, Programs in Biology, Biochemistry, Chemistry, City University of New York, New York, New York, USA
- Department of Chemistry and Biochemistry, City University of New York, Hunter College, Belfer Research Building, New York, New York, USA
- American Museum of Natural History, Invertebrate Zoology, Sackler Institute for Comparative Genomics, New York, New York, USA
- Weill Cornell Medicine, Department of Biochemistry, New York, New York, USA
| |
Collapse
|
3
|
Tress ML. The rapid degradation of translated upstream regions points to an inefficient translation initiation process. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.25.625198. [PMID: 39651291 PMCID: PMC11623489 DOI: 10.1101/2024.11.25.625198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Large-scale experimental analyses find ever more abundant evidence of translation from start codons upstream of the canonical start site. This translation either generates entirely new proteins (from novel upstream open reading frames) or produces isoforms with extended N-terminals when the novel start codon is in frame Most extended N-terminals are likely to just add a disordered region to the canonical protein isoform, but some may also block the recognition of the signal peptide causing the isoform to accumulate in the incorrect cellular compartment. This analysis finds evidence that upstream translations that would interfere with signal peptides are detected in expected quantities in ribosome profiling experiments, but that the equivalent N-terminally extended protein isoforms are significantly reduced in multiple proteomics experiments. This suggests that these isoforms are likely to be degraded shortly after translation by the ubiquitination pathway, thus preventing the build up of potentially harmful proteins with hydrophobic regions in the cytoplasm. In addition, this is further evidence that most of the transcripts translated from upstream start sites are the result of an inefficient translation initiation process. This has implications for the annotation of proteins given the huge numbers of upstream translations that are being detected in large-scale experiments.
Collapse
|
4
|
Tufail MA, Jordan B, Hadjeras L, Gelhausen R, Cassidy L, Habenicht T, Gutt M, Hellwig L, Backofen R, Tholey A, Sharma CM, Schmitz RA. Uncovering the small proteome of Methanosarcina mazei using Ribo-seq and peptidomics under different nitrogen conditions. Nat Commun 2024; 15:8659. [PMID: 39370430 PMCID: PMC11456600 DOI: 10.1038/s41467-024-53008-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 09/25/2024] [Indexed: 10/08/2024] Open
Abstract
The mesophilic methanogenic archaeal model organism Methanosarcina mazei strain Gö1 is crucial for climate and environmental research due to its ability to produce methane. Here, we establish a Ribo-seq protocol for M. mazei strain Gö1 under two growth conditions (nitrogen sufficiency and limitation). The translation of 93 previously annotated and 314 unannotated small ORFs, coding for proteins ≤ 70 amino acids, is predicted with high confidence based on Ribo-seq data. LC-MS analysis validates the translation for 62 annotated small ORFs and 26 unannotated small ORFs. Epitope tagging followed by immunoblotting analysis confirms the translation of 13 out of 16 selected unannotated small ORFs. A comprehensive differential transcription and translation analysis reveals that 29 of 314 unannotated small ORFs are differentially regulated in response to nitrogen availability at the transcriptional and 49 at the translational level. A high number of reported small RNAs are emerging as dual-function RNAs, including sRNA154, the central regulatory small RNA of nitrogen metabolism. Several unannotated small ORFs are conserved in Methanosarcina species and overproducing several (small ORF encoded) small proteins suggests key physiological functions. Overall, the comprehensive analysis opens an avenue to elucidate the function(s) of multitudinous small proteins and dual-function RNAs in M. mazei.
Collapse
Affiliation(s)
| | - Britta Jordan
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Lydia Hadjeras
- Institute of Molecular Infection Biology, University of Würzburg, 97080, Würzburg, Germany
| | - Rick Gelhausen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110, Freiburg, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Kiel University, 24105, Kiel, Germany
| | - Tim Habenicht
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Miriam Gutt
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Lisa Hellwig
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110, Freiburg, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Kiel University, 24105, Kiel, Germany
| | - Cynthia M Sharma
- Institute of Molecular Infection Biology, University of Würzburg, 97080, Würzburg, Germany
| | - Ruth A Schmitz
- Institute for General Microbiology, Kiel University, 24118, Kiel, Germany.
| |
Collapse
|
5
|
Su X, Shi C, Liu F, Tan M, Wang Y, Zhu L, Chen Y, Yu M, Wang X, Liu J, Liu Y, Lin W, Fang Z, Sun Q, Zhou T, Lin A. HMPA: a pioneering framework for the noncanonical peptidome from discovery to functional insights. Brief Bioinform 2024; 25:bbae510. [PMID: 39413795 PMCID: PMC11483136 DOI: 10.1093/bib/bbae510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/01/2024] [Accepted: 09/30/2024] [Indexed: 10/18/2024] Open
Abstract
Advancements in peptidomics have revealed numerous small open reading frames with coding potential and revealed that some of these micropeptides are closely related to human cancer. However, the systematic analysis and integration from sequence to structure and function remains largely undeveloped. Here, as a solution, we built a workflow for the collection and analysis of proteomic data, transcriptomic data, and clinical outcomes for cancer-associated micropeptides using publicly available datasets from large cohorts. We initially identified 19 586 novel micropeptides by reanalyzing proteomic profile data from 3753 samples across 8 cancer types. Further quantitative analysis of these micropeptides, along with associated clinical data, identified 3065 that were dysregulated in cancer, with 370 of them showing a strong association with prognosis. Moreover, we employed a deep learning framework to construct a micropeptide-protein interaction network for further bioinformatics analysis, revealing that micropeptides are involved in multiple biological processes as bioactive molecules. Taken together, our atlas provides a benchmark for high-throughput prediction and functional exploration of micropeptides, providing new insights into their biological mechanisms in cancer. The HMPA is freely available at http://hmpa.zju.edu.cn.
Collapse
Affiliation(s)
- Xinwan Su
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Chengyu Shi
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Fangzhou Liu
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Manman Tan
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Ying Wang
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Linyu Zhu
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Yu Chen
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Meng Yu
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Xinyi Wang
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
| | - Jian Liu
- Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, 718 East Haizhou Rd., Haining, Zhejiang 314400, China
| | - Yang Liu
- Institute of Immunology, Zhejiang University School of Medicine, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310009, China
| | - Weiqiang Lin
- International School of Medicine, International Institutes of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, No. N1, Shangcheng Avenue, Yiwu, Zhejiang 322000, China
| | - Zhaoyuan Fang
- Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, 718 East Haizhou Rd., Haining, Zhejiang 314400, China
- The Second Affiliated Hospital, Zhejiang University School of Medicine, 88 Jiefang Road, Shangcheng District, Hangzhou, Zhejiang 310000, China
| | - Qiang Sun
- International School of Medicine, International Institutes of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, No. N1, Shangcheng Avenue, Yiwu, Zhejiang 322000, China
| | - Tianhua Zhou
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Department of Cell Biology and Program in Molecular Cell Biology, Zhejiang University School of Medicine, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5S 1A8, Canada
| | - Aifu Lin
- MOE Laboratory of Biosystem Homeostasis and Protection, College of Life Sciences, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Cancer Center, Zhejiang University, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
- Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310000, China
- International School of Medicine, International Institutes of Medicine, The 4th Affiliated Hospital of Zhejiang University School of Medicine, No. N1, Shangcheng Avenue, Yiwu, Zhejiang 322000, China
- Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, 828 Zhongxing Road, Xitang District, Jiashan, Zhejiang, 314100, China
- Key Laboratory for Cell and Gene Engineering of Zhejiang Province, 866 Yuhangtang Road, West Lake District, Hangzhou, Zhejiang 310058, China
| |
Collapse
|
6
|
Fernandez SG, Ferguson L, Ingolia NT. Ribosome rescue factor PELOTA modulates translation start site choice for C/EBPα protein isoforms. Life Sci Alliance 2024; 7:e202302501. [PMID: 38803235 PMCID: PMC11109482 DOI: 10.26508/lsa.202302501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/15/2024] [Accepted: 04/16/2024] [Indexed: 05/29/2024] Open
Abstract
Translation initiation at alternative start sites can dynamically control the synthesis of two or more functionally distinct protein isoforms from a single mRNA. Alternate isoforms of the developmental transcription factor CCAAT/enhancer-binding protein α (C/EBPα) produced from different start sites exert opposing effects during myeloid cell development. This choice between alternative start sites depends on sequence features of the CEBPA transcript, including a regulatory uORF, but the molecular basis is not fully understood. Here, we identify the factors that affect C/EBPα isoform choice using a sensitive and quantitative two-color fluorescent reporter coupled with CRISPRi screening. Our screen uncovered a role of the ribosome rescue factor PELOTA (PELO) in promoting the expression of the longer C/EBPα isoform by directly removing inhibitory unrecycled ribosomes and through indirect effects mediated by the mechanistic target of rapamycin kinase. Our work uncovers further links between ribosome recycling and translation reinitiation that regulate a key transcription factor, with implications for normal hematopoiesis and leukemogenesis.
Collapse
Affiliation(s)
- Samantha G Fernandez
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Lucas Ferguson
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
- Center for Computational Biology and California Institute for Quantitative Biosciences, University of California, Berkeley, CA, USA
| | - Nicholas T Ingolia
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
- Center for Computational Biology and California Institute for Quantitative Biosciences, University of California, Berkeley, CA, USA
| |
Collapse
|
7
|
Halasz H, Malekos E, Covarrubias S, Yitiz S, Montano C, Sudek L, Katzman S, Liu SJ, Horlbeck MA, Namvar L, Weissman JS, Carpenter S. CRISPRi screens identify the lncRNA, LOUP, as a multifunctional locus regulating macrophage differentiation and inflammatory signaling. Proc Natl Acad Sci U S A 2024; 121:e2322524121. [PMID: 38781216 PMCID: PMC11145268 DOI: 10.1073/pnas.2322524121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 04/16/2024] [Indexed: 05/25/2024] Open
Abstract
Long noncoding RNAs (lncRNAs) account for the largest portion of RNA from the transcriptome, yet most of their functions remain unknown. Here, we performed two independent high-throughput CRISPRi screens to understand the role of lncRNAs in monocyte function and differentiation. The first was a reporter-based screen to identify lncRNAs that regulate TLR4-NFkB signaling in human monocytes and the second screen identified lncRNAs involved in monocyte to macrophage differentiation. We successfully identified numerous noncoding and protein-coding genes that can positively or negatively regulate inflammation and differentiation. To understand the functional roles of lncRNAs in both processes, we chose to further study the lncRNA LOUP [lncRNA originating from upstream regulatory element of SPI1 (also known as PU.1)], as it emerged as a top hit in both screens. Not only does LOUP regulate its neighboring gene, the myeloid fate-determining factor SPI1, thereby affecting monocyte to macrophage differentiation, but knockdown of LOUP leads to a broad upregulation of NFkB-targeted genes at baseline and upon TLR4-NFkB activation. LOUP also harbors three small open reading frames capable of being translated and are responsible for LOUP's ability to negatively regulate TLR4/NFkB signaling. This work emphasizes the value of high-throughput screening to rapidly identify functional lncRNAs in the innate immune system.
Collapse
Affiliation(s)
- Haley Halasz
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Eric Malekos
- Department of Biomolecular Engineering, University of California Santa Cruz, CA95064
| | - Sergio Covarrubias
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Samira Yitiz
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Christy Montano
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Lisa Sudek
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Sol Katzman
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - S. John Liu
- Department of Radiation Oncology, University of California, San Francisco, CA94158
- Department of Neurological Surgery, University of California, San Francisco, CA94158
| | - Max A. Horlbeck
- Department of Radiation Oncology, University of California, San Francisco, CA94158
- Department of Neurological Surgery, University of California, San Francisco, CA94158
- Department of Pediatrics, Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA02115
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA02138
| | - Leila Namvar
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Jonathan S. Weissman
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA02142
- HHMI, Chevy Chase, MD20815
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA02142
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA02142
| | - Susan Carpenter
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| |
Collapse
|
8
|
Kim KH, Lee CB. Socialized mitochondria: mitonuclear crosstalk in stress. Exp Mol Med 2024; 56:1033-1042. [PMID: 38689084 PMCID: PMC11148012 DOI: 10.1038/s12276-024-01211-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 01/27/2024] [Accepted: 02/07/2024] [Indexed: 05/02/2024] Open
Abstract
Traditionally, mitochondria are considered sites of energy production. However, recent studies have suggested that mitochondria are signaling organelles that are involved in intracellular interactions with other organelles. Remarkably, stressed mitochondria appear to induce a beneficial response that restores mitochondrial function and cellular homeostasis. These mitochondrial stress-centered signaling pathways have been rapidly elucidated in multiple organisms. In this review, we examine current perspectives on how mitochondria communicate with the rest of the cell, highlighting mitochondria-to-nucleus (mitonuclear) communication under various stresses. Our understanding of mitochondria as signaling organelles may provide new insights into disease susceptibility and lifespan extension.
Collapse
Affiliation(s)
- Kyung Hwa Kim
- Department of Health Sciences, The Graduate School of Dong-A University, 840 Hadan-dong, Saha-gu, Busan, 49315, Korea.
| | - Cho Bi Lee
- Department of Health Sciences, The Graduate School of Dong-A University, 840 Hadan-dong, Saha-gu, Busan, 49315, Korea
| |
Collapse
|
9
|
Wieder N, D'Souza EN, Martin-Geary AC, Lassen FH, Talbot-Martin J, Fernandes M, Chothani SP, Rackham OJL, Schafer S, Aspden JL, MacArthur DG, Davies RW, Whiffin N. Differences in 5'untranslated regions highlight the importance of translational regulation of dosage sensitive genes. Genome Biol 2024; 25:111. [PMID: 38685090 PMCID: PMC11057154 DOI: 10.1186/s13059-024-03248-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 04/15/2024] [Indexed: 05/02/2024] Open
Abstract
BACKGROUND Untranslated regions (UTRs) are important mediators of post-transcriptional regulation. The length of UTRs and the composition of regulatory elements within them are known to vary substantially across genes, but little is known about the reasons for this variation in humans. Here, we set out to determine whether this variation, specifically in 5'UTRs, correlates with gene dosage sensitivity. RESULTS We investigate 5'UTR length, the number of alternative transcription start sites, the potential for alternative splicing, the number and type of upstream open reading frames (uORFs) and the propensity of 5'UTRs to form secondary structures. We explore how these elements vary by gene tolerance to loss-of-function (LoF; using the LOEUF metric), and in genes where changes in dosage are known to cause disease. We show that LOEUF correlates with 5'UTR length and complexity. Genes that are most intolerant to LoF have longer 5'UTRs, greater TSS diversity, and more upstream regulatory elements than their LoF tolerant counterparts. We show that these differences are evident in disease gene-sets, but not in recessive developmental disorder genes where LoF of a single allele is tolerated. CONCLUSIONS Our results confirm the importance of post-transcriptional regulation through 5'UTRs in tight regulation of mRNA and protein levels, particularly for genes where changes in dosage are deleterious and lead to disease. Finally, to support gene-based investigation we release a web-based browser tool, VuTR, that supports exploration of the composition of individual 5'UTRs and the impact of genetic variation within them.
Collapse
Affiliation(s)
- Nechama Wieder
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Elston N D'Souza
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Alexandra C Martin-Geary
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Frederik H Lassen
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | | | - Maria Fernandes
- Big Data Institute, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Sonia P Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - Owen J L Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore, 169857, Singapore
| | - Julie L Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, United Kingdom
- LeedsOmics, University of Leeds, Leeds, LS2 9JT, United Kingdom
- Astbury Centre of Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, United Kingdom
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, NSW, Australia
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, VIC, Australia
| | - Robert W Davies
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
| | - Nicola Whiffin
- Big Data Institute, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
10
|
Cao X, Sun S, Xing J. A Massive Proteogenomic Screen Identifies Thousands of Novel Peptides From the Human "Dark" Proteome. Mol Cell Proteomics 2024; 23:100719. [PMID: 38242438 PMCID: PMC10867589 DOI: 10.1016/j.mcpro.2024.100719] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 01/01/2024] [Accepted: 01/16/2024] [Indexed: 01/21/2024] Open
Abstract
Although the human gene annotation has been continuously improved over the past 2 decades, numerous studies demonstrated the existence of a "dark proteome", consisting of proteins that were critical for biological processes but not included in widely used gene catalogs. The Genotype-Tissue Expression project generated more than 15,000 RNA-seq datasets from multiple tissues, which modeled 30 million transcripts in the human genome. To provide a resource of high-confidence novel proteins from the dark proteome, we screened 50,000 mass spectrometry runs from over 900 projects to identify proteins translated from the Genotype-Tissue Expression transcript model with proteomic support. We also integrated 3.8 million common genetic variants from the gnomAD database to improve peptide identification. As a result, we identified 170,529 novel peptides with proteomic evidence, of which 6048 passed the strictest standard we defined and were supported by PepQuery. We provided a user-friendly website (https://ncorf.genes.fun/) for researchers to check the evidence of novel peptides from their studies. The findings will improve our understanding of coding genes and facilitate genomic data interpretation in biomedical research.
Collapse
Affiliation(s)
- Xiaolong Cao
- Department of Anesthesiology, Zhujiang Hospital, Southern Medical University, Guangzhou, Guangdong, China; Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA; Human Genetic Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Siqi Sun
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA; Human Genetic Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Jinchuan Xing
- Department of Genetics, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA; Human Genetic Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA.
| |
Collapse
|
11
|
Mohsen JJ, Martel AA, Slavoff SA. Microproteins-Discovery, structure, and function. Proteomics 2023; 23:e2100211. [PMID: 37603371 PMCID: PMC10841188 DOI: 10.1002/pmic.202100211] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/22/2023]
Abstract
Advances in proteogenomic technologies have revealed hundreds to thousands of translated small open reading frames (sORFs) that encode microproteins in genomes across evolutionary space. While many microproteins have now been shown to play critical roles in biology and human disease, a majority of recently identified microproteins have little or no experimental evidence regarding their functionality. Computational tools have some limitations for analysis of short, poorly conserved microprotein sequences, so additional approaches are needed to determine the role of each member of this recently discovered polypeptide class. A currently underexplored avenue in the study of microproteins is structure prediction and determination, which delivers a depth of functional information. In this review, we provide a brief overview of microprotein discovery methods, then examine examples of microprotein structures (and, conversely, intrinsic disorder) that have been experimentally determined using crystallography, cryo-electron microscopy, and NMR, which provide insight into their molecular functions and mechanisms. Additionally, we discuss examples of predicted microprotein structures that have provided insight or context regarding their function. Analysis of microprotein structure at the angstrom level, and confirmation of predicted structures, therefore, has potential to identify translated microproteins that are of biological importance and to provide molecular mechanism for their in vivo roles.
Collapse
Affiliation(s)
- Jessica J. Mohsen
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Alina A. Martel
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
| | - Sarah A. Slavoff
- Department of Chemistry, Yale University, New Haven, CT, USA
- Institute of Biomolecular Design and Discovery, Yale University, West Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| |
Collapse
|
12
|
Fedorovskiy AG, Burakov AV, Terenin IM, Bykov DA, Lashkevich KA, Popenko VI, Makarova NE, Sorokin II, Sukhinina AP, Prassolov VS, Ivanov PV, Dmitriev SE. A Solitary Stalled 80S Ribosome Prevents mRNA Recruitment to Stress Granules. BIOCHEMISTRY. BIOKHIMIIA 2023; 88:1786-1799. [PMID: 38105199 DOI: 10.1134/s000629792311010x] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 08/31/2023] [Accepted: 09/11/2023] [Indexed: 12/19/2023]
Abstract
In response to stress stimuli, eukaryotic cells typically suppress protein synthesis. This leads to the release of mRNAs from polysomes, their condensation with RNA-binding proteins, and the formation of non-membrane-bound cytoplasmic compartments called stress granules (SGs). SGs contain 40S but generally lack 60S ribosomal subunits. It is known that cycloheximide, emetine, and anisomycin, the ribosome inhibitors that block the progression of 80S ribosomes along mRNA and stabilize polysomes, prevent SG assembly. Conversely, puromycin, which induces premature termination, releases mRNA from polysomes and stimulates the formation of SGs. The same effect is caused by some translation initiation inhibitors, which lead to polysome disassembly and the accumulation of mRNAs in the form of stalled 48S preinitiation complexes. Based on these and other data, it is believed that the trigger for SG formation is the presence of mRNA with extended ribosome-free segments, which tend to form condensates in the cell. In this study, we evaluated the ability of various small-molecule translation inhibitors to block or stimulate the assembly of SGs under conditions of severe oxidative stress induced by sodium arsenite. Contrary to expectations, we found that ribosome-targeting elongation inhibitors of a specific type, which arrest solitary 80S ribosomes at the beginning of the mRNA coding regions but do not interfere with all subsequent ribosomes in completing translation and leaving the transcripts (such as harringtonine, lactimidomycin, or T-2 toxin), completely prevent the formation of arsenite-induced SGs. These observations suggest that the presence of even a single 80S ribosome on mRNA is sufficient to prevent its recruitment into SGs, and the presence of extended ribosome-free regions of mRNA is not sufficient for SG formation. We propose that mRNA entry into SGs may be mediated by specific contacts between RNA-binding proteins and those regions on 40S subunits that remain inaccessible when ribosomes are associated.
Collapse
Affiliation(s)
- Artem G Fedorovskiy
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
- Faculty of Materials Science, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Anton V Burakov
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - Ilya M Terenin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
- Sirius University of Science and Technology, Sirius, Krasnodar Region, 354340, Russia
| | - Dmitry A Bykov
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
- Department of Biochemistry, Faculty of Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
| | - Kseniya A Lashkevich
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - Vladimir I Popenko
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
| | - Nadezhda E Makarova
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - Ivan I Sorokin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - Anastasia P Sukhinina
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - Vladimir S Prassolov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
| | - Pavel V Ivanov
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School Boston, MA 02115, USA
| | - Sergey E Dmitriev
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia.
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119234, Russia
| |
Collapse
|
13
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Moritz RL, Deutsch EW, van Heesch S. What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome? Mol Cell Proteomics 2023; 22:100631. [PMID: 37572790 PMCID: PMC10506109 DOI: 10.1016/j.mcpro.2023.100631] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 07/21/2023] [Accepted: 08/08/2023] [Indexed: 08/14/2023] Open
Abstract
Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from ∼19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."
Collapse
Affiliation(s)
- John R Prensner
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, USA.
| | | | - Leron W Kok
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Karl R Clauser
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, Agora Center Bugnon 25A, University of Lausanne, Lausanne, Switzerland; Department of Oncology, Centre Hospitalier Universitaire Vaudois (CHUV), Lausanne, Switzerland; Agora Cancer Research Centre, Lausanne, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | - Eric W Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | | |
Collapse
|
14
|
Inchingolo MA, Diman A, Adamczewski M, Humphreys T, Jaquier-Gubler P, Curran JA. TP53BP1, a dual-coding gene, uses promoter switching and translational reinitiation to express a smORF protein. iScience 2023; 26:106757. [PMID: 37216125 PMCID: PMC10193022 DOI: 10.1016/j.isci.2023.106757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 03/07/2023] [Accepted: 04/24/2023] [Indexed: 05/24/2023] Open
Abstract
The complexity of the metazoan proteome is significantly increased by the expression of small proteins (<100 aa) derived from smORFs within lncRNAs, uORFs, 3' UTRs and, reading frames overlapping the CDS. These smORF encoded proteins (SEPs) have diverse roles, ranging from the regulation of cellular physiological to essential developmental functions. We report the characterization of a new member of this protein family, SEP53BP1, derived from a small internal ORF that overlaps the CDS encoding 53BP1. Its expression is coupled to the utilization of an alternative, cell-type specific promoter coupled to translational reinitiation events mediated by a uORF in the alternative 5' TL of the mRNA. This uORF-mediated reinitiation at an internal ORF is also observed in zebrafish. Interactome studies indicate that the human SEP53BP1 associates with components of the protein turnover pathway including the proteasome, and the TRiC/CCT chaperonin complex, suggesting that it may play a role in cellular proteostasis.
Collapse
Affiliation(s)
- Marta A. Inchingolo
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Aurélie Diman
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Maxime Adamczewski
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Faculté de Médecine et Pharmacie, Université Grenoble Alpes, Grenoble, France
| | - Tom Humphreys
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Pascale Jaquier-Gubler
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Joseph A. Curran
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, Geneva, Switzerland
| |
Collapse
|
15
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Deutsch EW, van Heesch S. What can Ribo-seq and proteomics tell us about the non-canonical proteome? BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.16.541049. [PMID: 37292611 PMCID: PMC10245706 DOI: 10.1101/2023.05.16.541049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Ribosome profiling (Ribo-seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of non-canonical sites of ribosome translation outside of the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7,000 non-canonical open reading frames (ORFs) are translated, which, at first glance, has the potential to expand the number of human protein-coding sequences by 30%, from ∼19,500 annotated CDSs to over 26,000. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of non-canonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome, but searching for guidance on how to proceed. Here, we discuss the current state of non-canonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein-coding". In brief The human genome encodes thousands of non-canonical open reading frames (ORFs) in addition to protein-coding genes. As a nascent field, many questions remain regarding non-canonical ORFs. How many exist? Do they encode proteins? What level of evidence is needed for their verification? Central to these debates has been the advent of ribosome profiling (Ribo-seq) as a method to discern genome-wide ribosome occupancy, and immunopeptidomics as a method to detect peptides that are processed and presented by MHC molecules and not observed in traditional proteomics experiments. This article provides a synthesis of the current state of non-canonical ORF research and proposes standards for their future investigation and reporting. Highlights Combined use of Ribo-seq and proteomics-based methods enables optimal confidence in detecting non-canonical ORFs and their protein products.Ribo-seq can provide more sensitive detection of non-canonical ORFs, but data quality and analytical pipelines will impact results.Non-canonical ORF catalogs are diverse and span both high-stringency and low-stringency ORF nominations.A framework for standardized non-canonical ORF evidence will advance the research field.
Collapse
Affiliation(s)
- John R. Prensner
- Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | | | - Leron W. Kok
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Karl R. Clauser
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Jonathan M. Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University of Lausanne, Agora Center Bugnon 25A, 1005 Lausanne, Switzerland
- Department of Oncology, Centre hospitalier universitaire vaudois (CHUV), Rue du Bugnon 46, 1005 Lausanne, Switzerland
- Agora Cancer Research Centre, 1011 Lausanne, Switzerland
| | - Eric W. Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington 98109, USA
| | - Sebastiaan van Heesch
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| |
Collapse
|
16
|
Yang Y, Gatica D, Liu X, Wu R, Kang R, Tang D, Klionsky DJ. Upstream open reading frames mediate autophagy-related protein translation. Autophagy 2023; 19:457-473. [PMID: 35363116 PMCID: PMC9851245 DOI: 10.1080/15548627.2022.2059744] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Macroautophagy/autophagy, a highly conserved catabolic pathway that maintains proper cellular homeostasis is stringently regulated by numerous autophagy-related (Atg) proteins. Many studies have investigated autophagy regulation at the transcriptional level; however, relatively little is known about translational control. Here, we report the upstream open reading frame (uORF)-mediated translational control of multiple Atg proteins in Saccharomyces cerevisiae and in human cells. The translation of several essential autophagy regulators in yeast, including Atg13, is suppressed by canonical uORFs under nutrient-rich conditions, and is activated during nitrogen-starvation conditions. We also found that the predicted human ATG4B and ATG12 non-canonical uORFs suppress downstream coding sequence translation. These results demonstrate that uORF-mediated translational control is a widely used mechanism among ATG genes from yeast to human and suggest a model for how some ATG genes bypass the general translational suppression that occurs under stress conditions to maintain a proper level of autophagy.Abbreviations: 5' UTR, 5' untranslated region; Atg, autophagy-related; CDS, coding sequence; Cvt, cytoplasm-to-vacuole targeting; HBSS, Hanks' balanced salt solution; PA, protein A; PE, phosphati-dylethanolamine; PIC, preinitiation complex; PtdIns3K, phosphatidylinositol 3-kinase; qRT-PCR, quantitative reverse transcription PCR; Ubl, ubiquitin-like; uORF, upstream open reading frame; WT, wild-type.
Collapse
Affiliation(s)
- Ying Yang
- Department of Molecular, Cellular and Developmental Biology, and Life Sciences Institute, University of Michigan, Ann Arbor, MI48109, USA
| | - Damián Gatica
- Department of Molecular, Cellular and Developmental Biology, and Life Sciences Institute, University of Michigan, Ann Arbor, MI48109, USA
| | - Xu Liu
- Department of Molecular, Cellular and Developmental Biology, and Life Sciences Institute, University of Michigan, Ann Arbor, MI48109, USA
| | - Runliu Wu
- Department of Surgery, University of Texas Southwestern Medical Center, Dallas, TX75390, USA
| | - Rui Kang
- Department of Surgery, University of Texas Southwestern Medical Center, Dallas, TX75390, USA
| | - Daolin Tang
- Department of Surgery, University of Texas Southwestern Medical Center, Dallas, TX75390, USA
| | - Daniel J. Klionsky
- Department of Molecular, Cellular and Developmental Biology, and Life Sciences Institute, University of Michigan, Ann Arbor, MI48109, USA,CONTACT Daniel J. Klionsky Life Sciences Institute, University of Michigan, Ann Arbor, MI48109, USA
| |
Collapse
|
17
|
Chen Z, Meng J, Zhao S, Yin C, Luan Y. sORFPred: A Method Based on Comprehensive Features and Ensemble Learning to Predict the sORFs in Plant LncRNAs. Interdiscip Sci 2023; 15:189-201. [PMID: 36705893 DOI: 10.1007/s12539-023-00552-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 01/11/2023] [Accepted: 01/13/2023] [Indexed: 01/28/2023]
Abstract
Long non-coding RNAs (lncRNAs) are important regulators of biological processes. It has recently been shown that some lncRNAs include small open reading frames (sORFs) that can encode small peptides of no more than 100 amino acids. However, existing methods are commonly applied to human and animal datasets and still suffer from low feature representation capability. Thus, accurate and credible prediction of sORFs with coding ability in plant lncRNAs is imperative. This paper proposes a new method termed sORFPred, in which we design a model named MCSEN by combining multi-scale convolution and Squeeze-and-Excitation Networks to fully mine distinct information embedded in sORFs, integrate and optimize multiple sequence-based and physicochemical feature descriptors, and built a two-layer prediction classifier based on Bayesian optimization algorithm and Extra Trees. sORFPred has been evaluated on sORFs datasets of three species and experimentally validated sORFs dataset. Results indicate that sORFPred outperforms existing methods and achieves 97.28% accuracy, 97.06% precision, 97.52% recall, and 97.29% F1-score on Arabidopsis thaliana, which shows a significant improvement in prediction performance compared to various conventional shallow machine learning and deep learning models.
Collapse
Affiliation(s)
- Ziwei Chen
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China.,School of Bioengineering, Dalian University of Technology, Dalian, 116024, Liaoning, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China. .,School of Bioengineering, Dalian University of Technology, Dalian, 116024, Liaoning, China.
| | - Siyuan Zhao
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China.,School of Bioengineering, Dalian University of Technology, Dalian, 116024, Liaoning, China
| | - Chao Yin
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China.,School of Bioengineering, Dalian University of Technology, Dalian, 116024, Liaoning, China
| | - Yushi Luan
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning, China.,School of Bioengineering, Dalian University of Technology, Dalian, 116024, Liaoning, China
| |
Collapse
|
18
|
Wan W, Zhang L, Lin Y, Rao X, Wang X, Hua F, Ying J. Mitochondria-derived peptide MOTS-c: effects and mechanisms related to stress, metabolism and aging. J Transl Med 2023; 21:36. [PMID: 36670507 PMCID: PMC9854231 DOI: 10.1186/s12967-023-03885-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 01/11/2023] [Indexed: 01/22/2023] Open
Abstract
MOTS-c is a peptide encoded by the short open reading frame of the mitochondrial 12S rRNA gene. It is significantly expressed in response to stress or exercise and translocated to the nucleus, where it regulates the expression of stress adaptation-related genes with antioxidant response elements (ARE). MOTS-c mainly acts through the Folate-AICAR-AMPK pathway, thereby influencing energy metabolism, insulin resistance, inflammatory response, exercise, aging and aging-related pathologies. Because of the potential role of MOTS-c in maintaining energy and stress homeostasis to promote healthy aging, especially in view of the increasing aging of the global population, it is highly pertinent to summarize the relevant studies. This review summarizes the retrograde signaling of MOTS-c toward the nucleus, the regulation of energy metabolism, stress homeostasis, and aging-related pathological processes, as well as the underlying molecular mechanisms.
Collapse
Affiliation(s)
- Wei Wan
- grid.412455.30000 0004 1756 5980Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China ,Key Laboratory of Anesthesiology of Jiangxi Province, 1# Minde Road, Nanchang, 330006 Jiangxi People’s Republic of China
| | - Lieliang Zhang
- grid.412455.30000 0004 1756 5980Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China ,Key Laboratory of Anesthesiology of Jiangxi Province, 1# Minde Road, Nanchang, 330006 Jiangxi People’s Republic of China
| | - Yue Lin
- grid.412455.30000 0004 1756 5980Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China ,Key Laboratory of Anesthesiology of Jiangxi Province, 1# Minde Road, Nanchang, 330006 Jiangxi People’s Republic of China
| | - Xiuqing Rao
- grid.412455.30000 0004 1756 5980Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China ,Key Laboratory of Anesthesiology of Jiangxi Province, 1# Minde Road, Nanchang, 330006 Jiangxi People’s Republic of China
| | - Xifeng Wang
- grid.412604.50000 0004 1758 4073Department of Anesthesiology, The First Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China
| | - Fuzhou Hua
- grid.412455.30000 0004 1756 5980Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China ,Key Laboratory of Anesthesiology of Jiangxi Province, 1# Minde Road, Nanchang, 330006 Jiangxi People’s Republic of China
| | - Jun Ying
- grid.412455.30000 0004 1756 5980Department of Anesthesiology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006 Jiangxi China ,Key Laboratory of Anesthesiology of Jiangxi Province, 1# Minde Road, Nanchang, 330006 Jiangxi People’s Republic of China
| |
Collapse
|
19
|
Filatova A, Reveguk I, Piatkova M, Bessonova D, Kuziakova O, Demakova V, Romanishin A, Fishman V, Imanmalik Y, Chekanov N, Skitchenko R, Barbitoff Y, Kardymon O, Skoblov M. Annotation of uORFs in the OMIM genes allows to reveal pathogenic variants in 5'UTRs. Nucleic Acids Res 2023; 51:1229-1244. [PMID: 36651276 PMCID: PMC9943669 DOI: 10.1093/nar/gkac1247] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 11/29/2022] [Accepted: 12/15/2022] [Indexed: 01/19/2023] Open
Abstract
An increasing number of studies emphasize the role of non-coding variants in the development of hereditary diseases. However, the interpretation of such variants in clinical genetic testing still remains a critical challenge due to poor knowledge of their pathogenicity mechanisms. It was previously shown that variants in 5'-untranslated regions (5'UTRs) can lead to hereditary diseases due to disruption of upstream open reading frames (uORFs). Here, we performed a manual annotation of upstream translation initiation sites (TISs) in human disease-associated genes from the OMIM database and revealed ∼4.7 thousand of TISs related to uORFs. We compared our TISs with the previous studies and provided a list of 'high confidence' uORFs. Using a luciferase assay, we experimentally validated the translation of uORFs in the ETFDH, PAX9, MAST1, HTT, TTN,GLI2 and COL2A1 genes, as well as existence of N-terminal CDS extension in the ZIC2 gene. Besides, we created a tool to annotate the effects of genetic variants located in uORFs. We revealed the variants from the HGMD and ClinVar databases that disrupt uORFs and thereby could lead to Mendelian disorders. We also showed that the distribution of uORFs-affecting variants differs between pathogenic and population variants. Finally, drawing on manually curated data, we developed a machine-learning algorithm that allows us to predict the TISs in other human genes.
Collapse
Affiliation(s)
- Alexandra Filatova
- To whom correspondence should be addressed. Tel: +7 916 335 33 29; Fax: +7 499 324 07 02;
| | - Ivan Reveguk
- Laboratoire de Biologie Structurale de la Cellule, École Polytechnique, Paris, France
| | - Maria Piatkova
- Institute of Chemistry, Far Eastern Branch of the Russian Academy of Sciences, Vladivostok, Russia,Institute of high technologies and advanced materials, Far Eastern Federal University, Vladivostok, Russia
| | - Daria Bessonova
- Medical Center, Far Eastern Federal University, Vladivostok, Russia
| | - Olga Kuziakova
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, Vladivostok, Russia
| | | | - Alexander Romanishin
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, Vladivostok, Russia,Institute of Life Sciences, Immanuel Kant Baltic Federal University, Kaliningrad, Russia
| | - Veniamin Fishman
- Artificial Intelligence Research Institute, Moscow, Russia,Molecular Mechanisms of Ontogenesis, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | | | | | | | - Yury Barbitoff
- Bioinformatics Institute, St. Petersburg, Russia,Department of Genomic Medicine, D.O. Ott Research Institute of Obstetrics, Gynaecology, and Reproductology, St. Petersburg, Russia,Dpt. of Genetics and Biotechnology, St. Petersburg State University, St. Petersburg, Russia
| | - Olga Kardymon
- Artificial Intelligence Research Institute, Moscow, Russia
| | | |
Collapse
|
20
|
Chothani S, Ho L, Schafer S, Rackham O. Discovering microproteins: making the most of ribosome profiling data. RNA Biol 2023; 20:943-954. [PMID: 38013207 PMCID: PMC10730196 DOI: 10.1080/15476286.2023.2279845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2023] [Indexed: 11/29/2023] Open
Abstract
Building a reference set of protein-coding open reading frames (ORFs) has revolutionized biological process discovery and understanding. Traditionally, gene models have been confirmed using cDNA sequencing and encoded translated regions inferred using sequence-based detection of start and stop combinations longer than 100 amino-acids to prevent false positives. This has led to small ORFs (smORFs) and their encoded proteins left un-annotated. Ribo-seq allows deciphering translated regions from untranslated irrespective of the length. In this review, we describe the power of Ribo-seq data in detection of smORFs while discussing the major challenge posed by data-quality, -depth and -sparseness in identifying the start and end of smORF translation. In particular, we outline smORF cataloguing efforts in humans and the large differences that have arisen due to variation in data, methods and assumptions. Although current versions of smORF reference sets can already be used as a powerful tool for hypothesis generation, we recommend that future editions should consider these data limitations and adopt unified processing for the community to establish a canonical catalogue of translated smORFs.
Collapse
Affiliation(s)
- Sonia Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Lena Ho
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Owen Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
- The Alan Turing Institute, The British Library, London, UK
| |
Collapse
|
21
|
Jürgens L, Wethmar K. The Emerging Role of uORF-Encoded uPeptides and HLA uLigands in Cellular and Tumor Biology. Cancers (Basel) 2022; 14:6031. [PMID: 36551517 PMCID: PMC9776223 DOI: 10.3390/cancers14246031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 11/29/2022] [Accepted: 11/30/2022] [Indexed: 12/13/2022] Open
Abstract
Recent technological advances have facilitated the detection of numerous non-canonical human peptides derived from regulatory regions of mRNAs, long non-coding RNAs, and other cryptic transcripts. In this review, we first give an overview of the classification of these novel peptides and summarize recent improvements in their annotation and detection by ribosome profiling, mass spectrometry, and individual experimental analysis. A large fraction of the novel peptides originates from translation at upstream open reading frames (uORFs) that are located within the transcript leader sequence of regular mRNA. In humans, uORF-encoded peptides (uPeptides) have been detected in both healthy and malignantly transformed cells and emerge as important regulators in cellular and immunological pathways. In the second part of the review, we focus on various functional implications of uPeptides. As uPeptides frequently act at the transition of translational regulation and individual peptide function, we describe the mechanistic modes of translational regulation through ribosome stalling, the involvement in cellular programs through protein interaction and complex formation, and their role within the human leukocyte antigen (HLA)-associated immunopeptidome as HLA uLigands. We delineate how malignant transformation may lead to the formation of novel uORFs, uPeptides, or HLA uLigands and explain their potential implication in tumor biology. Ultimately, we speculate on a potential use of uPeptides as peptide drugs and discuss how uPeptides and HLA uLigands may facilitate translational inhibition of oncogenic protein messages and immunotherapeutic approaches in cancer therapy.
Collapse
Affiliation(s)
| | - Klaus Wethmar
- University Hospital Münster, Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, 48149 Münster, Germany
| |
Collapse
|
22
|
DAP5 enables main ORF translation on mRNAs with structured and uORF-containing 5' leaders. Nat Commun 2022; 13:7510. [PMID: 36473845 PMCID: PMC9726905 DOI: 10.1038/s41467-022-35019-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 11/16/2022] [Indexed: 12/12/2022] Open
Abstract
Half of mammalian transcripts contain short upstream open reading frames (uORFs) that potentially regulate translation of the downstream coding sequence (CDS). The molecular mechanisms governing these events remain poorly understood. Here, we find that the non-canonical initiation factor Death-associated protein 5 (DAP5 or eIF4G2) is required for translation initiation on select transcripts. Using ribosome profiling and luciferase-based reporters coupled with mutational analysis we show that DAP5-mediated translation occurs on messenger RNAs (mRNAs) with long, structure-prone 5' leader sequences and persistent uORF translation. These mRNAs preferentially code for signalling factors such as kinases and phosphatases. We also report that cap/eIF4F- and eIF4A-dependent recruitment of DAP5 to the mRNA facilitates main CDS, but not uORF, translation suggesting a role for DAP5 in translation re-initiation. Our study reveals important mechanistic insights into how a non-canonical translation initiation factor involved in stem cell fate shapes the synthesis of specific signalling factors.
Collapse
|
23
|
Salvagno C, Mandula JK, Rodriguez PC, Cubillos-Ruiz JR. Decoding endoplasmic reticulum stress signals in cancer cells and antitumor immunity. Trends Cancer 2022; 8:930-943. [PMID: 35817701 PMCID: PMC9588488 DOI: 10.1016/j.trecan.2022.06.006] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 06/09/2022] [Accepted: 06/10/2022] [Indexed: 12/24/2022]
Abstract
The tumor microenvironment (TME) provokes endoplasmic reticulum (ER) stress in malignant cells and infiltrating immune populations. Sensing and responding to ER stress is coordinated by the unfolded protein response (UPR), an integrated signaling pathway governed by three ER stress sensors: activating transcription factor (ATF6), inositol-requiring enzyme 1α (IRE1α), and protein kinase R (PKR)-like ER kinase (PERK). Persistent UPR activation modulates malignant progression, tumor growth, metastasis, and protective antitumor immunity. Hence, therapies targeting ER stress signaling can be harnessed to elicit direct tumor killing and concomitant anticancer immunity. We highlight recent findings on the role of the ER stress responses in onco-immunology, with an emphasis on genetic vulnerabilities that render tumors highly sensitive to therapeutic UPR modulation.
Collapse
Affiliation(s)
- Camilla Salvagno
- Department of Obstetrics and Gynecology, Weill Cornell Medicine, New York, NY, USA; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Jessica K Mandula
- Department of Immunology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Paulo C Rodriguez
- Department of Immunology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA.
| | - Juan R Cubillos-Ruiz
- Department of Obstetrics and Gynecology, Weill Cornell Medicine, New York, NY, USA; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, USA.
| |
Collapse
|
24
|
Manske F, Ogoniak L, Jürgens L, Grundmann N, Makałowski W, Wethmar K. The new uORFdb: integrating literature, sequence, and variation data in a central hub for uORF research. Nucleic Acids Res 2022; 51:D328-D336. [PMID: 36305828 PMCID: PMC9825577 DOI: 10.1093/nar/gkac899] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 09/28/2022] [Accepted: 10/03/2022] [Indexed: 02/07/2023] Open
Abstract
Upstream open reading frames (uORFs) are initiated by AUG or near-cognate start codons and have been identified in the transcript leader sequences of the majority of eukaryotic transcripts. Functionally, uORFs are implicated in downstream translational regulation of the main protein coding sequence and may serve as a source of non-canonical peptides. Genetic defects in uORF sequences have been linked to the development of various diseases, including cancer. To simplify uORF-related research, the initial release of uORFdb in 2014 provided a comprehensive and manually curated collection of uORF-related literature. Here, we present an updated sequence-based version of uORFdb, accessible at https://www.bioinformatics.uni-muenster.de/tools/uorfdb. The new uORFdb enables users to directly access sequence information, graphical displays, and genetic variation data for over 2.4 million human uORFs. It also includes sequence data of >4.2 million uORFs in 12 additional species. Multiple uORFs can be displayed in transcript- and reading-frame-specific models to visualize the translational context. A variety of filters, sequence-related information, and links to external resources (UCSC Genome Browser, dbSNP, ClinVar) facilitate immediate in-depth analysis of individual uORFs. The database also contains uORF-related somatic variation data obtained from whole-genome sequencing (WGS) analyses of 677 cancer samples collected by the TCGA consortium.
Collapse
Affiliation(s)
- Felix Manske
- Institute of Bioinformatics, University of Münster, Münster 48149, Germany
| | - Lynn Ogoniak
- Institute of Bioinformatics, University of Münster, Münster 48149, Germany
| | - Lara Jürgens
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Münster 48149, Germany
| | - Norbert Grundmann
- Institute of Bioinformatics, University of Münster, Münster 48149, Germany
| | - Wojciech Makałowski
- Correspondence may also be addressed to Wojciech Makałowski. Tel: +49 2518353006;
| | - Klaus Wethmar
- To whom correspondence should be addressed. Tel: +49 2518347587; Fax: +49 2518347588;
| |
Collapse
|
25
|
Gleason AC, Ghadge G, Sonobe Y, Roos RP. Kozak Similarity Score Algorithm Identifies Alternative Translation Initiation Codons Implicated in Cancers. Int J Mol Sci 2022; 23:ijms231810564. [PMID: 36142475 PMCID: PMC9506484 DOI: 10.3390/ijms231810564] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 09/05/2022] [Accepted: 09/08/2022] [Indexed: 11/16/2022] Open
Abstract
Ribosome profiling and mass spectroscopy have identified canonical and noncanonical translation initiation codons (TICs) that are upstream of the main translation initiation site and used to translate oncogenic proteins. There have previously been conflicting reports about the patterns of nucleotides that surround noncanonical TICs. Here, we use a Kozak Similarity Score algorithm to find that nearly all of these TICs have flanking nucleotides closely matching the Kozak sequence. Remarkably, the nucleotides flanking alternative noncanonical TICs are frequently closer to the Kozak sequence than the nucleotides flanking TICs used to translate the gene’s main protein. Of note, the 5′ untranslated region (5‘UTR) of cancer-associated genes with an upstream TIC tend to be significantly longer than the same region in genes not associated with cancer. The presence of a longer-than-typical 5′UTR increases the likelihood of ribosome binding to upstream noncanonical TICs, and may be a distinguishing feature of a number of genes overexpressed in cancer. Noncanonical TICs that are located in the 5′UTR, although thought by some to be disadvantageous and suppressed by evolution, may translate oncogenic proteins because of their flanking nucleotides.
Collapse
|
26
|
Malekos E, Carpenter S. Short open reading frame genes in innate immunity: from discovery to characterization. Trends Immunol 2022; 43:741-756. [PMID: 35965152 PMCID: PMC10118063 DOI: 10.1016/j.it.2022.07.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 07/11/2022] [Accepted: 07/13/2022] [Indexed: 12/27/2022]
Abstract
Next-generation sequencing (NGS) technologies have greatly expanded the size of the known transcriptome. Many newly discovered transcripts are classified as long noncoding RNAs (lncRNAs) which are assumed to affect phenotype through sequence and structure and not via translated protein products despite the vast majority of them harboring short open reading frames (sORFs). Recent advances have demonstrated that the noncoding designation is incorrect in many cases and that sORF-encoded peptides (SEPs) translated from these transcripts are important contributors to diverse biological processes. Interest in SEPs is at an early stage and there is evidence for the existence of thousands of SEPs that are yet unstudied. We hope to pique interest in investigating this unexplored proteome by providing a discussion of SEP characterization generally and describing specific discoveries in innate immunity.
Collapse
Affiliation(s)
- Eric Malekos
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA; Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Susan Carpenter
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA; Department of Molecular Cell and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA, USA.
| |
Collapse
|
27
|
Fages-Lartaud M, Tietze L, Elie F, Lale R, Hohmann-Marriott MF. mCherry contains a fluorescent protein isoform that interferes with its reporter function. Front Bioeng Biotechnol 2022; 10:892138. [PMID: 36017355 PMCID: PMC9395592 DOI: 10.3389/fbioe.2022.892138] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 06/30/2022] [Indexed: 11/13/2022] Open
Abstract
Fluorescent proteins are essential reporters in cell and molecular biology. Here, we found that red-fluorescent proteins possess an alternative translation initiation site that produces a short functional protein isoform in both prokaryotes and eukaryotes. The short isoform creates significant background fluorescence that biases the outcome of expression studies. In this study, we identified the short protein isoform, traced its origin, and determined the extent of the issue within the family of red fluorescent protein. Our analysis showed that the short isoform defect of the red fluorescent protein family may affect the interpretation of many published studies. We provided a re-engineered mCherry variant that lacks background expression as an improved tool for imaging and protein expression studies.
Collapse
Affiliation(s)
- Maxime Fages-Lartaud
- Department of Biotechnology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Lisa Tietze
- Department of Biotechnology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Florence Elie
- Department of Biotechnology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Rahmi Lale
- Department of Biotechnology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Martin Frank Hohmann-Marriott
- Department of Biotechnology, Norwegian University of Science and Technology, Trondheim, Norway
- United Scientists CORE (Limited), Dunedin, New Zealand
| |
Collapse
|
28
|
Andreev DE, Loughran G, Fedorova AD, Mikhaylova MS, Shatsky IN, Baranov PV. Non-AUG translation initiation in mammals. Genome Biol 2022; 23:111. [PMID: 35534899 PMCID: PMC9082881 DOI: 10.1186/s13059-022-02674-2] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Accepted: 04/14/2022] [Indexed: 12/12/2022] Open
Abstract
Recent proteogenomic studies revealed extensive translation outside of annotated protein coding regions, such as non-coding RNAs and untranslated regions of mRNAs. This non-canonical translation is largely due to start codon plurality within the same RNA. This plurality is often due to the failure of some scanning ribosomes to recognize potential start codons leading to initiation downstream—a process termed leaky scanning. Codons other than AUG (non-AUG) are particularly leaky due to their inefficiency. Here we discuss our current understanding of non-AUG initiation. We argue for a near-ubiquitous role of non-AUG initiation in shaping the dynamic composition of mammalian proteomes.
Collapse
|
29
|
Chiu CW, Li YR, Lin CY, Yeh HH, Liu MJ. Translation initiation landscape profiling reveals hidden open-reading frames required for the pathogenesis of tomato yellow leaf curl Thailand virus. THE PLANT CELL 2022; 34:1804-1821. [PMID: 35080617 PMCID: PMC9048955 DOI: 10.1093/plcell/koac019] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 01/06/2022] [Indexed: 05/12/2023]
Abstract
Plant viruses with densely packed genomes employ noncanonical translational strategies to increase the coding capacity for viral function. However, the diverse translational strategies used make it challenging to define the full set of viral genes. Here, using tomato yellow leaf curl Thailand virus (TYLCTHV, genus Begomovirus) as a model system, we identified genes beyond the annotated gene sets by experimentally profiling in vivo translation initiation sites (TISs). We found that unanticipated AUG TISs were prevalent and determined that their usage involves alternative transcriptional and/or translational start sites and is associated with flanking mRNA sequences. Specifically, two downstream in-frame TISs were identified in the viral gene AV2. These TISs were conserved in the begomovirus lineage and led to the translation of different protein isoforms localized to cytoplasmic puncta and at the cell periphery, respectively. In addition, we found translational evidence of an unexplored gene, BV2. BV2 is conserved among TYLCTHV isolates and localizes to the endoplasmic reticulum and plasmodesmata. Mutations of AV2 isoforms and BV2 significantly attenuated disease symptoms in tomato (Solanum lycopersicum). In conclusion, our study pinpointing in vivo TISs untangles the coding complexity of a plant viral genome and, more importantly, illustrates the biological significance of the hidden open-reading frames encoding viral factors for pathogenicity.
Collapse
Affiliation(s)
- Ching-Wen Chiu
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
| | - Ya-Ru Li
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
| | - Cheng-Yuan Lin
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
| | - Hsin-Hung Yeh
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei 115, Taiwan
| | | |
Collapse
|
30
|
Nelde A, Flötotto L, Jürgens L, Szymik L, Hubert E, Bauer J, Schliemann C, Kessler T, Lenz G, Rammensee HG, Walz JS, Wethmar K. Upstream open reading frames regulate translation of cancer-associated transcripts and encode HLA-presented immunogenic tumor antigens. Cell Mol Life Sci 2022; 79:171. [PMID: 35239002 PMCID: PMC8894207 DOI: 10.1007/s00018-022-04145-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 12/21/2021] [Accepted: 01/10/2022] [Indexed: 02/04/2023]
Abstract
BACKGROUND Upstream open reading frames (uORFs) represent translational control elements within eukaryotic transcript leader sequences. Recent data showed that uORFs can encode for biologically active proteins and human leukocyte antigen (HLA)-presented peptides in malignant and benign cells suggesting their potential role in cancer cell development and survival. However, the role of uORFs in translational regulation of cancer-associated transcripts as well as in cancer immune surveillance is still incompletely understood. METHODS We examined the translational regulatory effect of 29 uORFs in 13 cancer-associated genes by dual-luciferase assays. Cellular expression and localization of uORF-encoded peptides (uPeptides) were investigated by immunoblotting and immunofluorescence-based microscopy. Furthermore, we utilized mass spectrometry-based immunopeptidome analyses in an extensive dataset of primary malignant and benign tissue samples for the identification of naturally presented uORF-derived HLA-presented peptides screening for more than 2000 uORFs. RESULTS We provide experimental evidence for similarly effective translational regulation of cancer-associated transcripts through uORFs initiated by either canonical AUG codons or by alternative translation initiation sites (aTISs). We further demonstrate frequent cellular expression and reveal occasional specific cellular localization of uORF-derived peptides, suggesting uPeptide-specific biological implications. Immunopeptidome analyses delineated a set of 125 naturally presented uORF-derived HLA-presented peptides. Comparative immunopeptidome profiling of malignant and benign tissue-derived immunopeptidomes identified several tumor-associated uORF-derived HLA ligands capable to induce multifunctional T cell responses. CONCLUSION Our data provide direct evidence for the frequent expression of uPeptides in benign and malignant human tissues, suggesting a potentially widespread function of uPeptides in cancer biology. These findings may inspire novel approaches in direct molecular as well as immunotherapeutic targeting of cancer-associated uORFs and uPeptides.
Collapse
Affiliation(s)
- Annika Nelde
- Clinical Collaboration Unit Translational Immunology, Department of Internal Medicine, German Cancer Consortium (DKTK), University Hospital Tübingen, Otfried-Müller-Str. 10, 72076, Tübingen, Germany
- Department of Immunology, Institute for Cell Biology, University of Tübingen, 72076, Tübingen, Germany
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, 72076, Tübingen, Germany
| | - Lea Flötotto
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Lara Jürgens
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Laura Szymik
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Elvira Hubert
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Jens Bauer
- Clinical Collaboration Unit Translational Immunology, Department of Internal Medicine, German Cancer Consortium (DKTK), University Hospital Tübingen, Otfried-Müller-Str. 10, 72076, Tübingen, Germany
- Department of Immunology, Institute for Cell Biology, University of Tübingen, 72076, Tübingen, Germany
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, 72076, Tübingen, Germany
| | - Christoph Schliemann
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Torsten Kessler
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Georg Lenz
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany
| | - Hans-Georg Rammensee
- Department of Immunology, Institute for Cell Biology, University of Tübingen, 72076, Tübingen, Germany
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, 72076, Tübingen, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Partner Site Tübingen, 72076, Tübingen, Germany
| | - Juliane S Walz
- Clinical Collaboration Unit Translational Immunology, Department of Internal Medicine, German Cancer Consortium (DKTK), University Hospital Tübingen, Otfried-Müller-Str. 10, 72076, Tübingen, Germany.
- Department of Immunology, Institute for Cell Biology, University of Tübingen, 72076, Tübingen, Germany.
- Cluster of Excellence iFIT (EXC2180) "Image-Guided and Functionally Instructed Tumor Therapies", University of Tübingen, 72076, Tübingen, Germany.
- Dr. Margarete Fischer-Bosch Institute of Clinical Pharmacology, Robert Bosch Center for Tumor Diseases (RBCT), 70376, Stuttgart, Germany.
| | - Klaus Wethmar
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, Albert-Schweitzer-Campus 1A, 48149, Münster, Germany.
| |
Collapse
|
31
|
Ivanov IP, Saba JA, Fan CM, Wang J, Firth AE, Cao C, Green R, Dever TE. Evolutionarily conserved inhibitory uORFs sensitize Hox mRNA translation to start codon selection stringency. Proc Natl Acad Sci U S A 2022; 119:e2117226119. [PMID: 35217614 PMCID: PMC8892498 DOI: 10.1073/pnas.2117226119] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 01/20/2022] [Indexed: 01/15/2023] Open
Abstract
Translation start site selection in eukaryotes is influenced by context nucleotides flanking the AUG codon and by levels of the eukaryotic translation initiation factors eIF1 and eIF5. In a search of mammalian genes, we identified five homeobox (Hox) gene paralogs initiated by AUG codons in conserved suboptimal context as well as 13 Hox genes that contain evolutionarily conserved upstream open reading frames (uORFs) that initiate at AUG codons in poor sequence context. An analysis of published cap analysis of gene expression sequencing (CAGE-seq) data and generated CAGE-seq data for messenger RNAs (mRNAs) from mouse somites revealed that the 5' leaders of Hox mRNAs of interest contain conserved uORFs, are generally much shorter than reported, and lack previously proposed internal ribosome entry site elements. We show that the conserved uORFs inhibit Hox reporter expression and that altering the stringency of start codon selection by overexpressing eIF1 or eIF5 modulates the expression of Hox reporters. We also show that modifying ribosome homeostasis by depleting a large ribosomal subunit protein or treating cells with sublethal concentrations of puromycin leads to lower stringency of start codon selection. Thus, altering global translation can confer gene-specific effects through altered start codon selection stringency.
Collapse
Affiliation(s)
- Ivaylo P Ivanov
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20892
| | - James A Saba
- HHMI, Johns Hopkins University School of Medicine, Baltimore, MD 21205
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205
| | - Chen-Ming Fan
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD 21218
| | - Ji Wang
- Division of Virology, Department of Pathology, University of Cambridge, Cambridge CB2 1QP, United Kingdom
| | - Andrew E Firth
- Division of Virology, Department of Pathology, University of Cambridge, Cambridge CB2 1QP, United Kingdom
| | - Chune Cao
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20892
| | - Rachel Green
- HHMI, Johns Hopkins University School of Medicine, Baltimore, MD 21205;
- Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205
| | - Thomas E Dever
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20892;
| |
Collapse
|
32
|
Kute PM, Soukarieh O, Tjeldnes H, Trégouët DA, Valen E. Small Open Reading Frames, How to Find Them and Determine Their Function. Front Genet 2022; 12:796060. [PMID: 35154250 PMCID: PMC8831751 DOI: 10.3389/fgene.2021.796060] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 12/30/2021] [Indexed: 12/12/2022] Open
Abstract
Advances in genomics and molecular biology have revealed an abundance of small open reading frames (sORFs) across all types of transcripts. While these sORFs are often assumed to be non-functional, many have been implicated in physiological functions and a significant number of sORFs have been described in human diseases. Thus, sORFs may represent a hidden repository of functional elements that could serve as therapeutic targets. Unlike protein-coding genes, it is not necessarily the encoded peptide of an sORF that enacts its function, sometimes simply the act of translating an sORF might have a regulatory role. Indeed, the most studied sORFs are located in the 5′UTRs of coding transcripts and can have a regulatory impact on the translation of the downstream protein-coding sequence. However, sORFs have also been abundantly identified in non-coding RNAs including lncRNAs, circular RNAs and ribosomal RNAs suggesting that sORFs may be diverse in function. Of the many different experimental methods used to discover sORFs, the most commonly used are ribosome profiling and mass spectrometry. These can confirm interactions between transcripts and ribosomes and the production of a peptide, respectively. Extensions to ribosome profiling, which also capture scanning ribosomes, have further made it possible to see how sORFs impact the translation initiation of mRNAs. While high-throughput techniques have made the identification of sORFs less difficult, defining their function, if any, is typically more challenging. Together, the abundance and potential function of many of these sORFs argues for the necessity of including sORFs in gene annotations and systematically characterizing these to understand their potential functional roles. In this review, we will focus on the high-throughput methods used in the detection and characterization of sORFs and discuss techniques for validation and functional characterization.
Collapse
Affiliation(s)
- Preeti Madhav Kute
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
| | - Omar Soukarieh
- Department of Molecular Epidemiology Of Vascular and Brain Disorders, INSERM, BPH, U1219, University of Bordeaux, Bordeaux, France
| | - Håkon Tjeldnes
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - David-Alexandre Trégouët
- Department of Molecular Epidemiology Of Vascular and Brain Disorders, INSERM, BPH, U1219, University of Bordeaux, Bordeaux, France
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
- *Correspondence: Eivind Valen,
| |
Collapse
|
33
|
Andreev DE, Baranov PV, Milogorodskii A, Rachinskii D. A deterministic model for non-monotone relationship between translation of upstream and downstream open reading frames. MATHEMATICAL MEDICINE AND BIOLOGY : A JOURNAL OF THE IMA 2021; 38:490-515. [PMID: 34718568 DOI: 10.1093/imammb/dqab015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 08/12/2021] [Accepted: 10/06/2021] [Indexed: 01/01/2023]
Abstract
Totally asymmetric simple exclusion process (TASEP) modelling was shown to offer a parsimonious explanation for the experimentally confirmed ability of a single upstream open reading frames (uORFs) to upregulate downstream translation during the integrated stress response. As revealed by numerical simulations, the model predicts that reducing the density of scanning ribosomes upstream of certain uORFs increases the flow of ribosomes downstream. To gain a better insight into the mechanism which ensures the non-monotone relation between the upstream and downstream flows, in this work, we propose a phenomenological deterministic model approximating the TASEP model of the translation process. We establish the existence of a stationary solution featuring the decreasing density along the uORF for the deterministic model. Further, we find an explicit non-monotone relation between the upstream ribosome density and the downstream flow for the stationary solution in the limit of increasing uORF length and increasingly leaky initiation. The stationary distribution of the TASEP model, the stationary solution of the deterministic model and the explicit limit are compared numerically.
Collapse
Affiliation(s)
- D E Andreev
- Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow, 119991, Russian Federation, and Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
| | - P V Baranov
- University College Cork, College Road, Cork, T12 K8AF, Ireland, and Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry (RAS), 16/10 Miklukho-Maklay str., Moscow, 117997, Russian Federation
| | - A Milogorodskii
- Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow, 119991, Russian Federation, and Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
| | - D Rachinskii
- Department of Mathematical Sciences, The University of Texas at Dallas, 800 W. Campbell Rd, Richardson, TX 75080, USA
| |
Collapse
|
34
|
Shirokikh NE. Translation complex stabilization on messenger RNA and footprint profiling to study the RNA responses and dynamics of protein biosynthesis in the cells. Crit Rev Biochem Mol Biol 2021; 57:261-304. [PMID: 34852690 DOI: 10.1080/10409238.2021.2006599] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
During protein biosynthesis, ribosomes bind to messenger (m)RNA, locate its protein-coding information, and translate the nucleotide triplets sequentially as codons into the corresponding sequence of amino acids, forming proteins. Non-coding mRNA features, such as 5' and 3' untranslated regions (UTRs), start sites or stop codons of different efficiency, stretches of slower or faster code and nascent polypeptide interactions can alter the translation rates transcript-wise. Most of the homeostatic and signal response pathways of the cells converge on individual mRNA control, as well as alter the global translation output. Among the multitude of approaches to study translational control, one of the most powerful is to infer the locations of translational complexes on mRNA based on the mRNA fragments protected by these complexes from endonucleolytic hydrolysis, or footprints. Translation complex profiling by high-throughput sequencing of the footprints allows to quantify the transcript-wise, as well as global, alterations of translation, and uncover the underlying control mechanisms by attributing footprint locations and sizes to different configurations of the translational complexes. The accuracy of all footprint profiling approaches critically depends on the fidelity of footprint generation and many methods have emerged to preserve certain or multiple configurations of the translational complexes, often in challenging biological material. In this review, a systematic summary of approaches to stabilize translational complexes on mRNA for footprinting is presented and major findings are discussed. Future directions of translation footprint profiling are outlined, focusing on the fidelity and accuracy of inference of the native in vivo translation complex distribution on mRNA.
Collapse
Affiliation(s)
- Nikolay E Shirokikh
- Division of Genome Sciences and Cancer, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| |
Collapse
|
35
|
Unraveling the hidden role of a uORF-encoded peptide as a kinase inhibitor of PKCs. Proc Natl Acad Sci U S A 2021; 118:2018899118. [PMID: 34593629 PMCID: PMC8501901 DOI: 10.1073/pnas.2018899118] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/19/2021] [Indexed: 02/01/2023] Open
Abstract
Approximately 40% of human messenger RNAs (mRNAs) contain upstream open reading frames (uORFs) in their 5' untranslated regions. Some of these uORF sequences, thought to attenuate scanning ribosomes or lead to mRNA degradation, were recently shown to be translated, although the function of the encoded peptides remains unknown. Here, we show a uORF-encoded peptide that exhibits kinase inhibitory functions. This uORF, upstream of the protein kinase C-eta (PKC-η) main ORF, encodes a peptide (uPEP2) containing the typical PKC pseudosubstrate motif present in all PKCs that autoinhibits their kinase activity. We show that uPEP2 directly binds to and selectively inhibits the catalytic activity of novel PKCs but not of classical or atypical PKCs. The endogenous deletion of uORF2 or its overexpression in MCF-7 cells revealed that the endogenously translated uPEP2 reduces the protein levels of PKC-η and other novel PKCs and restricts cell proliferation. Functionally, treatment of breast cancer cells with uPEP2 diminished cell survival and their migration and synergized with chemotherapy by interfering with the response to DNA damage. Furthermore, in a xenograft of MDA-MB-231 breast cancer tumor in mice models, uPEP2 suppressed tumor progression, invasion, and metastasis. Tumor histology showed reduced proliferation, enhanced cell death, and lower protein expression levels of novel PKCs along with diminished phosphorylation of PKC substrates. Hence, our study demonstrates that uORFs may encode biologically active peptides beyond their role as translation regulators of their downstream ORFs. Together, we point to a unique function of a uORF-encoded peptide as a kinase inhibitor, pertinent to cancer therapy.
Collapse
|
36
|
Dmitriev SE, Vladimirov DO, Lashkevich KA. A Quick Guide to Small-Molecule Inhibitors of Eukaryotic Protein Synthesis. BIOCHEMISTRY (MOSCOW) 2021; 85:1389-1421. [PMID: 33280581 PMCID: PMC7689648 DOI: 10.1134/s0006297920110097] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Eukaryotic ribosome and cap-dependent translation are attractive targets in the antitumor, antiviral, anti-inflammatory, and antiparasitic therapies. Currently, a broad array of small-molecule drugs is known that specifically inhibit protein synthesis in eukaryotic cells. Many of them are well-studied ribosome-targeting antibiotics that block translocation, the peptidyl transferase center or the polypeptide exit tunnel, modulate the binding of translation machinery components to the ribosome, and induce miscoding, premature termination or stop codon readthrough. Such inhibitors are widely used as anticancer, anthelmintic and antifungal agents in medicine, as well as fungicides in agriculture. Chemicals that affect the accuracy of stop codon recognition are promising drugs for the nonsense suppression therapy of hereditary diseases and restoration of tumor suppressor function in cancer cells. Other compounds inhibit aminoacyl-tRNA synthetases, translation factors, and components of translation-associated signaling pathways, including mTOR kinase. Some of them have antidepressant, immunosuppressive and geroprotective properties. Translation inhibitors are also used in research for gene expression analysis by ribosome profiling, as well as in cell culture techniques. In this article, we review well-studied and less known inhibitors of eukaryotic protein synthesis (with the exception of mitochondrial and plastid translation) classified by their targets and briefly describe the action mechanisms of these compounds. We also present a continuously updated database (http://eupsic.belozersky.msu.ru/) that currently contains information on 370 inhibitors of eukaryotic protein synthesis.
Collapse
Affiliation(s)
- S E Dmitriev
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia. .,Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119234, Russia.,Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
| | - D O Vladimirov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, 119234, Russia
| | - K A Lashkevich
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Moscow, 119234, Russia
| |
Collapse
|
37
|
Jürgens L, Manske F, Hubert E, Kischka T, Flötotto L, Klaas O, Shabardina V, Schliemann C, Makalowski W, Wethmar K. Somatic Functional Deletions of Upstream Open Reading Frame-Associated Initiation and Termination Codons in Human Cancer. Biomedicines 2021; 9:biomedicines9060618. [PMID: 34072580 PMCID: PMC8227997 DOI: 10.3390/biomedicines9060618] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Revised: 05/22/2021] [Accepted: 05/27/2021] [Indexed: 11/16/2022] Open
Abstract
Upstream open reading frame (uORF)-mediated translational control has emerged as an important regulatory mechanism in human health and disease. However, a systematic search for cancer-associated somatic uORF mutations has not been performed. Here, we analyzed the genetic variability at canonical (uAUG) and alternative translational initiation sites (aTISs), as well as the associated upstream termination codons (uStops) in 3394 whole-exome-sequencing datasets from patient samples of breast, colon, lung, prostate, and skin cancer and of acute myeloid leukemia, provided by The Cancer Genome Atlas research network. We found that 66.5% of patient samples were affected by at least one of 5277 recurrent uORF-associated somatic single nucleotide variants altering 446 uAUG, 347 uStop, and 4733 aTIS codons. While twelve uORF variants were detected in all entities, 17 variants occurred in all five types of solid cancer analyzed here. Highest frequencies of individual somatic variants in the TLSs of NBPF20 and CHCHD2 reached 10.1% among LAML and 8.1% among skin cancer patients, respectively. Functional evaluation by dual luciferase reporter assays identified 19 uORF variants causing significant translational deregulation of the associated main coding sequence, ranging from 1.73-fold induction for an AUG.1 > UUG variant in SETD4 to 0.006-fold repression for a CUG.6 > GUG variant in HLA-DRB1. These data suggest that somatic uORF mutations are highly prevalent in human malignancies and that defective translational regulation of protein expression may contribute to the onset or progression of cancer.
Collapse
Affiliation(s)
- Lara Jürgens
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, 48149 Münster, Germany; (L.J.); (E.H.); (L.F.); (O.K.); (C.S.)
| | - Felix Manske
- Faculty of Medicine, Institute of Bioinformatics, University of Münster, 48149 Münster, Germany; (F.M.); (T.K.); (W.M.)
| | - Elvira Hubert
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, 48149 Münster, Germany; (L.J.); (E.H.); (L.F.); (O.K.); (C.S.)
| | - Tabea Kischka
- Faculty of Medicine, Institute of Bioinformatics, University of Münster, 48149 Münster, Germany; (F.M.); (T.K.); (W.M.)
| | - Lea Flötotto
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, 48149 Münster, Germany; (L.J.); (E.H.); (L.F.); (O.K.); (C.S.)
| | - Oliver Klaas
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, 48149 Münster, Germany; (L.J.); (E.H.); (L.F.); (O.K.); (C.S.)
| | - Victoria Shabardina
- Institute of Evolutionary Biology, CSIC-Unversitat Pompeu Frabra, 08002 Barcelona, Spain;
| | - Christoph Schliemann
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, 48149 Münster, Germany; (L.J.); (E.H.); (L.F.); (O.K.); (C.S.)
| | - Wojciech Makalowski
- Faculty of Medicine, Institute of Bioinformatics, University of Münster, 48149 Münster, Germany; (F.M.); (T.K.); (W.M.)
| | - Klaus Wethmar
- Department of Medicine A, Hematology, Oncology, Hemostaseology and Pneumology, University Hospital Münster, 48149 Münster, Germany; (L.J.); (E.H.); (L.F.); (O.K.); (C.S.)
- Correspondence: ; Tel.: +49-251-8347587; Fax: +49-251-8347588
| |
Collapse
|
38
|
Giess A, Torres Cleuren YN, Tjeldnes H, Krause M, Bizuayehu TT, Hiensch S, Okon A, Wagner CR, Valen E. Profiling of Small Ribosomal Subunits Reveals Modes and Regulation of Translation Initiation. Cell Rep 2021; 31:107534. [PMID: 32320657 DOI: 10.1016/j.celrep.2020.107534] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Revised: 02/28/2020] [Accepted: 03/27/2020] [Indexed: 12/11/2022] Open
Abstract
Translation initiation is often attributed as the rate-determining step of eukaryotic protein synthesis and key to gene expression control. Despite this centrality, the series of steps involved in this process is poorly understood. Here, we capture the transcriptome-wide occupancy of ribosomes across all stages of translation initiation, enabling us to characterize the transcriptome-wide dynamics of ribosome recruitment to mRNAs, scanning across 5' UTRs and stop codon recognition, in a higher eukaryote. We provide mechanistic evidence for ribosomes attaching to the mRNA by threading the mRNA through the small subunit. Moreover, we identify features that regulate the recruitment and processivity of scanning ribosomes and redefine optimal initiation contexts. Our approach enables deconvoluting translation initiation into separate stages and identifying regulators at each step.
Collapse
Affiliation(s)
- Adam Giess
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway
| | - Yamila N Torres Cleuren
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway.
| | - Håkon Tjeldnes
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway
| | - Maximilian Krause
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway; Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen 5008, Norway
| | | | - Senna Hiensch
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen 5008, Norway
| | - Aniekan Okon
- Department Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | - Carston R Wagner
- Department Medicinal Chemistry, University of Minnesota, Minneapolis, MN 55455, USA
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway; Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen 5008, Norway.
| |
Collapse
|
39
|
Popovic R, Celardo I, Yu Y, Costa AC, Loh SHY, Martins LM. Combined Transcriptomic and Proteomic Analysis of Perk Toxicity Pathways. Int J Mol Sci 2021; 22:4598. [PMID: 33925631 PMCID: PMC8124185 DOI: 10.3390/ijms22094598] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2021] [Revised: 04/19/2021] [Accepted: 04/23/2021] [Indexed: 12/17/2022] Open
Abstract
In Drosophila, endoplasmic reticulum (ER) stress activates the protein kinase R-like endoplasmic reticulum kinase (dPerk). dPerk can also be activated by defective mitochondria in fly models of Parkinson's disease caused by mutations in pink1 or parkin. The Perk branch of the unfolded protein response (UPR) has emerged as a major toxic process in neurodegenerative disorders causing a chronic reduction in vital proteins and neuronal death. In this study, we combined microarray analysis and quantitative proteomics analysis in adult flies overexpressing dPerk to investigate the relationship between the transcriptional and translational response to dPerk activation. We identified tribbles and Heat shock protein 22 as two novel Drosophila activating transcription factor 4 (dAtf4) regulated transcripts. Using a combined bioinformatics tool kit, we demonstrated that the activation of dPerk leads to translational repression of mitochondrial proteins associated with glutathione and nucleotide metabolism, calcium signalling and iron-sulphur cluster biosynthesis. Further efforts to enhance these translationally repressed dPerk targets might offer protection against Perk toxicity.
Collapse
Affiliation(s)
| | | | | | | | | | - L. Miguel Martins
- MRC Toxicology Unit, University of Cambridge, Gleeson Building, Tennis Court Road, Cambridge CB2 1QR, UK; (R.P.); (I.C.); (Y.Y.); (A.C.C.); (S.H.Y.L.)
| |
Collapse
|
40
|
Xu C, Zhang J. Mammalian Alternative Translation Initiation Is Mostly Nonadaptive. Mol Biol Evol 2021; 37:2015-2028. [PMID: 32145028 DOI: 10.1093/molbev/msaa063] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Alternative translation initiation (ATLI) refers to the existence of multiple translation initiation sites per gene and is a widespread phenomenon in eukaryotes. ATLI is commonly assumed to be advantageous through creating proteome diversity or regulating protein synthesis. We here propose an alternative hypothesis that ATLI arises primarily from nonadaptive initiation errors presumably due to the limited ability of ribosomes to distinguish sequence motifs truly signaling translation initiation from similar sequences. Our hypothesis, but not the adaptive hypothesis, predicts a series of global patterns of ATLI, all of which are confirmed at the genomic scale by quantitative translation initiation sequencing in multiple human and mouse cell lines and tissues. Similarly, although many codons differing from AUG by one nucleotide can serve as start codons, our analysis suggests that using non-AUG start codons is mostly disadvantageous. These and other findings strongly suggest that ATLI predominantly results from molecular error, requiring a major revision of our understanding of the precision and regulation of translation initiation.
Collapse
Affiliation(s)
- Chuan Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
41
|
Zhang H, Wang Y, Wu X, Tang X, Wu C, Lu J. Determinants of genome-wide distribution and evolution of uORFs in eukaryotes. Nat Commun 2021; 12:1076. [PMID: 33597535 PMCID: PMC7889888 DOI: 10.1038/s41467-021-21394-y] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Accepted: 01/20/2021] [Indexed: 01/02/2023] Open
Abstract
Upstream open reading frames (uORFs) play widespread regulatory functions in modulating mRNA translation in eukaryotes, but the principles underlying the genomic distribution and evolution of uORFs remain poorly understood. Here, we analyze ~17 million putative canonical uORFs in 478 eukaryotic species that span most of the extant taxa of eukaryotes. We demonstrate how positive and purifying selection, coupled with differences in effective population size (Ne), has shaped the contents of uORFs in eukaryotes. Besides, gene expression level is important in influencing uORF occurrences across genes in a species. Our analyses suggest that most uORFs might play regulatory roles rather than encode functional peptides. We also show that the Kozak sequence context of uORFs has evolved across eukaryotic clades, and that noncanonical uORFs tend to have weaker suppressive effects than canonical uORFs in translation regulation. This study provides insights into the driving forces underlying uORF evolution in eukaryotes.
Collapse
Affiliation(s)
- Hong Zhang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Yirong Wang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
- College of Biology, Hunan University, Changsha, China
| | - Xinkai Wu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Xiaolu Tang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Changcheng Wu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China.
| |
Collapse
|
42
|
Neville MDC, Kohze R, Erady C, Meena N, Hayden M, Cooper DN, Mort M, Prabakaran S. A platform for curated products from novel open reading frames prompts reinterpretation of disease variants. Genome Res 2021; 31:327-336. [PMID: 33468550 PMCID: PMC7849405 DOI: 10.1101/gr.263202.120] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 08/26/2020] [Indexed: 11/29/2022]
Abstract
Recent evidence from proteomics and deep massively parallel sequencing studies have revealed that eukaryotic genomes contain substantial numbers of as-yet-uncharacterized open reading frames (ORFs). We define these uncharacterized ORFs as novel ORFs (nORFs). nORFs in humans are mostly under 100 codons and are found in diverse regions of the genome, including in long noncoding RNAs, pseudogenes, 3' UTRs, 5' UTRs, and alternative reading frames of canonical protein coding exons. There is therefore a pressing need to evaluate the potential functional importance of these unannotated transcripts and proteins in biological pathways and human disease on a larger scale, rather than one at a time. In this study, we outline the creation of a valuable nORFs data set with experimental evidence of translation for the community, use measures of heritability and selection that reveal signals for functional importance, and show the potential implications for functional interpretation of genetic variants in nORFs. Our results indicate that some variants that were previously classified as being benign or of uncertain significance may have to be reinterpreted.
Collapse
Affiliation(s)
- Matthew D C Neville
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Robin Kohze
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Chaitanya Erady
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
| | - Narendra Meena
- Department of Biology, Indian Institute of Science Education and Research, Pune, Maharashtra 411008, India
| | - Matthew Hayden
- Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom
| | - David N Cooper
- Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom
| | - Matthew Mort
- Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom
| | - Sudhakaran Prabakaran
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, United Kingdom
- Department of Biology, Indian Institute of Science Education and Research, Pune, Maharashtra 411008, India
- St Edmund's College, University of Cambridge, Cambridge CB3 0BN, United Kingdom
| |
Collapse
|
43
|
Meydan S, Klepacki D, Mankin AS, Vázquez-Laslop N. Identification of Translation Start Sites in Bacterial Genomes. Methods Mol Biol 2021; 2252:27-55. [PMID: 33765270 DOI: 10.1007/978-1-0716-1150-0_2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The knowledge of translation start sites is crucial for annotation of genes in bacterial genomes. However, systematic mapping of start codons in bacterial genes has mainly relied on predictions based on protein conservation and mRNA sequence features which, although useful, are not always accurate. We recently found that the pleuromutilin antibiotic retapamulin (RET) is a specific inhibitor of translation initiation that traps ribosomes specifically at start codons, and we used it in combination with ribosome profiling to map start codons in the Escherichia coli genome. This genome-wide strategy, that was named Ribo-RET, not only verifies the position of start codons in already annotated genes but also enables identification of previously unannotated open reading frames and reveals the presence of internal start sites within genes. Here, we provide a detailed Ribo-RET protocol for E. coli. Ribo-RET can be adapted for mapping the start codons of the protein-coding sequences in a variety of bacterial species.
Collapse
Affiliation(s)
- Sezen Meydan
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Dorota Klepacki
- Center for Biomolecular Sciences, University of Illinois at Chicago, Chicago, IL, USA
| | - Alexander S Mankin
- Center for Biomolecular Sciences, University of Illinois at Chicago, Chicago, IL, USA.
| | - Nora Vázquez-Laslop
- Center for Biomolecular Sciences, University of Illinois at Chicago, Chicago, IL, USA.
| |
Collapse
|
44
|
Yoshitomi H, Lee KY, Yao K, Shin SH, Zhang T, Wang Q, Paul S, Roh E, Ryu J, Chen H, Aziz F, Chakraborty A, Bode AM, Dong Z. GSK3β-Mediated Expression of CUG-Translated WT1 Is Critical for Tumor Progression. Cancer Res 2020; 81:945-955. [PMID: 33184107 DOI: 10.1158/0008-5472.can-20-1880] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 09/29/2020] [Accepted: 11/09/2020] [Indexed: 11/16/2022]
Abstract
The Wilms' tumor 1 (WT1) gene is well known as a chameleon gene. It plays a role as a tumor suppressor in Wilms' tumor but also acts as an oncogene in other cancers. Previously, our group reported that a canonical AUG starting site for the WT1 protein (augWT1) acts as a tumor suppressor, whereas a CUG starting site for the WT1 protein (cugWT1) functions as an oncogene. In this study, we report an oncogenic role of cugWT1 in the AOM/DSS-induced colon cancer mouse model and in a urethane-induced lung cancer model in mice lacking cugWT1. Development of chemically-induced tumors was significantly depressed in cugWT1-deficient mice. Moreover, glycogen synthase kinase 3β promoted phosphorylation of cugWT1 at S64, resulting in ubiquitination and degradation of the cugWT1 associated with the F-box-/- WD repeat-containing protein 8. Overall, our findings suggest that inhibition of cugWT1 expression provides a potential candidate target for therapy. SIGNIFICANCE: These findings demonstrate that CUG-translated WT1 plays an oncogenic role in vivo, and GSK3β-mediated phosphorylation of cugWT1 induces its ubiquitination and degradation in concert with FBXW8.
Collapse
Affiliation(s)
- Hisae Yoshitomi
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Kun Y Lee
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Ke Yao
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Seung Ho Shin
- The Hormel Institute, University of Minnesota, Austin, Minnesota.,Department of Food and Nutrition, Gyeongsang National University, Jinju, Republic of Korea.,Institute of Agriculture and Life Science, Gyeongsang National University, Jinju, Republic of Korea
| | - Tianshun Zhang
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Qiushi Wang
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Souren Paul
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Eunmiri Roh
- The Hormel Institute, University of Minnesota, Austin, Minnesota.,Department of Cosmetic Science, Gwangju Women's University, Gwangju, Republic of Korea
| | - Joohyun Ryu
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Hanyong Chen
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Faisal Aziz
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | | | - Ann M Bode
- The Hormel Institute, University of Minnesota, Austin, Minnesota
| | - Zigang Dong
- College of Medicine, Zhengzhou University, Henan, China.
| |
Collapse
|
45
|
Szavits-Nossan J, Ciandrini L. Inferring efficiency of translation initiation and elongation from ribosome profiling. Nucleic Acids Res 2020; 48:9478-9490. [PMID: 32821926 PMCID: PMC7515720 DOI: 10.1093/nar/gkaa678] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Revised: 07/29/2020] [Accepted: 08/15/2020] [Indexed: 01/13/2023] Open
Abstract
One of the main goals of ribosome profiling is to quantify the rate of protein synthesis at the level of translation. Here, we develop a method for inferring translation elongation kinetics from ribosome profiling data using recent advances in mathematical modelling of mRNA translation. Our method distinguishes between the elongation rate intrinsic to the ribosome’s stepping cycle and the actual elongation rate that takes into account ribosome interference. This distinction allows us to quantify the extent of ribosomal collisions along the transcript and identify individual codons where ribosomal collisions are likely. When examining ribosome profiling in yeast, we observe that translation initiation and elongation are close to their optima and traffic is minimized at the beginning of the transcript to favour ribosome recruitment. However, we find many individual sites of congestion along the mRNAs where the probability of ribosome interference can reach \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$50\%$\end{document}. Our work provides new measures of translation initiation and elongation efficiencies, emphasizing the importance of rating these two stages of translation separately.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
| | - Luca Ciandrini
- Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ Montpellier, Montpellier 34090, France
| |
Collapse
|
46
|
Higdon AL, Brar GA. Rules are made to be broken: a "simple" model organism reveals the complexity of gene regulation. Curr Genet 2020; 67:49-56. [PMID: 33130938 DOI: 10.1007/s00294-020-01121-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 10/14/2020] [Accepted: 10/19/2020] [Indexed: 11/27/2022]
Abstract
Global methods for assaying translation have greatly improved our understanding of the protein-coding capacity of the genome. In particular, it is now possible to perform genome-wide and condition-specific identification of translation initiation sites through modified ribosome profiling methods that selectively capture initiating ribosomes. Here we discuss our recent study applying such an approach to meiotic and mitotic timepoints in the simple eukaryote, budding yeast, as an example of the surprising diversity of protein products-many of which are non-canonical-that can be revealed by such methods. We also highlight several key challenges in studying non-canonical protein isoforms that have precluded their prior systematic discovery. A growing body of work supports expanded use of empirical protein-coding region identification, which can help relieve some of the limitations and biases inherent to traditional genome annotation approaches. Our study also argues for the adoption of less static views of gene identity and a broader framework for considering the translational capacity of the genome.
Collapse
Affiliation(s)
- Andrea L Higdon
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA
- Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
| | - Gloria A Brar
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
- Center for Computational Biology, University of California, Berkeley, CA, 94720, USA.
| |
Collapse
|
47
|
Li YR, Liu MJ. Prevalence of alternative AUG and non-AUG translation initiators and their regulatory effects across plants. Genome Res 2020; 30:1418-1433. [PMID: 32973042 PMCID: PMC7605272 DOI: 10.1101/gr.261834.120] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 08/19/2020] [Indexed: 12/11/2022]
Abstract
Translation initiation is a key step determining protein synthesis. Studies have uncovered a number of alternative translation initiation sites (TISs) in mammalian mRNAs and showed their roles in reshaping the proteome. However, the extent to which alternative TISs affect gene expression across plants remains largely unclear. Here, by profiling initiating ribosome positions, we globally identified in vivo TISs in tomato and Arabidopsis and found thousands of genes with more than one TIS. Of the identified TISs, >19% and >20% were located at unannotated AUG and non-AUG sites, respectively. CUG and ACG were the most frequently observed codons at non-AUG TISs, a phenomenon also found in mammals. In addition, although alternative TISs were usually found in both orthologous genes, the TIS sequences were not conserved, suggesting the conservation of alternative initiation mechanisms but flexibility in using TISs. Unlike upstream AUG TISs, the presence of upstream non-AUG TISs was not correlated with the translational repression of main open reading frames, a pattern observed across plants. Also, the generation of proteins with diverse N-terminal regions through the use of alternative TISs contributes to differential subcellular localization, as mutating alternative TISs resulted in the loss of organelle localization. Our findings uncovered the hidden coding potential of plant genomes and, importantly, the constraint and flexibility of translational initiation mechanisms in the regulation of gene expression across plant species.
Collapse
Affiliation(s)
- Ya-Ru Li
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 741, Taiwan
| | - Ming-Jung Liu
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 741, Taiwan.,Agricultural Biotechnology Research Center, Academia Sinica, Taipei 115, Taiwan
| |
Collapse
|
48
|
Unusually efficient CUG initiation of an overlapping reading frame in POLG mRNA yields novel protein POLGARF. Proc Natl Acad Sci U S A 2020; 117:24936-24946. [PMID: 32958672 DOI: 10.1073/pnas.2001433117] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
While near-cognate codons are frequently used for translation initiation in eukaryotes, their efficiencies are usually low (<10% compared to an AUG in optimal context). Here, we describe a rare case of highly efficient near-cognate initiation. A CUG triplet located in the 5' leader of POLG messenger RNA (mRNA) initiates almost as efficiently (∼60 to 70%) as an AUG in optimal context. This CUG directs translation of a conserved 260-triplet-long overlapping open reading frame (ORF), which we call POLGARF (POLG Alternative Reading Frame). Translation of a short upstream ORF 5' of this CUG governs the ratio between POLG (the catalytic subunit of mitochondrial DNA polymerase) and POLGARF synthesized from a single POLG mRNA. Functional investigation of POLGARF suggests a role in extracellular signaling. While unprocessed POLGARF localizes to the nucleoli together with its interacting partner C1QBP, serum stimulation results in rapid cleavage and secretion of a POLGARF C-terminal fragment. Phylogenetic analysis shows that POLGARF evolved ∼160 million y ago due to a mammalian-wide interspersed repeat (MIR) transposition into the 5' leader sequence of the mammalian POLG gene, which became fixed in placental mammals. This discovery of POLGARF unveils a previously undescribed mechanism of de novo protein-coding gene evolution.
Collapse
|
49
|
uORFs: Important Cis-Regulatory Elements in Plants. Int J Mol Sci 2020; 21:ijms21176238. [PMID: 32872304 PMCID: PMC7503886 DOI: 10.3390/ijms21176238] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 08/20/2020] [Accepted: 08/22/2020] [Indexed: 11/17/2022] Open
Abstract
Gene expression is regulated at many levels, including mRNA transcription, translation, and post-translational modification. Compared with transcriptional regulation, mRNA translational control is a more critical step in gene expression and allows for more rapid changes of encoded protein concentrations in cells. Translation is highly regulated by complex interactions between cis-acting elements and trans-acting factors. Initiation is not only the first phase of translation, but also the core of translational regulation, because it limits the rate of protein synthesis. As potent cis-regulatory elements in eukaryotic mRNAs, upstream open reading frames (uORFs) generally inhibit the translation initiation of downstream major ORFs (mORFs) through ribosome stalling. During the past few years, with the development of RNA-seq and ribosome profiling, functional uORFs have been identified and characterized in many organisms. Here, we review uORF identification, uORF classification, and uORF-mediated translation initiation. More importantly, we summarize the translational regulation of uORFs in plant metabolic pathways, morphogenesis, disease resistance, and nutrient absorption, which open up an avenue for precisely modulating the plant growth and development, as well as environmental adaption. Additionally, we also discuss prospective applications of uORFs in plant breeding.
Collapse
|
50
|
Eisenberg AR, Higdon AL, Hollerer I, Fields AP, Jungreis I, Diamond PD, Kellis M, Jovanovic M, Brar GA. Translation Initiation Site Profiling Reveals Widespread Synthesis of Non-AUG-Initiated Protein Isoforms in Yeast. Cell Syst 2020; 11:145-160.e5. [PMID: 32710835 PMCID: PMC7508262 DOI: 10.1016/j.cels.2020.06.011] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 05/18/2020] [Accepted: 06/24/2020] [Indexed: 12/27/2022]
Abstract
Genomic analyses in budding yeast have helped define the foundational principles of eukaryotic gene expression. However, in the absence of empirical methods for defining coding regions, these analyses have historically excluded specific classes of possible coding regions, such as those initiating at non-AUG start codons. Here, we applied an experimental approach to globally annotate translation initiation sites in yeast and identified 149 genes with alternative N-terminally extended protein isoforms initiating from near-cognate codons upstream of annotated AUG start codons. These isoforms are produced in concert with canonical isoforms and translated with high specificity, resulting from initiation at only a small subset of possible start codons. The non-AUG initiation driving their production is enriched during meiosis and induced by low eIF5A, which is seen in this context. These findings reveal widespread production of non-canonical protein isoforms and unexpected complexity to the rules by which even a simple eukaryotic genome is decoded.
Collapse
Affiliation(s)
- Amy R Eisenberg
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Andrea L Higdon
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Ina Hollerer
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Alexander P Fields
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Irwin Jungreis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Paige D Diamond
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Manolis Kellis
- MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Gloria A Brar
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|