1
|
Ren S, Li Y, Zhou Z. RiboParser/RiboShiny: an integrated platform for comprehensive analysis and visualization of Ribo-seq data. J Genet Genomics 2025:S1673-8527(25)00119-5. [PMID: 40268050 DOI: 10.1016/j.jgg.2025.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Revised: 04/16/2025] [Accepted: 04/16/2025] [Indexed: 04/25/2025]
Abstract
Translation is a crucial step in gene expression. Over the past decade, the development and application of Ribosome profiling (Ribo-seq) have significantly advanced our understanding of translational regulation in vivo. However, the analysis and visualization of Ribo-seq data remain challenging. Despite the availability of various analytical pipelines, improvements in comprehensiveness, accuracy, and user-friendliness are still necessary. In this study, we develop RiboParser/RiboShiny, a robust framework for analyzing and visualizing Ribo-seq data. Building on published methods, we optimize ribosome structure-based and start/stop-based models to improve the accuracy and stability of P-site detection, even in species with a high proportion of leaderless transcripts. Leveraging these improvements, RiboParser offers comprehensive analyses, including quality control, gene-level analysis, codon-level analysis, and the analysis of Ribo-seq variants. Meanwhile, RiboShiny provides a user-friendly and adaptable platform for data visualization, facilitating deeper insights into the translational landscape. Furthermore, the integration of standardized genome annotation renders our platform universally applicable to various organisms with sequenced genomes. This framework has the potential to significantly improve the precision and efficiency of Ribo-seq data interpretation, thereby deepening our understanding of translational regulation.
Collapse
Affiliation(s)
- Shuchao Ren
- National Key Laboratory of Agricultural Microbiology, College of Life Science, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Yinan Li
- National Key Laboratory of Agricultural Microbiology, College of Life Science, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Zhipeng Zhou
- National Key Laboratory of Agricultural Microbiology, College of Life Science, Huazhong Agricultural University, Wuhan, Hubei 430070, China.
| |
Collapse
|
2
|
Okubo C, Nakamura M, Sato M, Shichino Y, Mito M, Takashima Y, Iwasaki S, Takahashi K. EIF3D safeguards the homeostasis of key signaling pathways in human primed pluripotency. SCIENCE ADVANCES 2025; 11:eadq5484. [PMID: 40203091 PMCID: PMC11980838 DOI: 10.1126/sciadv.adq5484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Accepted: 03/04/2025] [Indexed: 04/11/2025]
Abstract
Although pluripotent stem cell (PSC) properties, such as differentiation and infinite proliferation, have been well documented within the frameworks of transcription factor networks, epigenomes, and signal transduction, they remain unclear and fragmented. Directing attention toward translational regulation as a bridge between these events can yield additional insights into previously unexplained mechanisms. Our functional CRISPR interference screen-based approach revealed that EIF3D, a translation initiation factor, is crucial for maintaining primed pluripotency. Loss of EIF3D disrupted the balance of pluripotency-associated signaling pathways, thereby compromising primed pluripotency. Moreover, EIF3D ensured robust proliferation by controlling the translation of various p53 regulators, which maintain low p53 activity in the undifferentiated state. In this way, EIF3D-mediated translation contributes to tuning the homeostasis of the primed pluripotency networks, ensuring the maintenance of an undifferentiated state with high proliferative potential. This study provides further insights into the translation network in maintaining pluripotency.
Collapse
Affiliation(s)
- Chikako Okubo
- Department of Life Science Frontiers, Center for iPS Cell Research and Application, Kyoto University, Kyoto, 606-8507, Japan
| | - Michiko Nakamura
- Department of Life Science Frontiers, Center for iPS Cell Research and Application, Kyoto University, Kyoto, 606-8507, Japan
| | - Masae Sato
- Department of Life Science Frontiers, Center for iPS Cell Research and Application, Kyoto University, Kyoto, 606-8507, Japan
| | - Yuichi Shichino
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Saitama, 351-0198, Japan
| | - Mari Mito
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Saitama, 351-0198, Japan
| | - Yasuhiro Takashima
- Department of Life Science Frontiers, Center for iPS Cell Research and Application, Kyoto University, Kyoto, 606-8507, Japan
| | - Shintaro Iwasaki
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Saitama, 351-0198, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8561, Japan
| | - Kazutoshi Takahashi
- Department of Life Science Frontiers, Center for iPS Cell Research and Application, Kyoto University, Kyoto, 606-8507, Japan
| |
Collapse
|
3
|
Fan X, Chang T, Chen C, Hafner M, Wang Z. Analysis of RNA translation with a deep learning architecture provides new insight into translation control. Nucleic Acids Res 2025; 53:gkaf277. [PMID: 40219965 PMCID: PMC11992669 DOI: 10.1093/nar/gkaf277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 02/20/2025] [Accepted: 04/01/2025] [Indexed: 04/14/2025] Open
Abstract
Accurate annotation of coding regions in RNAs is essential for understanding gene translation. We developed a deep neural network to directly predict and analyze translation initiation and termination sites from RNA sequences. Trained with human transcripts, our model learned hidden rules of translation control and achieved a near perfect prediction of canonical translation sites across entire human transcriptome. Surprisingly, this model revealed a new role of codon usage in regulating translation termination, which was experimentally validated. We also identified thousands of new open reading frames in mRNAs or lncRNAs, some of which were confirmed experimentally. The model trained with human mRNAs achieved high prediction accuracy of canonical translation sites in all eukaryotes and good prediction in polycistronic transcripts from prokaryotes or RNA viruses, suggesting a high degree of conservation in translation control. Collectively, we present TranslationAI (https://www.biosino.org/TranslationAI/), a general and efficient deep learning model for RNA translation that generates new insights into the complexity of translation regulation.
Collapse
Affiliation(s)
- Xiaojuan Fan
- Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
- RNA Molecular Biology Laboratory, National Institute of Arthritis and Musculoskeletal and Skin Disease, Bethesda, MD 20814, United States
| | - Tiangen Chang
- Laboratory of Cancer Data Science, National Cancer Institute, Bethesda, MD 20814, United States
| | - Chuyun Chen
- Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Markus Hafner
- RNA Molecular Biology Laboratory, National Institute of Arthritis and Musculoskeletal and Skin Disease, Bethesda, MD 20814, United States
| | - Zefeng Wang
- Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
- School of Life Science, Southern University of Science and Technology, Shenzhen, Guangdong, 518055, China
| |
Collapse
|
4
|
Zhang T, Li Z, Li J, Peng Y. Small open reading frame-encoded microproteins in cancer: identification, biological functions and clinical significance. Mol Cancer 2025; 24:105. [PMID: 40170020 PMCID: PMC11963466 DOI: 10.1186/s12943-025-02278-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2025] [Accepted: 02/24/2025] [Indexed: 04/03/2025] Open
Abstract
The human genome harbors approximately twenty thousand protein-coding genes, and a significant portion of life science research focuses on elucidating their functions and the underlying mechanisms. Recent studies have revealed that small open reading frame (sORF), originating from non-coding RNAs or the 5' leader sequences of messenger RNAs, can be translated into small peptides called microproteins through cap-dependent or cap-independent mechanisms. These microproteins interact with diverse molecular partners to modulate gene expression at multiple regulatory levels, thereby playing critical roles in various biological processes. Notably, sORF-encoded microproteins exhibit aberrant expression patterns in cancer and are implicated in tumor initiation and progression, expanding our understanding of cancer biology. In this review, we introduce the translational mechanisms and identification methods of microproteins, summarize their dysregulation in cancer and their biological functions in regulating gene expression, and emphasize their roles in driving hallmark events of cancer. Furthermore, we discuss their clinical significance as diagnostic and prognostic biomarkers, as well as therapeutic targets.
Collapse
Affiliation(s)
- Tingting Zhang
- Center for Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Zhang Li
- Center for Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Jiao Li
- Center for Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China.
| | - Yong Peng
- Center for Molecular Oncology, Frontiers Science Center for Disease-related Molecular Network, State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China.
| |
Collapse
|
5
|
Shiraishi T, Matsumoto A. From non-coding to coding: The importance of long non-coding RNA translation in de novo gene birth. Biochim Biophys Acta Gen Subj 2025; 1869:130747. [PMID: 39708923 DOI: 10.1016/j.bbagen.2024.130747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2024] [Revised: 12/09/2024] [Accepted: 12/11/2024] [Indexed: 12/23/2024]
Abstract
Recent emerging evidence demonstrates that some long non-coding RNAs (lncRNAs) can indeed be translated into functional polypeptides. These discoveries are pivotal for understanding de novo gene birth, the process by which new genes evolve from previously non-genic regions. In this review, we first introduce key methods, such as Ribo-seq and translation initiation site detection by translation complex analysis, for identifying coding sequences within lncRNAs and highlight examples of functional polypeptides derived from lncRNAs across species. These polypeptides play essential roles in maintaining cellular homeostasis and contribute to pathological processes, including cancer. However, because not all lncRNA-derived polypeptides appear to be functional, these lncRNAs may act as gene reservoirs. We also discuss how lncRNAs change their intracellular localization, how lncRNA-derived polypeptides evade immune surveillance, and how they gradually evolve into typical coding RNAs, providing evidence for the evolutionary model of de novo gene birth.
Collapse
Affiliation(s)
- Taichi Shiraishi
- Division of Biological Science, Graduate School of Science, Nagoya University, Nagoya 464-8602, Japan
| | - Akinobu Matsumoto
- Division of Biological Science, Graduate School of Science, Nagoya University, Nagoya 464-8602, Japan.
| |
Collapse
|
6
|
Chen S, Liu M, Yi W, Li H, Yu Q. Micropeptides derived from long non-coding RNAs: Computational analysis and functional roles in breast cancer and other diseases. Gene 2025; 935:149019. [PMID: 39461573 DOI: 10.1016/j.gene.2024.149019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 10/08/2024] [Accepted: 10/16/2024] [Indexed: 10/29/2024]
Abstract
Long non-coding RNAs (lncRNAs), once thought to be mere transcriptional noise, are now revealing a hidden code. Recent advancements like ribosome sequencing have unveiled that many lncRNAs harbor small open reading frames and can potentially encode functional micropeptides. Emerging research suggests these micropeptides, not the lncRNAs themselves, play crucial roles in regulating homeostasis, inflammation, metabolism, and especially in breast cancer progression. This review delves into the rapidly evolving computational tools used to predict and validate lncRNA-encoded micropeptides. We then explore the diverse functions and mechanisms of action of these micropeptides in breast cancer pathogenesis, with a focus on their roles in various species. Ultimately, this review aims to illuminate the functional landscape of lncRNA-encoded micropeptides and their potential as therapeutic targets in cancer.
Collapse
Affiliation(s)
- Saisai Chen
- Department of Breast Surgery, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei 230031, China
| | - Mengru Liu
- Department of Infection, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230000, China
| | - Weizhen Yi
- Department of Breast Surgery, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei 230031, China
| | - Huagang Li
- Department of Breast Surgery, The First Affiliated Hospital of Anhui University of Traditional Chinese Medicine, Hefei 230031, China
| | - Qingsheng Yu
- Institute of Chinese Medicine Surgery, Anhui Academy of Chinese Medicine, Hefei 230031, China.
| |
Collapse
|
7
|
Tang L, Xu D, Luo L, Ma W, He X, Diao Y, Ke R, Kapranov P. A novel human protein-coding locus identified using a targeted RNA enrichment technique. BMC Biol 2024; 22:273. [PMID: 39593153 PMCID: PMC11590353 DOI: 10.1186/s12915-024-02069-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Accepted: 11/12/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND Accurate and comprehensive genomic annotation, including the full list of protein-coding genes, is vital for understanding the molecular mechanisms of human biology. We have previously shown that the genome contains a multitude of yet hidden functional exons and transcripts, some of which might represent novel mRNAs. These results resonate with those from other groups and strongly argue that two decades after the completion of the first draft of the human genome sequence, the current annotation of human genes and transcripts remains far from being complete. RESULTS Using a targeted RNA enrichment technique, we showed that one of the novel functional exons previously discovered by us and currently annotated as part of a long non-coding RNA, is actually a part of a novel protein-coding gene, InSETG-4, which encodes a novel human protein with no known homologs or motifs. We found that InSETG-4 is induced by various DNA-damaging agents across multiple cell types and therefore might represent a novel component of DNA damage response. Despite its low abundance in bulk cell populations, InSETG-4 exhibited expression restricted to a small fraction of cells, as demonstrated by the amplification-based single-molecule fluorescence in situ hybridization (asmFISH) analysis. CONCLUSIONS This study argues that yet undiscovered human protein-coding genes exist and provides an example of how targeted RNA enrichment techniques can help to fill this major gap in our knowledge of the information encoded in the human genome.
Collapse
Affiliation(s)
- Lu Tang
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Dongyang Xu
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
| | - Lingcong Luo
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Weiyan Ma
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Xiaojie He
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yong Diao
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Rongqin Ke
- School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
| | - Philipp Kapranov
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.
| |
Collapse
|
8
|
Xiao Y, Ren Y, Hu W, Paliouras AR, Zhang W, Zhong L, Yang K, Su L, Wang P, Li Y, Ma M, Shi L. Long non-coding RNA-encoded micropeptides: functions, mechanisms and implications. Cell Death Discov 2024; 10:450. [PMID: 39443468 PMCID: PMC11499885 DOI: 10.1038/s41420-024-02175-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 08/29/2024] [Accepted: 09/05/2024] [Indexed: 10/25/2024] Open
Abstract
Long non-coding RNAs (lncRNAs) are typically described as RNA transcripts exceeding 200 nucleotides in length, which do not code for proteins. Recent advancements in technology, including ribosome RNA sequencing and ribosome nascent-chain complex sequencing, have demonstrated that many lncRNAs retain small open reading frames and can potentially encode micropeptides. Emerging studies have revealed that these micropeptides, rather than lncRNAs themselves, are responsible for vital functions, including but not limited to regulating homeostasis, managing inflammation and the immune system, moderating metabolism, and influencing tumor progression. In this review, we initially outline the rapidly advancing computational analytical methods and public tools to predict and validate the potential encoding of lncRNAs. We then focus on the diverse functions of micropeptides and their underlying mechanisms in the pathogenesis of disease. This review aims to elucidate the functions of lncRNA-encoded micropeptides and explore their potential applications as therapeutic targets in cancer.
Collapse
Affiliation(s)
- Yinan Xiao
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China
| | - Yaru Ren
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China
| | - Wenteng Hu
- Thoracic surgery department, The First Hospital, Lanzhou University, Lanzhou, 730000, PR China
| | | | - Wenyang Zhang
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China
| | - Linghui Zhong
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China
| | - Kaixin Yang
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China
| | - Li Su
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China
| | - Peng Wang
- College of Animal Science and Technology, Hebei North University, Zhangjiakou, 075131, PR China
| | - Yonghong Li
- NHC Key Laboratory of Diagnosis and Therapy of Gastrointestinal Tumor, Gansu Provincial Hospital, Lanzhou, 730000, PR China
| | - Minjie Ma
- Thoracic surgery department, The First Hospital, Lanzhou University, Lanzhou, 730000, PR China
| | - Lei Shi
- RNA Oncology Group, School of Public Health, Lanzhou University, Lanzhou, 730000, PR China.
| |
Collapse
|
9
|
Tzani I, Castro-Rivadeneyra M, Kelly P, Strasser L, Zhang L, Clynes M, Karger BL, Barron N, Bones J, Clarke C. Detection of host cell microprotein impurities in antibody drug products. Nat Commun 2024; 15:8605. [PMID: 39366928 PMCID: PMC11452709 DOI: 10.1038/s41467-024-51870-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 08/21/2024] [Indexed: 10/06/2024] Open
Abstract
Chinese hamster ovary (CHO) cells are used to produce almost 90% of therapeutic monoclonal antibodies (mAbs) and antibody fusion proteins (Fc-fusion). The annotation of non-canonical translation events in these cellular factories remains incomplete, limiting our ability to study CHO cell biology and detect host cell protein (HCP) impurities in the final antibody drug product. We utilised ribosome footprint profiling (Ribo-seq) to identify novel open reading frames (ORFs) including N-terminal extensions and thousands of short ORFs (sORFs) predicted to encode microproteins. Mass spectrometry-based HCP analysis of eight commercial antibody drug products (7 mAbs and 1 Fc-fusion protein) using the extended protein sequence database revealed the presence of microprotein impurities. We present evidence that microprotein abundance varies with growth phase and can be affected by the cell culture environment. In addition, our work provides a vital resource to facilitate future studies of non-canonical translation and the regulation of protein synthesis in CHO cell lines.
Collapse
Affiliation(s)
- Ioanna Tzani
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland
| | - Marina Castro-Rivadeneyra
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland
- School of Chemical and Bioprocess Engineering, University College Dublin, Belfield, Dublin, Ireland
| | - Paul Kelly
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland
| | - Lisa Strasser
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland
| | - Lin Zhang
- Bioprocess R&D, Pfizer Inc. Andover, Massachusetts, USA
| | - Martin Clynes
- National Institute for Cellular Biotechnology, Dublin City University, Dublin, Ireland
| | - Barry L Karger
- Barnett Institute, Northeastern University, 360 Huntington Ave, Boston, MA, USA
| | - Niall Barron
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland
- School of Chemical and Bioprocess Engineering, University College Dublin, Belfield, Dublin, Ireland
| | - Jonathan Bones
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland
- School of Chemical and Bioprocess Engineering, University College Dublin, Belfield, Dublin, Ireland
| | - Colin Clarke
- National Institute for Bioprocessing Research and Training, Fosters Avenue, Blackrock, Co, Dublin, Ireland.
- School of Chemical and Bioprocess Engineering, University College Dublin, Belfield, Dublin, Ireland.
| |
Collapse
|
10
|
Vieira de Souza E, L Bookout A, Barnes CA, Miller B, Machado P, Basso LA, Bizarro CV, Saghatelian A. Rp3: Ribosome profiling-assisted proteogenomics improves coverage and confidence during microprotein discovery. Nat Commun 2024; 15:6839. [PMID: 39122697 PMCID: PMC11316118 DOI: 10.1038/s41467-024-50301-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 07/08/2024] [Indexed: 08/12/2024] Open
Abstract
There has been a dramatic increase in the identification of non-canonical translation and a significant expansion of the protein-coding genome. Among the strategies used to identify unannotated small Open Reading Frames (smORFs) that encode microproteins, Ribosome profiling (Ribo-Seq) is the gold standard for the annotation of novel coding sequences by reporting on smORF translation. In Ribo-Seq, ribosome-protected footprints (RPFs) that map to multiple genomic sites are removed since they cannot be unambiguously assigned to a specific genomic location. Furthermore, RPFs necessarily result in short (25-34 nucleotides) reads, increasing the chance of multi-mapping alignments, such that smORFs residing in these regions cannot be identified by Ribo-Seq. Moreover, it has been challenging to identify protein evidence for Ribo-Seq. To solve this, we developed Rp3, a pipeline that integrates proteogenomics and Ribosome profiling to provide unambiguous evidence for a subset of microproteins missed by current Ribo-Seq pipelines. Here, we show that Rp3 maximizes proteomics detection and confidence of microprotein-encoding smORFs.
Collapse
Affiliation(s)
- Eduardo Vieira de Souza
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil
- Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, 90616-900, Porto Alegre, Rio Grande do Sul, Brazil
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | | | | | - Brendan Miller
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Pablo Machado
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil
- Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, 90616-900, Porto Alegre, Rio Grande do Sul, Brazil
| | - Luiz A Basso
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil
- Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, 90616-900, Porto Alegre, Rio Grande do Sul, Brazil
| | - Cristiano V Bizarro
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil.
- Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, 90616-900, Porto Alegre, Rio Grande do Sul, Brazil.
| | - Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA.
| |
Collapse
|
11
|
Fan X, Chang T, Chen C, Hafner M, Wang Z. Analysis of RNA translation with a deep learning architecture provides new insight into translation control. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.08.548206. [PMID: 39005319 PMCID: PMC11244891 DOI: 10.1101/2023.07.08.548206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Accurate annotation of coding regions in RNAs is essential for understanding gene translation. We developed a deep neural network to directly predict and analyze translation initiation and termination sites from RNA sequences. Trained with human transcripts, our model learned hidden rules of translation control and achieved a near perfect prediction of canonical translation sites across entire human transcriptome. Surprisingly, this model revealed a new role of codon usage in regulating translation termination, which was experimentally validated. We also identified thousands of new open reading frames in mRNAs or lncRNAs, some of which were confirmed experimentally. The model trained with human mRNAs achieved high prediction accuracy of canonical translation sites in all eukaryotes and good prediction in polycistronic transcripts from prokaryotes or RNA viruses, suggesting a high degree of conservation in translation control. Collectively, we present a general and efficient deep learning model for RNA translation, generating new insights into the complexity of translation regulation.
Collapse
Affiliation(s)
- Xiaojuan Fan
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health
- RNA Molecular Biology Laboratory, National Institute of Arthritis and Musculoskeletal and Skin Disease, Bethesda, MD, USA
| | - Tiangen Chang
- Laboratory of Cancer Data Science, National Cancer Institute, Bethesda, MD, USA
| | - Chuyun Chen
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health
- University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Markus Hafner
- RNA Molecular Biology Laboratory, National Institute of Arthritis and Musculoskeletal and Skin Disease, Bethesda, MD, USA
| | - Zefeng Wang
- Bio-med Big Data Center, CAS Key Laboratory of Computational Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Nutrition and Health
- University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| |
Collapse
|
12
|
Fernandez SG, Ferguson L, Ingolia NT. Ribosome rescue factor PELOTA modulates translation start site choice for C/EBPα protein isoforms. Life Sci Alliance 2024; 7:e202302501. [PMID: 38803235 PMCID: PMC11109482 DOI: 10.26508/lsa.202302501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/15/2024] [Accepted: 04/16/2024] [Indexed: 05/29/2024] Open
Abstract
Translation initiation at alternative start sites can dynamically control the synthesis of two or more functionally distinct protein isoforms from a single mRNA. Alternate isoforms of the developmental transcription factor CCAAT/enhancer-binding protein α (C/EBPα) produced from different start sites exert opposing effects during myeloid cell development. This choice between alternative start sites depends on sequence features of the CEBPA transcript, including a regulatory uORF, but the molecular basis is not fully understood. Here, we identify the factors that affect C/EBPα isoform choice using a sensitive and quantitative two-color fluorescent reporter coupled with CRISPRi screening. Our screen uncovered a role of the ribosome rescue factor PELOTA (PELO) in promoting the expression of the longer C/EBPα isoform by directly removing inhibitory unrecycled ribosomes and through indirect effects mediated by the mechanistic target of rapamycin kinase. Our work uncovers further links between ribosome recycling and translation reinitiation that regulate a key transcription factor, with implications for normal hematopoiesis and leukemogenesis.
Collapse
Affiliation(s)
- Samantha G Fernandez
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
| | - Lucas Ferguson
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
- Center for Computational Biology and California Institute for Quantitative Biosciences, University of California, Berkeley, CA, USA
| | - Nicholas T Ingolia
- Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
- Center for Computational Biology and California Institute for Quantitative Biosciences, University of California, Berkeley, CA, USA
| |
Collapse
|
13
|
Hacisuleyman E, Hale CR, Noble N, Luo JD, Fak JJ, Saito M, Chen J, Weissman JS, Darnell RB. Neuronal activity rapidly reprograms dendritic translation via eIF4G2:uORF binding. Nat Neurosci 2024; 27:822-835. [PMID: 38589584 PMCID: PMC11088998 DOI: 10.1038/s41593-024-01615-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 03/05/2024] [Indexed: 04/10/2024]
Abstract
Learning and memory require activity-induced changes in dendritic translation, but which mRNAs are involved and how they are regulated are unclear. In this study, to monitor how depolarization impacts local dendritic biology, we employed a dendritically targeted proximity labeling approach followed by crosslinking immunoprecipitation, ribosome profiling and mass spectrometry. Depolarization of primary cortical neurons with KCl or the glutamate agonist DHPG caused rapid reprogramming of dendritic protein expression, where changes in dendritic mRNAs and proteins are weakly correlated. For a subset of pre-localized messages, depolarization increased the translation of upstream open reading frames (uORFs) and their downstream coding sequences, enabling localized production of proteins involved in long-term potentiation, cell signaling and energy metabolism. This activity-dependent translation was accompanied by the phosphorylation and recruitment of the non-canonical translation initiation factor eIF4G2, and the translated uORFs were sufficient to confer depolarization-induced, eIF4G2-dependent translational control. These studies uncovered an unanticipated mechanism by which activity-dependent uORF translational control by eIF4G2 couples activity to local dendritic remodeling.
Collapse
Affiliation(s)
- Ezgi Hacisuleyman
- Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA.
| | - Caryn R Hale
- Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Natalie Noble
- Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA
| | - Ji-Dung Luo
- Bioinformatics Resource Center, The Rockefeller University, New York, NY, USA
| | - John J Fak
- Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA
| | - Misa Saito
- Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA
| | - Jin Chen
- Department of Pharmacology and Cecil H. and Ida Green Center for Reproductive Biology Sciences, The University of Texas Southwestern Medical Center, Dallas, TX, USA
- Altos Labs, Bay Area Institute of Science, Redwood City, CA, USA
| | - Jonathan S Weissman
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA.
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA.
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Robert B Darnell
- Laboratory of Molecular Neuro-oncology, The Rockefeller University, New York, NY, USA.
- Howard Hughes Medical Institute, The Rockefeller University, New York, NY, USA.
| |
Collapse
|
14
|
Higdon AL, Won NH, Brar GA. Truncated protein isoforms generate diversity of protein localization and function in yeast. Cell Syst 2024; 15:388-408.e4. [PMID: 38636458 PMCID: PMC11075746 DOI: 10.1016/j.cels.2024.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 01/21/2024] [Accepted: 03/20/2024] [Indexed: 04/20/2024]
Abstract
Genome-wide measurement of ribosome occupancy on mRNAs has enabled empirical identification of translated regions, but high-confidence detection of coding regions that overlap annotated coding regions has remained challenging. Here, we report a sensitive and robust algorithm that revealed the translation of 388 N-terminally truncated proteins in budding yeast-more than 30-fold more than previously known. We extensively experimentally validated them and defined two classes. The first class lacks large portions of the annotated protein and tends to be produced from a truncated transcript. We show that two such cases, Yap5truncation and Pus1truncation, have condition-specific regulation and distinct functions from their respective annotated isoforms. The second class of truncated protein isoforms lacks only a small region of the annotated protein and is less likely to be produced from an alternative transcript isoform. Many display different subcellular localizations than their annotated counterpart, representing a common strategy for dual localization of otherwise functionally identical proteins. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Andrea L Higdon
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Nathan H Won
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Gloria A Brar
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
15
|
Fijalkowski I, Snauwaert V, Van Damme P. Proteins à la carte: riboproteogenomic exploration of bacterial N-terminal proteoform expression. mBio 2024; 15:e0033324. [PMID: 38511928 PMCID: PMC11005335 DOI: 10.1128/mbio.00333-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 02/28/2024] [Indexed: 03/22/2024] Open
Abstract
In recent years, it has become evident that the true complexity of bacterial proteomes remains underestimated. Gene annotation tools are known to propagate biases and overlook certain classes of truly expressed proteins, particularly proteoforms-protein isoforms arising from a single gene. Recent (re-)annotation efforts heavily rely on ribosome profiling by providing a direct readout of translation to fully describe bacterial proteomes. In this study, we employ a robust riboproteogenomic pipeline to conduct a systematic census of expressed N-terminal proteoform pairs, representing two isoforms encoded by a single gene raised by annotated and alternative translation initiation, in Salmonella. Intriguingly, conditional-dependent changes in relative utilization of annotated and alternative translation initiation sites (TIS) were observed in several cases. This suggests that TIS selection is subject to regulatory control, adding yet another layer of complexity to our understanding of bacterial proteomes. IMPORTANCE With the emerging theme of genes within genes comprising the existence of alternative open reading frames (ORFs) generated by translation initiation at in-frame start codons, mechanisms that control the relative utilization of annotated and alternative TIS need to be unraveled and our molecular understanding of resulting proteoforms broadened. Utilizing complementary ribosome profiling strategies to map ORF boundaries, we uncovered dual-encoding ORFs generated by in-frame TIS usage in Salmonella. Besides demonstrating that alternative TIS usage may generate proteoforms with different characteristics, such as differential localization and specialized function, quantitative aspects of conditional retapamulin-assisted ribosome profiling (Ribo-RET) translation initiation maps offer unprecedented insights into the relative utilization of annotated and alternative TIS, enabling the exploration of gene regulatory mechanisms that control TIS usage and, consequently, the translation of N-terminal proteoform pairs.
Collapse
Affiliation(s)
- Igor Fijalkowski
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Valdes Snauwaert
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Petra Van Damme
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| |
Collapse
|
16
|
Peng Z, Li J, Jiang X, Wan C. sOCP: a framework predicting smORF coding potential based on TIS and in-frame features and effectively applied in the human genome. Brief Bioinform 2024; 25:bbae147. [PMID: 38600664 PMCID: PMC11006793 DOI: 10.1093/bib/bbae147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 02/25/2024] [Accepted: 03/19/2024] [Indexed: 04/12/2024] Open
Abstract
Small open reading frames (smORFs) have been acknowledged to play various roles on essential biological pathways and affect human beings from diabetes to tumorigenesis. Predicting smORFs in silico is quite a prerequisite for processing the omics data. Here, we proposed the smORF-coding-potential-predicting framework, sOCP, which provides functions to construct a model for predicting novel smORFs in some species. The sOCP model constructed in human was based on in-frame features and the nucleotide bias around the start codon, and the small feature subset was proved to be competent enough and avoid overfitting problems for complicated models. It showed more advanced prediction metrics than previous methods and could correlate closely with experimental evidence in a heterogeneous dataset. The model was applied to Rattus norvegicus and exhibited satisfactory performance. We then scanned smORFs with ATG and non-ATG start codons from the human genome and generated a database containing about a million novel smORFs with coding potential. Around 72 000 smORFs are located on the lncRNA regions of the genome. The smORF-encoded peptides may be involved in biological pathways rare for canonical proteins, including glucocorticoid catabolic process and the prokaryotic defense system. Our work provides a model and database for human smORF investigation and a convenient tool for further smORF prediction in other species.
Collapse
Affiliation(s)
- Zhao Peng
- School of Life Sciences, and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Jiaqiang Li
- School of Computer Science, and Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Xingpeng Jiang
- School of Computer Science, and Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Cuihong Wan
- School of Life Sciences, and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| |
Collapse
|
17
|
Wu TY, Li YR, Chang KJ, Fang JC, Urano D, Liu MJ. Modeling alternative translation initiation sites in plants reveals evolutionarily conserved cis-regulatory codes in eukaryotes. Genome Res 2024; 34:272-285. [PMID: 38479836 PMCID: PMC10984385 DOI: 10.1101/gr.278100.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 02/15/2024] [Indexed: 03/22/2024]
Abstract
mRNA translation relies on identifying translation initiation sites (TISs) in mRNAs. Alternative TISs are prevalent across plant transcriptomes, but the mechanisms for their recognition are unclear. Using ribosome profiling and machine learning, we developed models for predicting alternative TISs in the tomato (Solanum lycopersicum). Distinct feature sets were predictive of AUG and nonAUG TISs in 5' untranslated regions and coding sequences, including a novel CU-rich sequence that promoted plant TIS activity, a translational enhancer found across dicots and monocots, and humans and viruses. Our results elucidate the mechanistic and evolutionary basis of TIS recognition, whereby cis-regulatory RNA signatures affect start site selection. The TIS prediction model provides global estimates of TISs to discover neglected protein-coding genes across plant genomes. The prevalence of cis-regulatory signatures across plant species, humans, and viruses suggests their broad and critical roles in reprogramming the translational landscape.
Collapse
Affiliation(s)
- Ting-Ying Wu
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei 11529, Taiwan;
| | - Ya-Ru Li
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
| | - Kai-Jyun Chang
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | - Jhen-Cheng Fang
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan
| | - Daisuke Urano
- Temasek Life Sciences Laboratory, Singapore 117604, Singapore
- Department of Biological Sciences, National University of Singapore, Singapore 117558, Singapore
| | - Ming-Jung Liu
- Biotechnology Center in Southern Taiwan, Academia Sinica, Tainan 711, Taiwan;
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan
- Agricultural Biotechnology Research Center, Academia Sinica, Taipei 115, Taiwan
| |
Collapse
|
18
|
Yang H, Li Q, Stroup EK, Wang S, Ji Z. Widespread stable noncanonical peptides identified by integrated analyses of ribosome profiling and ORF features. Nat Commun 2024; 15:1932. [PMID: 38431639 PMCID: PMC10908861 DOI: 10.1038/s41467-024-46240-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 02/18/2024] [Indexed: 03/05/2024] Open
Abstract
Studies have revealed dozens of functional peptides in putative 'noncoding' regions and raised the question of how many proteins are encoded by noncanonical open reading frames (ORFs). Here, we comprehensively annotate genome-wide translated ORFs across five eukaryotes (human, mouse, zebrafish, worm, and yeast) by analyzing ribosome profiling data. We develop a logistic regression model named PepScore based on ORF features (expected length, encoded domain, and conservation) to calculate the probability that the encoded peptide is stable in humans. Systematic ectopic expression validates PepScore and shows that stable complex-associating microproteins can be encoded in 5'/3' untranslated regions and overlapping coding regions of mRNAs besides annotated noncoding RNAs. Stable noncanonical proteins follow conventional rules and localize to different subcellular compartments. Inhibition of proteasomal/lysosomal degradation pathways can stabilize some peptides especially those with moderate PepScores, but cannot rescue the expression of short ones with low PepScores suggesting they are directly degraded by cellular proteases. The majority of human noncanonical peptides with high PepScores show longer lengths but low conservation across species/mammals, and hundreds contain trait-associated genetic variants. Our study presents a statistical framework to identify stable noncanonical peptides in the genome and provides a valuable resource for functional characterization of noncanonical translation during development and disease.
Collapse
Affiliation(s)
- Haiwang Yang
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Qianru Li
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Emily K Stroup
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA
| | - Sheng Wang
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, 60628, USA
| | - Zhe Ji
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, IL, 60628, USA.
| |
Collapse
|
19
|
Wang J, Li Y, Li M, Zhang W, Lu Y, Hua K, Ling X, Chen T, Guo D, Yang Y, Zheng Z, Liu Q, Zhang B. Translatome and Transcriptome Analyses Reveal the Mechanism that Underlies the Enhancement of Salt Stress by the Small Peptide Ospep5 in Plants. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:4277-4291. [PMID: 38288993 DOI: 10.1021/acs.jafc.3c08528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
Salt stress significantly impedes plant growth and the crop yield. This study utilized de novo transcriptome assembly and ribosome profiling to explore mRNA translation's role in rice salt tolerance. We identified unrecognized translated open reading frames (ORFs), including 42 upstream transcripts and 86 unannotated transcripts. A noteworthy discovery was the role of a small ORF, Ospep5, in conferring salt tolerance. Overexpression of Ospep5 in plants increased salt tolerance, while its absence led to heightened sensitivity. This hypothesis was corroborated by the findings that exogenous application of the synthetic small peptide Ospep5 bolstered salt tolerance in both rice and Arabidopsis. We found that the mechanism underpinning the Ospep5-mediated salt tolerance involves the maintenance of intracellular Na+/K+ homeostasis, facilitated by upregulation of high-affinity potassium transporters (HKT) and Na+/H+ exchangers (SOS1). Furthermore, a comprehensive multiomics approach, particularly ribosome profiling, is instrumental in uncovering unannotated ORFs and elucidating their functions in plant stress responses.
Collapse
Affiliation(s)
- Jinyan Wang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Yang Li
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Mingyue Li
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Wenting Zhang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Yaping Lu
- Experimental center of Life Sciences, College of Life Sciences, Nanjing Agricultural University, Nanjing 210095, China
| | - Kai Hua
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai 201602, China
| | - Xitie Ling
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Tianzi Chen
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Dongshu Guo
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Yuwen Yang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Zhongbing Zheng
- School of Breeding and Multiplication (Sanya Institute of Breeding and Multiplication), Hainan University, Sanya 572025, China
| | - Qing Liu
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
- College of Agriculture, Yangzhou University, Yangzhou 225009, China
| | - Baolong Zhang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
- School of Breeding and Multiplication (Sanya Institute of Breeding and Multiplication), Hainan University, Sanya 572025, China
| |
Collapse
|
20
|
Wu HYL, Ai Q, Teixeira RT, Nguyen PHT, Song G, Montes C, Elmore JM, Walley JW, Hsu PY. Improved super-resolution ribosome profiling reveals prevalent translation of upstream ORFs and small ORFs in Arabidopsis. THE PLANT CELL 2024; 36:510-539. [PMID: 38000896 PMCID: PMC10896292 DOI: 10.1093/plcell/koad290] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/13/2023] [Accepted: 10/17/2023] [Indexed: 11/26/2023]
Abstract
A crucial step in functional genomics is identifying actively translated ORFs and linking them to biological functions. The challenge lies in identifying short ORFs, as their identification is greatly influenced by data quality and depth. Here, we improved the coverage of super-resolution Ribo-seq in Arabidopsis (Arabidopsis thaliana), revealing uncharacterized translation events for nuclear, chloroplastic, and mitochondrial genes. Assisted by a transcriptome assembly, we identified 7,751 unconventional translation events, comprising 6,996 upstream ORFs (uORFs) and 209 downstream ORFs on annotated protein-coding genes, as well as 546 ORFs in presumed noncoding RNAs. Proteomic data confirmed the production of stable proteins from some of these unannotated translation events. We present evidence of active translation from primary transcripts of trans-acting small interfering RNAs (TAS1-4) and microRNAs (pri-MIR163 and pri-MIR169) and periodic ribosome stalling supporting cotranslational decay. Additionally, we developed a method for identifying extremely short uORFs, including 370 minimum uORFs (AUG-stop), and 2,921 tiny uORFs (2 to 10 amino acids) and 681 uORFs that overlap with each other. Remarkably, these short uORFs exhibit strong translational repression as do longer uORFs. We also systematically discovered 594 uORFs regulated by alternative splicing, suggesting widespread isoform-specific translational control. Finally, these prevalent uORFs are associated with numerous important pathways. In summary, our improved Arabidopsis translational landscape provides valuable resources to study gene expression regulation.
Collapse
Affiliation(s)
- Hsin-Yen Larry Wu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Qiaoyun Ai
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Rita Teresa Teixeira
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Phong H T Nguyen
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Gaoyuan Song
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Christian Montes
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - J Mitch Elmore
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Justin W Walley
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Polly Yingshan Hsu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
21
|
Valdivia-Francia F, Sendoel A. No country for old methods: New tools for studying microproteins. iScience 2024; 27:108972. [PMID: 38333695 PMCID: PMC10850755 DOI: 10.1016/j.isci.2024.108972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2024] Open
Abstract
Microproteins encoded by small open reading frames (sORFs) have emerged as a fascinating frontier in genomics. Traditionally overlooked due to their small size, recent technological advancements such as ribosome profiling, mass spectrometry-based strategies and advanced computational approaches have led to the annotation of more than 7000 sORFs in the human genome. Despite the vast progress, only a tiny portion of these microproteins have been characterized and an important challenge in the field lies in identifying functionally relevant microproteins and understanding their role in different cellular contexts. In this review, we explore the recent advancements in sORF research, focusing on the new methodologies and computational approaches that have facilitated their identification and functional characterization. Leveraging these new tools hold great promise for dissecting the diverse cellular roles of microproteins and will ultimately pave the way for understanding their role in the pathogenesis of diseases and identifying new therapeutic targets.
Collapse
Affiliation(s)
- Fabiola Valdivia-Francia
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
- Life Science Zurich Graduate School, Molecular Life Science Program, University of Zurich/ ETH Zurich, Schlieren-Zurich, Switzerland
| | - Ataman Sendoel
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
| |
Collapse
|
22
|
Mao Y, Qian SB. Making sense of mRNA translational "noise". Semin Cell Dev Biol 2024; 154:114-122. [PMID: 36925447 PMCID: PMC10500040 DOI: 10.1016/j.semcdb.2023.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 03/09/2023] [Accepted: 03/09/2023] [Indexed: 03/15/2023]
Abstract
The importance of translation fidelity has been apparent since the discovery of genetic code. It is commonly believed that translation deviating from the main coding region is to be avoided at all times inside cells. However, ribosome profiling and mass spectrometry have revealed pervasive noncanonical translation. Both the scope and origin of translational "noise" are just beginning to be appreciated. Although largely overlooked, those translational "noises" are associated with a wide range of cellular functions, such as producing unannotated protein products. Furthermore, the dynamic nature of translational "noise" is responsive to stress conditions, highlighting the beneficial effect of translational "noise" in stress adaptation. Mechanistic investigation of translational "noise" will provide better insight into the mechanisms of translational regulation. Ultimately, they are not "noise" at all but represent a signature of cellular activities under pathophysiological conditions. Deciphering translational "noise" holds the therapeutic and diagnostic potential in a wide spectrum of human diseases.
Collapse
Affiliation(s)
- Yuanhui Mao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
23
|
Lei T, Chang Y, Yao C, Zhang H. A systematic evaluation of computational methods for predicting translated non-canonical ORFs from ribosome profiling data. J Genet Genomics 2024; 51:105-108. [PMID: 37673174 DOI: 10.1016/j.jgg.2023.08.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 08/15/2023] [Accepted: 08/30/2023] [Indexed: 09/08/2023]
Affiliation(s)
- Tianyu Lei
- College of Ecology, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Yue Chang
- College of Ecology, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Chao Yao
- Cuiying Honors College, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Hong Zhang
- College of Ecology, Lanzhou University, Lanzhou, Gansu 730000, China.
| |
Collapse
|
24
|
Kore H, Datta KK, Nagaraj SH, Gowda H. Protein-coding potential of non-canonical open reading frames in human transcriptome. Biochem Biophys Res Commun 2023; 684:149040. [PMID: 37897910 DOI: 10.1016/j.bbrc.2023.09.068] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 09/09/2023] [Accepted: 09/23/2023] [Indexed: 10/30/2023]
Abstract
In recent years, proteogenomics and ribosome profiling studies have identified a large number of proteins encoded by noncoding regions in the human genome. They are encoded by small open reading frames (sORFs) in the untranslated regions (UTRs) of mRNAs and long non-coding RNAs (lncRNAs). These sORF encoded proteins (SEPs) are often <150AA and show poor evolutionary conservation. A subset of them have been functionally characterized and shown to play an important role in fundamental biological processes including cardiac and muscle function, DNA repair, embryonic development and various human diseases. How many novel protein-coding regions exist in the human genome and what fraction of them are functionally important remains a mystery. In this review, we discuss current progress in unraveling SEPs, approaches used for their identification, their limitations and reliability of these identifications. We also discuss functionally characterized SEPs and their involvement in various biological processes and diseases. Lastly, we provide insights into their distinctive features compared to canonical proteins and challenges associated with annotating these in protein reference databases.
Collapse
Affiliation(s)
- Hitesh Kore
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Cancer Precision Medicine Group, QIMR Berghofer Medical Research Institute, 300 Herston Road, Herston, Queensland, 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia.
| | - Keshava K Datta
- Proteomics and Metabolomics Platform, La Trobe University, Melbourne, VIC, 3083, Australia
| | - Shivashankar H Nagaraj
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia
| | - Harsha Gowda
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Cancer Precision Medicine Group, QIMR Berghofer Medical Research Institute, 300 Herston Road, Herston, Queensland, 4006, Australia; Faculty of Health, Queensland University of Technology, Brisbane, Queensland, 4059, Australia; Faculty of Medicine, The University of Queensland, Queensland, 4072, Australia.
| |
Collapse
|
25
|
Xu D, Tang L, Zhou J, Wang F, Cao H, Huang Y, Kapranov P. Evidence for widespread existence of functional novel and non-canonical human transcripts. BMC Biol 2023; 21:271. [PMID: 38001496 PMCID: PMC10675921 DOI: 10.1186/s12915-023-01753-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 10/31/2023] [Indexed: 11/26/2023] Open
Abstract
BACKGROUND Fraction of functional sequence in the human genome remains a key unresolved question in Biology and the subject of vigorous debate. While a plethora of studies have connected a significant fraction of human DNA to various biochemical processes, the classical definition of function requires evidence of effects on cellular or organismal fitness that such studies do not provide. Although multiple high-throughput reverse genetics screens have been developed to address this issue, they are limited to annotated genomic elements and suffer from non-specific effects, arguing for a strong need to develop additional functional genomics approaches. RESULTS In this work, we established a high-throughput lentivirus-based insertional mutagenesis strategy as a forward genetics screen tool in aneuploid cells. Application of this approach to human cell lines in multiple phenotypic screens suggested the presence of many yet uncharacterized functional elements in the human genome, represented at least in part by novel exons of known and novel genes. The novel transcripts containing these exons can be massively, up to thousands-fold, induced by specific stresses, and at least some can represent bi-cistronic protein-coding mRNAs. CONCLUSIONS Altogether, these results argue that many unannotated and non-canonical human transcripts, including those that appear as aberrant splice products, have biological relevance under specific biological conditions.
Collapse
Affiliation(s)
- Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Lu Tang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Junjun Zhou
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Fang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Huifen Cao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yu Huang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.
| |
Collapse
|
26
|
de Souza EV, Bookout AL, Barnes CA, Miller B, Machado P, Basso LA, Bizarro CV, Saghatelian A. The Integration of Proteogenomics and Ribosome Profiling Circumvents Key Limitations to Increase the Coverage and Confidence of Novel Microproteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.27.559809. [PMID: 37808637 PMCID: PMC10557729 DOI: 10.1101/2023.09.27.559809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Abstract
There has been a dramatic increase in the identification of non-conical translation and a significant expansion of the protein-coding genome and proteome. Among the strategies used to identify novel small ORFs (smORFs), Ribosome profiling (Ribo-Seq) is the gold standard for the annotation of novel coding sequences by reporting on smORF translation. In Ribo-Seq, ribosome-protected footprints (RPFs) that map to multiple sites in the genome are computationally removed since they cannot unambiguously be assigned to a specific genomic location, or to a specific transcript in the case of multiple isoforms. Furthermore, RPFs necessarily result in short (25-34 nucleotides) reads, increasing the chance of ambiguous and multi-mapping alignments, such that smORFs that reside in these regions cannot be identified by Ribo-Seq. Here, we show that the inclusion of proteogenomics to create a Ribosome Profiling and Proteogenomics Pipeline (RP3) bypasses this limitation to identify a group of microprotein-encoding smORFs that are missed by current Ribo-Seq pipelines. Moreover, we show that the microproteins identified by RP3 have different sequence compositions from the ones identified by Ribo-Seq-only pipelines, which can affect proteomics identification. In aggregate, the development of RP3 maximizes the detection and confidence of protein-encoding smORFs and microproteins.
Collapse
|
27
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Moritz RL, Deutsch EW, van Heesch S. What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome? Mol Cell Proteomics 2023; 22:100631. [PMID: 37572790 PMCID: PMC10506109 DOI: 10.1016/j.mcpro.2023.100631] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 07/21/2023] [Accepted: 08/08/2023] [Indexed: 08/14/2023] Open
Abstract
Ribosome profiling (Ribo-Seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of noncanonical sites of ribosome translation outside the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7000 noncanonical ORFs are translated, which, at first glance, has the potential to expand the number of human protein CDSs by 30%, from ∼19,500 annotated CDSs to over 26,000 annotated CDSs. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of noncanonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome but searching for guidance on how to proceed. Here, we discuss the current state of noncanonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein coding."
Collapse
Affiliation(s)
- John R Prensner
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan, USA.
| | | | - Leron W Kok
- Princess Máxima Center for Pediatric Oncology, Utrecht, The Netherlands
| | - Karl R Clauser
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Jonathan M Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, Agora Center Bugnon 25A, University of Lausanne, Lausanne, Switzerland; Department of Oncology, Centre Hospitalier Universitaire Vaudois (CHUV), Lausanne, Switzerland; Agora Cancer Research Centre, Lausanne, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | - Eric W Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington, USA
| | | |
Collapse
|
28
|
Zhang Y, Wang X, Zhang C, Yi H. The dysregulation of lncRNAs by epigenetic factors in human pathologies. Drug Discov Today 2023; 28:103664. [PMID: 37348827 DOI: 10.1016/j.drudis.2023.103664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/24/2023]
Abstract
Dysregulation of long noncoding RNAs (lncRNAs) contributes to numerous human diseases, including cancers and autoimmune diseases (ADs). Given the importance of lncRNAs in disease initiation and progression, a deeper understanding of their complex regulatory network is required to facilitate their use as therapeutic targets for ADs. In this review, we summarize how lncRNAs are dysregulated in pathological states by epigenetic factors, including RNA-binding proteins, chemical modifications (N6-methyladenosine, 5-methylcytosine, 7-methylguanosine, adenosine-to-inosine editing, microRNA, alternative splicing, DNA methylation, and histone modification). Moreover, the roles of lncRNA epigenetic regulators in immune response and ADs are discussed, providing new insights into the complicated epigenetic factor-lncRNA network, thus, laying a theoretical foundation for future research and clinical application of lncRNAs.
Collapse
Affiliation(s)
- Yanli Zhang
- Central Laboratory, The First Hospital of Jilin University, Changchun, Jilin, China; Key Laboratory of Organ Regeneration and Transplantation, Ministry of Education, Changchun, Jilin 130021, China; Department of Echocardiography, The First Hospital of Jilin University, Changchun, Jilin, China
| | - Xiaocong Wang
- Department of Echocardiography, The First Hospital of Jilin University, Changchun, Jilin, China
| | - Chen Zhang
- Colorectal and Anal Surgery, General Surgery Center, The First Hospital of Jilin University, Changchun, Jilin, China
| | - Huanfa Yi
- Central Laboratory, The First Hospital of Jilin University, Changchun, Jilin, China; Key Laboratory of Organ Regeneration and Transplantation, Ministry of Education, Changchun, Jilin 130021, China.
| |
Collapse
|
29
|
Feng YZ, Zhu QF, Xue J, Chen P, Yu Y. Shining in the dark: the big world of small peptides in plants. ABIOTECH 2023; 4:238-256. [PMID: 37970469 PMCID: PMC10638237 DOI: 10.1007/s42994-023-00100-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 02/24/2023] [Indexed: 11/17/2023]
Abstract
Small peptides represent a subset of dark matter in plant proteomes. Through differential expression patterns and modes of action, small peptides act as important regulators of plant growth and development. Over the past 20 years, many small peptides have been identified due to technical advances in genome sequencing, bioinformatics, and chemical biology. In this article, we summarize the classification of plant small peptides and experimental strategies used to identify them as well as their potential use in agronomic breeding. We review the biological functions and molecular mechanisms of small peptides in plants, discuss current problems in small peptide research and highlight future research directions in this field. Our review provides crucial insight into small peptides in plants and will contribute to a better understanding of their potential roles in biotechnology and agriculture.
Collapse
Affiliation(s)
- Yan-Zhao Feng
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Qing-Feng Zhu
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Jiao Xue
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Pei Chen
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Yang Yu
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| |
Collapse
|
30
|
Wang X, Zhang Z, Shi C, Wang Y, Zhou T, Lin A. Clinical prospects and research strategies of long non-coding RNA encoding micropeptides. Zhejiang Da Xue Xue Bao Yi Xue Ban 2023; 52:397-405. [PMID: 37643974 PMCID: PMC10495248 DOI: 10.3724/zdxbyxb-2023-0128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 07/20/2023] [Indexed: 08/12/2023]
Abstract
Long non-coding RNAs (lncRNAs) which are usually thought to have no protein coding ability, are widely involved in cell proliferation, signal transduction and other biological activities. However, recent studies have suggested that short open reading frames (sORFs) of some lncRNAs can encode small functional peptides (micropeptides). These micropeptides appear to play important roles in calcium homeostasis, embryonic development and tumorigenesis, suggesting their potential as therapeutic targets and diagnostic biomarkers. Currently, bioinformatic tools as well as experimental methods such as ribosome mapping and in vitro translation are applied to predict the coding potential of lncRNAs. Furthermore, mass spectrometry, specific antibodies and epitope tags are used for validating the expression of micropeptides. Here, we review the physiological and pathological functions of recently identified micropeptides as well as research strategies for predicting the coding potential of lncRNAs to facilitate the further research of lncRNA encoded micropeptides.
Collapse
Affiliation(s)
- Xinyi Wang
- College of Life Sciences, Zhejiang University, Hangzhou 310058, China.
- Zhejiang University Cancer Center, Hangzhou 310058, China.
| | - Zhen Zhang
- College of Life Sciences, Zhejiang University, Hangzhou 310058, China
- Zhejiang University Cancer Center, Hangzhou 310058, China
| | - Chengyu Shi
- College of Life Sciences, Zhejiang University, Hangzhou 310058, China
- Zhejiang University Cancer Center, Hangzhou 310058, China
| | - Ying Wang
- College of Life Sciences, Zhejiang University, Hangzhou 310058, China
- Zhejiang University Cancer Center, Hangzhou 310058, China
| | - Tianhua Zhou
- Zhejiang University Cancer Center, Hangzhou 310058, China.
- The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Center for RNA Medicine, International Institutes of Medicine, Zhejiang University, Jinhua 322000, Zhejiang Province, China.
- Department of Cell Biology, Zhejiang University School of Medicine, Hangzhou 310058, China.
| | - Aifu Lin
- College of Life Sciences, Zhejiang University, Hangzhou 310058, China.
- Zhejiang University Cancer Center, Hangzhou 310058, China.
- The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Center for RNA Medicine, International Institutes of Medicine, Zhejiang University, Jinhua 322000, Zhejiang Province, China.
| |
Collapse
|
31
|
Higdon AL, Won NH, Brar GA. Truncated protein isoforms generate diversity of protein localization and function in yeast. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.13.548938. [PMID: 37503254 PMCID: PMC10369987 DOI: 10.1101/2023.07.13.548938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Genome-wide measurements of ribosome occupancy on mRNA transcripts have enabled global empirical identification of translated regions. These approaches have revealed an unexpected diversity of protein products, but high-confidence identification of new coding regions that entirely overlap annotated coding regions - including those that encode truncated protein isoforms - has remained challenging. Here, we develop a sensitive and robust algorithm focused on identifying N-terminally truncated proteins genome-wide, identifying 388 truncated protein isoforms, a more than 30-fold increase in the number known in budding yeast. We perform extensive experimental validation of these truncated proteins and define two general classes. The first set lack large portions of the annotated protein sequence and tend to be produced from a truncated transcript. We show two such cases, Yap5 truncation and Pus1 truncation , to have condition-specific regulation and functions that appear distinct from their respective annotated isoforms. The second set of N-terminally truncated proteins lack only a small region of the annotated protein and are less likely to be regulated by an alternative transcript isoform. Many localize to different subcellular compartments than their annotated counterpart, representing a common strategy for achieving dual localization of otherwise functionally identical proteins.
Collapse
|
32
|
Dong X, Zhang K, Xun C, Chu T, Liang S, Zeng Y, Liu Z. Small Open Reading Frame-Encoded Micro-Peptides: An Emerging Protein World. Int J Mol Sci 2023; 24:10562. [PMID: 37445739 DOI: 10.3390/ijms241310562] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023] Open
Abstract
Small open reading frames (sORFs) are often overlooked features in genomes. In the past, they were labeled as noncoding or "transcriptional noise". However, accumulating evidence from recent years suggests that sORFs may be transcribed and translated to produce sORF-encoded polypeptides (SEPs) with less than 100 amino acids. The vigorous development of computational algorithms, ribosome profiling, and peptidome has facilitated the prediction and identification of many new SEPs. These SEPs were revealed to be involved in a wide range of basic biological processes, such as gene expression regulation, embryonic development, cellular metabolism, inflammation, and even carcinogenesis. To effectively understand the potential biological functions of SEPs, we discuss the history and development of the newly emerging research on sORFs and SEPs. In particular, we review a range of recently discovered bioinformatics tools for identifying, predicting, and validating SEPs as well as a variety of biochemical experiments for characterizing SEP functions. Lastly, this review underlines the challenges and future directions in identifying and validating sORFs and their encoded micropeptides, providing a significant reference for upcoming research on sORF-encoded peptides.
Collapse
Affiliation(s)
- Xiaoping Dong
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Kun Zhang
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Chengfeng Xun
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Tianqi Chu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Songping Liang
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Yong Zeng
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Zhonghua Liu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| |
Collapse
|
33
|
Yanaizu M, Adachi H, Araki M, Kontani K, Kino Y. Translational regulation and protein-coding capacity of the 5' untranslated region of human TREM2. Commun Biol 2023; 6:616. [PMID: 37291187 PMCID: PMC10250343 DOI: 10.1038/s42003-023-04998-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 05/30/2023] [Indexed: 06/10/2023] Open
Abstract
TREM2 is a transmembrane receptor expressed in microglia and macrophages. Elevated TREM2 levels in these cells are associated with age-related pathological conditions, including Alzheimer's disease. However, the regulatory mechanism underlying the protein expression of TREM2 remains unclear. In this study, we uncover the role of the 5' untranslated region (5'-UTR) of human TREM2 in translation. An upstream start codon (uAUG) in the 5'-UTR of TREM2 is specific to some primates, including humans. The expression of the conventional TREM2 protein, starting from the downstream AUG (dTREM2), is repressed by the 5'-UTR in a uAUG-mediated manner. We also detect a TREM2 protein isoform starting from uAUG (uTREM2) that is largely degraded by proteasomes. Finally, the 5'-UTR is essential for the downregulation of dTREM2 expression in response to amino acid starvation. Collectively, our study identifies a species-specific regulatory role of the 5'-UTR in TREM2 translation.
Collapse
Affiliation(s)
- Motoaki Yanaizu
- Department of Bioinformatics and Molecular Neuropathology, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan
- Department of RNA Pathobiology and Therapeutics, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan
| | - Haruka Adachi
- Department of Bioinformatics and Molecular Neuropathology, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan
| | - Makoto Araki
- Department of Biochemistry, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan
| | - Kenji Kontani
- Department of Biochemistry, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan
| | - Yoshihiro Kino
- Department of Bioinformatics and Molecular Neuropathology, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan.
- Department of RNA Pathobiology and Therapeutics, Meiji Pharmaceutical University, 2-522-1, Noshio, Kiyose-shi, Tokyo, 204-8588, Japan.
| |
Collapse
|
34
|
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, Bassani-Sternberg M, Deutsch EW, van Heesch S. What can Ribo-seq and proteomics tell us about the non-canonical proteome? BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.16.541049. [PMID: 37292611 PMCID: PMC10245706 DOI: 10.1101/2023.05.16.541049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Ribosome profiling (Ribo-seq) has proven transformative for our understanding of the human genome and proteome by illuminating thousands of non-canonical sites of ribosome translation outside of the currently annotated coding sequences (CDSs). A conservative estimate suggests that at least 7,000 non-canonical open reading frames (ORFs) are translated, which, at first glance, has the potential to expand the number of human protein-coding sequences by 30%, from ∼19,500 annotated CDSs to over 26,000. Yet, additional scrutiny of these ORFs has raised numerous questions about what fraction of them truly produce a protein product and what fraction of those can be understood as proteins according to conventional understanding of the term. Adding further complication is the fact that published estimates of non-canonical ORFs vary widely by around 30-fold, from several thousand to several hundred thousand. The summation of this research has left the genomics and proteomics communities both excited by the prospect of new coding regions in the human genome, but searching for guidance on how to proceed. Here, we discuss the current state of non-canonical ORF research, databases, and interpretation, focusing on how to assess whether a given ORF can be said to be "protein-coding". In brief The human genome encodes thousands of non-canonical open reading frames (ORFs) in addition to protein-coding genes. As a nascent field, many questions remain regarding non-canonical ORFs. How many exist? Do they encode proteins? What level of evidence is needed for their verification? Central to these debates has been the advent of ribosome profiling (Ribo-seq) as a method to discern genome-wide ribosome occupancy, and immunopeptidomics as a method to detect peptides that are processed and presented by MHC molecules and not observed in traditional proteomics experiments. This article provides a synthesis of the current state of non-canonical ORF research and proposes standards for their future investigation and reporting. Highlights Combined use of Ribo-seq and proteomics-based methods enables optimal confidence in detecting non-canonical ORFs and their protein products.Ribo-seq can provide more sensitive detection of non-canonical ORFs, but data quality and analytical pipelines will impact results.Non-canonical ORF catalogs are diverse and span both high-stringency and low-stringency ORF nominations.A framework for standardized non-canonical ORF evidence will advance the research field.
Collapse
Affiliation(s)
- John R. Prensner
- Department of Pediatrics, Division of Pediatric Hematology/Oncology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | | | - Leron W. Kok
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| | - Karl R. Clauser
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Jonathan M. Mudge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Michal Bassani-Sternberg
- Ludwig Institute for Cancer Research, University of Lausanne, Agora Center Bugnon 25A, 1005 Lausanne, Switzerland
- Department of Oncology, Centre hospitalier universitaire vaudois (CHUV), Rue du Bugnon 46, 1005 Lausanne, Switzerland
- Agora Cancer Research Centre, 1011 Lausanne, Switzerland
| | - Eric W. Deutsch
- Institute for Systems Biology (ISB), Seattle, Washington 98109, USA
| | - Sebastiaan van Heesch
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584 CS, Utrecht, the Netherlands
| |
Collapse
|
35
|
Tornini VA, Miao L, Lee HJ, Gerson T, Dube SE, Schmidt V, Kroll F, Tang Y, Du K, Kuchroo M, Vejnar CE, Bazzini AA, Krishnaswamy S, Rihel J, Giraldez AJ. linc-mipep and linc-wrb encode micropeptides that regulate chromatin accessibility in vertebrate-specific neural cells. eLife 2023; 12:e82249. [PMID: 37191016 PMCID: PMC10188112 DOI: 10.7554/elife.82249] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 04/14/2023] [Indexed: 05/17/2023] Open
Abstract
Thousands of long intergenic non-coding RNAs (lincRNAs) are transcribed throughout the vertebrate genome. A subset of lincRNAs enriched in developing brains have recently been found to contain cryptic open-reading frames and are speculated to encode micropeptides. However, systematic identification and functional assessment of these transcripts have been hindered by technical challenges caused by their small size. Here, we show that two putative lincRNAs (linc-mipep, also called lnc-rps25, and linc-wrb) encode micropeptides with homology to the vertebrate-specific chromatin architectural protein, Hmgn1, and demonstrate that they are required for development of vertebrate-specific brain cell types. Specifically, we show that NMDA receptor-mediated pathways are dysregulated in zebrafish lacking these micropeptides and that their loss preferentially alters the gene regulatory networks that establish cerebellar cells and oligodendrocytes - evolutionarily newer cell types that develop postnatally in humans. These findings reveal a key missing link in the evolution of vertebrate brain cell development and illustrate a genetic basis for how some neural cell types are more susceptible to chromatin disruptions, with implications for neurodevelopmental disorders and disease.
Collapse
Affiliation(s)
| | - Liyun Miao
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Ho-Joon Lee
- Department of Genetics, Yale UniversityNew HavenUnited States
- Yale Center for Genome Analysis, Yale UniversityNew HavenUnited States
| | - Timothy Gerson
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Sarah E Dube
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Valeria Schmidt
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - François Kroll
- Department of Cell and Developmental Biology, University College LondonLondonUnited Kingdom
| | - Yin Tang
- Department of Genetics, Yale UniversityNew HavenUnited States
| | - Katherine Du
- Department of Genetics, Yale UniversityNew HavenUnited States
- Department of Computer Science, Yale UniversityNew HavenUnited States
| | - Manik Kuchroo
- Department of Genetics, Yale UniversityNew HavenUnited States
- Department of Computer Science, Yale UniversityNew HavenUnited States
| | | | - Ariel Alejandro Bazzini
- Stowers Institute for Medical ResearchKansas CityUnited States
- Department of Molecular & Integrative Physiology, University of Kansas School of MedicineKansas CityUnited States
| | - Smita Krishnaswamy
- Department of Genetics, Yale UniversityNew HavenUnited States
- Department of Computer Science, Yale UniversityNew HavenUnited States
| | - Jason Rihel
- Department of Cell and Developmental Biology, University College LondonLondonUnited Kingdom
| | - Antonio J Giraldez
- Department of Genetics, Yale UniversityNew HavenUnited States
- Yale Stem Cell Center, Yale University School of MedicineNew HavenUnited States
- Yale Cancer Center, Yale University School of MedicineNew HavenUnited States
| |
Collapse
|
36
|
Li L, Shu XS, Geng H, Ying J, Guo L, Luo J, Xiang T, Wu L, Ma BBY, Chan ATC, Zhu X, Ambinder RF, Tao Q. A novel tumor suppressor encoded by a 1p36.3 lncRNA functions as a phosphoinositide-binding protein repressing AKT phosphorylation/activation and promoting autophagy. Cell Death Differ 2023; 30:1166-1183. [PMID: 36813924 PMCID: PMC10154315 DOI: 10.1038/s41418-023-01129-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 01/30/2023] [Accepted: 02/02/2023] [Indexed: 02/24/2023] Open
Abstract
Peptides/small proteins, encoded by noncanonical open reading frames (ORF) of previously claimed non-coding RNAs, have recently been recognized possessing important biological functions, but largely uncharacterized. 1p36 is an important tumor suppressor gene (TSG) locus frequently deleted in multiple cancers, with critical TSGs like TP73, PRDM16, and CHD5 already validated. Our CpG methylome analysis identified a silenced 1p36.3 gene KIAA0495, previously thought coding long non-coding RNA. We found that the open reading frame 2 of KIAA0495 is actually protein-coding and translating, encoding a small protein SP0495. KIAA0495 transcript is broadly expressed in multiple normal tissues, but frequently silenced by promoter CpG methylation in multiple tumor cell lines and primary tumors including colorectal, esophageal and breast cancers. Its downregulation/methylation is associated with poor survival of cancer patients. SP0495 induces tumor cell apoptosis, cell cycle arrest, senescence and autophagy, and inhibits tumor cell growth in vitro and in vivo. Mechanistically, SP0495 binds to phosphoinositides (PtdIns(3)P, PtdIns(3,5)P2) as a lipid-binding protein, inhibits AKT phosphorylation and its downstream signaling, and further represses oncogenic AKT/mTOR, NF-κB, and Wnt/β-catenin signaling. SP0495 also regulates the stability of autophagy regulators BECN1 and SQSTM1/p62 through modulating phosphoinositides turnover and autophagic/proteasomal degradation. Thus, we discovered and validated a 1p36.3 small protein SP0495, functioning as a novel tumor suppressor regulating AKT signaling activation and autophagy as a phosphoinositide-binding protein, being frequently inactivated by promoter methylation in multiple tumors as a potential biomarker.
Collapse
Affiliation(s)
- Lili Li
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
| | - Xing-Sheng Shu
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
- School of Medicine, Shenzhen University Health Science Center, Shenzhen, China
| | - Hua Geng
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Jianming Ying
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
- Department of Pathology, Cancer Hospital, Peking Union Medical College & Chinese Academy of Medical Sciences, Beijing, China
| | - Lei Guo
- Department of Pathology, Cancer Hospital, Peking Union Medical College & Chinese Academy of Medical Sciences, Beijing, China
| | - Jie Luo
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Tingxiu Xiang
- The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
- Chongqing University Cancer Hospital, Chongqing, China
| | - Longtao Wu
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Brigette B Y Ma
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Anthony T C Chan
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Xiaofeng Zhu
- State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Guangzhou, China
| | - Richard F Ambinder
- Johns Hopkins Singapore and Sydney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Qian Tao
- Cancer Epigenetics Laboratory, Department of Clinical Oncology, State Key Laboratory of Translational Oncology, Sir YK Pao Center for Cancer and Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.
- Johns Hopkins Singapore and Sydney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
37
|
Sabikunnahar B, Caldwell S, Varnum S, Hogan T, Cooper A, Lahue KG, Bivona JJ, Cousens PM, Symeonides M, Ballif BA, Poynter ME, Krementsov DN. Long Noncoding RNA U90926 Is Induced in Activated Macrophages, Is Protective in Endotoxic Shock, and Encodes a Novel Secreted Protein. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2023; 210:807-819. [PMID: 36705532 PMCID: PMC9998366 DOI: 10.4049/jimmunol.2200215] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 01/03/2023] [Indexed: 01/28/2023]
Abstract
Thousands of long noncoding RNAs are encoded in mammalian genomes, yet most remain uncharacterized. In this study, we functionally characterized a mouse long noncoding RNA named U90926. Analysis of U90926 RNA levels revealed minimal expression across multiple tissues at steady state. However, the expression of this gene was highly induced in macrophages and dendritic cells by TLR activation, in a p38 MAPK- and MyD88-dependent manner. To study the function of U90926, we generated U90926-deficient (U9-KO) mice. Surprisingly, we found minimal effects of U90926 deficiency in cultured macrophages. Given the lack of macrophage-intrinsic effect, we investigated the subcellular localization of U90926 transcript and its protein-coding potential. We found that U90926 RNA localizes to the cytosol, associates with ribosomes, and contains an open reading frame that encodes a novel glycosylated protein (termed U9-ORF), which is secreted from the cell. An in vivo model of endotoxic shock revealed that, in comparison with wild type mice, U9-KO mice exhibited increased sickness responses and mortality. Mechanistically, serum levels of IL-6 were elevated in U9-KO mice, and IL-6 neutralization improved endotoxemia outcomes in U9-KO mice. Taken together, these results suggest that U90926 expression is protective during endotoxic shock, potentially mediated by the paracrine and/or endocrine actions of the novel U9-ORF protein secreted by activated myeloid cells.
Collapse
Affiliation(s)
- Bristy Sabikunnahar
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
- Cellular, Molecular, and Biomedical Sciences Doctoral Program, University of Vermont, Burlington, VT
| | - Sydney Caldwell
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
| | - Stella Varnum
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
| | - Tyler Hogan
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
| | - Alexei Cooper
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
| | - Karolyn G Lahue
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
| | - Joseph J Bivona
- Cellular, Molecular, and Biomedical Sciences Doctoral Program, University of Vermont, Burlington, VT
- Department of Medicine, University of Vermont, Burlington, VT
| | | | - Menelaos Symeonides
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT
| | - Bryan A Ballif
- Department of Biology, University of Vermont, Burlington, VT
| | | | - Dimitry N Krementsov
- Department of Biomedical and Health Sciences, University of Vermont, Burlington, VT
| |
Collapse
|
38
|
Zhu XT, Zhou R, Che J, Zheng YY, Tahir Ul Qamar M, Feng JW, Zhang J, Gao J, Chen LL. Ribosome profiling reveals the translational landscape and allele-specific translational efficiency in rice. PLANT COMMUNICATIONS 2023; 4:100457. [PMID: 36199246 PMCID: PMC10030323 DOI: 10.1016/j.xplc.2022.100457] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 08/23/2022] [Accepted: 10/01/2022] [Indexed: 05/04/2023]
Abstract
Translational regulation is a critical step in the process of gene expression and governs the synthesis of proteins from mRNAs. Many studies have revealed translational regulation in plants in response to various environmental stimuli. However, there have been no studies documenting the comprehensive landscape of translational regulation and allele-specific translational efficiency in multiple plant tissues, especially those of rice, a main staple crop that feeds nearly half of the world's population. Here we used RNA sequencing and ribosome profiling data to analyze the transcriptome and translatome of an elite hybrid rice, Shanyou 63 (SY63), and its parental varieties Zhenshan 97 and Minghui 63. The results revealed that gene expression patterns varied more among tissues than among varieties at the transcriptional and translational levels. We identified 3392 upstream open reading frames (uORFs), and the uORF-containing genes were enriched in transcription factors. Only 668 of 13 492 long non-coding RNAs could be translated into peptides. Finally, we discovered numerous genes with allele-specific translational efficiency in SY63 and demonstrated that some cis-regulatory elements may contribute to allelic divergence in translational efficiency. Overall, these findings may improve our understanding of translational regulation in rice and provide information for molecular breeding research.
Collapse
Affiliation(s)
- Xi-Tong Zhu
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Run Zhou
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Jian Che
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Yu-Yu Zheng
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Muhammad Tahir Ul Qamar
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Jia-Wu Feng
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Jianwei Zhang
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Junxiang Gao
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Ling-Ling Chen
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China; State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China.
| |
Collapse
|
39
|
Shiny transcriptional junk: lncRNA-derived peptides in cancers and immune responses. Life Sci 2023; 316:121434. [PMID: 36706831 DOI: 10.1016/j.lfs.2023.121434] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Revised: 01/11/2023] [Accepted: 01/19/2023] [Indexed: 01/27/2023]
Abstract
By interacting with DNA, RNA, and proteins, long noncoding RNAs (lncRNAs) have been linked to several pathological states. LncRNA-derived peptides, as a novel modality of action of lncRNAs, have recently become a research hotspot. An increasing body of evidence has demonstrated the important role of these peptides in carcinogenesis and cancer progression and immune response. This review first describes lncRNA-derived peptides, the regulators that control their translation, and the roles of these peptides in multiple biological processes and disease states including cancers. In the following section, we comprehensively analyzed the significant role lncRNA-derived peptide played in the immune response. This review provides fresh perspectives on the biological role of lncRNAs and their relationship with diseases, particularly with cancers and the immune response, providing a theoretical basis for these lncRNA-derived peptides as therapeutic and diagnostic targets in cancers and inflammatory diseases.
Collapse
|
40
|
Clauwaert J, McVey Z, Gupta R, Menschaert G. TIS Transformer: remapping the human proteome using deep learning. NAR Genom Bioinform 2023; 5:lqad021. [PMID: 36879896 PMCID: PMC9985340 DOI: 10.1093/nargab/lqad021] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 01/20/2023] [Accepted: 02/14/2023] [Indexed: 03/07/2023] Open
Abstract
The correct mapping of the proteome is an important step towards advancing our understanding of biological systems and cellular mechanisms. Methods that provide better mappings can fuel important processes such as drug discovery and disease understanding. Currently, true determination of translation initiation sites is primarily achieved by in vivo experiments. Here, we propose TIS Transformer, a deep learning model for the determination of translation start sites solely utilizing the information embedded in the transcript nucleotide sequence. The method is built upon deep learning techniques first designed for natural language processing. We prove this approach to be best suited for learning the semantics of translation, outperforming previous approaches by a large margin. We demonstrate that limitations in the model performance are primarily due to the presence of low-quality annotations against which the model is evaluated against. Advantages of the method are its ability to detect key features of the translation process and multiple coding sequences on a transcript. These include micropeptides encoded by short Open Reading Frames, either alongside a canonical coding sequence or within long non-coding RNAs. To demonstrate the use of our methods, we applied TIS Transformer to remap the full human proteome.
Collapse
Affiliation(s)
- Jim Clauwaert
- Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Oost-Vlaanderen 9000, Belgium
| | - Zahra McVey
- Novo Nordisk Research Centre Oxford, Novo Nordisk Ltd., Crawley, South East England, RH6 0PA, UK
| | - Ramneek Gupta
- Novo Nordisk Research Centre Oxford, Novo Nordisk Ltd., Crawley, South East England, RH6 0PA, UK
| | - Gerben Menschaert
- Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Oost-Vlaanderen 9000, Belgium
| |
Collapse
|
41
|
Fernandez SG, Ferguson L, Ingolia NT. Ribosome rescue factor PELOTA modulates translation start site choice and protein isoform levels of transcription factor C/EBP α. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.16.524343. [PMID: 36711859 PMCID: PMC9882168 DOI: 10.1101/2023.01.16.524343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Translation initiation at alternative start sites can dynamically control the synthesis of two or more functionally distinct protein isoforms from a single mRNA. Alternate isoforms of the hematopoietic transcription factor CCAAT-enhancer binding protein α (C/EBPα) produced from different start sites exert opposing effects during myeloid cell development. This alternative initiation depends on sequence features of the CEBPA transcript, including a regulatory upstream open reading frame (uORF), but the molecular basis is not fully understood. Here we identify trans-acting factors that affect C/EBPα isoform choice using a sensitive and quantitative two-color fluorescence reporter coupled with CRISPRi screening. Our screen uncovered a role for the ribosome rescue factor PELOTA (PELO) in promoting expression of the longer C/EBPα isoform, by directly removing inhibitory unrecycled ribosomes and through indirect effects mediated by the mechanistic target of rapamycin (mTOR) kinase. Our work provides further mechanistic insights into coupling between ribosome recycling and translation reinitiation in regulation of a key transcription factor, with implications for normal hematopoiesis and leukemiagenesis.
Collapse
Affiliation(s)
| | - Lucas Ferguson
- Department of Molecular and Cell Biology, University of California, Berkeley
| | - Nicholas T. Ingolia
- Department of Molecular and Cell Biology, University of California, Berkeley
- Center for Computational Biology and California Institute for Quantitative Biosciences, University of California, Berkeley
| |
Collapse
|
42
|
Martinez TF, Lyons-Abbott S, Bookout AL, De Souza EV, Donaldson C, Vaughan JM, Lau C, Abramov A, Baquero AF, Baquero K, Friedrich D, Huard J, Davis R, Kim B, Koch T, Mercer AJ, Misquith A, Murray SA, Perry S, Pino LK, Sanford C, Simon A, Zhang Y, Zipp G, Bizarro CV, Shokhirev MN, Whittle AJ, Searle BC, MacCoss MJ, Saghatelian A, Barnes CA. Profiling mouse brown and white adipocytes to identify metabolically relevant small ORFs and functional microproteins. Cell Metab 2023; 35:166-183.e11. [PMID: 36599300 PMCID: PMC9889109 DOI: 10.1016/j.cmet.2022.12.004] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 09/19/2022] [Accepted: 12/06/2022] [Indexed: 01/05/2023]
Abstract
Microproteins (MPs) are a potentially rich source of uncharacterized metabolic regulators. Here, we use ribosome profiling (Ribo-seq) to curate 3,877 unannotated MP-encoding small ORFs (smORFs) in primary brown, white, and beige mouse adipocytes. Of these, we validated 85 MPs by proteomics, including 33 circulating MPs in mouse plasma. Analyses of MP-encoding mRNAs under different physiological conditions (high-fat diet) revealed that numerous MPs are regulated in adipose tissue in vivo and are co-expressed with established metabolic genes. Furthermore, Ribo-seq provided evidence for the translation of Gm8773, which encodes a secreted MP that is homologous to human and chicken FAM237B. Gm8773 is highly expressed in the arcuate nucleus of the hypothalamus, and intracerebroventricular administration of recombinant mFAM237B showed orexigenic activity in obese mice. Together, these data highlight the value of this adipocyte MP database in identifying MPs with roles in fundamental metabolic and physiological processes such as feeding.
Collapse
Affiliation(s)
- Thomas F Martinez
- Department of Pharmaceutical Sciences, Department of Biological Chemistry, Chao Family Comprehensive Cancer Center, University of California, Irvine, Irvine, CA, USA
| | | | - Angie L Bookout
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Eduardo V De Souza
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil; Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul 90616-900, Brazil; Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Cynthia Donaldson
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Joan M Vaughan
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Calvin Lau
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Ariel Abramov
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Arian F Baquero
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Karalee Baquero
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Dave Friedrich
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Justin Huard
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Ray Davis
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Bong Kim
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Ty Koch
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Aaron J Mercer
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Ayesha Misquith
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Sara A Murray
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Sakara Perry
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Lindsay K Pino
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | - Alex Simon
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Yu Zhang
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Garrett Zipp
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA
| | - Cristiano V Bizarro
- Centro de Pesquisas em Biologia Molecular e Funcional (CPBMF) and Instituto Nacional de Ciência e Tecnologia em Tuberculose (INCT-TB), Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, Brazil; Programa de Pós-Graduação em Biologia Celular e Molecular, Pontifícia Universidade Católica do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul 90616-900, Brazil
| | - Maxim N Shokhirev
- Razavi Newman Integrative Genomics and Bioinformatics Core, Salk Institute for Biological Studies, La Jolla, CA, USA
| | | | - Brian C Searle
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
| | - Michael J MacCoss
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Alan Saghatelian
- Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, CA, USA.
| | - Christopher A Barnes
- Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA; Velia Therapeutics, Inc., San Diego, CA, USA.
| |
Collapse
|
43
|
Chothani S, Ho L, Schafer S, Rackham O. Discovering microproteins: making the most of ribosome profiling data. RNA Biol 2023; 20:943-954. [PMID: 38013207 PMCID: PMC10730196 DOI: 10.1080/15476286.2023.2279845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2023] [Indexed: 11/29/2023] Open
Abstract
Building a reference set of protein-coding open reading frames (ORFs) has revolutionized biological process discovery and understanding. Traditionally, gene models have been confirmed using cDNA sequencing and encoded translated regions inferred using sequence-based detection of start and stop combinations longer than 100 amino-acids to prevent false positives. This has led to small ORFs (smORFs) and their encoded proteins left un-annotated. Ribo-seq allows deciphering translated regions from untranslated irrespective of the length. In this review, we describe the power of Ribo-seq data in detection of smORFs while discussing the major challenge posed by data-quality, -depth and -sparseness in identifying the start and end of smORF translation. In particular, we outline smORF cataloguing efforts in humans and the large differences that have arisen due to variation in data, methods and assumptions. Although current versions of smORF reference sets can already be used as a powerful tool for hypothesis generation, we recommend that future editions should consider these data limitations and adopt unified processing for the community to establish a canonical catalogue of translated smORFs.
Collapse
Affiliation(s)
- Sonia Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Lena Ho
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
| | - Owen Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore
- School of Biological Sciences, University of Southampton, Southampton, UK
- The Alan Turing Institute, The British Library, London, UK
| |
Collapse
|
44
|
Yang J, Liu M, Fang X, Zhang H, Ren Q, Zheng Y, Wang Y, Zhou Y. Advances in peptides encoded by non-coding RNAs: A cargo in exosome. Front Oncol 2022; 12:1081997. [PMID: 36620552 PMCID: PMC9822543 DOI: 10.3389/fonc.2022.1081997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 11/28/2022] [Indexed: 12/24/2022] Open
Abstract
The metastasis of malignant tumors determines patient prognosis. This is the main reason for the poor prognosis of patients with cancer and the most challenging aspect of treating malignant tumors. Therefore, it is important to identify early tumor markers and molecules that can predict patient prognosis. However, there are currently no molecular markers with good clinical accuracy and specificity. Many non-coding RNA (ncRNAs)have been identified, which can regulate the process of tumor development at multiple levels. Interestingly, some ncRNAs are translated to produce functional peptides. Exosomes act as signal carriers, are encapsulated in nucleic acids and proteins, and play a messenger role in cell-to-cell communication. Recent studies have identified exosome peptides with potential diagnostic roles. This review aims to provide a theoretical basis for ncRNA-encoded peptides or proteins transported by exosomes and ultimately to provide ideas for further development of new diagnostic and prognostic cancer markers.
Collapse
Affiliation(s)
- Jing Yang
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Mengxiao Liu
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Xidong Fang
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Huiyun Zhang
- The First Clinical Medical College, Lanzhou University, Lanzhou, China,Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Qian Ren
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Ya Zheng
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China
| | - Yuping Wang
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China,*Correspondence: Yongning Zhou, ; Yuping Wang,
| | - Yongning Zhou
- Department of Gastroenterology, The First Hospital of Lanzhou University, Lanzhou, China,Key Laboratory for Gastrointestinal Diseases of Gansu Province, The First Hospital of Lanzhou University, Lanzhou, China,*Correspondence: Yongning Zhou, ; Yuping Wang,
| |
Collapse
|
45
|
Vakirlis N, Vance Z, Duggan KM, McLysaght A. De novo birth of functional microproteins in the human lineage. Cell Rep 2022; 41:111808. [PMID: 36543139 PMCID: PMC10073203 DOI: 10.1016/j.celrep.2022.111808] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 06/21/2022] [Accepted: 11/18/2022] [Indexed: 12/24/2022] Open
Abstract
Small open reading frames (sORFs) can encode functional "microproteins" that perform crucial biological tasks. However, their size makes them less amenable to genomic analysis, and their origins and conservation are poorly understood. Given their short length, it is plausible that some of these functional microproteins have recently originated entirely de novo from noncoding sequences. Here we sought to identify such cases in the human lineage by reconstructing the evolutionary origins of human microproteins previously found to have measurable, statistically significant fitness effects. By tracing the formation of each ORF and its transcriptional activation, we show that novel microproteins with significant phenotypic effects have emerged de novo throughout animal evolution, including two after the human-chimpanzee split. Notably, traditional methods for assessing coding potential would miss most of these cases. This evidence demonstrates that the functional potential intrinsic to sORFs can be relatively rapidly and frequently realized through de novo gene emergence.
Collapse
Affiliation(s)
- Nikolaos Vakirlis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece.
| | - Zoe Vance
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Kate M Duggan
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland
| | - Aoife McLysaght
- Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin, Ireland.
| |
Collapse
|
46
|
Yang Y, Wang H, Zhang Y, Chen L, Chen G, Bao Z, Yang Y, Xie Z, Zhao Q. An Optimized Proteomics Approach Reveals Novel Alternative Proteins in Mouse Liver Development. Mol Cell Proteomics 2022; 22:100480. [PMID: 36494044 PMCID: PMC9823216 DOI: 10.1016/j.mcpro.2022.100480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/15/2022] [Accepted: 12/04/2022] [Indexed: 12/12/2022] Open
Abstract
Alternative ORFs (AltORFs) are unannotated sequences in genome that encode novel peptides or proteins named alternative proteins (AltProts). Although ribosome profiling and bioinformatics predict a large number of AltProts, mass spectrometry as the only direct way of identification is hampered by the short lengths and relative low abundance of AltProts. There is an urgent need for improvement of mass spectrometry methodologies for AltProt identification. Here, we report an approach based on size-exclusion chromatography for simultaneous enrichment and fractionation of AltProts from complex proteome. This method greatly simplifies the variance of AltProts discovery by enriching small proteins smaller than 40 kDa. In a systematic comparison between 10 methods, the approach we reported enabled the discovery of more AltProts with overall higher intensities, with less cost of time and effort compared to other workflows. We applied this approach to identify 89 novel AltProts from mouse liver, 39 of which were differentially expressed between embryonic and adult mice. During embryonic development, the upregulated AltProts were mainly involved in biological pathways on RNA splicing and processing, whereas the AltProts involved in metabolisms were more active in adult livers. Our study not only provides an effective approach for identifying AltProts but also novel AltProts that are potentially important in developmental biology.
Collapse
Affiliation(s)
- Ying Yang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, SAR, China
| | - Hongwei Wang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yuanliang Zhang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, SAR, China
| | - Lei Chen
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, SAR, China
| | - Gennong Chen
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zhaoshi Bao
- Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical School, Beijing, China
| | - Yang Yang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, SAR, China
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Qian Zhao
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, SAR, China,For correspondence: Qian Zhao
| |
Collapse
|
47
|
Duffy EE, Finander B, Choi G, Carter AC, Pritisanac I, Alam A, Luria V, Karger A, Phu W, Sherman MA, Assad EG, Pajarillo N, Khitun A, Crouch EE, Ganesh S, Chen J, Berger B, Sestan N, O'Donnell-Luria A, Huang EJ, Griffith EC, Forman-Kay JD, Moses AM, Kalish BT, Greenberg ME. Developmental dynamics of RNA translation in the human brain. Nat Neurosci 2022; 25:1353-1365. [PMID: 36171426 PMCID: PMC10198132 DOI: 10.1038/s41593-022-01164-9] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 08/12/2022] [Indexed: 01/27/2023]
Abstract
The precise regulation of gene expression is fundamental to neurodevelopment, plasticity and cognitive function. Although several studies have profiled transcription in the developing human brain, there is a gap in understanding of accompanying translational regulation. In this study, we performed ribosome profiling on 73 human prenatal and adult cortex samples. We characterized the translational regulation of annotated open reading frames (ORFs) and identified thousands of previously unknown translation events, including small ORFs that give rise to human-specific and/or brain-specific microproteins, many of which we independently verified using proteomics. Ribosome profiling in stem-cell-derived human neuronal cultures corroborated these findings and revealed that several neuronal activity-induced non-coding RNAs encode previously undescribed microproteins. Physicochemical analysis of brain microproteins identified a class of proteins that contain arginine-glycine-glycine (RGG) repeats and, thus, may be regulators of RNA metabolism. This resource expands the known translational landscape of the human brain and illuminates previously unknown brain-specific protein products.
Collapse
Affiliation(s)
- Erin E Duffy
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA.
| | | | - GiHun Choi
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Ava C Carter
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Iva Pritisanac
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, Canada
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Biochemistry, University of Toronto, Toronto, ON, Canada
- Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Molecular Biology and Biochemistry, Medical University of Graz, Graz, Austria
| | - Aqsa Alam
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, Canada
| | - Victor Luria
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Pediatrics, Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
| | - Amir Karger
- IT-Research Computing, Harvard Medical School, Boston, MA, USA
| | - William Phu
- Department of Pediatrics, Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Maxwell A Sherman
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Elena G Assad
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Naomi Pajarillo
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Alexandra Khitun
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA, USA
| | - Elizabeth E Crouch
- Department of Pediatrics, University of California, San Francisco, San Francisco, CA, USA
- The Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, CA, USA
| | - Sanika Ganesh
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Jin Chen
- Cecil H. and Ida Green Center for Reproductive Biology Sciences, UT Southwestern Medical Center, Dallas, TX, USA
| | - Bonnie Berger
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Nenad Sestan
- Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
| | - Anne O'Donnell-Luria
- Department of Pediatrics, Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Eric J Huang
- Department of Pathology, University of California, San Francisco, San Francisco, CA, USA
- Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA, USA
- Pathology Service 113B, San Francisco Veterans Affairs Healthcare System, San Francisco, CA, USA
| | - Eric C Griffith
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA
| | - Julie D Forman-Kay
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Biochemistry, University of Toronto, Toronto, ON, Canada
| | - Alan M Moses
- Department of Cell and Systems Biology, University of Toronto, Toronto, ON, Canada
| | - Brian T Kalish
- Department of Neurobiology, Harvard Medical School, Boston, MA, USA.
- Department of Paediatrics, Division of Neonatology, Hospital for Sick Children, Toronto, ON, Canada.
- Program in Neuroscience and Mental Health, SickKids Research Institute, Toronto, ON, Canada.
| | | |
Collapse
|
48
|
Bagheri A, Astafev A, Al-Hashimy T, Jiang P. Tracing Translational Footprint by Ribo-Seq: Principle, Workflow, and Applications to Understand the Mechanism of Human Diseases. Cells 2022; 11:cells11192966. [PMID: 36230928 PMCID: PMC9562884 DOI: 10.3390/cells11192966] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 09/02/2022] [Accepted: 09/19/2022] [Indexed: 11/30/2022] Open
Abstract
RNA-seq has been widely used as a high-throughput method to characterize transcript dynamic changes in a broad context, such as development and diseases. However, whether RNA-seq-estimated transcriptional dynamics can be translated into protein level changes is largely unknown. Ribo-seq (Ribosome profiling) is an emerging technology that allows for the investigation of the translational footprint via profiling ribosome-bounded mRNA fragments. Ribo-seq coupled with RNA-seq will allow us to understand the transcriptional and translational control of the fundamental biological process and human diseases. This review focuses on discussing the principle, workflow, and applications of Ribo-seq to study human diseases.
Collapse
Affiliation(s)
- Atefeh Bagheri
- Department of Biological, Geological and Environmental Sciences (BGES), Cleveland State University, Cleveland, OH 44115, USA
- Center for Gene Regulation in Health and Disease (GRHD), Cleveland State University, Cleveland, OH 44115, USA
| | - Artem Astafev
- Department of Biological, Geological and Environmental Sciences (BGES), Cleveland State University, Cleveland, OH 44115, USA
- Center for Gene Regulation in Health and Disease (GRHD), Cleveland State University, Cleveland, OH 44115, USA
| | - Tara Al-Hashimy
- Department of Biological, Geological and Environmental Sciences (BGES), Cleveland State University, Cleveland, OH 44115, USA
| | - Peng Jiang
- Department of Biological, Geological and Environmental Sciences (BGES), Cleveland State University, Cleveland, OH 44115, USA
- Center for Gene Regulation in Health and Disease (GRHD), Cleveland State University, Cleveland, OH 44115, USA
- Center for Applied Data Analysis and Modeling (ADAM), Cleveland State University, Cleveland, OH 44115, USA
- Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
- Correspondence: ; Tel.: +1-(216)-687-3917
| |
Collapse
|
49
|
Gleason AC, Ghadge G, Sonobe Y, Roos RP. Kozak Similarity Score Algorithm Identifies Alternative Translation Initiation Codons Implicated in Cancers. Int J Mol Sci 2022; 23:ijms231810564. [PMID: 36142475 PMCID: PMC9506484 DOI: 10.3390/ijms231810564] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 09/05/2022] [Accepted: 09/08/2022] [Indexed: 11/16/2022] Open
Abstract
Ribosome profiling and mass spectroscopy have identified canonical and noncanonical translation initiation codons (TICs) that are upstream of the main translation initiation site and used to translate oncogenic proteins. There have previously been conflicting reports about the patterns of nucleotides that surround noncanonical TICs. Here, we use a Kozak Similarity Score algorithm to find that nearly all of these TICs have flanking nucleotides closely matching the Kozak sequence. Remarkably, the nucleotides flanking alternative noncanonical TICs are frequently closer to the Kozak sequence than the nucleotides flanking TICs used to translate the gene’s main protein. Of note, the 5′ untranslated region (5‘UTR) of cancer-associated genes with an upstream TIC tend to be significantly longer than the same region in genes not associated with cancer. The presence of a longer-than-typical 5′UTR increases the likelihood of ribosome binding to upstream noncanonical TICs, and may be a distinguishing feature of a number of genes overexpressed in cancer. Noncanonical TICs that are located in the 5′UTR, although thought by some to be disadvantageous and suppressed by evolution, may translate oncogenic proteins because of their flanking nucleotides.
Collapse
|
50
|
Malekos E, Carpenter S. Short open reading frame genes in innate immunity: from discovery to characterization. Trends Immunol 2022; 43:741-756. [PMID: 35965152 PMCID: PMC10118063 DOI: 10.1016/j.it.2022.07.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 07/11/2022] [Accepted: 07/13/2022] [Indexed: 12/27/2022]
Abstract
Next-generation sequencing (NGS) technologies have greatly expanded the size of the known transcriptome. Many newly discovered transcripts are classified as long noncoding RNAs (lncRNAs) which are assumed to affect phenotype through sequence and structure and not via translated protein products despite the vast majority of them harboring short open reading frames (sORFs). Recent advances have demonstrated that the noncoding designation is incorrect in many cases and that sORF-encoded peptides (SEPs) translated from these transcripts are important contributors to diverse biological processes. Interest in SEPs is at an early stage and there is evidence for the existence of thousands of SEPs that are yet unstudied. We hope to pique interest in investigating this unexplored proteome by providing a discussion of SEP characterization generally and describing specific discoveries in innate immunity.
Collapse
Affiliation(s)
- Eric Malekos
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA; Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Susan Carpenter
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA; Department of Molecular Cell and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA, USA.
| |
Collapse
|