1
|
Sultan MF, Karim T, Hossain Shaon MS, Azim SM, Dehzangi I, Akter MS, Ibrahim SM, Ali MM, Ahmed K, Bui FM. DHUpredET: A comparative computational approach for identification of dihydrouridine modification sites in RNA sequence. Anal Biochem 2025; 702:115828. [PMID: 40057221 DOI: 10.1016/j.ab.2025.115828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Revised: 02/23/2025] [Accepted: 03/04/2025] [Indexed: 03/17/2025]
Abstract
Laboratory-based detection of D sites is laborious and expensive. In this study, we developed effective machine learning models employing efficient feature encoding methods to identify D sites. Initially, we explored various state-of-the-art feature encoding approaches and 30 machine learning techniques for each and selected the top eight models based on their independent testing and cross-validation outcomes. Finally, we developed DHUpredET using the extra tree classifier methods for predicting DHU sites. The DHUpredET model demonstrated balanced performance across all evaluation criteria, outperforming state-of-the-art models by 8 % and 14 % in terms of accuracy and sensitivity, respectively, on an independent test set. Further analysis revealed that the model achieved higher accuracy with position-specific two nucleotide (PS2) features, leading us to conclude that PS2 features are the best suited for the DHUpredET model. Therefore, our proposed model emerges as the most favorite choice for predicting D sites. In addition, we conducted an in-depth analysis of local features and identified a particularly significant attribute with a feature score of 0.035 for PS2_299 attributes. This tool holds immense promise as an advantageous instrument for accelerating the discovery of D modification sites, which contributes too many targeting therapeutic and understanding RNA structure.
Collapse
Affiliation(s)
- Md Fahim Sultan
- Department of Computer Science and Engineering, Oakland University, Rochester, MI, 48309, USA.
| | - Tasmin Karim
- Department of Computer Science and Engineering, Oakland University, Rochester, MI, 48309, USA.
| | | | - Sayed Mehedi Azim
- Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, 08102, USA.
| | - Iman Dehzangi
- Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, 08102, USA; Department of Computer Science, Rutgers University, Camden, NJ, 08102, USA.
| | - Mst Shapna Akter
- Department of Computer Science and Engineering, Oakland University, Rochester, MI, 48309, USA.
| | - Sobhy M Ibrahim
- Department of Biochemistry, College of Science, King Saud University, P.O. Box: 2455, Riyadh, 11451, Saudi Arabia.
| | - Md Mamun Ali
- Division of Biomedical Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada; Department of Software Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh.
| | - Kawsar Ahmed
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada; Health Informatics Research Lab, Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh.
| | - Francis M Bui
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada.
| |
Collapse
|
2
|
Porat J. Circuit logic: interdependent RNA modifications shape mRNA and noncoding RNA structure and function. RNA (NEW YORK, N.Y.) 2025; 31:613-622. [PMID: 40044218 PMCID: PMC12001972 DOI: 10.1261/rna.080421.125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2025] [Accepted: 02/26/2025] [Indexed: 03/28/2025]
Abstract
Continued advances in high-throughput detection of posttranscriptional RNA modifications have enabled large-scale, mechanistic studies into the importance of RNA modifications in regulating the structure, function, and stability of coding and noncoding RNAs. More recently, this has expanded beyond investigations of independent single modifications, revealing the breadth of modification complexities in single transcripts and the biogenesis pathways involved that lead to coordinately modified RNA species. This has resulted in the concept of modification circuits, where one modification can promote or inhibit the subsequent installation of other modifications, or when modifications are coordinated across different RNA species. These circuits play important roles in the biogenesis of multistepped posttranscriptional modifications, modulate ribonucleoprotein complex formation and conformational switches, and mediate codon-biased translation through the coordination of mRNA and tRNA modifications. Here, I review evidence of complex modification circuits in mRNA and noncoding RNA and highlight open questions concerning the molecular mechanisms giving rise to modification circuits and their importance in the context of RNA processing and maturation.
Collapse
MESH Headings
- RNA, Messenger/genetics
- RNA, Messenger/chemistry
- RNA, Messenger/metabolism
- RNA, Untranslated/genetics
- RNA, Untranslated/chemistry
- RNA, Untranslated/metabolism
- RNA Processing, Post-Transcriptional
- RNA, Transfer/genetics
- RNA, Transfer/metabolism
- RNA, Transfer/chemistry
- Nucleic Acid Conformation
- Humans
- Animals
Collapse
Affiliation(s)
- Jennifer Porat
- Stem Cell Program and Division of Hematology/Oncology, Boston Children's Hospital, Boston, Massachusetts 02215, USA
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| |
Collapse
|
3
|
Oberdoerffer S, Gilbert WV. All the sites we cannot see: Sources and mitigation of false negatives in RNA modification studies. Nat Rev Mol Cell Biol 2025; 26:237-248. [PMID: 39433914 DOI: 10.1038/s41580-024-00784-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/12/2024] [Indexed: 10/23/2024]
Abstract
RNA modifications are essential for human health - too much or too little of them leads to serious illnesses ranging from neurodevelopmental disorders to cancer. Technical advances in RNA modification sequencing are beginning to uncover the RNA targets of diverse RNA-modifying enzymes that are dysregulated in disease. However, the emerging transcriptome-wide maps of modified nucleosides installed by these enzymes should be considered as first drafts. In particular, a range of technical artefacts lead to false negatives - modified sites that are overlooked owing to technique-dependent, and often sequence-context-specific, 'blind spots'. In this Review, we discuss potential sources of false negatives in sequencing-based RNA modification maps, propose mitigation strategies and suggest guidelines for transparent reporting of sensitivity to detect modified sites in profiling studies. Important considerations for recognition and avoidance of false negatives include assessment and reporting of position-specific sequencing depth, identification of protocol-dependent RNA capture biases and applying controls for false negatives as well as for false positives. Despite their limitations, emerging maps of RNA modifications reveal exciting and largely uncharted potential for post-transcriptional control of all aspects of RNA function.
Collapse
Affiliation(s)
- Shalini Oberdoerffer
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD, USA.
| | - Wendy V Gilbert
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA.
| |
Collapse
|
4
|
Kilz LM, Zimmermann S, Marchand V, Bourguignon V, Sudol C, Brégeon D, Hamdane D, Motorin Y, Helm M. Differential redox sensitivity of tRNA dihydrouridylation. Nucleic Acids Res 2024; 52:12784-12797. [PMID: 39460624 PMCID: PMC11602153 DOI: 10.1093/nar/gkae964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Revised: 10/03/2024] [Accepted: 10/11/2024] [Indexed: 10/28/2024] Open
Abstract
Various transfer RNA (tRNA) modifications have recently been shown to regulate stress-dependent gene expression by modulating messenger RNA translation. Among these modifications, dihydrouridine stands out for its increase of tRNA structural flexibility. However, whether and how dihydrouridine synthesis reacts to environmental stimuli is largely unknown. In this study, we manipulated the intracellular redox state of Escherichia coli using paraquat, revealing differential sensitivities of the three tRNA-dihydrouridine synthases towards oxidative stress. Using liquid chromatography-mass spectrometry quantification of dihydrouridine in various knockout strains, we validated the use of a specific RNA sequencing method, namely AlkAnilineSeq, for the precise mapping of dihydrouridines throughout E. coli tRNAs. We found DusA showing high activity, followed by DusB and DusC, whose activity was decreased under paraquat treatment. The relative sensitivity is most plausibly explained by a paraquat-dependent drop of NADPH availability. These findings are substantiated by in vitro kinetics, revealing DusA as the most active enzyme, followed by DusB, while DusC showed little activity, likely related to the efficacy of the redox reaction of the flavin coenzyme with NADPH. Overall, our study underscores the intricate interplay between redox dynamics and tRNA modification processes, revealing a new facet of the regulatory mechanisms influencing cellular responses to oxidative stress.
Collapse
Affiliation(s)
- Lea-Marie Kilz
- Institute of Pharmaceutical and Biomedical Sciences, Staudingerweg 5, Johannes Gutenberg University Mainz, 55128 Mainz, Germany
| | - Simone Zimmermann
- Institute of Pharmaceutical and Biomedical Sciences, Staudingerweg 5, Johannes Gutenberg University Mainz, 55128 Mainz, Germany
| | - Virginie Marchand
- Université de Lorraine, CNRS, INSERM, UAR2008/US40 IBSLor, EpiRNA-Seq Core Facility, 9 Av. De la Forêt de Haye, 54500 Vandoeuvre-lès-Nancy, France
- Université de Lorraine, CNRS, UMR7365 IMoPA, 9 Av. De la Forêtde Haye, 54500 Vandoeuvre-lès-Nancy, France
| | - Valérie Bourguignon
- Université de Lorraine, CNRS, INSERM, UAR2008/US40 IBSLor, EpiRNA-Seq Core Facility, 9 Av. De la Forêt de Haye, 54500 Vandoeuvre-lès-Nancy, France
- Université de Lorraine, CNRS, UMR7365 IMoPA, 9 Av. De la Forêtde Haye, 54500 Vandoeuvre-lès-Nancy, France
| | - Claudia Sudol
- Sorbonne University, CNRS, Institute of Biology Paris Seine, Biology of Aging and Adaptation, 7 quai Saint Bernard, 75252 Paris, France
- Collège de France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques (LCPB), 11place Marcelin Berthelot, 75231 Paris France
| | - Damien Brégeon
- Sorbonne University, CNRS, Institute of Biology Paris Seine, Biology of Aging and Adaptation, 7 quai Saint Bernard, 75252 Paris, France
| | - Djemel Hamdane
- Collège de France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques (LCPB), 11place Marcelin Berthelot, 75231 Paris France
| | - Yuri Motorin
- Université de Lorraine, CNRS, INSERM, UAR2008/US40 IBSLor, EpiRNA-Seq Core Facility, 9 Av. De la Forêt de Haye, 54500 Vandoeuvre-lès-Nancy, France
- Université de Lorraine, CNRS, UMR7365 IMoPA, 9 Av. De la Forêtde Haye, 54500 Vandoeuvre-lès-Nancy, France
| | - Mark Helm
- Institute of Pharmaceutical and Biomedical Sciences, Staudingerweg 5, Johannes Gutenberg University Mainz, 55128 Mainz, Germany
| |
Collapse
|
5
|
Matsuura J, Akichika S, Wei FY, Suzuki T, Yamamoto T, Watanabe Y, Valášek LS, Mukasa A, Tomizawa K, Chujo T. Human DUS1L catalyzes dihydrouridine modification at tRNA positions 16/17, and DUS1L overexpression perturbs translation. Commun Biol 2024; 7:1238. [PMID: 39354220 PMCID: PMC11445529 DOI: 10.1038/s42003-024-06942-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 09/23/2024] [Indexed: 10/03/2024] Open
Abstract
Human cytoplasmic tRNAs contain dihydrouridine modifications at positions 16 and 17 (D16/D17). The enzyme responsible for D16/D17 formation and its cellular roles remain elusive. Here, we identify DUS1L as the human tRNA D16/D17 writer. DUS1L knockout in the glioblastoma cell lines LNZ308 and U87 causes loss of D16/D17. D formation is reconstituted in vitro using recombinant DUS1L in the presence of NADPH or NADH. DUS1L knockout/overexpression in LNZ308 cells shows that DUS1L supports cell growth. Moreover, higher DUS1L expression in glioma patients is associated with poorer prognosis. Upon vector-mediated DUS1L overexpression in LNZ308 cells, 5' and 3' processing of precursor tRNATyr(GUA) is inhibited, resulting in a reduced mature tRNATyr(GUA) level, reduced translation of the tyrosine codons UAC and UAU, and reduced translational readthrough of the near-cognate stop codons UAA and UAG. Moreover, DUS1L overexpression increases the amounts of several D16/D17-containing tRNAs and total cellular translation. Our study identifies a human dihydrouridine writer, providing the foundation to study its roles in health and disease.
Collapse
Affiliation(s)
- Jin Matsuura
- Department of Molecular Physiology, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
- Department of Neurosurgery, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Shinichiro Akichika
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Tokyo, Japan
| | - Fan-Yan Wei
- Department of Modomics Biology and Medicine, Institute of Development, Aging and Cancer, Tohoku University, Sendai, Japan
| | - Tsutomu Suzuki
- Department of Chemistry and Biotechnology, Graduate School of Engineering, University of Tokyo, Tokyo, Japan
| | - Takahiro Yamamoto
- Department of Neurosurgery, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Yuka Watanabe
- Department of Cell Pathology, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Leoš Shivaya Valášek
- Laboratory of Regulation of Gene Expression, Institute of Microbiology of the Czech Academy of Sciences, Prague, Czech Republic
| | - Akitake Mukasa
- Department of Neurosurgery, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Kazuhito Tomizawa
- Department of Molecular Physiology, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan.
- Center for Metabolic Regulation of Healthy Aging, Faculty of Life Science, Kumamoto University, Kumamoto, Japan.
| | - Takeshi Chujo
- Department of Molecular Physiology, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan.
| |
Collapse
|
6
|
Schaening-Burgos C, LeBlanc H, Fagre C, Li GW, Gilbert WV. RluA is the major mRNA pseudouridine synthase in Escherichia coli. PLoS Genet 2024; 20:e1011100. [PMID: 39241085 PMCID: PMC11421799 DOI: 10.1371/journal.pgen.1011100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 09/24/2024] [Accepted: 07/14/2024] [Indexed: 09/08/2024] Open
Abstract
Pseudouridine (Ψ) is an ubiquitous RNA modification, present in the tRNAs and rRNAs of species across all domains of life. Conserved pseudouridine synthases modify the mRNAs of diverse eukaryotes, but the modification has yet to be identified in bacterial mRNAs. Here, we report the discovery of pseudouridines in mRNA from E. coli. By testing the mRNA modification capacity of all 11 known pseudouridine synthases, we identify RluA as the predominant mRNA-modifying enzyme. RluA, a known tRNA and 23S rRNA pseudouridine synthase, modifies at least 31 of the 44 high-confidence sites we identified in E. coli mRNAs. Using RNA structure probing data to inform secondary structures, we show that the target sites of RluA occur in a common sequence and structural motif comprised of a ΨURAA sequence located in the loop of a short hairpin. This recognition element is shared with previously identified target sites of RluA in tRNAs and rRNA. Overall, our work identifies pseudouridine in key mRNAs and suggests the capacity of Ψ to regulate the transcripts that contain it.
Collapse
Affiliation(s)
- Cassandra Schaening-Burgos
- Department of Biology, Massachusetts Institute of Technology; Cambridge, Massachusetts, United States of America
- Program in Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Hannah LeBlanc
- Department of Biology, Massachusetts Institute of Technology; Cambridge, Massachusetts, United States of America
| | - Christian Fagre
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| | - Gene-Wei Li
- Department of Biology, Massachusetts Institute of Technology; Cambridge, Massachusetts, United States of America
| | - Wendy V. Gilbert
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
7
|
Toubdji S, Thullier Q, Kilz LM, Marchand V, Yuan Y, Sudol C, Goyenvalle C, Jean-Jean O, Rose S, Douthwaite S, Hardy L, Baharoglu Z, de Crécy-Lagard V, Helm M, Motorin Y, Hamdane D, Brégeon D. Exploring a unique class of flavoenzymes: Identification and biochemical characterization of ribosomal RNA dihydrouridine synthase. Proc Natl Acad Sci U S A 2024; 121:e2401981121. [PMID: 39078675 PMCID: PMC11317573 DOI: 10.1073/pnas.2401981121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 06/20/2024] [Indexed: 07/31/2024] Open
Abstract
Dihydrouridine (D), a prevalent and evolutionarily conserved base in the transcriptome, primarily resides in tRNAs and, to a lesser extent, in mRNAs. Notably, this modification is found at position 2449 in the Escherichia coli 23S rRNA, strategically positioned near the ribosome's peptidyl transferase site. Despite the prior identification, in E. coli genome, of three dihydrouridine synthases (DUS), a set of NADPH and FMN-dependent enzymes known for introducing D in tRNAs and mRNAs, characterization of the enzyme responsible for D2449 deposition has remained elusive. This study introduces a rapid method for detecting D in rRNA, involving reverse transcriptase-blockage at the rhodamine-labeled D2449 site, followed by PCR amplification (RhoRT-PCR). Through analysis of rRNA from diverse E. coli strains, harboring chromosomal or single-gene deletions, we pinpoint the yhiN gene as the ribosomal dihydrouridine synthase, now designated as RdsA. Biochemical characterizations uncovered RdsA as a unique class of flavoenzymes, dependent on FAD and NADH, with a complex structural topology. In vitro assays demonstrated that RdsA dihydrouridylates a short rRNA transcript mimicking the local structure of the peptidyl transferase site. This suggests an early introduction of this modification before ribosome assembly. Phylogenetic studies unveiled the widespread distribution of the yhiN gene in the bacterial kingdom, emphasizing the conservation of rRNA dihydrouridylation. In a broader context, these findings underscore nature's preference for utilizing reduced flavin in the reduction of uridines and their derivatives.
Collapse
Affiliation(s)
- Sabrine Toubdji
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Institut de Biologie Paris-Seine, F-75252Paris Cedex 05, France
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, F-75231, Paris Cedex 05, France
| | - Quentin Thullier
- Université de Lorraine, CNRS, Institut National de la Santé et de la Recherche Médicale, Ingénierie-Biologie-Santé en Lorraine, Epitranscriptomique et Séquençage Core Facility, F-54000Nancy, France
- Université de Lorraine, CNRS, Ingénierie Moléculaire, Cellulaire et Physiopathologie, F-54000Nancy, France
| | - Lea-Marie Kilz
- Institut für Pharmazeutische und Biomedizinische Wissenschaften, Johannes Gutenberg-Universität, MainzD-55128, Germany
| | - Virginie Marchand
- Université de Lorraine, CNRS, Institut National de la Santé et de la Recherche Médicale, Ingénierie-Biologie-Santé en Lorraine, Epitranscriptomique et Séquençage Core Facility, F-54000Nancy, France
- Université de Lorraine, CNRS, Ingénierie Moléculaire, Cellulaire et Physiopathologie, F-54000Nancy, France
| | - Yifeng Yuan
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL32611
| | - Claudia Sudol
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Institut de Biologie Paris-Seine, F-75252Paris Cedex 05, France
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, F-75231, Paris Cedex 05, France
| | - Catherine Goyenvalle
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Institut de Biologie Paris-Seine, F-75252Paris Cedex 05, France
| | - Olivier Jean-Jean
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Institut de Biologie Paris-Seine, F-75252Paris Cedex 05, France
| | - Simon Rose
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230Odense M, Denmark
| | - Stephen Douthwaite
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, DK-5230Odense M, Denmark
| | - Léo Hardy
- Institut Pasteur, Université Paris Cité, CNRS UMR3525, Unité Plasticité du Génome Bactérien, F-75015 Paris, France
| | - Zeynep Baharoglu
- Institut Pasteur, Université Paris Cité, CNRS UMR3525, Unité Plasticité du Génome Bactérien, F-75015 Paris, France
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL32611
- Genetics Institute, University of Florida, Gainesville, FL32610
| | - Mark Helm
- Institut für Pharmazeutische und Biomedizinische Wissenschaften, Johannes Gutenberg-Universität, MainzD-55128, Germany
| | - Yuri Motorin
- Université de Lorraine, CNRS, Institut National de la Santé et de la Recherche Médicale, Ingénierie-Biologie-Santé en Lorraine, Epitranscriptomique et Séquençage Core Facility, F-54000Nancy, France
- Université de Lorraine, CNRS, Ingénierie Moléculaire, Cellulaire et Physiopathologie, F-54000Nancy, France
| | - Djemel Hamdane
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, F-75231, Paris Cedex 05, France
| | - Damien Brégeon
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Institut de Biologie Paris-Seine, F-75252Paris Cedex 05, France
| |
Collapse
|
8
|
XIONG J, FENG T, YUAN BF. [Advances in mapping analysis of ribonucleic acid modifications through sequencing]. Se Pu 2024; 42:632-645. [PMID: 38966972 PMCID: PMC11224946 DOI: 10.3724/sp.j.1123.2023.12025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Indexed: 07/06/2024] Open
Abstract
Over 170 chemical modifications have been discovered in various types of ribonucleic acids (RNAs), including messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), and small nuclear RNA (snRNA). These RNA modifications play crucial roles in a wide range of biological processes such as gene expression regulation, RNA stability maintenance, and protein translation. RNA modifications represent a new dimension of gene expression regulation known as the "epitranscriptome". The discovery of RNA modifications and the relevant writers, erasers, and readers provides an important basis for studies on the dynamic regulation and physiological functions of RNA modifications. Owing to the development of detection technologies for RNA modifications, studies on RNA epitranscriptomes have progressed to the single-base resolution, multilayer, and full-coverage stage. Transcriptome-wide methods help discover new RNA modification sites and are of great importance for elucidating the molecular regulatory mechanisms of epitranscriptomics, exploring the disease associations of RNA modifications, and understanding their clinical applications. The existing RNA modification sequencing technologies can be categorized according to the pretreatment approach and sequencing principle as direct high-throughput sequencing, antibody-enrichment sequencing, enzyme-assisted sequencing, chemical labeling-assisted sequencing, metabolic labeling sequencing, and nanopore sequencing technologies. These methods, as well as studies on the functions of RNA modifications, have greatly expanded our understanding of epitranscriptomics. In this review, we summarize the recent progress in RNA modification detection technologies, focusing on the basic principles, advantages, and limitations of different methods. Direct high-throughput sequencing methods do not require complex RNA pretreatment and allow for the mapping of RNA modifications using conventional RNA sequencing methods. However, only a few RNA modifications can be analyzed by high-throughput sequencing. Antibody enrichment followed by high-throughput sequencing has emerged as a crucial approach for mapping RNA modifications, significantly advancing the understanding of RNA modifications and their regulatory functions in different species. However, the resolution of antibody-enrichment sequencing is limited to approximately 100-200 bp. Although chemical crosslinking techniques can achieve single-base resolution, these methods are often complex, and the specificity of the antibodies used in these methods has raised concerns. In particular, the issue of off-target binding by the antibodies requires urgent attention. Enzyme-assisted sequencing has improved the accuracy of the localization analysis of RNA modifications and enables stoichiometric detection with single-base resolution. However, the enzymes used in this technique show poor reactivity, specificity, and sequence preference. Chemical labeling sequencing has become a widely used approach for profiling RNA modifications, particularly by altering reverse transcription (RT) signatures such as RT stops, misincorporations, and deletions. Chemical-assisted sequencing provides a sequence-independent RNA modification detection strategy that enables the localization of multiple RNA modifications. Additionally, when combined with the biotin-streptavidin affinity method, low-abundance RNA modifications can be enriched and detected. Nevertheless, the specificity of many chemical reactions remains problematic, and the development of specific reaction probes for particular modifications should continue in the future to achieve the precise localization of RNA modifications. As an indirect localization method, metabolic labeling sequencing specifically localizes the sites at which modifying enzymes act, which is of great significance in the study of RNA modification functions. However, this method is limited by the intracellular labeling of RNA and cannot be applied to biological samples such as clinical tissues and blood samples. Nanopore sequencing is a direct RNA-sequencing method that does not require RT or the polymerase chain reaction (PCR). However, challenges in analyzing the data obtained from nanopore sequencing, such as the high rate of false positives, must be resolved. Discussing sequencing analysis methods for various types of RNA modifications is instructive for the future development of novel RNA modification mapping technologies, and will aid studies on the functions of RNA modifications across the entire transcriptome.
Collapse
|
9
|
Relier S, Schiffers S, Beiki H, Oberdoerffer S. Enhanced ac4C detection in RNA via chemical reduction and cDNA synthesis with modified dNTPs. RNA (NEW YORK, N.Y.) 2024; 30:938-953. [PMID: 38697668 PMCID: PMC11182010 DOI: 10.1261/rna.079863.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 04/04/2024] [Indexed: 05/05/2024]
Abstract
The functional analysis of epitranscriptomic modifications in RNA is constrained by a lack of methods that accurately capture their locations and levels. We previously demonstrated that the RNA modification N4-acetylcytidine (ac4C) can be mapped at base resolution through sodium borohydride reduction to tetrahydroacetylcytidine (tetrahydro-ac4C), followed by cDNA synthesis to misincorporate adenosine opposite reduced ac4C sites, culminating in C:T mismatches at acetylated cytidines (RedaC:T). However, this process is relatively inefficient, resulting in <20% C:T mismatches at a fully modified ac4C site in 18S rRNA. Considering that ac4C locations in other substrates including mRNA are unlikely to reach full penetrance, this method is not ideal for comprehensive mapping. Here, we introduce "RetraC:T" (reduction to tetrahydro-ac4C and reverse transcription with amino-dATP to induce C:T mismatches) as a method with enhanced ability to detect ac4C in cellular RNA. In brief, RNA is reduced through NaBH4 or the closely related reagent sodium cyanoborohydride (NaCNBH3) followed by cDNA synthesis in the presence of a modified DNA nucleotide, 2-amino-dATP, that preferentially binds to tetrahydro-ac4C. Incorporation of the modified dNTP substantially improved C:T mismatch rates, reaching stoichiometric detection of ac4C in 18S rRNA. Importantly, 2-amino-dATP did not result in truncated cDNA products nor increase mismatches at other locations. Thus, modified dNTPs are introduced as a new addition to the toolbox for detecting ac4C at base resolution.
Collapse
Affiliation(s)
- Sebastien Relier
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Sarah Schiffers
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Hamid Beiki
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Shalini Oberdoerffer
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
10
|
Sudol C, Kilz LM, Marchand V, Thullier Q, Guérineau V, Goyenvalle C, Faivre B, Toubdji S, Lombard M, Jean-Jean O, de Crécy-Lagard V, Helm M, Motorin Y, Brégeon D, Hamdane D. Functional redundancy in tRNA dihydrouridylation. Nucleic Acids Res 2024; 52:5880-5894. [PMID: 38682613 PMCID: PMC11162810 DOI: 10.1093/nar/gkae325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 03/26/2024] [Accepted: 04/24/2024] [Indexed: 05/01/2024] Open
Abstract
Dihydrouridine (D) is a common modified base found predominantly in transfer RNA (tRNA). Despite its prevalence, the mechanisms underlying dihydrouridine biosynthesis, particularly in prokaryotes, have remained elusive. Here, we conducted a comprehensive investigation into D biosynthesis in Bacillus subtilis through a combination of genetic, biochemical, and epitranscriptomic approaches. Our findings reveal that B. subtilis relies on two FMN-dependent Dus-like flavoprotein homologs, namely DusB1 and DusB2, to introduce all D residues into its tRNAs. Notably, DusB1 exhibits multisite enzyme activity, enabling D formation at positions 17, 20, 20a and 47, while DusB2 specifically catalyzes D biosynthesis at positions 20 and 20a, showcasing a functional redundancy among modification enzymes. Extensive tRNA-wide D-mapping demonstrates that this functional redundancy impacts the majority of tRNAs, with DusB2 displaying a higher dihydrouridylation efficiency compared to DusB1. Interestingly, we found that BsDusB2 can function like a BsDusB1 when overexpressed in vivo and under increasing enzyme concentration in vitro. Furthermore, we establish the importance of the D modification for B. subtilis growth at suboptimal temperatures. Our study expands the understanding of D modifications in prokaryotes, highlighting the significance of functional redundancy in this process and its impact on bacterial growth and adaptation.
Collapse
Affiliation(s)
- Claudia Sudol
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Paris 75252, France
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, 11 place Marcelin Berthelot, 75231 Paris Cedex 05, France
| | - Lea-Marie Kilz
- Institut für pharmazeutische und biomedizinische Wissenschaften (IPBW), Johannes Gutenberg-Universität, Mainz 55128, Germany
| | - Virginie Marchand
- Université de Lorraine, CNRS, INSERM, UMS2008/US40 IBSLor, EpiRNA-Seq Core Facility, Nancy F-54000, France
- Université de Lorraine, CNRS, UMR7365 IMoPA, Nancy F-54000, France
| | - Quentin Thullier
- Université de Lorraine, CNRS, INSERM, UMS2008/US40 IBSLor, EpiRNA-Seq Core Facility, Nancy F-54000, France
- Université de Lorraine, CNRS, UMR7365 IMoPA, Nancy F-54000, France
| | - Vincent Guérineau
- Université Paris-Saclay, CNRS, Institut de Chimie des Substances Naturelles, UPR 2301, 91198, Gif-sur-Yvette, France
| | - Catherine Goyenvalle
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Paris 75252, France
| | - Bruno Faivre
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, 11 place Marcelin Berthelot, 75231 Paris Cedex 05, France
| | - Sabrine Toubdji
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Paris 75252, France
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, 11 place Marcelin Berthelot, 75231 Paris Cedex 05, France
| | - Murielle Lombard
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, 11 place Marcelin Berthelot, 75231 Paris Cedex 05, France
| | - Olivier Jean-Jean
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Paris 75252, France
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
- University of Florida, Genetics Institute, Gainesville, FL 32610, USA
| | - Mark Helm
- Institut für pharmazeutische und biomedizinische Wissenschaften (IPBW), Johannes Gutenberg-Universität, Mainz 55128, Germany
| | - Yuri Motorin
- Université de Lorraine, CNRS, INSERM, UMS2008/US40 IBSLor, EpiRNA-Seq Core Facility, Nancy F-54000, France
- Université de Lorraine, CNRS, UMR7365 IMoPA, Nancy F-54000, France
| | - Damien Brégeon
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biology of Aging and Adaptation, Paris 75252, France
| | - Djemel Hamdane
- Collège De France, Sorbonne Université, CNRS, Laboratoire de Chimie des Processus Biologiques, 11 place Marcelin Berthelot, 75231 Paris Cedex 05, France
| |
Collapse
|
11
|
Ji J, Yu NJ, Kleiner RE. Sequence- and Structure-Specific tRNA Dihydrouridylation by hDUS2. ACS CENTRAL SCIENCE 2024; 10:803-812. [PMID: 38680565 PMCID: PMC11046453 DOI: 10.1021/acscentsci.3c01382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 02/22/2024] [Accepted: 02/26/2024] [Indexed: 05/01/2024]
Abstract
The post-transcriptional reduction of uridine to dihydrouridine (D) by dihydrouridine synthase (DUS) enzymes is among the most ubiquitous transformations in RNA biology. D is found at multiple sites in tRNAs, and studies in yeast have proposed that each of the four eukaryotic DUS enzymes modifies a different site; however, the molecular basis for this exquisite selectivity is unknown, and human DUS enzymes have remained largely uncharacterized. Here we investigate the substrate specificity of human dihydrouridine synthase 2 (hDUS2) using mechanism-based cross-linking with 5-bromouridine (5-BrUrd)-modified oligonucleotide probes and in vitro dihydrouridylation assays. We find that hDUS2 exclusively modifies U20 across diverse tRNA substrates and identify a minimal GU sequence within the tRNA D loop that underlies selective substrate modification. Further, we use our mechanism-based platform to screen small molecule inhibitors of hDUS2, a potential anticancer target. Our work elucidates the principles of substrate modification by a conserved DUS and provides a general platform for studying RNA modifying enzymes with sequence-defined activity-based probes.
Collapse
Affiliation(s)
- Jingwei Ji
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Nathan J. Yu
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Ralph E. Kleiner
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
12
|
Beiki H, Sturgill D, Arango D, Relier S, Schiffers S, Oberdoerffer S. Detection of ac4C in human mRNA is preserved upon data reassessment. Mol Cell 2024; 84:1611-1625.e3. [PMID: 38640896 PMCID: PMC11353019 DOI: 10.1016/j.molcel.2024.03.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 07/19/2023] [Accepted: 03/18/2024] [Indexed: 04/21/2024]
Abstract
We recently reported the distribution of N4-acetylcytidine (ac4C) in HeLa mRNA at base resolution through chemical reduction and the induction of C:T mismatches in sequencing (RedaC:T-seq). Our results contradicted an earlier report from Schwartz and colleagues utilizing a similar method termed ac4C-seq. Here, we revisit both datasets and reaffirm our findings. Through RedaC:T-seq reanalysis, we establish a low basal error rate at unmodified nucleotides that is not skewed to any specific mismatch type and a prominent increase in C:T substitutions as the dominant mismatch type in both treated wild-type replicates, with a high degree of reproducibility across replicates. In contrast, through ac4C-seq reanalysis, we uncover significant data quality issues including insufficient depth, with one wild-type replicate yielding 2.7 million reads, inconsistencies in reduction efficiencies between replicates, and an overall increase in mismatches involving thymine that could obscure ac4C detection. These analyses bolster the detection of ac4C in HeLa mRNA through RedaC:T-seq.
Collapse
Affiliation(s)
- Hamid Beiki
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892, USA
| | - David Sturgill
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892, USA
| | - Daniel Arango
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Sebastien Relier
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892, USA
| | - Sarah Schiffers
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892, USA
| | - Shalini Oberdoerffer
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, MD 20892, USA.
| |
Collapse
|
13
|
Schiffers S, Oberdoerffer S. ac4C: a fragile modification with stabilizing functions in RNA metabolism. RNA (NEW YORK, N.Y.) 2024; 30:583-594. [PMID: 38531654 PMCID: PMC11019744 DOI: 10.1261/rna.079948.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 02/09/2024] [Indexed: 03/28/2024]
Abstract
In recent years, concerted efforts to map and understand epitranscriptomic modifications in mRNA have unveiled new complexities in the regulation of gene expression. These studies cumulatively point to diverse functions in mRNA metabolism, spanning pre-mRNA processing, mRNA degradation, and translation. However, this emerging landscape is not without its intricacies and sources of discrepancies. Disparities in detection methodologies, divergent interpretations of functional outcomes, and the complex nature of biological systems across different cell types pose significant challenges. With a focus of N4-acetylcytidine (ac4C), this review endeavors to unravel conflicting narratives by examining the technological, biological, and methodological factors that have contributed to discrepancies and thwarted research progress. Our goal is to mitigate detection inconsistencies and establish a unified model to elucidate the contribution of ac4C to mRNA metabolism and cellular equilibrium.
Collapse
Affiliation(s)
- Sarah Schiffers
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| | - Shalini Oberdoerffer
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland 20892, USA
| |
Collapse
|
14
|
Gilbert WV. Recent developments, opportunities, and challenges in the study of mRNA pseudouridylation. RNA (NEW YORK, N.Y.) 2024; 30:530-536. [PMID: 38531650 PMCID: PMC11019745 DOI: 10.1261/rna.079975.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 02/09/2024] [Indexed: 03/28/2024]
Abstract
Pseudouridine is an abundant mRNA modification found in diverse organisms ranging from bacteria and viruses to multicellular plants and humans. New developments in pseudouridine profiling provide quantitative tools to map mRNA pseudouridylation sites. Sparse biochemical studies establish the potential for mRNA pseudouridylation to affect most stages of the mRNA life cycle from birth to death. This recent progress sets the stage for deeper investigations into the molecular and cellular functions of specific mRNA pseudouridines, including in disease.
Collapse
Affiliation(s)
- Wendy V Gilbert
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| |
Collapse
|
15
|
Thalalla Gamage S, Howpay Manage SA, Chu TT, Meier JL. Cytidine Acetylation Across the Tree of Life. Acc Chem Res 2024; 57:338-348. [PMID: 38226431 PMCID: PMC11578069 DOI: 10.1021/acs.accounts.3c00673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2024]
Abstract
Acetylation plays a critical role in regulating eukaryotic transcription via the modification of histones. Beyond this well-documented function, a less explored biological frontier is the potential for acetylation to modify and regulate the function of RNA molecules themselves. N4-Acetylcytdine (ac4C) is a minor RNA nucleobase conserved across all three domains of life (archaea, bacteria, and eukarya), a conservation that suggests a fundamental role in biological processes. Unlike many RNA modifications that are controlled by large enzyme families, almost all organisms catalyze ac4C using a homologue of human Nat10, an essential disease-associated acetyltransferase enzyme.A critical step in defining the fundamental functions of RNA modifications has been the development of methods for their sensitive and specific detection. This Account describes recent progress enabling the use of chemical sequencing reactions to map and quantify ac4C with single-nucleotide resolution in RNA. To orient readers, we first provide historical background of the discovery of ac4C and the enzymes that catalyze its formation. Next, we describe mechanistic experiments that led to the development of first- and second-generation sequencing reactions able to determine ac4C's position in a polynucleotide by exploiting the nucleobase's selective susceptibility to reduction by hydride donors. A notable feature of this chemistry, which may serve as a prototype for nucleotide resolution RNA modification sequencing reactions more broadly, is its ability to drive a penetrant and detectable gain of signal specifically at ac4C sites. Emphasizing practical applications, we present how this optimized chemistry can be integrated into experimental workflows capable of sensitive, transcriptome-wide analysis. Such readouts can be applied to quantitatively define the ac4C landscape across the tree of life. For example, in human cell lines and yeast, this method has uncovered that ac4C is highly selective, predominantly occupying dominant sites within rRNA (rRNA) and tRNA (tRNA). By contrast, when we extend these analyses to thermophilic archaea they identify the potential for much more prevalent patterns of cytidine acetylation, leading to the discovery of a role for this modification in adaptation to environmental stress. Nucleotide resolution analyses of ac4C have also allowed for the determination of structure-activity relationships required for short nucleolar RNA (snoRNA)-catalyzed ac4C deposition and the discovery of organisms with unexpectedly divergent tRNA and rRNA acetylation signatures. Finally, we share how these studies have shaped our approach to evaluating novel ac4C sites reported in the literature and highlight unanswered questions and new directions that set the stage for future research in the field.
Collapse
Affiliation(s)
- Supuni Thalalla Gamage
- Chemical Biology Laboratory, National Cancer Institute, National Institutes of Health, Frederick, Maryland 21702, United States
| | - Shereen A Howpay Manage
- Chemical Biology Laboratory, National Cancer Institute, National Institutes of Health, Frederick, Maryland 21702, United States
| | - T Thu Chu
- Chemical Biology Laboratory, National Cancer Institute, National Institutes of Health, Frederick, Maryland 21702, United States
| | - Jordan L Meier
- Chemical Biology Laboratory, National Cancer Institute, National Institutes of Health, Frederick, Maryland 21702, United States
| |
Collapse
|
16
|
Harun-Or-Roshid M, Maeda K, Phan LT, Manavalan B, Kurata H. Stack-DHUpred: Advancing the accuracy of dihydrouridine modification sites detection via stacking approach. Comput Biol Med 2024; 169:107848. [PMID: 38145601 DOI: 10.1016/j.compbiomed.2023.107848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 11/14/2023] [Accepted: 12/11/2023] [Indexed: 12/27/2023]
Abstract
Dihydrouridine (DHU, D) is one of the most abundant post-transcriptional uridine modifications found in tRNA, mRNA, and snoRNA, closely associated with disease pathogenesis and various biological processes in eukaryotes. Identifying D sites is important for understanding the modification mechanisms and/or epigenetic regulation. However, biological experiments for detecting D sites are time-consuming and expensive. Given these challenges, computational methods have been developed for accurately identifying the D sites in genome-wide datasets. However, existing methods have some limitations, and their prediction performance needs to be improved. In this work, we have developed a new computational predictor for accurately identifying D sites called Stack-DHUpred. Briefly, we trained 66 baseline models or single-feature models by connecting six machine learning classifiers with eleven different feature encoding methods and stacked different baseline models to build stacked ensemble learning models. Subsequently, the optimal combination of the baseline models was identified for the construction of the final stacked model. Remarkably, the Stack-DHUpred outperformed the existing predictors on our new independent dataset, indicating that the stacking approach significantly improved the prediction performance. We have made Stack-DHUpred available to the public through a web server (http://kurata35.bio.kyutech.ac.jp/Stack-DHUpred) and a standalone program (https://github.com/kuratahiroyuki/Stack-DHUpred). We believe that Stack-DHUpred will be a valuable tool for accelerating the discovery of D modifications and understanding their role in post-transcriptional regulation.
Collapse
Affiliation(s)
- Md Harun-Or-Roshid
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan
| | - Kazuhiro Maeda
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan
| | - Le Thi Phan
- Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, 16419, Republic of Korea
| | - Balachandran Manavalan
- Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, 16419, Republic of Korea.
| | - Hiroyuki Kurata
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan.
| |
Collapse
|
17
|
Rodell R, Robalin N, Martinez NM. Why U matters: detection and functions of pseudouridine modifications in mRNAs. Trends Biochem Sci 2024; 49:12-27. [PMID: 38097411 PMCID: PMC10976346 DOI: 10.1016/j.tibs.2023.10.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 10/24/2023] [Accepted: 10/25/2023] [Indexed: 01/07/2024]
Abstract
The uridine modifications pseudouridine (Ψ), dihydrouridine, and 5-methyluridine are present in eukaryotic mRNAs. Many uridine-modifying enzymes are associated with human disease, underscoring the importance of uncovering the functions of uridine modifications in mRNAs. These modified uridines have chemical properties distinct from those of canonical uridines, which impact RNA structure and RNA-protein interactions. Ψ, the most abundant of these uridine modifications, is present across (pre-)mRNAs. Recent work has shown that many Ψs are present at intermediate to high stoichiometries that are likely conducive to function and at locations that are poised to influence pre-/mRNA processing. Technological innovations and mechanistic investigations are unveiling the functions of uridine modifications in pre-mRNA splicing, translation, and mRNA stability, which are discussed in this review.
Collapse
Affiliation(s)
- Rebecca Rodell
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA
| | - Nicolas Robalin
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Nicole M Martinez
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA 94305, USA; Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA; Sarafan ChEM-H Institute, Stanford University, Stanford, CA 94305, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| |
Collapse
|
18
|
Ren J, Chen X, Zhang Z, Shi H, Wu S. DPred_3S: identifying dihydrouridine (D) modification on three species epitranscriptome based on multiple sequence-derived features. Front Genet 2023; 14:1334132. [PMID: 38169665 PMCID: PMC10758487 DOI: 10.3389/fgene.2023.1334132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 11/29/2023] [Indexed: 01/05/2024] Open
Abstract
Introduction: Dihydrouridine (D) is a conserved modification of tRNA among all three life domains. D modification enhances the flexibility of a single nucleotide base in the spatial structure and is disease- and evolution-associated. Recent studies have also suggested the presence of dihydrouridine on mRNA. Methods: To identify D in epitranscriptome, we provided a prediction framework named "DPred_3S" based on the machine learning approach for three species D epitranscriptome, which used epitranscriptome sequencing data as training data for the first time. Results: The optimal features were evaluated by the F-score and integration of different features; our model achieved area under the receiver operating characteristic curve (AUROC) scores 0.955, 0.946, and 0.905 for Saccharomyces cerevisiae, Escherichia coli, and Schizosaccharomyces pombe, respectively. The performances of different machine learning algorithms were also compared in this study. Discussion: The high performances of our model suggest the D sites can be distinguished based on their surrounding sequence, but the lower performance of cross-species prediction may be limited by technique preferences.
Collapse
Affiliation(s)
- Jinjin Ren
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian, China
- Fujian Key Laboratory of Tumor Microbiology, Department of Medical Microbiology, Fujian Medical University, Fuzhou, Fujian, China
| | - Xiaozhen Chen
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian, China
| | - Zhengqian Zhang
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian, China
| | - Haoran Shi
- Institute of Applied Microbiology, Research Center for BioSystems, Land Use, and Nutrition (IFZ), Justus-Liebig-University Giessen, Giessen, Germany
| | - Shuxiang Wu
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, Fujian, China
- Fujian Key Laboratory of Tumor Microbiology, Department of Medical Microbiology, Fujian Medical University, Fuzhou, Fujian, China
| |
Collapse
|
19
|
Yu NJ, Dai W, Li A, He M, Kleiner RE. Cell type-specific translational regulation by human DUS enzymes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.03.565399. [PMID: 37965204 PMCID: PMC10635104 DOI: 10.1101/2023.11.03.565399] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Dihydrouridine is an abundant and conserved modified nucleoside present on tRNA, but characterization and functional studies of modification sites and associated DUS writer enzymes in mammals is lacking. Here we use a chemical probing strategy, RNABPP-PS, to identify 5-chlorouridine as an activity-based probe for human DUS enzymes. We map D modifications using RNA-protein crosslinking and chemical transformation and mutational profiling to reveal D modification sites on human tRNAs. Further, we knock out individual DUS genes in two human cell lines to investigate regulation of tRNA expression levels and codon-specific translation. We show that whereas D modifications are present across most tRNA species, loss of D only perturbs the translational function of a subset of tRNAs in a cell type-specific manner. Our work provides powerful chemical strategies for investigating D and DUS enzymes in diverse biological systems and provides insight into the role of a ubiquitous tRNA modification in translational regulation.
Collapse
|
20
|
Ji J, Yu NJ, Kleiner RE. A minimal sequence motif drives selective tRNA dihydrouridylation by hDUS2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.04.565616. [PMID: 37961591 PMCID: PMC10635142 DOI: 10.1101/2023.11.04.565616] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
The post-transcriptional reduction of uridine to dihydrouridine (D) by dihydrouridine synthase (DUS) enzymes is among the most ubiquitous transformations in RNA biology. D is found at multiple sites in tRNAs and studies in yeast have proposed that each of the four eukaryotic DUS enzymes modifies a different site, however the molecular basis for this exquisite selectivity is unknown and human DUS enzymes have remained largely uncharacterized. Here we investigate the substrate specificity of human dihydrouridine synthase 2 (hDUS2) using mechanism-based crosslinking with 5-bromouridine (5-BrUrd)-modified oligonucleotide probes and in vitro dihydrouridylation assays. We find that hDUS2 modifies U20 in the D loop of diverse tRNA substrates and identify a minimal GU motif within the tRNA tertiary fold required for directing its activity. Further, we use our mechanism-based platform to screen small molecule inhibitors of hDUS2, a potential anti-cancer target. Our work elucidates the principles of substrate modification by a conserved DUS and provides a general platform to studying RNA modifying enzymes with sequence-defined activity-based probes.
Collapse
|
21
|
Dai W, Yu NJ, Kleiner RE. Chemoproteomic Approaches to Studying RNA Modification-Associated Proteins. Acc Chem Res 2023; 56:2726-2739. [PMID: 37733063 PMCID: PMC11025531 DOI: 10.1021/acs.accounts.3c00450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]
Abstract
The function of cellular RNA is modulated by a host of post-transcriptional chemical modifications installed by dedicated RNA-modifying enzymes. RNA modifications are widespread in biology, occurring in all kingdoms of life and in all classes of RNA molecules. They regulate RNA structure, folding, and protein-RNA interactions, and have important roles in fundamental gene expression processes involving mRNA, tRNA, rRNA, and other types of RNA species. Our understanding of RNA modifications has advanced considerably; however, there are still many outstanding questions regarding the distribution of modifications across all RNA transcripts and their biological function. One of the major challenges in the study of RNA modifications is the lack of sequencing methods for the transcriptome-wide mapping of different RNA-modification structures. Furthermore, we lack general strategies to characterize RNA-modifying enzymes and RNA-modification reader proteins. Therefore, there is a need for new approaches to enable integrated studies of RNA-modification chemistry and biology.In this Account, we describe our development and application of chemoproteomic strategies for the study of RNA-modification-associated proteins. We present two orthogonal methods based on nucleoside and oligonucleotide chemical probes: 1) RNA-mediated activity-based protein profiling (RNABPP), a metabolic labeling strategy based on reactive modified nucleoside probes to profile RNA-modifying enzymes in cells and 2) photo-cross-linkable diazirine-containing synthetic oligonucleotide probes for identifying RNA-modification reader proteins.We use RNABPP with C5-modified cytidine and uridine nucleosides to capture diverse RNA-pyrimidine-modifying enzymes including methyltransferases, dihydrouridine synthases, and RNA dioxygenase enzymes. Metabolic labeling facilitates the mechanism-based cross-linking of RNA-modifying enzymes with their native RNA substrates in cells. Covalent RNA-protein complexes are then isolated by denaturing oligo(dT) pulldown, and cross-linked proteins are identified by quantitative proteomics. Once suitable modified nucleosides have been identified as mechanism-based proteomic probes, they can be further deployed in transcriptome-wide sequencing experiments to profile the substrates of RNA-modifying enzymes at nucleotide resolution. Using 5-fluorouridine-mediated RNA-protein cross-linking and sequencing, we analyzed the substrates of human dihydrouridine synthase DUS3L. 5-Ethynylcytidine-mediated cross-linking enabled the investigation of ALKBH1 substrates. We also characterized the functions of these RNA-modifying enzymes in human cells by using genetic knockouts and protein translation reporters.We profiled RNA readers for N6-methyladenosine (m6A) and N1-methyladenosine (m1A) using a comparative proteomic workflow based on diazirine-containing modified oligonucleotide probes. Our approach enables quantitative proteome-wide analysis of the preference of RNA-binding proteins for modified nucleotides across a range of affinities. Interestingly, we found that YTH-domain proteins YTHDF1/2 can bind to both m6A and m1A to mediate transcript destabilization. Furthermore, m6A also inhibits stress granule proteins from binding to RNA.Taken together, we demonstrate the application of chemical probing strategies, together with proteomic and transcriptomic workflows, to reveal new insights into the biological roles of RNA modifications and their associated proteins.
Collapse
Affiliation(s)
| | | | - Ralph E. Kleiner
- Department of Chemistry, Princeton University, Princeton, NJ, USA 08544
| |
Collapse
|
22
|
Tomasi FG, Kimura S, Rubin EJ, Waldor MK. A tRNA modification in Mycobacterium tuberculosis facilitates optimal intracellular growth. eLife 2023; 12:RP87146. [PMID: 37755167 PMCID: PMC10531406 DOI: 10.7554/elife.87146] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2023] Open
Abstract
Diverse chemical modifications fine-tune the function and metabolism of tRNA. Although tRNA modification is universal in all kingdoms of life, profiles of modifications, their functions, and physiological roles have not been elucidated in most organisms including the human pathogen, Mycobacterium tuberculosis (Mtb), the causative agent of tuberculosis. To identify physiologically important modifications, we surveyed the tRNA of Mtb, using tRNA sequencing (tRNA-seq) and genome-mining. Homology searches identified 23 candidate tRNA modifying enzymes that are predicted to create 16 tRNA modifications across all tRNA species. Reverse transcription-derived error signatures in tRNA-seq predicted the sites and presence of nine modifications. Several chemical treatments prior to tRNA-seq expanded the number of predictable modifications. Deletion of Mtb genes encoding two modifying enzymes, TruB and MnmA, eliminated their respective tRNA modifications, validating the presence of modified sites in tRNA species. Furthermore, the absence of mnmA attenuated Mtb growth in macrophages, suggesting that MnmA-dependent tRNA uridine sulfation contributes to Mtb intracellular growth. Our results lay the foundation for unveiling the roles of tRNA modifications in Mtb pathogenesis and developing new therapeutics against tuberculosis.
Collapse
Affiliation(s)
- Francesca G Tomasi
- Department of Immunology and Infectious Diseases Harvard T. H. Chan School of Public HealthBostonUnited States
| | - Satoshi Kimura
- Division of Infectious Diseases, Brigham and Women's HospitalBostonUnited States
- Department of Microbiology, Harvard Medical SchoolBostonUnited States
- Howard Hughes Medical InstituteBostonUnited States
| | - Eric J Rubin
- Department of Immunology and Infectious Diseases Harvard T. H. Chan School of Public HealthBostonUnited States
| | - Matthew K Waldor
- Department of Immunology and Infectious Diseases Harvard T. H. Chan School of Public HealthBostonUnited States
- Division of Infectious Diseases, Brigham and Women's HospitalBostonUnited States
- Department of Microbiology, Harvard Medical SchoolBostonUnited States
- Howard Hughes Medical InstituteBostonUnited States
| |
Collapse
|
23
|
Draycott AS, Schaening-Burgos C, Rojas-Duran MF, Gilbert WV. D-Seq: Genome-wide detection of dihydrouridine modifications in RNA. Methods Enzymol 2023; 692:3-22. [PMID: 37925185 DOI: 10.1016/bs.mie.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2023]
Abstract
In addition to A, C, G and U, RNA contains over 100 additional chemically distinct residues. An abundant modified base frequently found in tRNAs, dihydrouridine (D) has recently been mapped to over 100 positions in mRNAs in yeast and human cells. Multiple highly conserved dihydrouridine synthases associate with and modify mRNA, suggesting there are many D sites yet to be found. Because D alters RNA structure, installation of D in mRNA is likely to effect multiple steps in mRNA metabolism including processing, trafficking, translation, and degradation. Here, we introduce D-seq, a method to chart the D landscape at single nucleotide resolution. The included protocols start with RNA isolation and carry through D-seq library preparation and data analysis. While the protocols below are tailored to map Ds in mRNA, the D-seq method is generalizable to any RNA type of interest, including non-coding RNAs, which have also recently been identified as dihydrouridine synthase targets.
Collapse
Affiliation(s)
- Austin S Draycott
- Yale School of Medicine, Department of Molecular Biophysics & Biochemistry, New, Haven, CT, United States
| | | | - Maria F Rojas-Duran
- Yale School of Medicine, Department of Molecular Biophysics & Biochemistry, New, Haven, CT, United States
| | - Wendy V Gilbert
- Yale School of Medicine, Department of Molecular Biophysics & Biochemistry, New, Haven, CT, United States.
| |
Collapse
|
24
|
Patrasso EA, Raikundalia S, Arango D. Regulation of the epigenome through RNA modifications. Chromosoma 2023; 132:231-246. [PMID: 37138119 PMCID: PMC10524150 DOI: 10.1007/s00412-023-00794-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/10/2023] [Accepted: 04/12/2023] [Indexed: 05/05/2023]
Abstract
Chemical modifications of nucleotides expand the complexity and functional properties of genomes and transcriptomes. A handful of modifications in DNA bases are part of the epigenome, wherein DNA methylation regulates chromatin structure, transcription, and co-transcriptional RNA processing. In contrast, more than 150 chemical modifications of RNA constitute the epitranscriptome. Ribonucleoside modifications comprise a diverse repertoire of chemical groups, including methylation, acetylation, deamination, isomerization, and oxidation. Such RNA modifications regulate all steps of RNA metabolism, including folding, processing, stability, transport, translation, and RNA's intermolecular interactions. Initially thought to influence all aspects of the post-transcriptional regulation of gene expression exclusively, recent findings uncovered a crosstalk between the epitranscriptome and the epigenome. In other words, RNA modifications feedback to the epigenome to transcriptionally regulate gene expression. The epitranscriptome achieves this feat by directly or indirectly affecting chromatin structure and nuclear organization. This review highlights how chemical modifications in chromatin-associated RNAs (caRNAs) and messenger RNAs (mRNAs) encoding factors involved in transcription, chromatin structure, histone modifications, and nuclear organization affect gene expression transcriptionally.
Collapse
Affiliation(s)
- Emmely A Patrasso
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
- Medical and Pharmaceutical Biotechnology Program, IMC University of Applied Sciences, Krems, Austria
| | - Sweta Raikundalia
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Daniel Arango
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA.
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University, Chicago, IL, USA.
| |
Collapse
|
25
|
Kleiner RE. Chemical Approaches To Investigate Post-transcriptional RNA Regulation. ACS Chem Biol 2023; 18:1684-1697. [PMID: 37540831 PMCID: PMC11031734 DOI: 10.1021/acschembio.3c00406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/06/2023]
Abstract
RNA plays a central role in biological processes, and its activity is regulated by a host of diverse chemical and biochemical mechanisms including post-transcriptional modification and interactions with RNA-binding proteins. Here, we describe our efforts to illuminate RNA biology through the application of chemical tools, focusing on post-transcriptional regulatory mechanisms. We describe the development of an activity-based protein profiling approach for discovery and characterization of RNA-modifying enzymes. Next, we highlight novel approaches for RNA imaging based upon metabolic labeling with modified nucleosides and engineering of the nucleotide salvage pathway. Finally, we discuss profiling RNA-protein interactions using small molecule-dependent RNA editing and synthetic photo-cross-linkable oligonucleotide probes. Our work provides enabling technologies for deciphering the complexity of RNA and its diverse functions in biology.
Collapse
Affiliation(s)
- Ralph E. Kleiner
- Department of Chemistry, Princeton University, Princeton, NJ, USA 08544
| |
Collapse
|
26
|
Abstract
Chemical modifications on mRNA represent a critical layer of gene expression regulation. Research in this area has continued to accelerate over the last decade, as more modifications are being characterized with increasing depth and breadth. mRNA modifications have been demonstrated to influence nearly every step from the early phases of transcript synthesis in the nucleus through to their decay in the cytoplasm, but in many cases, the molecular mechanisms involved in these processes remain mysterious. Here, we highlight recent work that has elucidated the roles of mRNA modifications throughout the mRNA life cycle, describe gaps in our understanding and remaining open questions, and offer some forward-looking perspective on future directions in the field.
Collapse
Affiliation(s)
- Wendy V Gilbert
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, Connecticut, USA;
| | - Sigrid Nachtergaele
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut, USA;
| |
Collapse
|
27
|
Tomasi FG, Kimura S, Rubin EJ, Waldor MK. A tRNA modification in Mycobacterium tuberculosis facilitates optimal intracellular growth. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.20.529267. [PMID: 36865327 PMCID: PMC9979996 DOI: 10.1101/2023.02.20.529267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Diverse chemical modifications fine-tune the function and metabolism of tRNA. Although tRNA modification is universal in all kingdoms of life, profiles of modifications, their functions, and physiological roles have not been elucidated in most organisms including the human pathogen, Mycobacterium tuberculosis ( Mtb ), the causative agent of tuberculosis. To identify physiologically important modifications, we surveyed the tRNA of Mtb , using tRNA sequencing (tRNA-seq) and genome-mining. Homology searches identified 23 candidate tRNA modifying enzymes that are predicted to create 16 tRNA modifications across all tRNA species. Reverse transcription-derived error signatures in tRNA-seq predicted the sites and presence of 9 modifications. Several chemical treatments prior to tRNA-seq expanded the number of predictable modifications. Deletion of Mtb genes encoding two modifying enzymes, TruB and MnmA, eliminated their respective tRNA modifications, validating the presence of modified sites in tRNA species. Furthermore, the absence of mnmA attenuated Mtb growth in macrophages, suggesting that MnmA-dependent tRNA uridine sulfation contributes to Mtb intracellular growth. Our results lay the foundation for unveiling the roles of tRNA modifications in Mtb pathogenesis and developing new therapeutics against tuberculosis.
Collapse
Affiliation(s)
- Francesca G. Tomasi
- Department of Immunology and Infectious Diseases Harvard T. H. Chan School of Public Health, Boston, MA USA
| | - Satoshi Kimura
- Division of Infectious Diseases, Brigham and Women’s Hospital, Boston, MA, USA
- Department of Microbiology, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Boston, MA, USA
| | - Eric J. Rubin
- Department of Immunology and Infectious Diseases Harvard T. H. Chan School of Public Health, Boston, MA USA
| | - Matthew K. Waldor
- Department of Immunology and Infectious Diseases Harvard T. H. Chan School of Public Health, Boston, MA USA
- Division of Infectious Diseases, Brigham and Women’s Hospital, Boston, MA, USA
- Department of Microbiology, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Boston, MA, USA
| |
Collapse
|
28
|
Wang Y, Wang X, Cui X, Meng J, Rong R. Self-attention enabled deep learning of dihydrouridine (D) modification on mRNAs unveiled a distinct sequence signature from tRNAs. MOLECULAR THERAPY. NUCLEIC ACIDS 2023; 31:411-420. [PMID: 36845339 PMCID: PMC9945750 DOI: 10.1016/j.omtn.2023.01.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 01/23/2023] [Indexed: 01/28/2023]
Abstract
Dihydrouridine (D) is a modified pyrimidine nucleotide universally found in viral, prokaryotic, and eukaryotic species. It serves as a metabolic modulator for various pathological conditions, and its elevated levels in tumors are associated with a series of cancers. Precise identification of D sites on RNA is vital for understanding its biological function. A number of computational approaches have been developed for predicting D sites on tRNAs; however, none have considered mRNAs. We present here DPred, the first computational tool for predicting D on mRNAs in yeast from the primary RNA sequences. Built on a local self-attention layer and a convolutional neural network (CNN) layer, the proposed deep learning model outperformed classic machine learning approaches (random forest, support vector machines, etc.) and achieved reasonable accuracy and reliability with areas under the curve of 0.9166 and 0.9027 in jackknife cross-validation and on an independent testing dataset, respectively. Importantly, we showed that distinct sequence signatures are associated with the D sites on mRNAs and tRNAs, implying potentially different formation mechanisms and putative divergent functionality of this modification on the two types of RNA. DPred is available as a user-friendly Web server.
Collapse
Affiliation(s)
- Yue Wang
- Department of Mathematical Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China,Department of Computer Science, University of Liverpool, L69 7ZB Liverpool, UK
| | - Xuan Wang
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Xiaodong Cui
- School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an, Shaanxi 710072, China
| | - Jia Meng
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China,AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China,Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB Liverpool, UK
| | - Rong Rong
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China,Corresponding author: Rong Rong, Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China.
| |
Collapse
|
29
|
Tavakoli S, Nabizadeh M, Makhamreh A, Gamper H, McCormick CA, Rezapour NK, Hou YM, Wanunu M, Rouhanifard SH. Semi-quantitative detection of pseudouridine modifications and type I/II hypermodifications in human mRNAs using direct long-read sequencing. Nat Commun 2023; 14:334. [PMID: 36658122 PMCID: PMC9852470 DOI: 10.1038/s41467-023-35858-w] [Citation(s) in RCA: 61] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 01/05/2023] [Indexed: 01/21/2023] Open
Abstract
Here, we develop and apply a semi-quantitative method for the high-confidence identification of pseudouridylated sites on mammalian mRNAs via direct long-read nanopore sequencing. A comparative analysis of a modification-free transcriptome reveals that the depth of coverage and specific k-mer sequences are critical parameters for accurate basecalling. By adjusting these parameters for high-confidence U-to-C basecalling errors, we identify many known sites of pseudouridylation and uncover previously unreported uridine-modified sites, many of which fall in k-mers that are known targets of pseudouridine synthases. Identified sites are validated using 1000-mer synthetic RNA controls bearing a single pseudouridine in the center position, demonstrating systematic under-calling using our approach. We identify mRNAs with up to 7 unique modification sites. Our workflow allows direct detection of low-, medium-, and high-occupancy pseudouridine modifications on native RNA molecules from nanopore sequencing data and multiple modifications on the same strand.
Collapse
Affiliation(s)
- Sepideh Tavakoli
- Department of Bioengineering, Northeastern University, Boston, MA, USA
| | - Mohammad Nabizadeh
- Department of Mechanical Engineering, Northeastern University, Boston, MA, USA
| | - Amr Makhamreh
- Department of Bioengineering, Northeastern University, Boston, MA, USA
| | - Howard Gamper
- Department of Biochemistry and Molecular Biology, Thomas Jefferson University, Philadelphia, PA, USA
| | | | - Neda K Rezapour
- Department of Physics, Northeastern University, Boston, MA, USA
| | - Ya-Ming Hou
- Department of Biochemistry and Molecular Biology, Thomas Jefferson University, Philadelphia, PA, USA
| | - Meni Wanunu
- Department of Bioengineering, Northeastern University, Boston, MA, USA
- Department of Physics, Northeastern University, Boston, MA, USA
| | | |
Collapse
|
30
|
Pichot F, Hogg MC, Marchand V, Bourguignon V, Jirström E, Farrell C, Gibriel HA, Prehn JH, Motorin Y, Helm M. Quantification of substoichiometric modification reveals global tsRNA hypomodification, preferences for angiogenin-mediated tRNA cleavage, and idiosyncratic epitranscriptomes of human neuronal cell-lines. Comput Struct Biotechnol J 2022; 21:401-417. [PMID: 36618980 PMCID: PMC9798144 DOI: 10.1016/j.csbj.2022.12.020] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 12/13/2022] [Accepted: 12/13/2022] [Indexed: 12/23/2022] Open
Abstract
Modification of tRNA is an integral part of the epitranscriptome with a particularly pronounced potential to generate diversity in RNA expression. Eukaryotic tRNA contains modifications in up to 20% of their nucleotides, but not all sites are always fully modified. Combinations and permutations of partially modified sites in tRNAs can generate a plethora of tRNA isoforms, termed modivariants. Here, we investigate the stoichiometry of incompletely modified sites in tRNAs from human cell lines for their information content. Using a panel of RNA modification mapping methods, we assess the stoichiometry of sites that contain the modifications 5-methylcytidine (m5C), 2'-O-ribose methylation (Nm), 3-methylcytidine (m3C), 7-methylguanosine (m7G), and Dihydrouridine (D). We discovered that up to 75% of sites can be incompletely modified and that the differential modification status of a cellular tRNA population holds information that allows to discriminate e.g. different cell lines. As a further aspect, we investigated potential causal connectivity between tRNA modification and its processing into tRNA fragments (tiRNAs and tRFs). Upon exposure of cultured living cells to cell-penetrating angiogenin, the modification patterns of the corresponding RNA populations was changed. Importantly, we also found that tsRNAs were significantly less modified than their parent tRNAs at numerous sites, suggesting that tsRNAs might derive chiefly from hypomodified tRNAs.
Collapse
Affiliation(s)
- Florian Pichot
- Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg University Mainz, Staudingerweg 5, 55128 Mainz, Germany
- Université de Lorraine, CNRS, INSERM, IBSLor (UAR2008/US40), Epitranscriptomics and RNA Sequencing Core Facility, F54000 Nancy, France
| | - Marion C. Hogg
- Department of Physiology and Medical Physics and SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, St. Stephen's Green, Dublin, D02 YN77, Ireland
| | - Virginie Marchand
- Université de Lorraine, CNRS, INSERM, IBSLor (UAR2008/US40), Epitranscriptomics and RNA Sequencing Core Facility, F54000 Nancy, France
| | - Valérie Bourguignon
- Université de Lorraine, CNRS, INSERM, IBSLor (UAR2008/US40), Epitranscriptomics and RNA Sequencing Core Facility, F54000 Nancy, France
- Université de Lorraine, CNRS, IMoPA (UMR7365), F54000 Nancy, France
| | - Elisabeth Jirström
- Department of Physiology and Medical Physics and SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, St. Stephen's Green, Dublin, D02 YN77, Ireland
| | - Cliona Farrell
- Department of Physiology and Medical Physics and SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, St. Stephen's Green, Dublin, D02 YN77, Ireland
| | - Hesham A. Gibriel
- Department of Physiology and Medical Physics and SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, St. Stephen's Green, Dublin, D02 YN77, Ireland
| | - Jochen H.M. Prehn
- Department of Physiology and Medical Physics and SFI FutureNeuro Research Centre, Royal College of Surgeons in Ireland, St. Stephen's Green, Dublin, D02 YN77, Ireland
| | - Yuri Motorin
- Université de Lorraine, CNRS, INSERM, IBSLor (UAR2008/US40), Epitranscriptomics and RNA Sequencing Core Facility, F54000 Nancy, France
- Université de Lorraine, CNRS, IMoPA (UMR7365), F54000 Nancy, France
| | - Mark Helm
- Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg University Mainz, Staudingerweg 5, 55128 Mainz, Germany
| |
Collapse
|
31
|
Lombard M, Reed CJ, Pecqueur L, Faivre B, Toubdji S, Sudol C, Brégeon D, de Crécy-Lagard V, Hamdane D. Evolutionary Diversity of Dus2 Enzymes Reveals Novel Structural and Functional Features among Members of the RNA Dihydrouridine Synthases Family. Biomolecules 2022; 12:1760. [PMID: 36551188 PMCID: PMC9775027 DOI: 10.3390/biom12121760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 11/23/2022] [Accepted: 11/24/2022] [Indexed: 11/29/2022] Open
Abstract
Dihydrouridine (D) is an abundant modified base found in the tRNAs of most living organisms and was recently detected in eukaryotic mRNAs. This base confers significant conformational plasticity to RNA molecules. The dihydrouridine biosynthetic reaction is catalyzed by a large family of flavoenzymes, the dihydrouridine synthases (Dus). So far, only bacterial Dus enzymes and their complexes with tRNAs have been structurally characterized. Understanding the structure-function relationships of eukaryotic Dus proteins has been hampered by the paucity of structural data. Here, we combined extensive phylogenetic analysis with high-precision 3D molecular modeling of more than 30 Dus2 enzymes selected along the tree of life to determine the evolutionary molecular basis of D biosynthesis by these enzymes. Dus2 is the eukaryotic enzyme responsible for the synthesis of D20 in tRNAs and is involved in some human cancers and in the detoxification of β-amyloid peptides in Alzheimer's disease. In addition to the domains forming the canonical structure of all Dus, i.e., the catalytic TIM-barrel domain and the helical domain, both participating in RNA recognition in the bacterial Dus, a majority of Dus2 proteins harbor extensions at both ends. While these are mainly unstructured extensions on the N-terminal side, the C-terminal side extensions can adopt well-defined structures such as helices and beta-sheets or even form additional domains such as zinc finger domains. 3D models of Dus2/tRNA complexes were also generated. This study suggests that eukaryotic Dus2 proteins may have an advantage in tRNA recognition over their bacterial counterparts due to their modularity.
Collapse
Affiliation(s)
- Murielle Lombard
- Laboratoire de Chimie des Processus Biologiques, CNRS-UMR 8229, Collège de France, Université Pierre et Marie Curie, 11 Place Marcelin Berthelot, CEDEX 05, 75231 Paris, France
| | - Colbie J. Reed
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
| | - Ludovic Pecqueur
- Laboratoire de Chimie des Processus Biologiques, CNRS-UMR 8229, Collège de France, Université Pierre et Marie Curie, 11 Place Marcelin Berthelot, CEDEX 05, 75231 Paris, France
| | - Bruno Faivre
- Laboratoire de Chimie des Processus Biologiques, CNRS-UMR 8229, Collège de France, Université Pierre et Marie Curie, 11 Place Marcelin Berthelot, CEDEX 05, 75231 Paris, France
| | - Sabrine Toubdji
- Laboratoire de Chimie des Processus Biologiques, CNRS-UMR 8229, Collège de France, Université Pierre et Marie Curie, 11 Place Marcelin Berthelot, CEDEX 05, 75231 Paris, France
- IBPS, Biology of Aging and Adaptation, Sorbonne Université 7 quai Saint Bernard, CEDEX 05, 75252 Paris, France
| | - Claudia Sudol
- Laboratoire de Chimie des Processus Biologiques, CNRS-UMR 8229, Collège de France, Université Pierre et Marie Curie, 11 Place Marcelin Berthelot, CEDEX 05, 75231 Paris, France
- IBPS, Biology of Aging and Adaptation, Sorbonne Université 7 quai Saint Bernard, CEDEX 05, 75252 Paris, France
| | - Damien Brégeon
- IBPS, Biology of Aging and Adaptation, Sorbonne Université 7 quai Saint Bernard, CEDEX 05, 75252 Paris, France
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL 32611, USA
- Genetics Institute, University of Florida, Gainesville, FL 32610, USA
| | - Djemel Hamdane
- Laboratoire de Chimie des Processus Biologiques, CNRS-UMR 8229, Collège de France, Université Pierre et Marie Curie, 11 Place Marcelin Berthelot, CEDEX 05, 75231 Paris, France
| |
Collapse
|
32
|
Arzumanian VA, Dolgalev GV, Kurbatov IY, Kiseleva OI, Poverennaya EV. Epitranscriptome: Review of Top 25 Most-Studied RNA Modifications. Int J Mol Sci 2022; 23:13851. [PMID: 36430347 PMCID: PMC9695239 DOI: 10.3390/ijms232213851] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 11/07/2022] [Accepted: 11/08/2022] [Indexed: 11/12/2022] Open
Abstract
The alphabet of building blocks for RNA molecules is much larger than the standard four nucleotides. The diversity is achieved by the post-transcriptional biochemical modification of these nucleotides into distinct chemical entities that are structurally and functionally different from their unmodified counterparts. Some of these modifications are constituent and critical for RNA functions, while others serve as dynamic markings to regulate the fate of specific RNA molecules. Together, these modifications form the epitranscriptome, an essential layer of cellular biochemistry. As of the time of writing this review, more than 300 distinct RNA modifications from all three life domains have been identified. However, only a few of the most well-established modifications are included in most reviews on this topic. To provide a complete overview of the current state of research on the epitranscriptome, we analyzed the extent of the available information for all known RNA modifications. We selected 25 modifications to describe in detail. Summarizing our findings, we describe the current status of research on most RNA modifications and identify further developments in this field.
Collapse
Affiliation(s)
- Viktoriia A. Arzumanian
- Correspondence: (V.A.A.); (G.V.D.); Tel.: +7-960-889-7117 (V.A.A.); +7-967-236-36-79 (G.V.D.)
| | - Georgii V. Dolgalev
- Correspondence: (V.A.A.); (G.V.D.); Tel.: +7-960-889-7117 (V.A.A.); +7-967-236-36-79 (G.V.D.)
| | | | | | | |
Collapse
|
33
|
Abstract
Nucleotide modifications can markedly influence mRNA processing and metabolism. This Primer explores two new studies, one in PLOS Biology, showing that ~130 yeast mRNAs contain dihydrouridine, a derivative of uridine. Functional studies show that dihydrouridine, in some cases, can affect mRNA splicing.
Collapse
Affiliation(s)
- Sameer Dixit
- Department of Pharmacology, Weill-Cornell Medical College, Cornell University, New York, New York, United States of America
| | - Samie R. Jaffrey
- Department of Pharmacology, Weill-Cornell Medical College, Cornell University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|