51
|
Predicting RNA 5-Methylcytosine Sites by Using Essential Sequence Features and Distributions. BIOMED RESEARCH INTERNATIONAL 2022; 2022:4035462. [PMID: 35071593 PMCID: PMC8776474 DOI: 10.1155/2022/4035462] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 12/07/2021] [Accepted: 12/22/2021] [Indexed: 12/15/2022]
Abstract
Methylation is one of the most common and considerable modifications in biological systems mediated by multiple enzymes. Recent studies have shown that methylation has been widely identified in different RNA molecules. RNA methylation modifications have various kinds, such as 5-methylcytosine (m5C). However, for individual methylation sites, their functions still remain to be elucidated. Testing of all methylation sites relies heavily on high-throughput sequencing technology, which is expensive and labor consuming. Thus, computational prediction approaches could serve as a substitute. In this study, multiple machine learning models were used to predict possible RNA m5C sites on the basis of mRNA sequences in human and mouse. Each site was represented by several features derived from
-mers of an RNA subsequence containing such site as center. The powerful max-relevance and min-redundancy (mRMR) feature selection method was employed to analyse these features. The outcome feature list was fed into incremental feature selection method, incorporating four classification algorithms, to build efficient models. Furthermore, the sites related to features used in the models were also investigated.
Collapse
|
52
|
He S, Gu C, Su T, Qiu Q. Research Progress of circRNAs in Inflammatory Mechanisms of Diabetic Retinopathy: An Emerging Star with Potential Therapeutic Targets. Curr Eye Res 2021; 47:165-178. [PMID: 34963381 DOI: 10.1080/02713683.2021.1995002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
PURPOSE We summarized the existing studies to elaborate the biogenesis and function of circRNAs, the effect of aberrant circRNAs expression in the mechanism of inflammation and diabetic retinopathy (DR) respectively and further explored the vital roles of circRNAs in inflammation involved in DR. Methods: We conducted a systematical literature search of abundant electronic databases (PubMed, GeneMedical and MEDLINE) up to August 2021. Results: In this review, we exhibited the biogenesis and function of circRNAs and highlighted the components of inflammatory mediators implicated in DR. Numerous circRNAs, such as circHIPK3, circZNF609, circRNA_0084043, circ_0002570, circ_0041795, circEhmt1 and circ-ITCH were discovered to play vital roles in inflammation involved in DR, which provided new ideas for diagnosis and treatment of DR. Moreover, we proposed not only the epigenetic functions of circRNAs but also novel forms of the inflammatory response, including pyroptosis, to inspire further exploration and creative research in this field. Conclusion: CircRNAs were implicated in the progression and development of inflammation in DR via aberrant expression and modulation of gene expression, serving as an emerging star with potential therapeutic targets.
Collapse
Affiliation(s)
- Shuai He
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, PR China.,Department of Ophthalmology, National Clinical Research Center for Eye Diseases; Shanghai Key Laboratory of Ocular Fundus Diseases; Shanghai Engineering Center for Visual Science and Photomedicine; Shanghai Engineering Center for Precise Diagnosis and Treatment of Eye Diseases, Department of Ophthalmology, Shanghai, PR China
| | - Chufeng Gu
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, PR China.,Department of Ophthalmology, National Clinical Research Center for Eye Diseases; Shanghai Key Laboratory of Ocular Fundus Diseases; Shanghai Engineering Center for Visual Science and Photomedicine; Shanghai Engineering Center for Precise Diagnosis and Treatment of Eye Diseases, Department of Ophthalmology, Shanghai, PR China
| | - Tong Su
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, PR China.,Department of Ophthalmology, National Clinical Research Center for Eye Diseases; Shanghai Key Laboratory of Ocular Fundus Diseases; Shanghai Engineering Center for Visual Science and Photomedicine; Shanghai Engineering Center for Precise Diagnosis and Treatment of Eye Diseases, Department of Ophthalmology, Shanghai, PR China
| | - Qinghua Qiu
- Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, PR China.,Department of Ophthalmology, National Clinical Research Center for Eye Diseases; Shanghai Key Laboratory of Ocular Fundus Diseases; Shanghai Engineering Center for Visual Science and Photomedicine; Shanghai Engineering Center for Precise Diagnosis and Treatment of Eye Diseases, Department of Ophthalmology, Shanghai, PR China.,Department of Ophthalmology, Shigatse People's Hospital, Shigatse, Xizang, PR China
| |
Collapse
|
53
|
Dou L, Zhou W, Zhang L, Xu L, Han K. Accurate identification of RNA D modification using multiple features. RNA Biol 2021; 18:2236-2246. [PMID: 33729104 PMCID: PMC8632091 DOI: 10.1080/15476286.2021.1898160] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 02/13/2021] [Accepted: 02/23/2021] [Indexed: 10/21/2022] Open
Abstract
As one of the common post-transcriptional modifications in tRNAs, dihydrouridine (D) has prominent effects on regulating the flexibility of tRNA as well as cancerous diseases. Facing with the expensive and time-consuming sequencing techniques to detect D modification, precise computational tools can largely promote the progress of molecular mechanisms and medical developments. We proposed a novel predictor, called iRNAD_XGBoost, to identify potential D sites using multiple RNA sequence representations. In this method, by considering the imbalance problem using hybrid sampling method SMOTEEEN, the XGBoost-selected top 30 features are applied to construct model. The optimized model showed high Sn and Sp values of 97.13% and 97.38% over jackknife test, respectively. For the independent experiment, these two metrics separately achieved 91.67% and 94.74%. Compared with iRNAD method, this model illustrated high generalizability and consistent prediction efficiencies for positive and negative samples, which yielded satisfactory MCC scores of 0.94 and 0.86, respectively. It is inferred that the chemical property and nucleotide density features (CPND), electron-ion interaction pseudopotential (EIIP and PseEIIP) as well as dinucleotide composition (DNC) are crucial to the recognition of D modification. The proposed predictor is a promising tool to help experimental biologists investigate molecular functions.
Collapse
Affiliation(s)
- Lijun Dou
- School of Automotive and Transportation Engineering, Shenzhen Polytechnic, Shenzhen, GuangdongChina
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, SichuanChina
| | - Wenyang Zhou
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, HeilongjiangChina
| | - Lichao Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, Guangdong, China
| | - Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, GuangdongChina
| | - Ke Han
- School of Computer and Information Engineering, Harbin University of Commerce, Harbin, HeilongjiangChina
| |
Collapse
|
54
|
Issah MA, Wu D, Zhang F, Zheng W, Liu Y, Fu H, Zhou H, Chen R, Shen J. Epigenetic modifications in acute myeloid leukemia: The emerging role of circular RNAs (Review). Int J Oncol 2021; 59:107. [PMID: 34792180 PMCID: PMC8651224 DOI: 10.3892/ijo.2021.5287] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 09/13/2021] [Indexed: 11/06/2022] Open
Abstract
Canonical epigenetic modifications, which include histone modification, chromatin remodeling and DNA methylation, play key roles in numerous cellular processes. Epigenetics underlies how cells that posses DNA with similar sequences develop into different cell types with different functions in an organism. Earlier epigenetic research has primarily been focused at the chromatin level. However, the number of studies on epigenetic modifications of RNA, such as N1‑methyladenosine, 2'‑O‑ribosemethylation, inosine, 5‑methylcytidine, N6‑methyladenosine (m6A) and pseudouridine, has seen an increase. Circular RNAs (circRNAs), a type of RNA species that lacks a 5' cap or 3' poly(A) tail, are abundantly expressed in acute myeloid leukemia (AML) and may regulate disease progression. circRNAs possess various functions, including microRNA sponging, gene transcription regulation and RNA‑binding protein interaction. Furthermore, circRNAs are m6A methylated in other types of cancer, such as colorectal and hypopharyngeal squamous cell cancers. Therefore, the critical roles of circRNA epigenetic modifications, particularly m6A, and their possible involvement in AML are discussed in the present review. Epigenetic modification of circRNAs may become a diagnostic and therapeutic target for AML in the future.
Collapse
Affiliation(s)
- Mohammed Awal Issah
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Dansen Wu
- Medical Intensive Care Unit, Fujian Provincial Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Feng Zhang
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Weili Zheng
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Yanquan Liu
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Haiying Fu
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Huarong Zhou
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Rong Chen
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| | - Jianzhen Shen
- Fujian Institute of Hematology, Fujian Medical Center of Hematology, Clinical Research Center for Hematological Malignancies of Fujian Province, Fuzhou, Fujian 350001, P.R. China
- Fujian Provincial Key Laboratory on Hematology, Fujian Medical University Union Hospital, Fuzhou, Fujian 350001, P.R. China
| |
Collapse
|
55
|
Cai L, Xuan J, Lin Q, Wang J, Liu S, Xie F, Zheng L, Li B, Qu L, Yang J. Pol3Base: a resource for decoding the interactome, expression, evolution, epitranscriptome and disease variations of Pol III-transcribed ncRNAs. Nucleic Acids Res 2021; 50:D279-D286. [PMID: 34747466 PMCID: PMC8728242 DOI: 10.1093/nar/gkab1033] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 09/30/2021] [Accepted: 10/15/2021] [Indexed: 12/20/2022] Open
Abstract
RNA polymerase III (Pol III) transcribes hundreds of non-coding RNA genes (ncRNAs), which involve in a variety of cellular processes. However, the expression, functions, regulatory networks and evolution of these Pol III-transcribed ncRNAs are still largely unknown. In this study, we developed a novel resource, Pol3Base (http://rna.sysu.edu.cn/pol3base/), to decode the interactome, expression, evolution, epitranscriptome and disease variations of Pol III-transcribed ncRNAs. The current release of Pol3Base includes thousands of regulatory relationships between ∼79 000 ncRNAs and transcription factors by mining 56 ChIP-seq datasets. By integrating CLIP-seq datasets, we deciphered the interactions of these ncRNAs with >240 RNA binding proteins. Moreover, Pol3Base contains ∼9700 RNA modifications located within thousands of Pol III-transcribed ncRNAs. Importantly, we characterized expression profiles of ncRNAs in >70 tissues and 28 different tumor types. In addition, by comparing these ncRNAs from human and mouse, we revealed about 4000 evolutionary conserved ncRNAs. We also identified ∼11 403 tRNA-derived small RNAs (tsRNAs) in 32 different tumor types. Finally, by analyzing somatic mutation data, we investigated the mutation map of these ncRNAs to help uncover their potential roles in diverse diseases. This resource will help expand our understanding of potential functions and regulatory networks of Pol III-transcribed ncRNAs.
Collapse
Affiliation(s)
- Li Cai
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Jiajia Xuan
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Qiao Lin
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Junhao Wang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Shurong Liu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Fangzhou Xie
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Lingling Zheng
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Bin Li
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Lianghu Qu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| | - Jianhua Yang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, The Fifth Affiliated Hospital, Sun Yat-sen University, Guangdong, Guangzhou 510275, P.R. China
| |
Collapse
|
56
|
Guo Y, Dong X, Jin J, He Y. The Expression Patterns and Prognostic Value of the Proteasome Activator Subunit Gene Family in Gastric Cancer Based on Integrated Analysis. Front Cell Dev Biol 2021; 9:663001. [PMID: 34650966 PMCID: PMC8505534 DOI: 10.3389/fcell.2021.663001] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 08/31/2021] [Indexed: 12/12/2022] Open
Abstract
Increasing evidence supports that proteasome activator subunit (PSME) genes play an indispensable role in multiple tumors. The diverse expression patterns, prognostic value, underlying mechanism, and the role in the immunotherapy of PSME genes in gastric cancer (GC) have yet to be fully elucidated. We systematically demonstrated the functions of these genes in GC using various large databases, unbiased in silico approaches, and experimental validation. We found that the median expression levels of all PSME genes were significantly higher in GC tissues than in normal tissues. Our findings showed that up-regulated PSME1 and PSME2 expression significantly correlated with favorable overall survival, post-progression survival, and first progression survival in GC patients. The expression of PSME1 and PSME2 was positively correlated with the infiltration of most immune cells and the activation of anti-cancer immunity cycle steps. Moreover, GC patients with high PSME1 and PSME2 expression have higher immunophenoscore and tumor mutational burden. In addition, a receiver operating characteristic analysis suggested that PSME3 and PSME4 had high diagnostic performance for distinguishing GC patients from healthy individuals. Moreover, our further analysis indicated that PSME genes exert an essential role in GC, and the present study indicated that PSME1 and PSME2 may be potential prognostic markers for enhancing survival and prognostic accuracy in GC patients and may even act as potential biomarkers for GC patients indicating a response to immunotherapy. PSME3 may serve as an oncogene in tumorigenesis and may be a promising therapeutic target for GC. PSME4 had excellent diagnostic performance and could serve as a good diagnostic indicator for GC.
Collapse
Affiliation(s)
- Yongdong Guo
- Cancer Institute, Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Xiaoping Dong
- Cancer Institute, Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Jing Jin
- Cancer Institute, Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| | - Yutong He
- Cancer Institute, Fourth Hospital of Hebei Medical University, Shijiazhuang, China
| |
Collapse
|
57
|
He X, Zhang S, Zhang Y, Lei Z, Jiang T, Zeng J. Characterizing RNA Pseudouridylation by Convolutional Neural Networks. GENOMICS, PROTEOMICS & BIOINFORMATICS 2021; 19:815-833. [PMID: 33631424 PMCID: PMC9170758 DOI: 10.1016/j.gpb.2019.11.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 09/15/2019] [Accepted: 11/13/2019] [Indexed: 12/12/2022]
Abstract
Pseudouridine (Ψ) is the most prevalent post-transcriptional RNA modification and is widespread in small cellular RNAs and mRNAs. However, the functions, mechanisms, and precise distribution of Ψs (especially in mRNAs) still remain largely unclear. The landscape of Ψs across the transcriptome has not yet been fully delineated. Here, we present a highly effective model based on a convolutional neural network (CNN), called PseudoUridyLation Site Estimator (PULSE), to analyze large-scale profiling data of Ψ sites and characterize the contextual sequence features of pseudouridylation. PULSE, consisting of two alternatively-stacked convolution and pooling layers followed by a fully-connected neural network, can automatically learn the hidden patterns of pseudouridylation from the local sequence information. Extensive validation tests demonstrated that PULSE can outperform other state-of-the-art prediction methods and achieve high prediction accuracy, thus enabling us to further characterize the transcriptome-wide landscape of Ψ sites. We further showed that the prediction results derived from PULSE can provide novel insights into understanding the functional roles of pseudouridylation, such as the regulations of RNA secondary structure, codon usage, translation, and RNA stability, and the connection to single nucleotide variants. The source code and final model for PULSE are available at https://github.com/mlcb-thu/PULSE.
Collapse
Affiliation(s)
- Xuan He
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Sai Zhang
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Yanqing Zhang
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
| | - Zhixin Lei
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China; Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Tao Jiang
- Department of Computer Science and Engineering, University of California, Riverside, CA 92521, USA; MOE Key Lab of Bioinformatics and Bioinformatics Division, BNRIST/Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China; Institute of Integrative Genome Biology, University of California, Riverside, CA 92521, USA
| | - Jianyang Zeng
- Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
58
|
Wei Q. Bioinformatical identification of key genes regulated by IGF2BP2-mediated RNA N6-methyladenosine and prediction of prognosis in hepatocellular carcinoma. J Gastrointest Oncol 2021; 12:1773-1785. [PMID: 34532127 DOI: 10.21037/jgo-21-306] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 08/17/2021] [Indexed: 12/22/2022] Open
Abstract
Background The treatment of hepatocellular carcinoma (HCC), a malignant cancer with global spread, remains unsatisfactory, and novel prognostic biomarkers need to be identified. N6-methyladenosine (m6A) has been found to regulate tumor initiation and progression through different mechanisms. As a dynamic and reversible messenger RNA (mRNA) modification, m6A can be read by insulin-like growth factor 2 mRNA-binding protein 2 (IGF2BP2). IGF2BP2 targets thousands of mRNA transcripts, which may be involved in HCC progression. Methods In this study, we integrated 4 classes of datasets including The Cancer Genome Atlas (TCGA)-LICH, m6A-sequencing data of HepG2 cells, and RNA-sequencing data of IGF2BP2-knockdown HepG2 cells to explore the key genes regulated by IGF2BP2-mediated m6A in HCC. The expression and m6A modification of candidates were validation in independent microarray expression profile of HCC tissue and annotated m6A database RMBase. The relationship of immune cell infiltration and the genes expression was estimated by CIBERSORT and TIMER. Results A total of 89 candidate genes were filtered. Next, cluster analysis was performed base on functions and pathways to identify the enrichment pathways. By constructing a protein-protein interaction (PPI) network, we found 54 nodes. Ten significant genes were filtered from the PPI. These genes were validated in data of an independent microarray and an m6A database. We found that the upregulation of these 10 genes was associated with poor prognosis. In addition, we showed the expression of these 10 genes was associated with the infiltration of variety of immune cell and tumor purity. Conclusions These identified genes may provide novel insights and facilitate the development of potential biomarkers for HCC diagnosis, as well as provide clues for IGF2BP2 inhibition therapy in HCC.
Collapse
Affiliation(s)
- Qiang Wei
- Hepatological Surgery Department, Bethune International Peace Hospital of PLA, Shijiazhuang, China
| |
Collapse
|
59
|
Qin S, Mao Y, Chen X, Xiao J, Qin Y, Zhao L. The functional roles, cross-talk and clinical implications of m6A modification and circRNA in hepatocellular carcinoma. Int J Biol Sci 2021; 17:3059-3079. [PMID: 34421350 PMCID: PMC8375232 DOI: 10.7150/ijbs.62767] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Accepted: 07/06/2021] [Indexed: 12/13/2022] Open
Abstract
Hepatocellular carcinoma (HCC) is one of the leading causes of cancer-related deaths worldwide. HCC has high rates of death and recurrence, as well as very low survival rates. N6-methyladenosine (m6A) is the most abundant modification in eukaryotic RNAs, and circRNAs are a class of circular noncoding RNAs that are generated by back-splicing and they modulate multiple functions in a variety of cellular processes. Although the carcinogenesis of HCC is complex, emerging evidence has indicated that m6A modification and circRNA play vital roles in HCC development and progression. However, the underlying mechanisms governing HCC, their cross-talk, and clinical implications have not been fully elucidated. Therefore, in this paper, we elucidated the biological functions and molecular mechanisms of m6A modification in the carcinogenesis of HCC by illustrating three different regulatory factors ("writer", "eraser", and "reader") of the m6A modification process. Additionally, we dissected the functional roles of circRNAs in various malignant behaviors of HCC, thereby contributing to HCC initiation, progression and relapse. Furthermore, we demonstrated the cross-talk and interplay between m6A modification and circRNA by revealing the effects of the collaboration of circRNA and m6A modification on HCC progression. Finally, we proposed the clinical potential and implications of m6A modifiers and circRNAs as diagnostic biomarkers and therapeutic targets for HCC diagnosis, treatment and prognosis evaluation.
Collapse
Affiliation(s)
- Sha Qin
- Department of Pathology, Xiangya Hospital, Central South University, Changsha, Hunan, China; and Department of Pathology, School of Basic Medical Science, Xiangya School of Medicine, Central South University, Changsha, Hunan, China
| | - Yitao Mao
- Department of Radiology, Xiangya Hospital, Central South University, Changsha, Hunan, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, China
| | - Xue Chen
- Early Clinical Trial Center, Hunan Cancer Hospital and The Affiliated Cancer Hospital of Xiangya School of Medicine, Central South University, Changsha, Hunan, China
| | - Juxiong Xiao
- Department of Radiology, Xiangya Hospital, Central South University, Changsha, Hunan, China
| | - Yan Qin
- Department of Radiology, Xiangya Hospital, Central South University, Changsha, Hunan, China
| | - Luqing Zhao
- Department of Pathology, Xiangya Hospital, Central South University, Changsha, Hunan, China; and Department of Pathology, School of Basic Medical Science, Xiangya School of Medicine, Central South University, Changsha, Hunan, China.,National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan, China
| |
Collapse
|
60
|
Cadoni E, De Paepe L, Manicardi A, Madder A. Beyond small molecules: targeting G-quadruplex structures with oligonucleotides and their analogues. Nucleic Acids Res 2021; 49:6638-6659. [PMID: 33978760 PMCID: PMC8266634 DOI: 10.1093/nar/gkab334] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 04/15/2021] [Accepted: 04/29/2021] [Indexed: 12/20/2022] Open
Abstract
G-Quadruplexes (G4s) are widely studied secondary DNA/RNA structures, naturally occurring when G-rich sequences are present. The strategic localization of G4s in genome areas of crucial importance, such as proto-oncogenes and telomeres, entails fundamental implications in terms of gene expression regulation and other important biological processes. Although thousands of small molecules capable to induce G4 stabilization have been reported over the past 20 years, approaches based on the hybridization of a synthetic probe, allowing sequence-specific G4-recognition and targeting are still rather limited. In this review, after introducing important general notions about G4s, we aim to list, explain and critically analyse in more detail the principal approaches available to target G4s by using oligonucleotides and synthetic analogues such as Locked Nucleic Acids (LNAs) and Peptide Nucleic Acids (PNAs), reporting on the most relevant examples described in literature to date.
Collapse
Affiliation(s)
- Enrico Cadoni
- Organic and Biomimetic Chemistry Research Group, Ghent University, Krijgslaan 281 S4, B-9000 Ghent, Belgium
| | - Lessandro De Paepe
- Organic and Biomimetic Chemistry Research Group, Ghent University, Krijgslaan 281 S4, B-9000 Ghent, Belgium
| | - Alex Manicardi
- Organic and Biomimetic Chemistry Research Group, Ghent University, Krijgslaan 281 S4, B-9000 Ghent, Belgium
| | - Annemieke Madder
- Organic and Biomimetic Chemistry Research Group, Ghent University, Krijgslaan 281 S4, B-9000 Ghent, Belgium
| |
Collapse
|
61
|
Li F, Guo X, Jin P, Chen J, Xiang D, Song J, Coin LJM. Porpoise: a new approach for accurate prediction of RNA pseudouridine sites. Brief Bioinform 2021; 22:6314697. [PMID: 34226915 DOI: 10.1093/bib/bbab245] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 05/19/2021] [Accepted: 06/08/2021] [Indexed: 12/14/2022] Open
Abstract
Pseudouridine is a ubiquitous RNA modification type present in eukaryotes and prokaryotes, which plays a vital role in various biological processes. Almost all kinds of RNAs are subject to this modification. However, it remains a great challenge to identify pseudouridine sites via experimental approaches, requiring expensive and time-consuming experimental research. Therefore, computational approaches that can be used to perform accurate in silico identification of pseudouridine sites from the large amount of RNA sequence data are highly desirable and can aid in the functional elucidation of this critical modification. Here, we propose a new computational approach, termed Porpoise, to accurately identify pseudouridine sites from RNA sequence data. Porpoise builds upon a comprehensive evaluation of 18 frequently used feature encoding schemes based on the selection of four types of features, including binary features, pseudo k-tuple composition, nucleotide chemical property and position-specific trinucleotide propensity based on single-strand (PSTNPss). The selected features are fed into the stacked ensemble learning framework to enable the construction of an effective stacked model. Both cross-validation tests on the benchmark dataset and independent tests show that Porpoise achieves superior predictive performance than several state-of-the-art approaches. The application of model interpretation tools demonstrates the importance of PSTNPs for the performance of the trained models. This new method is anticipated to facilitate community-wide efforts to identify putative pseudouridine sites and formulate novel testable biological hypothesis.
Collapse
Affiliation(s)
- Fuyi Li
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia
| | | | - Peipei Jin
- Department of Clinical Laboratory of Ruijin Hospital, affiliated with Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | | | - Dongxu Xiang
- Faculty of Engineering and Information Technology, The University of Melbourne, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash University, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology at the University of Melbourne, Australia
| |
Collapse
|
62
|
Schauerte M, Pozhydaieva N, Höfer K. Shaping the Bacterial Epitranscriptome-5'-Terminal and Internal RNA Modifications. Adv Biol (Weinh) 2021; 5:e2100834. [PMID: 34121369 DOI: 10.1002/adbi.202100834] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 05/07/2021] [Indexed: 11/11/2022]
Abstract
All domains of life utilize a diverse set of modified ribonucleotides that can impact the sequence, structure, function, stability, and the fate of RNAs, as well as their interactions with other molecules. Today, more than 160 different RNA modifications are known that decorate the RNA at the 5'-terminus or internal RNA positions. The boost of next-generation sequencing technologies sets the foundation to identify and study the functional role of RNA modifications. The recent advances in the field of RNA modifications reveal a novel regulatory layer between RNA modifications and proteins, which is central to developing a novel concept called "epitranscriptomics." The majority of RNA modifications studies focus on the eukaryotic epitranscriptome. In contrast, RNA modifications in prokaryotes are poorly characterized. This review outlines the current knowledge of the prokaryotic epitranscriptome focusing on mRNA modifications. Here, it is described that several internal and 5'-terminal RNA modifications either present or likely present in prokaryotic mRNA. Thereby, the individual techniques to identify these epitranscriptomic modifications, their writers, readers and erasers, and their proposed functions are explored. Besides that, still unanswered questions in the field of prokaryotic epitranscriptomics are pointed out, and its future perspectives in the dawn of next-generation sequencing technologies are outlined.
Collapse
Affiliation(s)
- Maik Schauerte
- Max-Planck-Institute for terrestrial Microbiology, Marburg, Hessen, 35043, Germany
| | - Nadiia Pozhydaieva
- Max-Planck-Institute for terrestrial Microbiology, Marburg, Hessen, 35043, Germany
| | - Katharina Höfer
- Max-Planck-Institute for terrestrial Microbiology, Marburg, Hessen, 35043, Germany
| |
Collapse
|
63
|
Ao C, Zou Q, Yu L. RFhy-m2G: Identification of RNA N2-methylguanosine modification sites based on random forest and hybrid features. Methods 2021; 203:32-39. [PMID: 34033879 DOI: 10.1016/j.ymeth.2021.05.016] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Revised: 05/04/2021] [Accepted: 05/20/2021] [Indexed: 12/31/2022] Open
Abstract
N2-methylguanosine is a post-transcriptional modification of RNA that is found in eukaryotes and archaea. The biological function of m2G modification discovered so far is to control and stabilize the three-dimensional structure of tRNA and the dynamic barrier of reverse transcription. To discover additional biological functions of m2G, it is necessary to develop time-saving and labor-saving calculation tools to identify m2G. In this paper, based on hybrid features and a random forest, a novel predictor, RFhy-m2G, was developed to identify the m2G modification sites for three species. The hybrid feature used by the predictor is used to fuse the three features of ENAC, PseDNC, and NPPS. These three features include primary sequence derivation properties, physicochemical properties, and position-specific properties. Since there are redundant features in hybrid features, MRMD2.0 is used for optimal feature selection. Through feature analysis, it is found that the optimal hybrid features obtained still contain three kinds of properties, and the hybrid features can more accurately identify m2G modification sites and improve prediction performance. Based on five-fold cross-validation and independent testing to evaluate the prediction model, the accuracies obtained were 0.9982 and 0.9417, respectively. The robustness of the predictor is demonstrated by comparisons with other predictors.
Collapse
Affiliation(s)
- Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China; Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China.
| |
Collapse
|
64
|
Garikipati VNS, Uchida S. Elucidating the Functions of Non-Coding RNAs from the Perspective of RNA Modifications. Noncoding RNA 2021; 7:ncrna7020031. [PMID: 34065036 PMCID: PMC8163165 DOI: 10.3390/ncrna7020031] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 04/29/2021] [Accepted: 05/05/2021] [Indexed: 12/11/2022] Open
Abstract
It is now commonly accepted that most of the mammalian genome is transcribed as RNA, yet less than 2% of such RNA encode for proteins. A majority of transcribed RNA exists as non-protein-coding RNAs (ncRNAs) with various functions. Because of the lack of sequence homologies among most ncRNAs species, it is difficult to infer the potential functions of ncRNAs by examining sequence patterns, such as catalytic domains, as in the case of proteins. Added to the existing complexity of predicting the functions of the ever-growing number of ncRNAs, increasing evidence suggests that various enzymes modify ncRNAs (e.g., ADARs, METTL3, and METTL14), which has opened up a new field of study called epitranscriptomics. Here, we examine the current status of ncRNA research from the perspective of epitranscriptomics.
Collapse
Affiliation(s)
- Venkata Naga Srikanth Garikipati
- Department of Emergency Medicine, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA;
- Dorothy M. Davis Heart Lung and Research Institute, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| | - Shizuka Uchida
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, Frederikskaj 10B, 2. (building C), DK-2450 Copenhagen SV, Denmark
- Correspondence: or
| |
Collapse
|
65
|
Cai Y, Feng R, Lu T, Chen X, Zhou X, Wang X. Novel insights into the m 6A-RNA methyltransferase METTL3 in cancer. Biomark Res 2021; 9:27. [PMID: 33879256 PMCID: PMC8056546 DOI: 10.1186/s40364-021-00278-9] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 03/25/2021] [Indexed: 12/11/2022] Open
Abstract
N6-methyladenosine (m6A) is a prevalent internal RNA modification in higher eukaryotic cells. As the pivotal m6A regulator, RNA methyltransferase-like 3 (METTL3) is responsible for methyl group transfer in the progression of m6A modification. This epigenetic regulation contributes to the structure and functional regulation of RNA and further promotes tumorigenesis and tumor progression. Accumulating evidence has illustrated the pivotal roles of METTL3 in a variety of human cancers. Here, we systemically summarize the interaction between METTL3 and RNAs, and illustrate the multiple functions of METTL3 in human cancer. METLL3 is aberrantly expressed in a variety of tumors. Elevation of METTL3 is usually associated with rapid progression and poor prognosis of tumors. On the other hand, METTL3 may also function as a tumor suppressor in several cancers. Based on the tumor-promoting effect of METTL3, the possibility of applying METTL3 inhibitors is further discussed, which is expected to provide novel insights into antitumor therapy.
Collapse
Affiliation(s)
- Yiqing Cai
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No.324, Jingwu Road, Jinan, 250021, Shandong, China
| | - Rui Feng
- Department of Hematology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China
| | - Tiange Lu
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No.324, Jingwu Road, Jinan, 250021, Shandong, China
| | - Xiaomin Chen
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No.324, Jingwu Road, Jinan, 250021, Shandong, China
| | - Xiangxiang Zhou
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No.324, Jingwu Road, Jinan, 250021, Shandong, China. .,Department of Hematology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China. .,School of Medicine, Shandong University, Jinan, 250012, Shandong, China. .,Shandong Provincial Engineering Research Center of Lymphoma, Jinan, 250021, Shandong, China. .,Branch of National Clinical Research Center for Hematologic Diseases, Jinan, 250021, Shandong, China. .,National Clinical Research Center for Hematologic Diseases, The First Affiliated Hospital of Soochow University, Suzhou, 251006, China.
| | - Xin Wang
- Department of Hematology, Shandong Provincial Hospital, Cheeloo College of Medicine, Shandong University, No.324, Jingwu Road, Jinan, 250021, Shandong, China. .,Department of Hematology, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, Shandong, China. .,School of Medicine, Shandong University, Jinan, 250012, Shandong, China. .,Shandong Provincial Engineering Research Center of Lymphoma, Jinan, 250021, Shandong, China. .,Branch of National Clinical Research Center for Hematologic Diseases, Jinan, 250021, Shandong, China. .,National Clinical Research Center for Hematologic Diseases, The First Affiliated Hospital of Soochow University, Suzhou, 251006, China.
| |
Collapse
|
66
|
Ma J, Zhang L, Chen S, Liu H. A brief review of RNA modification related database resources. Methods 2021; 203:342-353. [PMID: 33705860 DOI: 10.1016/j.ymeth.2021.03.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 02/19/2021] [Accepted: 03/04/2021] [Indexed: 01/28/2023] Open
Abstract
To date, over 150 different RNA modifications have been identified, playing crucial roles in biological processes and disease pathogenesis. Thanks to the advancement of high-throughput sequencing technologies employed for transcriptome-wide mapping, a bunch of RNA modification databases have emerged as an exciting area, which promotes further investigation of the mechanisms and functions of these modified ribonucleotides. This article introduces the high-throughput sequencing technique developed for transcriptome-wide mapping of RNA modifications, as well as the procedures and main techniques of building these databases from the developers' perspective. It also reviews existing RNA modification databases in terms of their main functions, species, the number of sites they collected, the annotations, and the tools they provided. From the view of users, we further analyze and compare these databases in terms of their functions. For instance, these databases can be applied to record chemical structures and biosynthetic pathways, or unravel the epi-transcriptome comprehensively, or only investigate specific features of RNA modifications. Additionally, the limitations of the existing approaches are discussed, and some future suggestions are offered.
Collapse
Affiliation(s)
- Jiani Ma
- Engineering Research Center of Intelligent Control for Underground Space, Ministry of Education, China University of Mining and Technology, Xuzhou 221116, China; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Lin Zhang
- Engineering Research Center of Intelligent Control for Underground Space, Ministry of Education, China University of Mining and Technology, Xuzhou 221116, China; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Shutao Chen
- Engineering Research Center of Intelligent Control for Underground Space, Ministry of Education, China University of Mining and Technology, Xuzhou 221116, China; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
| | - Hui Liu
- Engineering Research Center of Intelligent Control for Underground Space, Ministry of Education, China University of Mining and Technology, Xuzhou 221116, China; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China.
| |
Collapse
|
67
|
Aziz AZB, Hasan MAM, Shin J. Identification of RNA pseudouridine sites using deep learning approaches. PLoS One 2021; 16:e0247511. [PMID: 33621235 PMCID: PMC7901771 DOI: 10.1371/journal.pone.0247511] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 02/08/2021] [Indexed: 01/05/2023] Open
Abstract
Pseudouridine(Ψ) is widely popular among various RNA modifications which have been confirmed to occur in rRNA, mRNA, tRNA, and nuclear/nucleolar RNA. Hence, identifying them has vital significance in academic research, drug development and gene therapies. Several laboratory techniques for Ψ identification have been introduced over the years. Although these techniques produce satisfactory results, they are costly, time-consuming and requires skilled experience. As the lengths of RNA sequences are getting longer day by day, an efficient method for identifying pseudouridine sites using computational approaches is very important. In this paper, we proposed a multi-channel convolution neural network using binary encoding. We employed k-fold cross-validation and grid search to tune the hyperparameters. We evaluated its performance in the independent datasets and found promising results. The results proved that our method can be used to identify pseudouridine sites for associated purposes. We have also implemented an easily accessible web server at http://103.99.176.239/ipseumulticnn/.
Collapse
Affiliation(s)
- Abu Zahid Bin Aziz
- Department of Computer Science & Engineering, Rajshahi University of Engineering & Technology, Rajshahi, Bangladesh
- * E-mail:
| | - Md. Al Mehedi Hasan
- Department of Computer Science & Engineering, Rajshahi University of Engineering & Technology, Rajshahi, Bangladesh
| | - Jungpil Shin
- School of Computer Science and Engineering, University of Aizu, Aizuwakamatsu, Japan
| |
Collapse
|
68
|
Xu Y, Zhang W, Shen F, Yang X, Liu H, Dai S, Sun X, Huang J, Guo Q. YTH Domain Proteins: A Family of m 6A Readers in Cancer Progression. Front Oncol 2021; 11:629560. [PMID: 33692959 PMCID: PMC7937903 DOI: 10.3389/fonc.2021.629560] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 01/07/2021] [Indexed: 02/05/2023] Open
Abstract
N6-methyladenosine (m6A) is the most abundant internal modification in eukaryotic messenger RNAs (mRNAs). m6A RNA methylation is involved in all stages of RNA life cycle, from RNA processing, nuclear output, translation regulation to RNA degradation, indicating that m6A has various functions affecting RNA metabolism positively or negatively. Reading proteins are vital in regulating the translation and stability of m6A mRNAs positively or negatively. Recent studies have enhanced the understanding of the molecular mechanism of the YT521-B homology (YTH) domain family and the modification of m6A. This study aimed to review the specific mechanisms, functions, and interactions of the YTH domain protein family. It also discussed future research directions, thus providing new ideas for the clinical diagnosis and targeted therapy of cancer.
Collapse
Affiliation(s)
- Yirong Xu
- Department of Oncology, Taizhou People's Hospital, Taizhou, China.,Graduate school, Dalian Medical University, Dalian, China
| | - Wei Zhang
- Department of Oncology, Taizhou People's Hospital, Taizhou, China
| | - Feng Shen
- Department of Neurosurgery, Taizhou People's Hospital, Taizhou, China
| | - Xi Yang
- Department of Medical Oncology, Cancer Center, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu, China
| | - Huilan Liu
- Department of Oncology, Taizhou People's Hospital, Taizhou, China
| | - Shengbin Dai
- Department of Oncology, Taizhou People's Hospital, Taizhou, China
| | - Xinchen Sun
- Department of Radiation Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| | - Junxing Huang
- Department of Oncology, Taizhou People's Hospital, Taizhou, China
| | - Qing Guo
- Department of Oncology, Taizhou People's Hospital, Taizhou, China
| |
Collapse
|
69
|
N 6-methyladenosine modification of MALAT1 promotes metastasis via reshaping nuclear speckles. Dev Cell 2021; 56:702-715.e8. [PMID: 33609462 DOI: 10.1016/j.devcel.2021.01.015] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 11/25/2020] [Accepted: 01/24/2021] [Indexed: 01/05/2023]
Abstract
N6-methyladenosine (m6A), one of the most prevalent RNA post-transcriptional modifications, is involved in numerous biological processes. In previous studies, the functions of m6A were typically identified by perturbing the activity of the methyltransferase complex. Here, we dissect the contribution of m6A to an individual-long noncoding RNA-metastasis-associated lung adenocarcinoma transcript 1 (MALAT1). The mutant MALAT1 lacking m6A-motifs significantly suppressed the metastatic potential of cancer cells both in vitro and in vivo in mouse. Super-resolution imaging showed that the concatenated m6A residues on MALAT1 acted as a scaffold for recruiting YTH-domain-containing protein 1 (YTHDC1) to nuclear speckles. We further reveal that the recognition of MALAT1-m6A by YTHDC1 played a critical role in maintaining the composition and genomic binding sites of nuclear speckles, which regulate the expression of several key oncogenes. Furthermore, artificially tethering YTHDC1 onto m6A-deficient MALAT1 largely rescues the metastatic potential of cancer cells.
Collapse
|
70
|
Seelam Prabhakar P, Takyi NA, Wetmore SD. Posttranscriptional modifications at the 37th position in the anticodon stem-loop of tRNA: structural insights from MD simulations. RNA (NEW YORK, N.Y.) 2021; 27:202-220. [PMID: 33214333 PMCID: PMC7812866 DOI: 10.1261/rna.078097.120] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 11/16/2020] [Indexed: 06/11/2023]
Abstract
Transfer RNA (tRNA) is the most diversely modified RNA. Although the strictly conserved purine position 37 in the anticodon stem-loop undergoes modifications that are phylogenetically distributed, we do not yet fully understand the roles of these modifications. Therefore, molecular dynamics simulations are used to provide molecular-level details for how such modifications impact the structure and function of tRNA. A focus is placed on three hypermodified base families that include the parent i6A, t6A, and yW modifications, as well as derivatives. Our data reveal that the hypermodifications exhibit significant conformational flexibility in tRNA, which can be modulated by additional chemical functionalization. Although the overall structure of the tRNA anticodon stem remains intact regardless of the modification considered, the anticodon loop must rearrange to accommodate the bulky, dynamic hypermodifications, which includes changes in the nucleotide glycosidic and backbone conformations, and enhanced or completely new nucleobase-nucleobase interactions compared to unmodified tRNA or tRNA containing smaller (m1G) modifications at the 37th position. Importantly, the extent of the changes in the anticodon loop is influenced by the addition of small functional groups to parent modifications, implying each substituent can further fine-tune tRNA structure. Although the dominant conformation of the ASL is achieved in different ways for each modification, the molecular features of all modified tRNA drive the ASL domain to adopt the functional open-loop conformation. Importantly, the impact of the hypermodifications is preserved in different sequence contexts. These findings highlight the likely role of regulating mRNA structure and translation.
Collapse
MESH Headings
- Adenosine/analogs & derivatives
- Adenosine/metabolism
- Anticodon/chemistry
- Anticodon/genetics
- Anticodon/metabolism
- Base Pairing
- Base Sequence
- Escherichia coli/genetics
- Escherichia coli/metabolism
- Isopentenyladenosine/chemistry
- Isopentenyladenosine/metabolism
- Molecular Dynamics Simulation
- Nucleic Acid Conformation
- Nucleosides/chemistry
- Nucleosides/metabolism
- RNA Processing, Post-Transcriptional
- RNA, Transfer, Lys/chemistry
- RNA, Transfer, Lys/genetics
- RNA, Transfer, Lys/metabolism
- RNA, Transfer, Phe/chemistry
- RNA, Transfer, Phe/genetics
- RNA, Transfer, Phe/metabolism
Collapse
Affiliation(s)
- Preethi Seelam Prabhakar
- Department of Chemistry and Biochemistry, University of Lethbridge, Lethbridge, Alberta T1K 3M4, Canada
| | - Nathania A Takyi
- Department of Chemistry and Biochemistry, University of Lethbridge, Lethbridge, Alberta T1K 3M4, Canada
| | - Stacey D Wetmore
- Department of Chemistry and Biochemistry, University of Lethbridge, Lethbridge, Alberta T1K 3M4, Canada
| |
Collapse
|
71
|
Zhuang J, Liu D, Lin M, Qiu W, Liu J, Chen S. PseUdeep: RNA Pseudouridine Site Identification with Deep Learning Algorithm. Front Genet 2021; 12:773882. [PMID: 34868261 PMCID: PMC8637112 DOI: 10.3389/fgene.2021.773882] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 10/04/2021] [Indexed: 11/16/2022] Open
Abstract
Background: Pseudouridine (Ψ) is a common ribonucleotide modification that plays a significant role in many biological processes. The identification of Ψ modification sites is of great significance for disease mechanism and biological processes research in which machine learning algorithms are desirable as the lab exploratory techniques are expensive and time-consuming. Results: In this work, we propose a deep learning framework, called PseUdeep, to identify Ψ sites of three species: H. sapiens, S. cerevisiae, and M. musculus. In this method, three encoding methods are used to extract the features of RNA sequences, that is, one-hot encoding, K-tuple nucleotide frequency pattern, and position-specific nucleotide composition. The three feature matrices are convoluted twice and fed into the capsule neural network and bidirectional gated recurrent unit network with a self-attention mechanism for classification. Conclusion: Compared with other state-of-the-art methods, our model gets the highest accuracy of the prediction on the independent testing data set S-200; the accuracy improves 12.38%, and on the independent testing data set H-200, the accuracy improves 0.68%. Moreover, the dimensions of the features we derive from the RNA sequences are only 109,109, and 119 in H. sapiens, M. musculus, and S. cerevisiae, which is much smaller than those used in the traditional algorithms. On evaluation via tenfold cross-validation and two independent testing data sets, PseUdeep outperforms the best traditional machine learning model available. PseUdeep source code and data sets are available at https://github.com/dan111262/PseUdeep.
Collapse
Affiliation(s)
- Jujuan Zhuang
- College of Science, Dalian Maritime University, Dalian, China
| | - Danyang Liu
- College of Science, Dalian Maritime University, Dalian, China
| | - Meng Lin
- College of Science, Dalian Maritime University, Dalian, China
| | - Wenjing Qiu
- Electrical and Information Engineering, Anhui University of Technology, Anhui, China
- Geneis (Beijing) Co., Ltd., Beijing, China
| | | | - Size Chen
- Department of Oncology, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
- Guangdong Provincial Engineering Research Center for Esophageal Cancer Precise Therapy, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
- Central Laboratory, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
- *Correspondence: Size Chen,
| |
Collapse
|
72
|
Zheng HX, Zhang XS, Sui N. Advances in the profiling of N 6-methyladenosine (m 6A) modifications. Biotechnol Adv 2020; 45:107656. [PMID: 33181242 DOI: 10.1016/j.biotechadv.2020.107656] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2020] [Revised: 11/03/2020] [Accepted: 11/03/2020] [Indexed: 12/26/2022]
Abstract
Over 160 RNA modifications have been identified, including N7-methylguanine (m7G), N6-methyladenosine (m6A), and 5-methylcytosine (m5C). These modifications play key roles in regulating the fate of RNA. In eukaryotes, m6A is the most abundant mRNA modification, accounting for over 80% of all RNA methylation modifications. Highly dynamic m6A modification may exert important effects on organismal reproduction and development. Significant advances in understanding the mechanism of m6A modification have been made using immunoprecipitation, chemical labeling, and site-directed mutagenesis, combined with next-generation sequencing. Single-molecule real-time and nanopore direct RNA sequencing (DRS) approaches provide additional ways to study RNA modifications at the cellular level. In this review, we explore the technical history of identifying m6A RNA modifications, emphasizing technological advances in detecting m6A modification. In particular, we discuss the challenge of generating accurate dynamic single-base resolution m6A maps and also strategies for improving detection specificity. Finally, we outline a roadmap for future research in this area, focusing on the application of RNA epigenetic modification, represented by m6A modification.
Collapse
Affiliation(s)
- Hong-Xiang Zheng
- Shandong Provincial Key Laboratory of Plant Stress, College of life Sciences, Shandong Normal University, Jinan, Shandong 250014, China
| | - Xian-Sheng Zhang
- State Key Laboratory of Crop Biology, College of Life Sciences, Shandong Agricultural University, Taian, Shandong 271018, China.
| | - Na Sui
- Shandong Provincial Key Laboratory of Plant Stress, College of life Sciences, Shandong Normal University, Jinan, Shandong 250014, China.
| |
Collapse
|
73
|
The Distinct Function and Localization of METTL3/METTL14 and METTL16 Enzymes in Cardiomyocytes. Int J Mol Sci 2020; 21:ijms21218139. [PMID: 33143367 PMCID: PMC7663386 DOI: 10.3390/ijms21218139] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 10/21/2020] [Accepted: 10/23/2020] [Indexed: 12/15/2022] Open
Abstract
It has become evident that epitranscriptome events, mediated by specific enzymes, regulate gene expression and, subsequently, cell differentiation processes. We show that methyltransferase-like proteins METTL3/METTL14 and N6-adenosine methylation (m6A) in RNAs are homogeneously distributed in embryonic hearts, and histone deacetylase (HDAC) inhibitors valproic acid and Trichostatin A (TSA) up-regulate METTL3/METTL14 proteins. The levels of METTL3 in mouse adult hearts, isolated from male and female animals, were lower in the aorta and pulmonary trunks when compared with atria, but METT14 was up-regulated in the aorta and pulmonary trunk, in comparison with ventriculi. Aging caused METTL3 down-regulation in aorta and atria in male animals. Western blot analysis in differentiated mouse embryonic stem cells (mESCs), containing 10-30 percent of cardiomyocytes, showed METTL3/METTL14 down-regulation, while the differentiation-induced increased level of METTL16 was observed in both wild type (wt) and HDAC1 depleted (dn) cells. In parallel, experimental differentiation in especially HDAC1 wild type cells was accompanied by depletion of m6A in RNA. Immunofluorescence analysis of individual cells revealed the highest density of METTL3/METTL14 in α-actinin positive cardiomyocytes when compared with the other cells in the culture undergoing differentiation. In both wt and HDAC1 dn cells, the amount of METTL16 was also up-regulated in cardiomyocytes when compared to co-cultivated cells. Together, we showed that distinct anatomical regions of the mouse adult hearts are characterized by different levels of METTL3 and METTL14 proteins, which are changed during aging. Experimental cell differentiation was also accompanied by changes in METTL-like proteins and m6A in RNA; in particular, levels and distribution patterns of METTL3/METTL14 proteins were different from the same parameters studied in the case of the METTL16 protein.
Collapse
|
74
|
REW-ISA: unveiling local functional blocks in epi-transcriptome profiling data via an RNA expression-weighted iterative signature algorithm. BMC Bioinformatics 2020; 21:447. [PMID: 33036550 PMCID: PMC7547494 DOI: 10.1186/s12859-020-03787-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Accepted: 09/29/2020] [Indexed: 02/07/2023] Open
Abstract
Background Recent studies have shown that N6-methyladenosine (m6A) plays a critical role in numbers of biological processes and complex human diseases. However, the regulatory mechanisms of most methylation sites remain uncharted. Thus, in-depth study of the epi-transcriptomic patterns of m6A may provide insights into its complex functional and regulatory mechanisms. Results Due to the high economic and time cost of wet experimental methods, revealing methylation patterns through computational models has become a more preferable way, and drawn more and more attention. Considering the theoretical basics and applications of conventional clustering methods, an RNA Expression Weighted Iterative Signature Algorithm (REW-ISA) is proposed to find potential local functional blocks (LFBs) based on MeRIP-Seq data, where sites are hyper-methylated or hypo-methylated simultaneously across the specific conditions. REW-ISA adopts RNA expression levels of each site as weights to make sites of lower expression level less significant. It starts from random sets of sites, then follows iterative search strategies by thresholds of rows and columns to find the LFBs in m6A methylation profile. Its application on MeRIP-Seq data of 69,446 methylation sites under 32 experimental conditions unveiled 6 LFBs, which achieve higher enrichment scores than ISA. Pathway analysis and enzyme specificity test showed that sites remained in LFBs are highly relevant to the m6A methyltransferase, such as METTL3, METTL14, WTAP and KIAA1429. Further detailed analyses for each LFB even showed that some LFBs are condition-specific, indicating that methylation profiles of some specific sites may be condition relevant. Conclusions REW-ISA finds potential local functional patterns presented in m6A profiles, where sites are co-methylated under specific conditions.
Collapse
|
75
|
Zhang H, Shi X, Huang T, Zhao X, Chen W, Gu N, Zhang R. Dynamic landscape and evolution of m6A methylation in human. Nucleic Acids Res 2020; 48:6251-6264. [PMID: 32406913 PMCID: PMC7293016 DOI: 10.1093/nar/gkaa347] [Citation(s) in RCA: 232] [Impact Index Per Article: 46.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Revised: 04/23/2020] [Accepted: 04/24/2020] [Indexed: 01/03/2023] Open
Abstract
m6A is a prevalent internal modification in mRNAs and has been linked to the diverse effects on mRNA fate. To explore the landscape and evolution of human m6A, we generated 27 m6A methylomes across major adult tissues. These data reveal dynamic m6A methylation across tissue types, uncover both broadly or tissue-specifically methylated sites, and identify an unexpected enrichment of m6A methylation at non-canonical cleavage sites. A comparison of fetal and adult m6A methylomes reveals that m6A preferentially occupies CDS regions in fetal tissues. Moreover, the m6A sub-motifs vary between fetal and adult tissues or across tissue types. From the evolutionary perspective, we uncover that the selection pressure on m6A sites varies and depends on their genic locations. Unexpectedly, we found that ∼40% of the 3′UTR m6A sites are under negative selection, which is higher than the evolutionary constraint on miRNA binding sites, and much higher than that on A-to-I RNA modification. Moreover, the recently gained m6A sites in human populations are clearly under positive selection and associated with traits or diseases. Our work provides a resource of human m6A profile for future studies of m6A functions, and suggests a role of m6A modification in human evolutionary adaptation and disease susceptibility.
Collapse
Affiliation(s)
- Hui Zhang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China
| | - Xinrui Shi
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China
| | - Tao Huang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China
| | - Xueni Zhao
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China
| | - Wanying Chen
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China
| | - Nannan Gu
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China
| | - Rui Zhang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, PR China.,RNA Biomedical Institute, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, PR China
| |
Collapse
|
76
|
Prognostic Value of an m6A RNA Methylation Regulator-Based Signature in Patients with Hepatocellular Carcinoma. BIOMED RESEARCH INTERNATIONAL 2020; 2020:2053902. [PMID: 32733931 PMCID: PMC7378627 DOI: 10.1155/2020/2053902] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2020] [Accepted: 06/30/2020] [Indexed: 12/24/2022]
Abstract
Purposes Hepatocellular carcinoma (HCC) is one of the most common malignant tumors in the world. Recent researches have demonstrated that m6A methylation regulators play a key role in various cancers, such as gastric cancer and colon adenocarcinoma. Several m6A methylation regulators are reported to predict the prognosis of HCC. Therefore, there is a need to further identify the predictive value of m6A methylation regulators in HCC. Methods We utilized The Cancer Genome Atlas (TCGA) database to obtain the gene expression profile of m6A RNA methylation regulators and clinical information for patients with HCC. Besides, we identified two clusters of HCC with various clinical factors by consensus clustering analysis. Then the least absolute shrinkage and selection operator (LASSO) and the Cox regression analysis were applied to construct a prognostic signature. Results Except for ZC3H13 and METTL14, a majority of the thirteen m6A RNA methylation regulators were significantly overexpressed in HCC specimens. HCC patients were classified into two groups (cluster 1 and cluster 2). The cluster 1 was with a significantly worse prognosis than cluster 2, and most of the 13 known m6A RNA methylation regulators were upregulated in cluster 1. Besides, we developed a prognostic signature consisting of YTHDF2, YTHDF1, METTL3, KIAA1429, and ZC3H13, which could successfully differentiate high-risk patients. More importantly, univariate and multivariate Cox regression analysis indicated that the signature-based risk score was an independent prognostic factor for patients with HCC. Conclusions Our study showed these five m6A RNA methylation regulators can be used as practical and reliable prognostic tools of HCC, which might have potential value for therapeutic strategies.
Collapse
|
77
|
Liu L, Song B, Ma J, Song Y, Zhang SY, Tang Y, Wu X, Wei Z, Chen K, Su J, Rong R, Lu Z, de Magalhães JP, Rigden DJ, Zhang L, Zhang SW, Huang Y, Lei X, Liu H, Meng J. Bioinformatics approaches for deciphering the epitranscriptome: Recent progress and emerging topics. Comput Struct Biotechnol J 2020; 18:1587-1604. [PMID: 32670500 PMCID: PMC7334300 DOI: 10.1016/j.csbj.2020.06.010] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2020] [Revised: 06/02/2020] [Accepted: 06/07/2020] [Indexed: 12/13/2022] Open
Abstract
Post-transcriptional RNA modification occurs on all types of RNA and plays a vital role in regulating every aspect of RNA function. Thanks to the development of high-throughput sequencing technologies, transcriptome-wide profiling of RNA modifications has been made possible. With the accumulation of a large number of high-throughput datasets, bioinformatics approaches have become increasing critical for unraveling the epitranscriptome. We review here the recent progress in bioinformatics approaches for deciphering the epitranscriptomes, including epitranscriptome data analysis techniques, RNA modification databases, disease-association inference, general functional annotation, and studies on RNA modification site prediction. We also discuss the limitations of existing approaches and offer some future perspectives.
Collapse
Affiliation(s)
- Lian Liu
- School of Computer Sciences, Shannxi Normal University, Xi’an, Shaanxi 710119, China
| | - Bowen Song
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Jiani Ma
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China
| | - Yi Song
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Song-Yao Zhang
- Key Laboratory of Information Fusion Technology of Ministry of Education, School of Automation, Northwestern Polytechnical University, Xi’an, Shaanxi 710072, China
| | - Yujiao Tang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Xiangyu Wu
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Zhen Wei
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Kunqi Chen
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Jionglong Su
- Department of Mathematical Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
| | - Rong Rong
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Zhiliang Lu
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - João Pedro de Magalhães
- Institute of Ageing & Chronic Disease, University of Liverpool, L7 8TX, Liverpool, United Kingdom
| | - Daniel J. Rigden
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| | - Lin Zhang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China
| | - Shao-Wu Zhang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China
| | - Yufei Huang
- Department of Electrical and Computer Engineering, University of Texas at San Antonio, San Antonio, TX, 78249, USA
- Department of Epidemiology and Biostatistics, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| | - Xiujuan Lei
- School of Computer Sciences, Shannxi Normal University, Xi’an, Shaanxi 710119, China
| | - Hui Liu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, 215123, China
- AI University Research Centre, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom
| |
Collapse
|
78
|
Dou L, Li X, Ding H, Xu L, Xiang H. Prediction of m5C Modifications in RNA Sequences by Combining Multiple Sequence Features. MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 21:332-342. [PMID: 32645685 PMCID: PMC7340967 DOI: 10.1016/j.omtn.2020.06.004] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 06/03/2020] [Accepted: 06/04/2020] [Indexed: 12/14/2022]
Abstract
5-Methylcytosine (m5C) is a well-known post-transcriptional modification that plays significant roles in biological processes, such as RNA metabolism, tRNA recognition, and stress responses. Traditional high-throughput techniques on identification of m5C sites are usually time consuming and expensive. In addition, the number of RNA sequences shows explosive growth in the post-genomic era. Thus, machine-learning-based methods are urgently requested to quickly predict RNA m5C modifications with high accuracy. Here, we propose a noval support-vector-machine (SVM)-based tool, called iRNA-m5C_SVM, by combining multiple sequence features to identify m5C sites in Arabidopsis thaliana. Eight kinds of popular feature-extraction methods were first investigated systematically. Then, four well-performing features were incorporated to construct a comprehensive model, including position-specific propensity (PSP) (PSNP, PSDP, and PSTP, associated with frequencies of nucleotides, dinucleotides, and trinucleotides, respectively), nucleotide composition (nucleic acid, di-nucleotide, and tri-nucleotide compositions; NAC, DNC, and TNC, respectively), electron-ion interaction pseudopotentials of trinucleotide (PseEIIPs), and general parallel correlation pseudo-dinucleotide composition (PC-PseDNC-general). Evaluated accuracies over 10-fold cross-validation and independent tests achieved 73.06% and 80.15%, respectively, which showed the best predictive performances in A. thaliana among existing models. It is believed that the proposed model in this work can be a promising alternative for further research on m5C modification sites in plant.
Collapse
Affiliation(s)
- Lijun Dou
- School of Automotive and Transportation Engineering, Shenzhen Polytechnic, Shenzhen, China; Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Xiaoling Li
- Department of Oncology, Heilongjiang Province Land Reclamation Headquarters General Hospital, Harbin, China
| | - Hui Ding
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China.
| | - Huaikun Xiang
- School of Automotive and Transportation Engineering, Shenzhen Polytechnic, Shenzhen, China.
| |
Collapse
|
79
|
Liu S, Li B, Liang Q, Liu A, Qu L, Yang J. Classification and function of RNA-protein interactions. WILEY INTERDISCIPLINARY REVIEWS-RNA 2020; 11:e1601. [PMID: 32488992 DOI: 10.1002/wrna.1601] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 04/15/2020] [Accepted: 04/29/2020] [Indexed: 12/11/2022]
Abstract
Almost all RNAs need to interact with proteins to fully exert their functions, and proteins also bind to RNAs to act as regulators. It has now become clear that RNA-protein interactions play important roles in many biological processes among organisms. Despite the great progress that has been made in the field, there is still no precise classification system for RNA-protein interactions, which makes it challenging to further decipher the functions and mechanisms of these interactions. In this review, we propose four different categories of RNA-protein interactions according to their basic characteristics: RNA motif-dependent RNA-protein interactions, RNA structure-dependent RNA-protein interactions, RNA modification-dependent RNA-protein interactions, and RNA guide-based RNA-protein interactions. Moreover, the integration of different types of RNA-protein interactions and the regulatory factors implicated in these interactions are discussed. Furthermore, we emphasize the functional diversity of these four types of interactions in biological processes and disease development and assess emerging trends in this exciting research field. This article is categorized under: RNA Interactions with Proteins and Other Molecules > Protein-RNA Interactions: Functional Implications RNA Interactions with Proteins and Other Molecules > Protein-RNA Recognition RNA Processing > RNA Editing and Modification.
Collapse
Affiliation(s)
- Shurong Liu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Bin Li
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Qiaoxia Liang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Anrui Liu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Lianghu Qu
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Jianhua Yang
- MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory for Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China.,Department of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, China
| |
Collapse
|
80
|
Salekin S, Mostavi M, Chiu YC, Chen Y, Zhang J(M, Huang Y. Predicting sites of epitranscriptome modifications using unsupervised representation learning based on generative adversarial networks. FRONTIERS IN PHYSICS 2020; 8:196. [PMID: 33274189 PMCID: PMC7710330 DOI: 10.3389/fphy.2020.00196] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Epitranscriptome is an exciting area that studies different types of modifications in transcripts and the prediction of such modification sites from the transcript sequence is of significant interest. However, the scarcity of positive sites for most modifications imposes critical challenges for training robust algorithms. To circumvent this problem, we propose MR-GAN, a generative adversarial network (GAN) based model, which is trained in an unsupervised fashion on the entire pre-mRNA sequences to learn a low dimensional embedding of transcriptomic sequences. MR-GAN was then applied to extract embeddings of the sequences in a training dataset we created for eight epitranscriptome modifications, including m6A, m1A, m1G, m2G, m5C, m5U, 2'-O-Me, Pseudouridine (Ψ) and Dihydrouridine (D), of which the positive samples are very limited. Prediction models were trained based on the embeddings extracted by MR-GAN. We compared the prediction performance with the one-hot encoding of the training sequences and SRAMP, a state-of-the-art m6A site prediction algorithm and demonstrated that the learned embeddings outperform one-hot encoding by a significant margin for up to 15% improvement. Using MR-GAN, we also investigated the sequence motifs for each modification type and uncovered known motifs as well as new motifs not possible with sequences directly. The results demonstrated that transcriptome features extracted using unsupervised learning could lead to high precision for predicting multiple types of epitranscriptome modifications, even when the data size is small and extremely imbalanced.
Collapse
Affiliation(s)
- Sirajul Salekin
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78207, USA
| | - Milad Mostavi
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78207, USA
| | - Yu-Chiao Chiu
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA
| | - Yidong Chen
- Greehey Children’s Cancer Research Institute, University of Texas Health San Antonio, San Antonio, TX, 78229, USA
- Department of Population Health Sciences, University of Texas Health San Antonio, San Antonio, TX, 78229, USA
| | - Jianqiu (Michelle) Zhang
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78207, USA
| | - Yufei Huang
- Department of Electrical and Computer Engineering, the University of Texas at San Antonio, San Antonio, TX, 78207, USA
- Department of Population Health Sciences, University of Texas Health San Antonio, San Antonio, TX, 78229, USA
| |
Collapse
|
81
|
Li J, Huang Y, Zhou Y. A Mini-review of the Computational Methods Used in Identifying RNA 5-Methylcytosine Sites. Curr Genomics 2020; 21:3-10. [PMID: 32655293 PMCID: PMC7324889 DOI: 10.2174/2213346107666200219124951] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 01/17/2020] [Accepted: 01/31/2020] [Indexed: 01/10/2023] Open
Abstract
RNA 5-methylcytosine (m5C) is one of the pillars of post-transcriptional modification (PTCM). A growing body of evidence suggests that m5C plays a vital role in RNA metabolism. Accurate localization of RNA m5C sites in tissue cells is the premise and basis for the in-depth understanding of the functions of m5C. However, the main experimental methods of detecting m5C sites are limited to varying degrees. Establishing a computational model to predict modification sites is an excellent complement to wet experiments for identifying m5C sites. In this review, we summarized some available m5C predictors and discussed the characteristics of these methods.
Collapse
Affiliation(s)
- Jianwei Li
- 1Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China; 2Department of Biomedical Informatics, School of Basic Medical Sciences, Center for Noncoding RNA Medicine, Peking University, Beijing, China
| | - Yan Huang
- 1Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China; 2Department of Biomedical Informatics, School of Basic Medical Sciences, Center for Noncoding RNA Medicine, Peking University, Beijing, China
| | - Yuan Zhou
- 1Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China; 2Department of Biomedical Informatics, School of Basic Medical Sciences, Center for Noncoding RNA Medicine, Peking University, Beijing, China
| |
Collapse
|
82
|
iPseU-Layer: Identifying RNA Pseudouridine Sites Using Layered Ensemble Model. Interdiscip Sci 2020; 12:193-203. [PMID: 32170573 DOI: 10.1007/s12539-020-00362-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Revised: 02/16/2020] [Accepted: 02/19/2020] [Indexed: 01/28/2023]
Abstract
Pseudouridine represents one of the most prevalent post-transcriptional RNA modifications. The identification of pseudouridine sites is an essential step toward understanding RNA functions, RNA structure stabilization, translation process, and RNA stability; however, high-throughput experimental techniques remain expensive and time-consuming in lab explorations and biochemical processes. Thus, how to develop an efficient pseudouridine site identification method based on machine learning is very important both in academic research and drug development. Motived by this, we present an effective layered ensemble model designated as iPseU-Layer for identification of RNA pseudouridine sites. The proposed iPseU-Layer approach is essentially based on three different machine learning layers including: feature selection layer, feature extraction and fusion layer, and prediction layer. The feature selection layer reduces the dimensionality, which can be regarded as a data pre-processing stage. The feature extraction and fusion layer utilizes an ensemble method which is implemented through various machine learning algorithms to generate some outputs. The prediction layer applies classic random forest to identify the final results. Furthermore, we systematically conduct the validation experiments using cross-validation tests and independent test with the current state-of-the-art models. The proposed iPseU-Layer provides a promising predictive performance in terms of sensitivity, specificity, accuracy and Matthews correlation coefficient. Collectively, these findings indicate that the framework of iPseU-Layer is a feasible and effective strategy for the prediction of RNA pseudouridine sites.
Collapse
|
83
|
Dou L, Li X, Ding H, Xu L, Xiang H. Is There Any Sequence Feature in the RNA Pseudouridine Modification Prediction Problem? MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 19:293-303. [PMID: 31865116 PMCID: PMC6931122 DOI: 10.1016/j.omtn.2019.11.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Revised: 10/29/2019] [Accepted: 11/11/2019] [Indexed: 01/01/2023]
Abstract
Pseudouridine (Ψ) is the most abundant RNA modification and has been found in many kinds of RNAs, including snRNA, rRNA, tRNA, mRNA, and snoRNA. Thus, Ψ sites play a significant role in basic research and drug development. Although some experimental techniques have been developed to identify Ψ sites, they are expensive and time consuming, especially in the post-genomic era with the explosive growth of known RNA sequences. Thus, highly accurate computational methods are urgently required to quickly detect the Ψ sites on uncharacterized RNA sequences. Several predictors have been proposed using multifarious features, but their evaluated performances are still unsatisfactory. In this study, we first identified Ψ sites for H. sapiens, S. cerevisiae, and M. musculus using the sequence features from the bi-profile Bayes (BPB) method based on the random forest (RF) and support vector machine (SVM) algorithms, where the performances were evaluated using 5-fold cross-validation and independent tests. It was found that the SVM-based accuracies were 3.55% and 5.09% lower than the iPseU-CUU predictor for the H_990 and S_628 datasets, respectively. Almost the same-level results were obtained for M_994 and an independent H_200 dataset, even showing a 5.0% improvement for S_200. Then, three different kinds of features, including basic Kmer, general parallel correlation pseudo-dinucleotide composition (PC-PseDNC-General), and nucleotide chemical property (NCP) and nucleotide density (ND) from the iRNA-PseU method, were combined with BPB to show their comprehensive performances, where the effective features are selected by the max-relevance-max-distance (MRMD) method. The best evaluated accuracies of the combined features for the S_628 and M_994 datasets were achieved at 70.54% and 72.45%, which were 2.39% and 0.65% higher than iPseU-CUU. For the S_200 dataset, it was also improved 8% from 69% to 77%. However, there was no obvious improvement for H. sapiens, which was evaluated as approximately 63.23% and 72.0% for the H_990 and H_200 datasets, respectively. The overall performances for Ψ identification using BPB features as well as the combined features were not obviously improved. Although some kinds of feature extraction methods based on the RNA sequence information have been applied to construct the predictors in previous studies, the corresponding accuracies are generally in the range of 60%-70%. Thus, researchers need to reconsider whether there is any sequence feature in the RNA Ψ modification prediction problem.
Collapse
Affiliation(s)
- Lijun Dou
- School of Automotive and Transportation Engineering, Shenzhen Polytechnic, Shenzhen, China; Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Xiaoling Li
- Department of Oncology, Heilongjiang Province Land Reclamation Headquarters General Hospital, Harbin, China
| | - Hui Ding
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China.
| | - Huaikun Xiang
- School of Automotive and Transportation Engineering, Shenzhen Polytechnic, Shenzhen, China.
| |
Collapse
|
84
|
Small ID, Schallenberg-Rüdinger M, Takenaka M, Mireau H, Ostersetzer-Biran O. Plant organellar RNA editing: what 30 years of research has revealed. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:1040-1056. [PMID: 31630458 DOI: 10.1111/tpj.14578] [Citation(s) in RCA: 213] [Impact Index Per Article: 42.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 09/25/2019] [Accepted: 10/08/2019] [Indexed: 05/21/2023]
Abstract
The central dogma in biology defines the flow of genetic information from DNA to RNA to protein. Accordingly, RNA molecules generally accurately follow the sequences of the genes from which they are transcribed. This rule is transgressed by RNA editing, which creates RNA products that differ from their DNA templates. Analyses of the RNA landscapes of terrestrial plants have indicated that RNA editing (in the form of C-U base transitions) is highly prevalent within organelles (that is, mitochondria and chloroplasts). Numerous C→U conversions (and in some plants also U→C) alter the coding sequences of many of the organellar transcripts and can also produce translatable mRNAs by creating AUG start sites or eliminating premature stop codons, or affect the RNA structure, influence splicing and alter the stability of RNAs. RNA-binding proteins are at the heart of post-transcriptional RNA expression. The C-to-U RNA editing process in plant mitochondria involves numerous nuclear-encoded factors, many of which have been identified as pentatricopeptide repeat (PPR) proteins that target editing sites in a sequence-specific manner. In this review we report on major discoveries on RNA editing in plant organelles, since it was first documented 30 years ago.
Collapse
Affiliation(s)
- Ian D Small
- ARC Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, WA, 6009, Australia
| | - Mareike Schallenberg-Rüdinger
- IZMB - Institut für Zelluläre und Molekulare Botanik, Abt. Molekulare Evolution, University of Bonn, Kirschallee 1, 53115, Bonn, Germany
| | - Mizuki Takenaka
- Department of Botany, Graduate School of Science, Kyoto University, Oiwake-cho, Sakyo-ku, Kyoto, 606-8502, Japan
| | - Hakim Mireau
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, RD10, 78026, Versailles Cedex, France
| | - Oren Ostersetzer-Biran
- Department of Plant and Environmental Sciences, Institute of Life Sciences, The Hebrew University of Jerusalem, Edmond J. Safra Campus - Givat Ram, Jerusalem, 9190401, Israel
| |
Collapse
|
85
|
Abstract
Background Pseudouridine modification is most commonly found among various kinds of RNA modification occurred in both prokaryotes and eukaryotes. This biochemical event has been proved to occur in multiple types of RNAs, including rRNA, mRNA, tRNA, and nuclear/nucleolar RNA. Hence, gaining a holistic understanding of pseudouridine modification can contribute to the development of drug discovery and gene therapies. Although some laboratory techniques have come up with moderately good outcomes in pseudouridine identification, they are costly and required skilled work experience. We propose iPseU-NCP – an efficient computational framework to predict pseudouridine sites using the Random Forest (RF) algorithm combined with nucleotide chemical properties (NCP) generated from RNA sequences. The benchmark dataset collected from Chen et al. (2016) was used to develop iPseU-NCP and fairly compare its performances with other methods. Results Under the same experimental settings, comparing with three state-of-the-art methods including iPseU-CNN, PseUI, and iRNA-PseU, the Matthew’s correlation coefficient (MCC) of our model increased by about 20.0%, 55.0%, and 109.0% when tested on the H. sapiens (H_200) dataset and by about 6.5%, 35.0%, and 150.0% when tested on the S. cerevisiae (S_200) dataset, respectively. This significant growth in MCC is very important since it ensures the stability and performance of our model. With those two independent test datasets, our model also presented higher accuracy with a success rate boosted by 7.0%, 13.0%, and 20.0% and 2.0%, 9.5%, and 25.0% when compared to iPseU-CNN, PseUI, and iRNA-PseU, respectively. For majority of other evaluation metrics, iPseU-NCP demonstrated superior performance as well. Conclusions iPseU-NCP combining the RF and NPC-encoded features showed better performances than other existing state-of-the-art methods in the identification of pseudouridine sites. This also shows an optimistic view in addressing biological issues related to human diseases.
Collapse
|
86
|
Zhang C, Wang Y, Peng Y, Xu H, Zhou X. METTL3 regulates inflammatory pain by modulating m 6A-dependent pri-miR-365-3p processing. FASEB J 2019; 34:122-132. [PMID: 31914601 DOI: 10.1096/fj.201901555r] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2019] [Revised: 09/15/2019] [Accepted: 10/15/2019] [Indexed: 01/24/2023]
Abstract
N6-methyladenosine (m6A) modification in RNA has been implicated in diverse biological processes. However, very little is currently known about its role in nociceptive modulation. Here, we found that the level of spinal m6A modification was significantly increased in a mouse model of Complete Freund's Adjuvant (CFA)-induced chronic inflammatory pain, which was accompanied with the augmentation of methyltransferase-like 3 (METTL3) expression in the spinal cord. Knockdown of spinal METTL3 prevented and reversed CFA-induced pain behaviors and spinal neuronal sensitization. In contrast, overexpression of spinal METTL3 produced pain behaviors and neuronal sensitization in naive mice. Moreover, we found that METTL3 positively modulated the pri-miR-65-3p processing in a microprocessor protein DiGeorge critical region 8-dependent manner. Collectively, our findings reveal an important role of METTL3-mediated m6A modification in nociceptive sensitization and provide a novel perspective on m6A modification in the development of pathological pain.
Collapse
Affiliation(s)
- Chenjing Zhang
- Department of Gastroenterology, Zhejiang Province People's Hospital, Hangzhou, China
| | - Yin Wang
- Department of Anesthesiology, Taizhou People's Hospital, Taizhou, China
| | - Yunan Peng
- Department of Anesthesiology, Affiliated Drum Tower Hospital of Nanjing University Medical School, Nanjing, China
| | - Hongjiao Xu
- Department of Anesthesiology, the First People's Hospital of Shanghai Transportation University, Shanghai, China
| | - Xuelong Zhou
- Department of Anesthesiology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, China
| |
Collapse
|
87
|
Chen Z, Zhao P, Li F, Wang Y, Smith AI, Webb GI, Akutsu T, Baggag A, Bensmail H, Song J. Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences. Brief Bioinform 2019; 21:1676-1696. [DOI: 10.1093/bib/bbz112] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 07/31/2019] [Accepted: 08/07/2019] [Indexed: 12/14/2022] Open
Abstract
Abstract
RNA post-transcriptional modifications play a crucial role in a myriad of biological processes and cellular functions. To date, more than 160 RNA modifications have been discovered; therefore, accurate identification of RNA-modification sites is fundamental for a better understanding of RNA-mediated biological functions and mechanisms. However, due to limitations in experimental methods, systematic identification of different types of RNA-modification sites remains a major challenge. Recently, more than 20 computational methods have been developed to identify RNA-modification sites in tandem with high-throughput experimental methods, with most of these capable of predicting only single types of RNA-modification sites. These methods show high diversity in their dataset size, data quality, core algorithms, features extracted and feature selection techniques and evaluation strategies. Therefore, there is an urgent need to revisit these methods and summarize their methodologies, in order to improve and further develop computational techniques to identify and characterize RNA-modification sites from the large amounts of sequence data. With this goal in mind, first, we provide a comprehensive survey on a large collection of 27 state-of-the-art approaches for predicting N1-methyladenosine and N6-methyladenosine sites. We cover a variety of important aspects that are crucial for the development of successful predictors, including the dataset quality, operating algorithms, sequence and genomic features, feature selection, model performance evaluation and software utility. In addition, we also provide our thoughts on potential strategies to improve the model performance. Second, we propose a computational approach called DeepPromise based on deep learning techniques for simultaneous prediction of N1-methyladenosine and N6-methyladenosine. To extract the sequence context surrounding the modification sites, three feature encodings, including enhanced nucleic acid composition, one-hot encoding, and RNA embedding, were used as the input to seven consecutive layers of convolutional neural networks (CNNs), respectively. Moreover, DeepPromise further combined the prediction score of the CNN-based models and achieved around 43% higher area under receiver-operating curve (AUROC) for m1A site prediction and 2–6% higher AUROC for m6A site prediction, respectively, when compared with several existing state-of-the-art approaches on the independent test. In-depth analyses of characteristic sequence motifs identified from the convolution-layer filters indicated that nucleotide presentation at proximal positions surrounding the modification sites contributed most to the classification, whereas those at distal positions also affected classification but to different extents. To maximize user convenience, a web server was developed as an implementation of DeepPromise and made publicly available at http://DeepPromise.erc.monash.edu/, with the server accepting both RNA sequences and genomic sequences to allow prediction of two types of putative RNA-modification sites.
Collapse
Affiliation(s)
- Zhen Chen
- School of BasicMedical Science, Qingdao University, China
| | - Pei Zhao
- State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences, Anyang 455000, Henan, China
| | - Fuyi Li
- Northwest A&F University, China
| | | | - A Ian Smith
- Prince Henrys Institute Melbourne and Monash University, Australia
| | | | | | - Abdelkader Baggag
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha 34110, Qatar
| | - Halima Bensmail
- Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha 34110, Qatar
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Victoria 3800, Australia
| |
Collapse
|
88
|
Baquero-Perez B, Antanaviciute A, Yonchev ID, Carr IM, Wilson SA, Whitehouse A. The Tudor SND1 protein is an m 6A RNA reader essential for replication of Kaposi's sarcoma-associated herpesvirus. eLife 2019; 8:e47261. [PMID: 31647415 PMCID: PMC6812964 DOI: 10.7554/elife.47261] [Citation(s) in RCA: 98] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 09/19/2019] [Indexed: 02/06/2023] Open
Abstract
N6-methyladenosine (m6A) is the most abundant internal RNA modification of cellular mRNAs. m6A is recognised by YTH domain-containing proteins, which selectively bind to m6A-decorated RNAs regulating their turnover and translation. Using an m6A-modified hairpin present in the Kaposi's sarcoma associated herpesvirus (KSHV) ORF50 RNA, we identified seven members from the 'Royal family' as putative m6A readers, including SND1. RIP-seq and eCLIP analysis characterised the SND1 binding profile transcriptome-wide, revealing SND1 as an m6A reader. We further demonstrate that the m6A modification of the ORF50 RNA is critical for SND1 binding, which in turn stabilises the ORF50 transcript. Importantly, SND1 depletion leads to inhibition of KSHV early gene expression showing that SND1 is essential for KSHV lytic replication. This work demonstrates that members of the 'Royal family' have m6A-reading ability, greatly increasing their epigenetic functions beyond protein methylation.
Collapse
Affiliation(s)
- Belinda Baquero-Perez
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, Astbury Centre of Structural Molecular BiologyUniversity of LeedsLeedsUnited Kingdom
- Astbury Centre of Structural Molecular BiologyUniversity of LeedsLeedsUnited Kingdom
| | - Agne Antanaviciute
- Leeds Institute of Medical Research, School of MedicineUniversity of Leeds, St James's University HospitalLeedsUnited Kingdom
| | - Ivaylo D Yonchev
- Department of Molecular Biology and BiotechnologyUniversity of SheffieldSheffieldUnited Kingdom
- Sheffield Institute For Nucleic AcidsUniversity of SheffieldSheffieldUnited Kingdom
| | - Ian M Carr
- Leeds Institute of Medical Research, School of MedicineUniversity of Leeds, St James's University HospitalLeedsUnited Kingdom
| | - Stuart A Wilson
- Department of Molecular Biology and BiotechnologyUniversity of SheffieldSheffieldUnited Kingdom
- Sheffield Institute For Nucleic AcidsUniversity of SheffieldSheffieldUnited Kingdom
| | - Adrian Whitehouse
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, Astbury Centre of Structural Molecular BiologyUniversity of LeedsLeedsUnited Kingdom
- Astbury Centre of Structural Molecular BiologyUniversity of LeedsLeedsUnited Kingdom
- Department of Biochemistry and MicrobiologyRhodes UniversityGrahamstownSouth Africa
| |
Collapse
|
89
|
Fang T, Zhang Z, Sun R, Zhu L, He J, Huang B, Xiong Y, Zhu X. RNAm5CPred: Prediction of RNA 5-Methylcytosine Sites Based on Three Different Kinds of Nucleotide Composition. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 18:739-747. [PMID: 31726390 PMCID: PMC6859278 DOI: 10.1016/j.omtn.2019.10.008] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Revised: 10/11/2019] [Accepted: 10/11/2019] [Indexed: 12/11/2022]
Abstract
5-methylcytosine (m5C) is one of the most common and abundant post-transcriptional modifications (PTCMs) in RNA. Recent studies showed that m5C plays important roles in many biological functions such as RNA metabolism and cell fate decision. Because most experimental methods that determine m5C sites across the transcriptome are time-consuming and expensive, it is urgent to develop accurate computational methods to identify m5C sites effectively. A benchmark dataset is important for developing and evaluating computational methods. In this work, we constructed four different datasets according to the data redundancy and imbalance. Based on these datasets, we generated three different kinds of features, i.e., KNFs (K-nucleotide frequencies), KSNPFs (K-spaced nucleotide pair frequencies), and pseDNC (pseudo-dinucleotide composition), and then used a support vector machine (SVM) to build our models. Based on the imbalanced and nonredundant dataset, Met935, we extensively studied the three kinds of features and determined an optimal combination of the features. Based on the feature combination, we built models on the three different datasets and compared them with state-of-the-art models. According to the predictive results of the stringent jackknife test, the models based on the three features, 4NF, 1SNPF, and pseDNC, are superior or comparable to other methods. To determine the best model between the models based on the imbalanced dataset Met935 and the balanced dataset Met240, we further evaluated the two models on an independent test set Test1157. Our results demonstrate that the model based on the balanced dataset Met240 achieved the highest recall (68.79%) and the highest Matthews correlation coefficient (MCC) (0.154). In addition, the model is also superior to other state-of-the-art methods according to the integrated parameter MCC on the independent test set. Thus, we selected the model based on Met240 as our final model, which was named RNAm5CPred. In addition, a web server for RNAm5CPred (http://zhulab.ahu.edu.cn/RNAm5CPred/) has been provided to facilitate experimental research.
Collapse
Affiliation(s)
- Ting Fang
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China; School of Life Sciences, Anhui University, Hefei, Anhui 230601, China
| | - Zizheng Zhang
- School of Life Sciences, Anhui University, Hefei, Anhui 230601, China
| | - Rui Sun
- Beijing Baidu Netcom Sciences and Technology Co., Ltd., Beijing, China
| | - Lin Zhu
- School of Computer Science and Technology, Anhui University, Hefei, Anhui 230601, China
| | - Jingjing He
- School of Life Sciences, Anhui University, Hefei, Anhui 230601, China
| | - Bei Huang
- School of Life Sciences, Anhui University, Hefei, Anhui 230601, China.
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Xiaolei Zhu
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China; School of Life Sciences, Anhui University, Hefei, Anhui 230601, China.
| |
Collapse
|
90
|
Ashraf S, Huang L, Lilley DMJ. Effect of methylation of adenine N 6 on kink turn structure depends on location. RNA Biol 2019; 16:1377-1385. [PMID: 31234702 PMCID: PMC6779385 DOI: 10.1080/15476286.2019.1630797] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 05/30/2019] [Accepted: 06/03/2019] [Indexed: 12/22/2022] Open
Abstract
N6-methyladenine is the most common covalent modification in cellular RNA species, with demonstrated functional consequences. At the molecular level this methylation could alter local RNA structure, and/or modulate the binding of specific proteins. We have previously shown that trans-Hoogsteen-sugar (sheared) A:G base pairs can be completely disrupted by methylation, and that this occurs in a sub-set ofD/D k-turn structures. In this work we have investigated to what extent sequence context affects the severity with which inclusion of N6-methyladenine into different A:G base pairs of a standard k-turn affects RNA folding and L7Ae protein binding. We find that local sequence has a major influence, ranging from complete absence of folding and protein binding to a relatively mild effect. We have determined the crystal structure of one of these species both free and protein-bound, showing the environment of the methyl group and the way the modification is accommodated into the k-turn structure.
Collapse
Affiliation(s)
- Saira Ashraf
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dundee, U.K
| | - Lin Huang
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dundee, U.K
| | - David M. J. Lilley
- Cancer Research UK Nucleic Acid Structure Research Group, MSI/WTB Complex, The University of Dundee, Dundee, U.K
| |
Collapse
|
91
|
Efficient Epidermal Growth Factor Receptor Targeting Oligonucleotide as a Potential Molecule for Targeted Cancer Therapy. Int J Mol Sci 2019; 20:ijms20194700. [PMID: 31546749 PMCID: PMC6801465 DOI: 10.3390/ijms20194700] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Revised: 09/18/2019] [Accepted: 09/20/2019] [Indexed: 01/19/2023] Open
Abstract
Epidermal growth factor receptor (EGFR) is associated with the progression of a wide range of cancers including breast, glioma, lung, and liver cancer. The observation that EGFR inhibition can limit the growth of EGFR positive cancers has led to the development of various EGFR inhibitors including monoclonal antibodies and small-molecule inhibitors. However, the reported toxicity and drug resistance greatly compromised the clinical outcome of such inhibitors. As a type of chemical antibodies, nucleic acid aptamer provides an opportunity to overcome the obstacles faced by current EGFR inhibitors. In this study, we have developed and investigated the therapeutic potential of a 27mer aptamer CL-4RNV616 containing 2′-O-Methyl RNA and DNA nucleotides. Our results showed that CL-4RNV616 not only displayed enhanced stability in human serum, but also effectively recognized and inhibited the proliferation of EGFR positive Huh-7 liver cancer, MDA-MB-231 breast cancer, and U87MG glioblastoma cells, with an IC50 value of 258.9 nM, 413.7 nM, and 567.9 nM, respectively. Furthermore, TUNEL apoptosis assay revealed that CL-4RNV616 efficiently induced apoptosis of cancer cells. In addition, clinical breast cancer biopsy-based immunostaining assay demonstrated that CL-4RNV616 had a comparable detection efficacy for EGFR positive breast cancer with commonly used commercial antibodies. Based on the results, we firmly believe that CL-4RNV616 could be useful in the development of targeted cancer therapeutics and diagnostics.
Collapse
|
92
|
Chen W, Song X, Lv H, Lin H. iRNA-m2G: Identifying N 2-methylguanosine Sites Based on Sequence-Derived Information. MOLECULAR THERAPY-NUCLEIC ACIDS 2019; 18:253-258. [PMID: 31581049 PMCID: PMC6796771 DOI: 10.1016/j.omtn.2019.08.023] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 08/06/2019] [Accepted: 08/19/2019] [Indexed: 12/11/2022]
Abstract
RNA N2-methylguanosine (m2G) is one kind of posttranscriptional modification and plays crucial roles in the control and stabilization of tRNA. However, our knowledge about the biological functions of m2G is still limited. The key step of revealing its new function is to recognize the m2G sites in the transcriptome. Since there is no effective method for detecting m2G sites, it is desirable to develop new methods to identify m2G sites. In this study, a computational predictor called iRNA-m2G was proposed to identify m2G sites in eukaryotic transcriptomes. In iRNA-m2G, the RNA sequences were encoded by using nucleotide chemical property and accumulated nucleotide frequency. iRNA-m2G was not only validated by the rigorous jackknife test on the benchmark dataset but also examined by performing cross-species validations. In addition, iRNA-m2G was also tested on an independent dataset. It was found that the accuracies obtained by iRNA-m2G were all quite promising in these tests, indicating that the proposed method could become a powerful tool for identifying m2G sites. Finally, a user-friendly web server for iRNA-m2G is freely accessible at http://lin-group.cn/server/iRNA-m2G.php.
Collapse
Affiliation(s)
- Wei Chen
- Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611730, China; Center for Genomics and Computational Biology, School of Life Sciences, North China University of Science and Technology, Tangshan 063000, China.
| | - Xiaoming Song
- Center for Genomics and Computational Biology, School of Life Sciences, North China University of Science and Technology, Tangshan 063000, China
| | - Hao Lv
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hao Lin
- Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.
| |
Collapse
|
93
|
Zheng Y, Nie P, Peng D, He Z, Liu M, Xie Y, Miao Y, Zuo Z, Ren J. m6AVar: a database of functional variants involved in m6A modification. Nucleic Acids Res 2019; 46:D139-D145. [PMID: 29036329 PMCID: PMC5753261 DOI: 10.1093/nar/gkx895] [Citation(s) in RCA: 167] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 09/23/2017] [Indexed: 01/10/2023] Open
Abstract
Identifying disease-causing variants among a large number of single nucleotide variants (SNVs) is still a major challenge. Recently, N6-methyladenosine (m6A) has become a research hotspot because of its critical roles in many fundamental biological processes and a variety of diseases. Therefore, it is important to evaluate the effect of variants on m6A modification, in order to gain a better understanding of them. Here, we report m6AVar (http://m6avar.renlab.org), a comprehensive database of m6A-associated variants that potentially influence m6A modification, which will help to interpret variants by m6A function. The m6A-associated variants were derived from three different m6A sources including miCLIP/PA-m6A-seq experiments (high confidence), MeRIP-Seq experiments (medium confidence) and transcriptome-wide predictions (low confidence). Currently, m6AVar contains 16 132 high, 71 321 medium and 326 915 low confidence level m6A-associated variants. We also integrated the RBP-binding regions, miRNA-targets and splicing sites associated with variants to help users investigate the effect of m6A-associated variants on post-transcriptional regulation. Because it integrates the data from genome-wide association studies (GWAS) and ClinVar, m6AVar is also a useful resource for investigating the relationship between the m6A-associated variants and disease. Overall, m6AVar will serve as a useful resource for annotating variants and identifying disease-causing variants.
Collapse
Affiliation(s)
- Yueyuan Zheng
- Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University, Guangzhou 510060, China.,State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Peng Nie
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Di Peng
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Zhihao He
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Mengni Liu
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Yubin Xie
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Yanyan Miao
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China
| | - Zhixiang Zuo
- Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University, Guangzhou 510060, China
| | - Jian Ren
- Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University, Guangzhou 510060, China.,State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong 510275, China.,Collaborative Innovation Center of High Performance Computing, National University of Defense Technology, Changsha 410073, China
| |
Collapse
|
94
|
Zheng LL, Zhou KR, Liu S, Zhang DY, Wang ZL, Chen ZR, Yang JH, Qu LH. dreamBase: DNA modification, RNA regulation and protein binding of expressed pseudogenes in human health and disease. Nucleic Acids Res 2019; 46:D85-D91. [PMID: 29059382 PMCID: PMC5753186 DOI: 10.1093/nar/gkx972] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 10/12/2017] [Indexed: 12/28/2022] Open
Abstract
Although thousands of pseudogenes have been annotated in the human genome, their transcriptional regulation, expression profiles and functional mechanisms are largely unknown. In this study, we developed dreamBase (http://rna.sysu.edu.cn/dreamBase) to facilitate the investigation of DNA modification, RNA regulation and protein binding of potential expressed pseudogenes from multidimensional high-throughput sequencing data. Based on ∼5500 ChIP-seq and DNase-seq datasets, we identified genome-wide binding profiles of various transcription-associated factors around pseudogene loci. By integrating ∼18 000 RNA-seq data, we analysed the expression profiles of pseudogenes and explored their co-expression patterns with their parent genes in 32 cancers and 31 normal tissues. By combining microRNA binding sites, we demonstrated complex post-transcriptional regulation networks involving 275 microRNAs and 1201 pseudogenes. We generated ceRNA networks to illustrate the crosstalk between pseudogenes and their parent genes through competitive binding of microRNAs. In addition, we studied transcriptome-wide interactions between RNA binding proteins (RBPs) and pseudogenes based on 458 CLIP-seq datasets. In conjunction with epitranscriptome sequencing data, we also mapped 1039 RNA modification sites onto 635 pseudogenes. This database will provide insights into the transcriptional regulation, expression, functions and mechanisms of pseudogenes as well as their roles in biological processes and diseases.
Collapse
Affiliation(s)
- Ling-Ling Zheng
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Ke-Ren Zhou
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Shun Liu
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Ding-Yao Zhang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Ze-Lin Wang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Zhi-Rong Chen
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Jian-Hua Yang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Liang-Hu Qu
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| |
Collapse
|
95
|
Boccaletto P, Machnicka MA, Purta E, Piatkowski P, Baginski B, Wirecki TK, de Crécy-Lagard V, Ross R, Limbach PA, Kotter A, Helm M, Bujnicki JM. MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res 2019; 46:D303-D307. [PMID: 29106616 PMCID: PMC5753262 DOI: 10.1093/nar/gkx1030] [Citation(s) in RCA: 1356] [Impact Index Per Article: 226.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 10/18/2017] [Indexed: 12/13/2022] Open
Abstract
MODOMICS is a database of RNA modifications that provides comprehensive information concerning the chemical structures of modified ribonucleosides, their biosynthetic pathways, the location of modified residues in RNA sequences, and RNA-modifying enzymes. In the current database version, we included the following new features and data: extended mass spectrometry and liquid chromatography data for modified nucleosides; links between human tRNA sequences and MINTbase - a framework for the interactive exploration of mitochondrial and nuclear tRNA fragments; new, machine-friendly system of unified abbreviations for modified nucleoside names; sets of modified tRNA sequences for two bacterial species, updated collection of mammalian tRNA modifications, 19 newly identified modified ribonucleosides and 66 functionally characterized proteins involved in RNA modification. Data from MODOMICS have been linked to the RNAcentral database of RNA sequences. MODOMICS is available at http://modomics.genesilico.pl.
Collapse
Affiliation(s)
- Pietro Boccaletto
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Magdalena A Machnicka
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.,Institute of Informatics, University of Warsaw, Banacha 2, PL-02-097 Warsaw, Poland
| | - Elzbieta Purta
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Pawel Piatkowski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Blazej Baginski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Tomasz K Wirecki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | | | - Robert Ross
- Department of Chemistry, Rieveschl Laboratories for Mass Spectrometry, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Patrick A Limbach
- Department of Chemistry, Rieveschl Laboratories for Mass Spectrometry, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Annika Kotter
- Institut für Pharmazie und Biochemie, Johannes Gutenberg-Universität, Staudinger Weg 5, D-55128 Mainz, Germany
| | - Mark Helm
- Institut für Pharmazie und Biochemie, Johannes Gutenberg-Universität, Staudinger Weg 5, D-55128 Mainz, Germany
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.,Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, PL-61-614 Poznan, Poland
| |
Collapse
|
96
|
Xuan JJ, Sun WJ, Lin PH, Zhou KR, Liu S, Zheng LL, Qu LH, Yang JH. RMBase v2.0: deciphering the map of RNA modifications from epitranscriptome sequencing data. Nucleic Acids Res 2019; 46:D327-D334. [PMID: 29040692 PMCID: PMC5753293 DOI: 10.1093/nar/gkx934] [Citation(s) in RCA: 306] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 10/02/2017] [Indexed: 12/26/2022] Open
Abstract
More than 100 distinct chemical modifications to RNA have been characterized so far. However, the prevalence, mechanisms and functions of various RNA modifications remain largely unknown. To provide transcriptome-wide landscapes of RNA modifications, we developed the RMBase v2.0 (http://rna.sysu.edu.cn/rmbase/), which is a comprehensive database that integrates epitranscriptome sequencing data for the exploration of post-transcriptional modifications of RNAs and their relationships with miRNA binding events, disease-related single-nucleotide polymorphisms (SNPs) and RNA-binding proteins (RBPs). RMBase v2.0 was expanded with ∼600 datasets and ∼1 397 000 modification sites from 47 studies among 13 species, which represents an approximately 10-fold expansion when compared with the previous release. It contains ∼1 373 000 N6-methyladenosines (m6A), ∼5400 N1-methyladenosines (m1A), ∼9600 pseudouridine (Ψ) modifications, ∼1000 5-methylcytosine (m5C) modifications, ∼5100 2′-O-methylations (2′-O-Me), and ∼2800 modifications of other modification types. Moreover, we built a new module called ‘Motif’ that provides the visualized logos and position weight matrices (PWMs) of the modification motifs. We also constructed a novel module termed ‘modRBP’ to study the relationships between RNA modifications and RBPs. Additionally, we developed a novel web-based tool named ‘modMetagene’ to plot the metagenes of RNA modification along a transcript model. This database will help researchers investigate the potential functions and mechanisms of RNA modifications.
Collapse
Affiliation(s)
- Jia-Jia Xuan
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Wen-Ju Sun
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Peng-Hui Lin
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Ke-Ren Zhou
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Shun Liu
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Ling-Ling Zheng
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Liang-Hu Qu
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | - Jian-Hua Yang
- Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-sen University, Guangzhou 510275, PR China.,State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| |
Collapse
|
97
|
Meng S, Zhou H, Feng Z, Xu Z, Tang Y, Wu M. Epigenetics in Neurodevelopment: Emerging Role of Circular RNA. Front Cell Neurosci 2019; 13:327. [PMID: 31379511 PMCID: PMC6658887 DOI: 10.3389/fncel.2019.00327] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 07/03/2019] [Indexed: 01/16/2023] Open
Abstract
Canonical epigenetic modifications, including DNA methylation, histone modification and chromatin remodeling, play a role in numerous life processes, particularly neurodevelopment. Epigenetics explains the development of cells in an organism with the same DNA sequence into different cell types with various functions. However, previous studies on epigenetics have only focused on the chromatin level. Recently, epigenetic modifications of RNA, which mainly include 6-methyladenosine (m6A), pseudouridine, 5-methylcytidine (m5C), inosine (I), 2′-O-ribosemethylation, and 1-methyladenosine (m1A), have gained increasing attention. Circular RNAs (circRNAs), which are a type of non-coding RNA without a 5′ cap or 3′ poly (A) tail, are abundantly found in the brain and might respond to and regulate synaptic function. Also, circRNAs have various functions, such as microRNA sponge, regulation of gene transcription and interaction with RNA binding protein. In addition, circRNAs are methylated by N6-methyladenosine (m6A). In this review, we discuss the crucial roles of epigenetic modifications of circRNAs, such as m6A, in the genesis and development of neurons and in synaptic function and plasticity. Thus, this type of changes in circRNAs might be a therapeutic target in central nervous system (CNS) disorders and could aid the diagnosis and treatment of these disorders.
Collapse
Affiliation(s)
- Shujuan Meng
- Hunan Provincial Tumor Hospital, The Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis, Ministry of Health, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China
| | - Hecheng Zhou
- Hunan Provincial Tumor Hospital, The Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis, Ministry of Health, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China
| | - Ziyang Feng
- Hunan Provincial Tumor Hospital, The Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis, Ministry of Health, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China
| | - Zihao Xu
- Hunan Provincial Tumor Hospital, The Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis, Ministry of Health, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China
| | - Ying Tang
- Hunan Provincial Tumor Hospital, The Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis, Ministry of Health, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China
| | - Minghua Wu
- Hunan Provincial Tumor Hospital, The Affiliated Tumor Hospital of Xiangya Medical School, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China.,Key Laboratory of Carcinogenesis, Ministry of Health, Cancer Research Institute, School of Basic Medical Science, Central South University, Changsha, China
| |
Collapse
|
98
|
Akila Parvathy Dharshini S, Taguchi YH, Michael Gromiha M. Exploring the selective vulnerability in Alzheimer disease using tissue specific variant analysis. Genomics 2019; 111:936-949. [PMID: 29879491 DOI: 10.1016/j.ygeno.2018.05.024] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 05/03/2018] [Accepted: 05/30/2018] [Indexed: 02/08/2023]
Abstract
The selective vulnerability of distinct regions of the brain is a critical factor in neurodegenerative disorders. In Alzheimer's disease (AD), neurons in hippocampus situated in medial temporal lobe are immensely damaged. Identifying tissue-specific variants is essential in order to perceive the selective vulnerability in AD. In current work, we aligned mRNA-seq data with HG19/HG38 genomic assembly and identified specific variations present in temporal, frontal and other lobes of the AD using sequence alignment map tools. We compared the results with the genome-wide association and gene expression quantitative trait loci studies of the various neurological disorders. We also distinguished variants and epitranscriptomic modifications through the RNA-modification database and evaluated the variant effect in the coding/UTR regions. In addition, we developed genetic and functional interaction networks to understand the relationship between predicted vulnerable variations and differentially expressed genes. We found that genes involved in gliogenesis, intermediate filament organization are altered in the temporal lobe. Oxidative phosphorylation, and calcium ion homeostasis are modified in the frontal lobe, and protein degradation, apoptotic signaling are altered in other lobes. From this study, we propose that disruption of glial cell structural integrity, defective gliogenesis, and failure in glia-neuron communication are the primary factors for selective vulnerability.
Collapse
Affiliation(s)
- S Akila Parvathy Dharshini
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamilnadu, India
| | - Y-H Taguchi
- Department of Physics, Chuo University, Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamilnadu, India; Advanced Computational Drug Discovery Unit (ACDD), Institute of Innovative Research, Tokyo Institute of Technology, 4259 Nagatsutacho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan.
| |
Collapse
|
99
|
Lv H, Zhang ZM, Li SH, Tan JX, Chen W, Lin H. Evaluation of different computational methods on 5-methylcytosine sites identification. Brief Bioinform 2019; 21:982-995. [DOI: 10.1093/bib/bbz048] [Citation(s) in RCA: 82] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 03/25/2019] [Accepted: 04/01/2019] [Indexed: 11/13/2022] Open
Abstract
Abstract
5-Methylcytosine (m5C) plays an extremely important role in the basic biochemical process. With the great increase of identified m5C sites in a wide variety of organisms, their epigenetic roles become largely unknown. Hence, accurate identification of m5C site is a key step in understanding its biological functions. Over the past several years, more attentions have been paid on the identification of m5C sites in multiple species. In this work, we firstly summarized the current progresses in computational prediction of m5C sites and then constructed a more powerful and reliable model for identifying m5C sites. To train the model, we collected experimentally confirmed m5C data from Homo sapiens, Mus musculus, Saccharomyces cerevisiae and Arabidopsis thaliana, and compared the performances of different feature extraction methods and classification algorithms for optimizing prediction model. Based on the optimal model, a novel predictor called iRNA-m5C was developed for the recognition of m5C sites. Finally, we critically evaluated the performance of iRNA-m5C and compared it with existing methods. The result showed that iRNA-m5C could produce the best prediction performance. We hope that this paper could provide a guide on the computational identification of m5C site and also anticipate that the proposed iRNA-m5C will become a powerful tool for large scale identification of m5C sites.
Collapse
Affiliation(s)
- Hao Lv
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zi-Mei Zhang
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Shi-Hao Li
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Jiu-Xin Tan
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Wei Chen
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
- Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Hao Lin
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
100
|
Egervari G, Kozlenkov A, Dracheva S, Hurd YL. Molecular windows into the human brain for psychiatric disorders. Mol Psychiatry 2019; 24:653-673. [PMID: 29955163 PMCID: PMC6310674 DOI: 10.1038/s41380-018-0125-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Revised: 05/14/2018] [Accepted: 06/05/2018] [Indexed: 12/20/2022]
Abstract
Delineating the pathophysiology of psychiatric disorders has been extremely challenging but technological advances in recent decades have facilitated a deeper interrogation of molecular processes in the human brain. Initial candidate gene expression studies of the postmortem brain have evolved into genome wide profiling of the transcriptome and the epigenome, a critical regulator of gene expression. Here, we review the potential and challenges of direct molecular characterization of the postmortem human brain, and provide a brief overview of recent transcriptional and epigenetic studies with respect to neuropsychiatric disorders. Such information can now be leveraged and integrated with the growing number of genome-wide association databases to provide a functional context of trait-associated genetic variants linked to psychiatric illnesses and related phenotypes. While it is clear that the field is still developing and challenges remain to be surmounted, these recent advances nevertheless hold tremendous promise for delineating the neurobiological underpinnings of mental diseases and accelerating the development of novel medication strategies.
Collapse
Affiliation(s)
- Gabor Egervari
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Addiction Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, School of Medicine at Mount Sinai, New York, NY, USA
- Epigenetics Institute and Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Alexey Kozlenkov
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, School of Medicine at Mount Sinai, New York, NY, USA
- James J. Peters VA Medical Center, Bronx, NY, USA
| | - Stella Dracheva
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, School of Medicine at Mount Sinai, New York, NY, USA
- James J. Peters VA Medical Center, Bronx, NY, USA
| | - Yasmin L Hurd
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Addiction Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Friedman Brain Institute, School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|