1
|
Nath P, Bhuyan K, Bhattacharyya DK, Barah P. ETENLNC: An end to end lncRNA identification and analysis framework to facilitate construction of known and novel lncRNA regulatory networks. Comput Biol Chem 2024; 112:108140. [PMID: 38996755 DOI: 10.1016/j.compbiolchem.2024.108140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 04/22/2024] [Accepted: 06/26/2024] [Indexed: 07/14/2024]
Abstract
Long non-coding RNAs (lncRNAs) play crucial roles in the regulation of gene expression and maintenance of genomic integrity through various interactions with DNA, RNA, and proteins. The availability of large-scale sequence data from various high-throughput platforms has opened possibilities to identify, predict, and functionally annotate lncRNAs. As a result, there is a growing demand for an integrative computational framework capable of identifying known lncRNAs, predicting novel lncRNAs, and inferring the downstream regulatory interactions of lncRNAs at the genome-scale. We present ETENLNC (End-To-End-Novel-Long-NonCoding), a user-friendly, integrative, open-source, scalable, and modular computational framework for identifying and analyzing lncRNAs from raw RNA-Seq data. ETENLNC employs six stringent filtration steps to identify novel lncRNAs, performs differential expression analysis of mRNA and lncRNA transcripts, and predicts regulatory interactions between lncRNAs, mRNAs, miRNAs, and proteins. We benchmarked ETENLNC against six existing tools and optimized it for desktop workstations and high-performance computing environments using data from three different species. ETENLNC is freely available on GitHub: https://github.com/EvolOMICS-TU/ETENLNC.
Collapse
Affiliation(s)
- Prangan Nath
- Department of Molecular Biology and Biotechnology, Tezpur University, Assam 784028, India
| | - Kaveri Bhuyan
- Department of Computer Science and Engineering, Tezpur University, Assam 784028, India; Department of Electrical Engineering, Tezpur University, Assam 784028, India
| | | | - Pankaj Barah
- Department of Molecular Biology and Biotechnology, Tezpur University, Assam 784028, India.
| |
Collapse
|
2
|
Jastrzebski JP, Pascarella S, Lipka A, Dorocki S. IncRna: The R Package for Optimizing lncRNA Identification Processes. J Comput Biol 2023; 30:1322-1326. [PMID: 37878344 DOI: 10.1089/cmb.2023.0091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2023] Open
Abstract
In silico identification of long noncoding RNAs (lncRNAs) is a multistage process including filtering of transcripts according to their physical characteristics (e.g., length, exon-intron structure) and determination of the coding potential of the sequence. A common issue within this process is the choice of the most suitable method of coding potential analysis for the conducted research. Selection of tools on the sole basis of their single performance may not provide the most effective choice for a specific problem. To overcome these limitations, we developed the R library lncRna, which provides functions to easily carry out the entire lncRNA identification process. For example, the package prepares the data files for coding potential analysis to perform error analysis. Moreover, the package gives the opportunity to analyze the effectiveness of various combinations of the lncRNA prediction methods to select the optimal configuration of the entire process.
Collapse
Affiliation(s)
- Jan Pawel Jastrzebski
- Faculty of Biology and Biotechnology, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
| | - Stefano Pascarella
- Department of Biochemical Sciences "A. Rossi Fanelli" Sapienza University of Rome, Rome, Italy
| | - Aleksandra Lipka
- Institute of Oral Biology, Faculty of Dentistry University of Oslo, Oslo, Norway
| | | |
Collapse
|
3
|
Chen XG, Yang X, Li C, Lin X, Zhang W. Non-coding RNA identification with pseudo RNA sequences and feature representation learning. Comput Biol Med 2023; 165:107355. [PMID: 37639767 DOI: 10.1016/j.compbiomed.2023.107355] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 07/16/2023] [Accepted: 08/12/2023] [Indexed: 08/31/2023]
Abstract
Distinguishing non-coding RNAs (ncRNAs) from coding RNAs is very important in bioinformatics. Although many methods have been proposed for solving this task, it remains highly challenging to further improve the accuracy of ncRNA identification. In this paper, we propose a coding potential predictor using feature representation learning based on pseudo RNA sequences named CPPFLPS. In this method, we use the pseudo RNA sequences generated by simulating RNA sequence mutations as new samples for data augmentation, and six string operations simulating RNA sequence mutations are considered: base replacement, base insertion, base deletion, subsequence reversion, subsequence repetition and subsequence deletion. In the feature representation learning framework, different types of pseudo RNA sequences are added to the training set to form new training sets that can be used to train baseline classifiers, thus obtaining baseline models. The resulting labels of these baseline models are used as feature vectors to represent RNA sequences, and the resulting feature vectors acquired after feature selection are used to train a predictive model for distinguishing ncRNAs from coding RNAs. Our method achieves better performance compared with that of existing state-of-the-art methods. The implementation of the proposed method is available at https://github.com/chenxgscuec/CPPFLPS.
Collapse
Affiliation(s)
- Xian-Gan Chen
- School of Biomedical Engineering, South-Central Minzu University, Wuhan, 430074, China; Hubei Key Laboratory of Medical Information Analysis and Tumor Diagnosis & Treatment, South-Central Minzu University, Wuhan, 430074, China; Key Laboratory of Cognitive Science(South-Central Minzu University), State Ethnic Affairs Commission, Wuhan, 430074, China.
| | - Xiaofei Yang
- School of Biomedical Engineering, South-Central Minzu University, Wuhan, 430074, China; Hubei Key Laboratory of Medical Information Analysis and Tumor Diagnosis & Treatment, South-Central Minzu University, Wuhan, 430074, China; Key Laboratory of Cognitive Science(South-Central Minzu University), State Ethnic Affairs Commission, Wuhan, 430074, China.
| | - Chenhong Li
- School of Biomedical Engineering, South-Central Minzu University, Wuhan, 430074, China; Hubei Key Laboratory of Medical Information Analysis and Tumor Diagnosis & Treatment, South-Central Minzu University, Wuhan, 430074, China; Key Laboratory of Cognitive Science(South-Central Minzu University), State Ethnic Affairs Commission, Wuhan, 430074, China.
| | - Xianguang Lin
- School of Biomedical Engineering, South-Central Minzu University, Wuhan, 430074, China; Hubei Key Laboratory of Medical Information Analysis and Tumor Diagnosis & Treatment, South-Central Minzu University, Wuhan, 430074, China; Key Laboratory of Cognitive Science(South-Central Minzu University), State Ethnic Affairs Commission, Wuhan, 430074, China.
| | - Wen Zhang
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China.
| |
Collapse
|
4
|
Bitar M, Rivera I, Almeida I, Shi W, Ferguson K, Beesley J, Lakhani S, Edwards S, French J. Redefining normal breast cell populations using long noncoding RNAs. Nucleic Acids Res 2023; 51:6389-6410. [PMID: 37144467 PMCID: PMC10325898 DOI: 10.1093/nar/gkad339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 04/12/2023] [Accepted: 04/21/2023] [Indexed: 05/06/2023] Open
Abstract
Single-cell RNAseq has allowed unprecedented insight into gene expression across different cell populations in normal tissue and disease states. However, almost all studies rely on annotated gene sets to capture gene expression levels and sequencing reads that do not align to known genes are discarded. Here, we discover thousands of long noncoding RNAs (lncRNAs) expressed in human mammary epithelial cells and analyze their expression in individual cells of the normal breast. We show that lncRNA expression alone can discriminate between luminal and basal cell types and define subpopulations of both compartments. Clustering cells based on lncRNA expression identified additional basal subpopulations, compared to clustering based on annotated gene expression, suggesting that lncRNAs can provide an additional layer of information to better distinguish breast cell subpopulations. In contrast, these breast-specific lncRNAs poorly distinguish brain cell populations, highlighting the need to annotate tissue-specific lncRNAs prior to expression analyses. We also identified a panel of 100 breast lncRNAs that could discern breast cancer subtypes better than protein-coding markers. Overall, our results suggest that lncRNAs are an unexplored resource for new biomarker and therapeutic target discovery in the normal breast and breast cancer subtypes.
Collapse
Affiliation(s)
- Mainá Bitar
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
- Faculty of Medicine, The University of Queensland, Brisbane 4006, Australia
| | - Isela Sarahi Rivera
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
- School of Biomedical Science and Institute of Health and Biomedical Innovation, Faculty of Health, Queensland University of Technology, Brisbane 4001, Australia
| | - Isabela Almeida
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
- Faculty of Medicine, The University of Queensland, Brisbane 4006, Australia
| | - Wei Shi
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
| | - Kaltin Ferguson
- UQ Centre for Clinical Research, The University of Queensland, Brisbane 4006, Australia
| | - Jonathan Beesley
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
| | - Sunil R Lakhani
- UQ Centre for Clinical Research, The University of Queensland, Brisbane 4006, Australia
- Pathology Queensland, The Royal Brisbane & Women's Hospital, Brisbane 4006, Australia
| | - Stacey L Edwards
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
- Faculty of Medicine, The University of Queensland, Brisbane 4006, Australia
| | - Juliet D French
- Cancer Program, QIMR Berghofer Medical Research Institute, Brisbane 4006, Australia
- Faculty of Medicine, The University of Queensland, Brisbane 4006, Australia
| |
Collapse
|
5
|
Yu D, Wu Y, Zhu L, Wang Y, Sheng D, Zhao X, Liang G, Gan L. The landscape of the long non-coding RNAs in developing mouse retinas. BMC Genomics 2023; 24:252. [PMID: 37165305 PMCID: PMC10173636 DOI: 10.1186/s12864-023-09354-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Accepted: 05/03/2023] [Indexed: 05/12/2023] Open
Abstract
BACKGROUND The long non-coding RNAs (lncRNAs) are critical regulators of diverse biological processes. Nevertheless, a global view of its expression and function in the mouse retina, a crucial model for neurogenesis study, still needs to be made available. RESULTS Herein, by integrating the established gene models and the result from ab initio prediction using short- and long-read sequencing, we characterized 4,523 lncRNA genes (MRLGs) in developing mouse retinas (from the embryonic day of 12.5 to the neonatal day of P28), which was so far the most comprehensive collection of retinal lncRNAs. Next, derived from transcriptomics analyses of different tissues and developing retinas, we found that the MRLGs were highly spatiotemporal specific in expression and played essential roles in regulating the genesis and function of mouse retinas. In addition, we investigated the expression of MRLGs in some mouse mutants and revealed that 97 intergenic MRLGs might be involved in regulating differentiation and development of retinal neurons through Math5, Isl1, Brn3b, NRL, Onecut1, or Onecut2 mediated pathways. CONCLUSIONS In summary, this work significantly enhanced our knowledge of lncRNA genes in mouse retina development and provided valuable clues for future exploration of their biological roles.
Collapse
Affiliation(s)
- Dongliang Yu
- College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, Zhejiang, 310018, China.
- Institute of Life Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, 310036, China.
| | - Yuqing Wu
- College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, Zhejiang, 310018, China
| | - Leilei Zhu
- Institute of Life Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, 310036, China
| | - Yuying Wang
- Institute of Life Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, 310036, China
| | - Donglai Sheng
- Institute of Life Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, 310036, China
| | - Xiaofeng Zhao
- Institute of Life Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, 310036, China
| | - Guoqing Liang
- Institute of Life Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, 310036, China.
| | - Lin Gan
- Department of Neuroscience & Regenerative Medicine, Medical College of Georgia, Augusta University, Augusta, GA, 30912, USA.
| |
Collapse
|
6
|
Ao C, Jiao S, Wang Y, Yu L, Zou Q. Biological Sequence Classification: A Review on Data and General Methods. RESEARCH (WASHINGTON, D.C.) 2022; 2022:0011. [PMID: 39285948 PMCID: PMC11404319 DOI: 10.34133/research.0011] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 10/25/2022] [Indexed: 09/19/2024]
Abstract
With the rapid development of biotechnology, the number of biological sequences has grown exponentially. The continuous expansion of biological sequence data promotes the application of machine learning in biological sequences to construct predictive models for mining biological sequence information. There are many branches of biological sequence classification research. In this review, we mainly focus on the function and modification classification of biological sequences based on machine learning. Sequence-based prediction and analysis are the basic tasks to understand the biological functions of DNA, RNA, proteins, and peptides. However, there are hundreds of classification models developed for biological sequences, and the quite varied specific methods seem dizzying at first glance. Here, we aim to establish a long-term support website (http://lab.malab.cn/~acy/BioseqData/home.html), which provides readers with detailed information on the classification method and download links to relevant datasets. We briefly introduce the steps to build an effective model framework for biological sequence data. In addition, a brief introduction to single-cell sequencing data analysis methods and applications in biology is also included. Finally, we discuss the current challenges and future perspectives of biological sequence classification research.
Collapse
Affiliation(s)
- Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Shihu Jiao
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Yansu Wang
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Quan Zou
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
7
|
Singh D, Roy J. A large-scale benchmark study of tools for the classification of protein-coding and non-coding RNAs. Nucleic Acids Res 2022; 50:12094-12111. [PMID: 36420898 PMCID: PMC9757047 DOI: 10.1093/nar/gkac1092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 10/22/2022] [Accepted: 10/28/2022] [Indexed: 11/27/2022] Open
Abstract
Identification of protein-coding and non-coding transcripts is paramount for understanding their biological roles. Computational approaches have been addressing this task for over a decade; however, generalized and high-performance models are still unreliable. This benchmark study assessed the performance of 24 tools producing >55 models on the datasets covering a wide range of species. We have collected 135 small and large transcriptomic datasets from existing studies for comparison and identified the potential bottlenecks hampering the performance of current tools. The key insights of this study include lack of standardized training sets, reliance on homogeneous training data, gradual changes in annotated data, lack of augmentation with homology searches, the presence of false positives and negatives in datasets and the lower performance of end-to-end deep learning models. We also derived a new dataset, RNAChallenge, from the benchmark considering hard instances that may include potential false alarms. The best and least well performing models under- and overfit the dataset, respectively, thereby serving a dual purpose. For computational approaches, it will be valuable to develop accurate and unbiased models. The identification of false alarms will be of interest for genome annotators, and experimental study of hard RNAs will help to untangle the complexity of the RNA world.
Collapse
Affiliation(s)
- Dalwinder Singh
- To whom correspondence should be addressed. Tel: +91 172 5221206;
| | - Joy Roy
- Correspondence may also be addressed to Joy Roy.
| |
Collapse
|
8
|
Baruah C, Nath P, Barah P. LncRNAs in neuropsychiatric disorders and computational insights for their prediction. Mol Biol Rep 2022; 49:11515-11534. [PMID: 36097122 DOI: 10.1007/s11033-022-07819-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 07/20/2022] [Accepted: 07/24/2022] [Indexed: 12/06/2022]
Abstract
Long non-coding RNAs (lncRNAs) are 200 nucleotide extended transcripts that do not encode proteins or possess limited coding ability. LncRNAs epigenetically control several biological functions such as gene regulation, transcription, mRNA splicing, protein interaction, and genomic imprinting. Over the years, drastic progress in understanding the role of lncRNAs in diverse biological processes has been made. LncRNAs are reported to show tissue-specific expression patterns suggesting their potential as novel candidate biomarkers for diseases. Among all other non-coding RNAs, lncRNAs are highly expressed within the brain-enriched or brain-specific regions of the neural tissues. They are abundantly expressed in the neocortex and pre-mature frontal regions of the brain. LncRNAs are co-expressed with the protein-coding genes and have a significant role in the evolution of functions of the brain. Any deregulation in the lncRNAs contributes to disruptions in normal brain functions resulting in multiple neurological disorders. Neuropsychiatric disorders such as schizophrenia, bipolar disease, autism spectrum disorders, and anxiety are associated with the abnormal expression and regulation of lncRNAs. This review aims to highlight the understanding of lncRNAs concerning normal brain functions and their deregulation associated with neuropsychiatric disorders. We have also provided a survey on the available computational tools for the prediction of lncRNAs, their protein coding potentials, and sub-cellular locations, along with a section on existing online databases with known lncRNAs, and their interactions with other molecules.
Collapse
Affiliation(s)
- Cinmoyee Baruah
- Department of Molecular Biology and Biotechnology, Tezpur University, 784028, Napaam, Sonitpur, Assam, India
| | - Prangan Nath
- Department of Molecular Biology and Biotechnology, Tezpur University, 784028, Napaam, Sonitpur, Assam, India
| | - Pankaj Barah
- Department of Molecular Biology and Biotechnology, Tezpur University, 784028, Napaam, Sonitpur, Assam, India.
| |
Collapse
|
9
|
Ammunét T, Wang N, Khan S, Elo LL. Deep learning tools are top performers in long non-coding RNA prediction. Brief Funct Genomics 2022; 21:230-241. [PMID: 35136929 PMCID: PMC9123429 DOI: 10.1093/bfgp/elab045] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 11/08/2021] [Accepted: 12/02/2021] [Indexed: 11/23/2022] Open
Abstract
The increasing amount of transcriptomic data has brought to light vast numbers of potential novel RNA transcripts. Accurately distinguishing novel long non-coding RNAs (lncRNAs) from protein-coding messenger RNAs (mRNAs) has challenged bioinformatic tool developers. Most recently, tools implementing deep learning architectures have been developed for this task, with the potential of discovering sequence features and their interactions still not surfaced in current knowledge. We compared the performance of deep learning tools with other predictive tools that are currently used in lncRNA coding potential prediction. A total of 15 tools representing the variety of available methods were investigated. In addition to known annotated transcripts, we also evaluated the use of the tools in actual studies with real-life data. The robustness and scalability of the tools' performance was tested with varying sized test sets and test sets with different proportions of lncRNAs and mRNAs. In addition, the ease-of-use for each tested tool was scored. Deep learning tools were top performers in most metrics and labelled transcripts similarly with each other in the real-life dataset. However, the proportion of lncRNAs and mRNAs in the test sets affected the performance of all tools. Computational resources were utilized differently between the top-ranking tools, thus the nature of the study may affect the decision of choosing one well-performing tool over another. Nonetheless, the results suggest favouring the novel deep learning tools over other tools currently in broad use.
Collapse
Affiliation(s)
- Tea Ammunét
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland
| | - Ning Wang
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland
| | - Sofia Khan
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland
| | - Laura L Elo
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, Turku, Finland
- Institute of Biomedicine, University of Turku, Turku, Finland
| |
Collapse
|
10
|
Solayman M, Litfin T, Singh J, Paliwal K, Zhou Y, Zhan J. Probing RNA structures and functions by solvent accessibility: an overview from experimental and computational perspectives. Brief Bioinform 2022; 23:bbac112. [PMID: 35348613 PMCID: PMC9116373 DOI: 10.1093/bib/bbac112] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/03/2022] [Accepted: 03/04/2022] [Indexed: 12/30/2022] Open
Abstract
Characterizing RNA structures and functions have mostly been focused on 2D, secondary and 3D, tertiary structures. Recent advances in experimental and computational techniques for probing or predicting RNA solvent accessibility make this 1D representation of tertiary structures an increasingly attractive feature to explore. Here, we provide a survey of these recent developments, which indicate the emergence of solvent accessibility as a simple 1D property, adding to secondary and tertiary structures for investigating complex structure-function relations of RNAs.
Collapse
Affiliation(s)
- Md Solayman
- Institute for Glycomics, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Thomas Litfin
- Institute for Glycomics, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Jaswinder Singh
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD 4111, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD 4111, Australia
| | - Yaoqi Zhou
- Institute for Glycomics, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
- Institute for Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518055, China
- Peking University Shenzhen Graduate School, Shenzhen 518055, China
| | - Jian Zhan
- Institute for Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518055, China
| |
Collapse
|
11
|
Glushkevich A, Spechenkova N, Fesenko I, Knyazev A, Samarskaya V, Kalinina NO, Taliansky M, Love AJ. Transcriptomic Reprogramming, Alternative Splicing and RNA Methylation in Potato ( Solanum tuberosum L.) Plants in Response to Potato Virus Y Infection. PLANTS (BASEL, SWITZERLAND) 2022; 11:635. [PMID: 35270104 PMCID: PMC8912425 DOI: 10.3390/plants11050635] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 02/09/2022] [Accepted: 02/22/2022] [Indexed: 05/05/2023]
Abstract
Plant-virus interactions are greatly influenced by environmental factors such as temperatures. In virus-infected plants, enhanced temperature is frequently associated with more severe symptoms and higher virus content. However, the mechanisms involved in controlling the temperature regulation of plant-virus interactions are poorly characterised. To elucidate these further, we analysed the responses of potato plants cv Chicago to infection by potato virus Y (PVY) at normal (22 °C) and elevated temperature (28 °C), the latter of which is known to significantly increase plant susceptibility to PVY. Using RNAseq analysis, we showed that single and combined PVY and heat-stress treatments caused dramatic changes in gene expression, affecting the transcription of both protein-coding and non-coding RNAs. Among the newly identified genes responsive to PVY infection, we found genes encoding enzymes involved in the catalysis of polyamine formation and poly ADP-ribosylation. We also identified a range of novel non-coding RNAs which were differentially produced in response to single or combined PVY and heat stress, that consisted of antisense RNAs and RNAs with miRNA binding sites. Finally, to gain more insights into the potential role of alternative splicing and epitranscriptomic RNA methylation during combined stress conditions, direct RNA nanopore sequencing was performed. Our findings offer insights for future studies of functional links between virus infections and transcriptome reprogramming, RNA methylation and alternative splicing.
Collapse
Affiliation(s)
- Anna Glushkevich
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (A.G.); (N.S.); (I.F.); (A.K.); (V.S.)
| | - Nadezhda Spechenkova
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (A.G.); (N.S.); (I.F.); (A.K.); (V.S.)
| | - Igor Fesenko
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (A.G.); (N.S.); (I.F.); (A.K.); (V.S.)
| | - Andrey Knyazev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (A.G.); (N.S.); (I.F.); (A.K.); (V.S.)
| | - Viktoriya Samarskaya
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (A.G.); (N.S.); (I.F.); (A.K.); (V.S.)
| | - Natalia O. Kalinina
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119991 Moscow, Russia;
| | - Michael Taliansky
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (A.G.); (N.S.); (I.F.); (A.K.); (V.S.)
- The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Andrew J. Love
- The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| |
Collapse
|
12
|
Klapproth C, Sen R, Stadler PF, Findeiß S, Fallmann J. Common Features in lncRNA Annotation and Classification: A Survey. Noncoding RNA 2021; 7:77. [PMID: 34940758 PMCID: PMC8708962 DOI: 10.3390/ncrna7040077] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 12/03/2021] [Accepted: 12/06/2021] [Indexed: 12/29/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are widely recognized as important regulators of gene expression. Their molecular functions range from miRNA sponging to chromatin-associated mechanisms, leading to effects in disease progression and establishing them as diagnostic and therapeutic targets. Still, only a few representatives of this diverse class of RNAs are well studied, while the vast majority is poorly described beyond the existence of their transcripts. In this review we survey common in silico approaches for lncRNA annotation. We focus on the well-established sets of features used for classification and discuss their specific advantages and weaknesses. While the available tools perform very well for the task of distinguishing coding sequence from other RNAs, we find that current methods are not well suited to distinguish lncRNAs or parts thereof from other non-protein-coding input sequences. We conclude that the distinction of lncRNAs from intronic sequences and untranslated regions of coding mRNAs remains a pressing research gap.
Collapse
Affiliation(s)
- Christopher Klapproth
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany; (C.K.); (P.F.S.); (S.F.)
| | - Rituparno Sen
- Helmholtz Institute for RNA-Based Infection Research (HIRI), Helmholtz-Center for Infection Research (HZI), D-97080 Würzburg, Germany;
| | - Peter F. Stadler
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany; (C.K.); (P.F.S.); (S.F.)
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, University Leipzig, D-04103 Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, D-04103 Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090 Vienna, Austria
- Facultad de Ciencias, Universidad National de Colombia, Bogotá CO-111321, Colombia
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM 87501, USA
| | - Sven Findeiß
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany; (C.K.); (P.F.S.); (S.F.)
| | - Jörg Fallmann
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107 Leipzig, Germany; (C.K.); (P.F.S.); (S.F.)
| |
Collapse
|