1
|
Anteghini M, Gualdi F, Oliva B. How did we get there? AI applications to biological networks and sequences. Comput Biol Med 2025; 190:110064. [PMID: 40184941 DOI: 10.1016/j.compbiomed.2025.110064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 03/18/2025] [Accepted: 03/20/2025] [Indexed: 04/07/2025]
Abstract
The rapidly advancing field of artificial intelligence (AI) has transformed numerous scientific domains, including biology, where a vast and complex volume of data is available for analysis. This paper provides a comprehensive overview of the current state of AI-driven methodologies in genomics, proteomics, and systems biology. We discuss how machine learning algorithms, particularly deep learning models, have enhanced the accuracy and efficiency of embedding sequences, motif discovery, and the prediction of gene expression and protein structure. Additionally, we explore the integration of AI in the embedding and analysis of biological networks, including protein-protein interaction networks and multi-layered networks. By leveraging large-scale biological data, AI techniques have enabled unprecedented insights into complex biological processes and disease mechanisms. This work underlines the potential of applying AI to complex biological data, highlighting current applications and suggesting directions for future research to further explore AI in this rapidly evolving field.
Collapse
Affiliation(s)
- Marco Anteghini
- BioFolD Unit, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy; Visual and Data-Centric Computing, Zuse Institut Berlin, Berlin, Germany.
| | - Francesco Gualdi
- Structural Bioinformatics Lab, Universitat Pompeu Fabra, Barcelona, Spain; Istituto dalle Molle di Studi sull'Intelligenza Artificiale, USI/SUPSI (Università Svizzera Italiana/Scuola Universitaria Professionale Svizzera Italiana) Lugano, Switzerland.
| | - Baldo Oliva
- Structural Bioinformatics Lab, Universitat Pompeu Fabra, Barcelona, Spain.
| |
Collapse
|
2
|
Guo Y, Lei X, Li S. An Integrated TCN-CrossMHA Model for Predicting circRNA-RBP Binding Sites. Interdiscip Sci 2025; 17:86-100. [PMID: 39503827 DOI: 10.1007/s12539-024-00660-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 09/14/2024] [Accepted: 09/17/2024] [Indexed: 02/19/2025]
Abstract
Circular RNA (circRNA) has the capacity to bind with RNA binding protein (RBP), thereby exerting a substantial impact on diseases. Predicting binding sites aids in comprehending the interaction mechanism, thereby offering insights for disease treatment strategies. Here, we propose a novel approach based on temporal convolutional network (TCN) and cross multi-head attention mechanism to predict circRNA-RBP binding sites (circTCA). First, we employ two distinct encoding methodologies to obtain two raw matrices of circRNA sequences. Then, two parallel TCN blocks extract shallow and abstract features of the two matrices separately. The fusion of the two is achieved through cross multi-head attention mechanism and after this, global expectation pooling assigns weights to the concatenated feature. Finally, the task of classifying the input sequence is entrusted to a fully connected (FC) layer. We compare circTCA with other five methods and conduct ablation experiments to demonstrate its effectiveness. We also conduct feature visualization and assess the motifs extracted by circTCA with existing motifs. All in all, circTCA is effective for binding sites prediction of circRNA and RBP.
Collapse
Affiliation(s)
- Yajing Guo
- School of Computer Science, Shaanxi Normal University, Xi'an, 710119, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an, 710119, China.
| | - Shuyu Li
- School of Computer Science, Shaanxi Normal University, Xi'an, 710119, China
| |
Collapse
|
3
|
Asim MN, Ibrahim MA, Asif T, Dengel A. RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models. Heliyon 2025; 11:e41488. [PMID: 39897847 PMCID: PMC11783440 DOI: 10.1016/j.heliyon.2024.e41488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 12/23/2024] [Accepted: 12/24/2024] [Indexed: 02/04/2025] Open
Abstract
Deciphering information of RNA sequences reveals their diverse roles in living organisms, including gene regulation and protein synthesis. Aberrations in RNA sequence such as dysregulation and mutations can drive a diverse spectrum of diseases including cancers, genetic disorders, and neurodegenerative conditions. Furthermore, researchers are harnessing RNA's therapeutic potential for transforming traditional treatment paradigms into personalized therapies through the development of RNA-based drugs and gene therapies. To gain insights of biological functions and to detect diseases at early stages and develop potent therapeutics, researchers are performing diverse types RNA sequence analysis tasks. RNA sequence analysis through conventional wet-lab methods is expensive, time-consuming and error prone. To enable large-scale RNA sequence analysis, empowerment of wet-lab experimental methods with Artificial Intelligence (AI) applications necessitates scientists to have a comprehensive knowledge of both DNA and AI fields. While molecular biologists encounter challenges in understanding AI methods, computer scientists often lack basic foundations of RNA sequence analysis tasks. Considering the absence of a comprehensive literature that bridges this research gap and promotes the development of AI-driven RNA sequence analysis applications, the contributions of this manuscript are manifold: It equips AI researchers with biological foundations of 47 distinct RNA sequence analysis tasks. It sets a stage for development of benchmark datasets related to 47 distinct RNA sequence analysis tasks by facilitating cruxes of 64 different biological databases. It presents word embeddings and language models applications across 47 distinct RNA sequence analysis tasks. It streamlines the development of new predictors by providing a comprehensive survey of 58 word embeddings and 70 language models based predictive pipelines performance values as well as top performing traditional sequence encoding based predictors and their performances across 47 RNA sequence analysis tasks.
Collapse
Affiliation(s)
- Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| | - Muhammad Ali Ibrahim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Tayyaba Asif
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Andreas Dengel
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| |
Collapse
|
4
|
Liu L, Wei Y, Tan Z, Zhang Q, Sun J, Zhao Q. Predicting circRNA-RBP Binding Sites Using a Hybrid Deep Neural Network. Interdiscip Sci 2024; 16:635-648. [PMID: 38381315 DOI: 10.1007/s12539-024-00616-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/26/2024] [Accepted: 01/29/2024] [Indexed: 02/22/2024]
Abstract
Circular RNAs (circRNAs) are non-coding RNAs generated by reverse splicing. They are involved in biological process and human diseases by interacting with specific RNA-binding proteins (RBPs). Due to traditional biological experiments being costly, computational methods have been proposed to predict the circRNA-RBP interaction. However, these methods have problems of single feature extraction. Therefore, we propose a novel model called circ-FHN, which utilizes only circRNA sequences to predict circRNA-RBP interactions. The circ-FHN approach involves feature coding and a hybrid deep learning model. Feature coding takes into account the physicochemical properties of circRNA sequences and employs four coding methods to extract sequence features. The hybrid deep structure comprises a convolutional neural network (CNN) and a bidirectional gated recurrent unit (BiGRU). The CNN learns high-level abstract features, while the BiGRU captures long-term dependencies in the sequence. To assess the effectiveness of circ-FHN, we compared it to other computational methods on 16 datasets and conducted ablation experiments. Additionally, we conducted motif analysis. The results demonstrate that circ-FHN exhibits exceptional performance and surpasses other methods. circ-FHN is freely available at https://github.com/zhaoqi106/circ-FHN .
Collapse
Affiliation(s)
- Liwei Liu
- College of Science, Dalian Jiaotong University, Dalian, 116028, China
- Key Laboratory of Computational Science and Application of Hainan Province, Hainan Normal University, Haikou, 571158, China
| | - Yixin Wei
- College of Science, Dalian Jiaotong University, Dalian, 116028, China
| | - Zhebin Tan
- College of Software, Dalian Jiaotong University, Dalian, 116028, China
| | - Qi Zhang
- College of Science, Dalian Jiaotong University, Dalian, 116028, China
| | - Jianqiang Sun
- School of Information Science and Engineering, Linyi University, Linyi, 276000, China.
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
| |
Collapse
|
5
|
Zhu H, Yang Y, Wang Y, Wang F, Huang Y, Chang Y, Wong KC, Li X. Dynamic characterization and interpretation for protein-RNA interactions across diverse cellular conditions using HDRNet. Nat Commun 2023; 14:6824. [PMID: 37884495 PMCID: PMC10603054 DOI: 10.1038/s41467-023-42547-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 10/13/2023] [Indexed: 10/28/2023] Open
Abstract
RNA-binding proteins play crucial roles in the regulation of gene expression, and understanding the interactions between RNAs and RBPs in distinct cellular conditions forms the basis for comprehending the underlying RNA function. However, current computational methods pose challenges to the cross-prediction of RNA-protein binding events across diverse cell lines and tissue contexts. Here, we develop HDRNet, an end-to-end deep learning-based framework to precisely predict dynamic RBP binding events under diverse cellular conditions. Our results demonstrate that HDRNet can accurately and efficiently identify binding sites, particularly for dynamic prediction, outperforming other state-of-the-art models on 261 linear RNA datasets from both eCLIP and CLIP-seq, supplemented with additional tissue data. Moreover, we conduct motif and interpretation analyses to provide fresh insights into the pathological mechanisms underlying RNA-RBP interactions from various perspectives. Our functional genomic analysis further explores the gene-human disease associations, uncovering previously uncharacterized observations for a broad range of genetic disorders.
Collapse
Affiliation(s)
- Haoran Zhu
- School of Artificial Intelligence, Jilin University, 130012, Changchun, China
| | - Yuning Yang
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | - Yunhe Wang
- School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| | - Fuzhou Wang
- Department of Computer Science, City University of Hong Kong, Hong Kong, Hong Kong SAR
| | - Yujian Huang
- College of Computer Science and Cyber Security, Chengdu University of Technology, 610059, Chengdu, China
| | - Yi Chang
- School of Artificial Intelligence, Jilin University, 130012, Changchun, China
| | - Ka-Chun Wong
- Department of Computer Science, City University of Hong Kong, Hong Kong, Hong Kong SAR.
| | - Xiangtao Li
- School of Artificial Intelligence, Jilin University, 130012, Changchun, China.
| |
Collapse
|
6
|
Liu N, Zhang Z, Wu Y, Wang Y, Liang Y. CRBSP:Prediction of CircRNA-RBP Binding Sites Based on Multimodal Intermediate Fusion. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:2898-2906. [PMID: 37130249 DOI: 10.1109/tcbb.2023.3272400] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Circular RNA (CircRNA) is widely expressed and has physiological and pathological significance, regulating post-transcriptional processes via its protein-binding activity. However, whereas much work has been done on linear RNA and RNA binding protein (RBP), little is known about the binding sites of CircRNA. The current report is on the development of a medium-term multimodal data fusion strategy, CRBSP, to predict CircRNA-RBP binding sites. CRBSP represents the CircRNA trinucleotide semantic, location, composition and frequency information as the corresponding coding methods of Word to vector (Word2vec), Position-specific trinucleotide propensity (PSTNP), Pseudo trinucleotide composition (PseTNC) and Trinucleotide nucleotide composition (TNC), respectively. CNN (Convolution Neural Networks) was used to extract global information and BiLSTM (bidirectional Long- and Short-Term Memory network) encoder and LSTM (Long- and Short-Term Memory network) decoder for local sequence information. Enhancement of the contributions of key features by the self-attention mechanism was followed by mid-term fusion of the four enhanced features. Logistic Regression (LR) classifier showed that CRBSP gives a mean AUC value of 0.9362 through 5-fold Cross Validation of all 37 datasets, a performance which is superior to five current state-of-the-art models. Similar evaluation of linear RNA-RBP binding sites gave an AUC value of 0.7615 which is also higher than other prediction methods, demonstrating the robustness of CRBSP.
Collapse
|
7
|
Wang XF, Yu CQ, You ZH, Qiao Y, Li ZW, Huang WZ, Zhou JR, Jin HY. KS-CMI: A circRNA-miRNA interaction prediction method based on the signed graph neural network and denoising autoencoder. iScience 2023; 26:107478. [PMID: 37583550 PMCID: PMC10424127 DOI: 10.1016/j.isci.2023.107478] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 06/16/2023] [Accepted: 07/21/2023] [Indexed: 08/17/2023] Open
Abstract
Circular RNA (circRNA) plays an important role in the diagnosis, treatment, and prognosis of human diseases. The discovery of potential circRNA-miRNA interactions (CMI) is of guiding significance for subsequent biological experiments. Limited by the small amount of experimentally supported data and high randomness, existing models are difficult to accomplish the CMI prediction task based on real cases. In this paper, we propose KS-CMI, a novel method for effectively accomplishing CMI prediction in real cases. KS-CMI enriches the 'behavior relationships' of molecules by constructing circRNA-miRNA-cancer (CMCI) networks and extracts the behavior relationship attribute of molecules based on balance theory. Next, the denoising autoencoder (DAE) is used to enhance the feature representation of molecules. Finally, the CatBoost classifier was used for prediction. KS-CMI achieved the most reliable prediction results in real cases and achieved competitive performance in all datasets in the CMI prediction.
Collapse
Affiliation(s)
- Xin-Fei Wang
- School of Information Engineering, Xijing University, Xi’an, China
| | - Chang-Qing Yu
- School of Information Engineering, Xijing University, Xi’an, China
| | - Zhu-Hong You
- School of Computer Science, Northwestern Polytechnical University, Xi’an, China
| | - Yan Qiao
- College of Agriculture and Forestry, Longdong University, Qingyang, China
| | - Zheng-Wei Li
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
| | - Wen-Zhun Huang
- School of Information Engineering, Xijing University, Xi’an, China
| | - Ji-Ren Zhou
- School of Computer Science, Northwestern Polytechnical University, Xi’an, China
| | - Hai-Yan Jin
- School of Computer Science and Engineering, Xi’an University of Technology, Xi’an, China
| |
Collapse
|
8
|
Zhang Y, Luo J, Yang W, Ye WC. CircRNAs in colorectal cancer: potential biomarkers and therapeutic targets. Cell Death Dis 2023; 14:353. [PMID: 37296107 PMCID: PMC10250185 DOI: 10.1038/s41419-023-05881-2] [Citation(s) in RCA: 55] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 05/09/2023] [Accepted: 05/31/2023] [Indexed: 06/12/2023]
Abstract
Globally, colorectal cancer (CRC) is the third most prevalent cancer and the second leading cause of cancer-related deaths. Circular RNAs (circRNAs) are single-stranded RNA with covalently closed-loop structures and are highly stable, conserved, and abundantly expressed in various organs and tissues. Recent research found abnormal circRNA expression in CRC patients' blood/serum, cells, CRC tissues, and exosomes. Furthermore, mounting data demonstrated that circRNAs are crucial to the development of CRC. CircRNAs have been shown to exert biological functions by acting as microRNA sponges, RNA-binding protein sponges, regulators of gene splicing and transcription, and protein/peptide translators. These characteristics make circRNAs potential markers for CRC diagnosis and prognosis, potential therapeutic targets, and circRNA-based therapies. However, further studies are still necessary to improve the understanding of the roles and biological mechanisms of circRNAs in the development of CRC. In this review, up-to-date research on the role of circRNAs in CRC was examined, focusing on their potential application in CRC diagnosis and targeted therapy, which would advance the knowledge of the functions of circRNAs in the development and progression of CRC.
Collapse
Affiliation(s)
- Yuying Zhang
- Central Laboratory, Shenzhen Longhua Maternity and Child Healthcare Hospital, Shenzhen, 518109, China
| | - Jingyan Luo
- Forevergen Biosciences Centre, Guangzhou International Biotech Island, Guangzhou, 510300, China
| | - Weikang Yang
- Department of Prevention and Healthcare, Shenzhen Longhua Maternity and Child Healthcare Hospital, Shenzhen, 518109, China
| | - Wen-Chu Ye
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China.
| |
Collapse
|
9
|
Wang LS, Sun ZL. iDHS-FFLG: Identifying DNase I Hypersensitive Sites by Feature Fusion and Local-Global Feature Extraction Network. Interdiscip Sci 2023; 15:155-170. [PMID: 36166165 DOI: 10.1007/s12539-022-00538-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 09/12/2022] [Accepted: 09/12/2022] [Indexed: 05/01/2023]
Abstract
The DNase I hypersensitive sites (DHSs) are active regions on chromatin that have been found to be highly sensitive to DNase I. These regions contain various cis-regulatory elements, including promoters, enhancers and silencers. Accurate identification of DHSs helps researchers better understand the transcriptional machinery of DNA and deepen the knowledge of functional DNA elements in non-coding sequences. Researchers have developed many methods based on traditional experiments and machine learning to identify DHSs. However, low prediction accuracy and robustness limit their application in genetics research. In this paper, a novel computational approach based on deep learning is proposed by feature fusion and local-global feature extraction network to identify DHSs in mouse, named iDHS-FFLG. First of all, multiple binary features of nucleotides are fused to better express sequence information. Then, a network consisting of the convolutional neural network (CNN), bidirectional long short-term memory (BiLSTM) and self-attention mechanism is designed to extract local features and global contextual associations. In the end, the prediction module is applied to distinguish between DHSs and non-DHSs. The results of several experiments demonstrate the superior performances of iDHS-FFLG compared to the latest methods.
Collapse
Affiliation(s)
- Lei-Shan Wang
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, Hefei, 230601, Anhui, China
- School of Electrical Engineering and Automation, Anhui University, Hefei, 230601, Anhui, China
| | - Zhan-Li Sun
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, Hefei, 230601, Anhui, China.
- School of Electrical Engineering and Automation, Anhui University, Hefei, 230601, Anhui, China.
| |
Collapse
|
10
|
Ma Z, Sun ZL, Liu M. CRBP-HFEF: Prediction of RBP-Binding Sites on circRNAs Based on Hierarchical Feature Expansion and Fusion. Interdiscip Sci 2023:10.1007/s12539-023-00572-0. [PMID: 37233959 DOI: 10.1007/s12539-023-00572-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 04/20/2023] [Accepted: 04/21/2023] [Indexed: 05/27/2023]
Abstract
Circular RNAs (circRNAs) participate in the regulation of biological processes by binding to specific proteins and thus influence transcriptional processes. In recent years, circRNAs have become an emerging hotspot in RNA research. Due to powerful learning ability, the various deep learning frameworks have been used to predict the binding sites of RNA-binding protein (RPB) on circRNAs. These methods usually perform only single-level feature extraction of sequence information. However, the feature acquisition may be inadequate for single-level extraction. Generally, the features of deep and shallow layers of neural network can complement each other and are both important for binding site prediction tasks. Based on this concept, we propose a method that combines deep and shallow features, namely CRBP-HFEF. Specifically, features are first extracted and expanded for different levels of network. Then, the expanded deep and shallow features are fused and fed into the classification network, which finally determines whether they are binding sites. Compared to several existing methods, the experimental results on multiple datasets show that the proposed method achieves significant improvements in a number of metrics (with an average AUC of 0.9855). Moreover, much sufficient ablation experiments are also performed to verify the effectiveness of the hierarchical feature expansion strategy.
Collapse
Affiliation(s)
- Zheng Ma
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, and School of Electrical Engineering and Automation Anhui University, Hefei, 230601, Anhui, China
| | - Zhan-Li Sun
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, and School of Electrical Engineering and Automation Anhui University, Hefei, 230601, Anhui, China.
| | - Mengya Liu
- School of Computer Science and Technology, Anhui University, Hefei, 230601, Anhui, China
| |
Collapse
|
11
|
Murdaugh RL, Anastas JN. Applying single cell multi-omic analyses to understand treatment resistance in pediatric high grade glioma. Front Pharmacol 2023; 14:1002296. [PMID: 37205910 PMCID: PMC10191214 DOI: 10.3389/fphar.2023.1002296] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 04/20/2023] [Indexed: 05/21/2023] Open
Abstract
Despite improvements in cancer patient outcomes seen in the past decade, tumor resistance to therapy remains a major impediment to achieving durable clinical responses. Intratumoral heterogeneity related to genetic, epigenetic, transcriptomic, proteomic, and metabolic differences between individual cancer cells has emerged as a driver of therapeutic resistance. This cell to cell heterogeneity can be assessed using single cell profiling technologies that enable the identification of tumor cell clones that exhibit similar defining features like specific mutations or patterns of DNA methylation. Single cell profiling of tumors before and after treatment can generate new insights into the cancer cell characteristics that confer therapeutic resistance by identifying intrinsically resistant sub-populations that survive treatment and by describing new cellular features that emerge post-treatment due to tumor cell evolution. Integrative, single cell analytical approaches have already proven advantageous in studies characterizing treatment-resistant clones in cancers where pre- and post-treatment patient samples are readily available, such as leukemia. In contrast, little is known about other cancer subtypes like pediatric high grade glioma, a class of heterogeneous, malignant brain tumors in children that rapidly develop resistance to multiple therapeutic modalities, including chemotherapy, immunotherapy, and radiation. Leveraging single cell multi-omic technologies to analyze naïve and therapy-resistant glioma may lead to the discovery of novel strategies to overcome treatment resistance in brain tumors with dismal clinical outcomes. In this review, we explore the potential for single cell multi-omic analyses to reveal mechanisms of glioma resistance to therapy and discuss opportunities to apply these approaches to improve long-term therapeutic response in pediatric high grade glioma and other brain tumors with limited treatment options.
Collapse
Affiliation(s)
- Rebecca L. Murdaugh
- Department of Neurosurgery, Baylor College of Medicine, Houston, TX, United States
- Program in Cell and Gene Therapy, Baylor College of Medicine, Houston, TX, United States
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States
| | - Jamie N. Anastas
- Department of Neurosurgery, Baylor College of Medicine, Houston, TX, United States
- Program in Cell and Gene Therapy, Baylor College of Medicine, Houston, TX, United States
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States
| |
Collapse
|
12
|
Ao C, Ye X, Sakurai T, Zou Q, Yu L. m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation. BMC Biol 2023; 21:93. [PMID: 37095510 PMCID: PMC10127088 DOI: 10.1186/s12915-023-01596-0] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 04/12/2023] [Indexed: 04/26/2023] Open
Abstract
BACKGROUND RNA 5-methyluridine (m5U) modifications are obtained by methylation at the C5 position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute to the understanding of their biological functions and the pathogenesis of related diseases. Compared to traditional experimental methods, computational methods developed based on machine learning with ease of use can identify modification sites from RNA sequences in an efficient and time-saving manner. Despite the good performance of these computational methods, there are some drawbacks and limitations. RESULTS In this study, we have developed a novel predictor, m5U-SVM, based on multi-view features and machine learning algorithms to construct predictive models for identifying m5U modification sites from RNA sequences. In this method, we used four traditional physicochemical features and distributed representation features. The optimized multi-view features were obtained from the four fused traditional physicochemical features by using the two-step LightGBM and IFS methods, and then the distributed representation features were fused with the optimized physicochemical features to obtain the new multi-view features. The best performing classifier, support vector machine, was identified by screening different machine learning algorithms. Compared with the results, the performance of the proposed model is better than that of the existing state-of-the-art tool. CONCLUSIONS m5U-SVM provides an effective tool that successfully captures sequence-related attributes of modifications and can accurately predict m5U modification sites from RNA sequences. The identification of m5U modification sites helps to understand and delve into the related biological processes and functions.
Collapse
Affiliation(s)
- Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China
- Department of Computer Science, University of Tsukuba, Tsukuba, Japan
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba, Japan
| | - Tetsuya Sakurai
- Department of Computer Science, University of Tsukuba, Tsukuba, Japan
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.
| | - Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China.
| |
Collapse
|
13
|
Yue Y, Tao J, An D, Shi L. Exploring the role of tumor stemness and the potential of stemness-related risk model in the prognosis of intrahepatic cholangiocarcinoma. Front Genet 2023; 13:1089405. [PMID: 36712866 PMCID: PMC9877308 DOI: 10.3389/fgene.2022.1089405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/27/2022] [Indexed: 01/14/2023] Open
Abstract
Background: Tumor stem cells (TSCs) have been widely reported to play a critical role in tumor progression and metastasis. We explored the role of tumor stemness in intrahepatic cholangiocarcinoma (iCCA) and established a prognostic risk model related to tumor stemness for prognosis prediction and clinical treatment guidance in iCCA patients. Materials and Methods: The expression profiles of iCCA samples (E-MTAB-6389 and GSE107943 cohorts) were used in the study. One-class logistic regression algorithm calculated the mRNA stemness index (mRNAsi). The mRNAsi-related genes were used as a basis for the identification of mRNAsi-related molecular subtypes through consensus clustering. The immune characteristics and biological pathways of different subtypes were assessed. The mRNAsi-related risk model was constructed with differentially expressed genes (DEGs) between subtypes. Results: The patients with high mRNAsi had longer overall survival than that with low mRNAsi. Two subtypes were identified with that C2 had higher mRNAsi and better prognosis than C1. Tumor-related pathways such as TGF-β and epithelial-mesenchymal transition (EMT) were activated in C1. C1 had higher enrichment of cancer-associated fibroblasts and tumor-associated macrophages, as well as higher immune response and angiogenesis score than C2. We screened a total 98 prognostic DEGs between C1 and C2. Based on the prognostic DEGs, we constructed a risk model containing three genes (ANO1, CD109, and CTNND2) that could divide iCCA samples into high- and low-risk groups. The two groups had distinct prognosis and immune characteristics. Notably, the risk score was negatively associated with mRNAsi (R = -0.53). High-risk group had higher enrichment score of T cell inflamed GEP, INF-γ, and cytolytic activity, and lower score of estimated IC50 of 5-fluorouracil and cisplatin than low-risk group. Conclusions: This study clarified the important role of tumor stemness in iCCA and developed an mRNAsi-related risk model for predicting the prognosis and supporting the clinical treatment in iCCA patients. The three genes (ANO1, CD109, and CTNND2) may serve as potential targets for iCCA treatment.
Collapse
Affiliation(s)
- Yuan Yue
- Department of Pharmacy, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| | - Jie Tao
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| | - Dan An
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| | - Lei Shi
- Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, China
| |
Collapse
|
14
|
Ruan H, Wang PC, Han L. Characterization of circular RNAs with advanced sequencing technologies in human complex diseases. WILEY INTERDISCIPLINARY REVIEWS. RNA 2023; 14:e1759. [PMID: 36164985 DOI: 10.1002/wrna.1759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 07/09/2022] [Accepted: 08/02/2022] [Indexed: 01/31/2023]
Abstract
Circular RNAs (circRNAs) are one category of non-coding RNAs that do not possess 5' caps and 3' free ends. Instead, they are derived in closed circle forms from pre-mRNAs by a non-canonical splicing mechanism named "back-splicing." CircRNAs were discovered four decades ago, initially called "scrambled exons." Compared to linear RNAs, the expression levels of circRNAs are considerably lower, and it is challenging to identify circRNAs specifically. Thus, the biological relevance of circRNAs has been underappreciated until the advancement of next generation sequencing (NGS) technology. The biological insights of circRNAs, such as their tissue-specific expression patterns, biogenesis factors, and functional effects in complex diseases, namely human cancers, have been extensively explored in the last decade. With the invention of the third generation sequencing (TGS) with longer sequencing reads and newly designed strategies to characterize full-length circRNAs, the panorama of circRNAs in human complex diseases could be further unveiled. In this review, we first introduce the history of circular RNA detection. Next, we describe widely adopted NGS-based methods and the recently established TGS-based approaches capable of characterizing circRNAs in full-length. We then summarize data resources and representative circRNA functional studies related to human complex diseases. In the last section, we reviewed computational tools and discuss the potential advantages of utilizing advanced sequencing approaches to a functional interpretation of full-length circRNAs in complex diseases. This article is categorized under: RNA Evolution and Genomics > Computational Analyses of RNA RNA in Disease and Development > RNA in Disease.
Collapse
Affiliation(s)
- Hang Ruan
- Institutes of Biology and Medical Sciences, Soochow University, Suzhou, China
| | - Peng-Cheng Wang
- Institutes of Biology and Medical Sciences, Soochow University, Suzhou, China
| | - Leng Han
- Center for Epigenetics and Disease Prevention, Institute of Biosciences and Technology, Texas A&M University, Houston, Texas, USA.,Department of Translational Medical Sciences, College of Medicine, Texas A&M University, Houston, Texas, USA
| |
Collapse
|
15
|
Shao M, Hao S, Jiang L, Cai Y, Zhao X, Chen Q, Gao X, Xu J. CRIT: Identifying RNA-binding protein regulator in circRNA life cycle via non-negative matrix factorization. MOLECULAR THERAPY. NUCLEIC ACIDS 2022; 30:398-406. [PMID: 36420213 PMCID: PMC9664520 DOI: 10.1016/j.omtn.2022.10.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 10/25/2022] [Indexed: 11/06/2022]
Abstract
Circular RNAs (circRNAs) are endogenous non-coding RNAs that regulate gene expression and participate in carcinogenesis. However, the RNA-binding proteins (RBPs) involved in circRNAs biogenesis and modulation remain largely unclear. We developed the circRNA regulator identification tool (CRIT), a non-negative matrix-factorization-based pipeline to identify regulating RBPs in cancers. CRIT uncovered 73 novel regulators across thousands of samples by effectively leveraging genomics data and functional annotations. We demonstrated that known RBPs involved in circRNA control are significantly enriched in these predictions. Analysis of circRNA-RBP interactions using two large cross-linking immunoprecipitation (CLIP) databases, we validated the consistency between CRIT prediction and the CLIP experiments. Furthermore, newly discovered RBPs are functionally connected with authentic circRNA regulators by various biological associations, such as physical interaction, similar binding motifs, common transcription factor modulation, and co-expression. When analyzing RNA sequencing (RNA-seq) datasets after short hairpin RNA (shRNA)/small interfering RNA (siRNA) knockdown, we found several novel RBPs that can affect global circRNA expression, which strengthens their role in the circRNA life cycle. The above evidence provided independent confirmation that CRIT is a useful tool to capture RBPs in circRNA processing. Finally, we show that authentic regulators are more likely the core splicing proteins and peripheral factors and usually harbor more alterations in the vast majority of cancers.
Collapse
Affiliation(s)
- Mengting Shao
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
| | - Shijia Hao
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
| | - Leiming Jiang
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
| | - Yujie Cai
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
- Guangdong Key Laboratory of Age-Related Cardiac and Cerebral Diseases, Affiliated Hospital of Guangdong Medical University, 524000 Zhanjiang, China
| | - Xing Zhao
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
- Department of Pathology and Medical Biology, University of Groningen, University Medical Center, Groningen, 9700 RB Groningen, the Netherlands
| | - Qiuyang Chen
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
| | - Xuefei Gao
- Department of Physiology, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
- Guangdong Provincial Key Laboratory of Infectious Diseases and Molecular Immunopathology, 515041 Shantou, China
| | - Jianzhen Xu
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
- Guangdong Provincial Key Laboratory of Infectious Diseases and Molecular Immunopathology, 515041 Shantou, China
- Corresponding author Jianzhen Xu, Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College (SUMC), 515041 Shantou, China
| |
Collapse
|
16
|
Ma L, Chu H, Wang M, Zhang Z. Biological functions and potential implications of circular RNAs. J Biomed Res 2022; 37:89-99. [PMID: 36814375 PMCID: PMC10018409 DOI: 10.7555/jbr.36.20220095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023] Open
Abstract
Circular RNAs (circRNAs) are characterized by a covalent closed-loop structure with an absence of both 5' cap structure and 3' polyadenylated tail. Numerous studies have found that circRNAs play an important role in various diseases and have a variety of biological regulatory mechanisms, including acting as microRNA sponges, interacting with proteins, modulating the expression of related genes and translating into peptides or proteins. CircRNAs have also been used as biomarkers for a number of diseases, which could improve clinical practice. This review summarizes the most recent advances in biogenesis and knowledge of the biological functions of circRNAs as well as the related bioinformatics databases. We specifically describe developments in understanding of circRNA functions in the field of environmental exposure-induced diseases. Finally, we focus on potential clinical implications of circRNAs to facilitate their clinical transformation into disease treatment.
Collapse
Affiliation(s)
- Lan Ma
- Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China.,Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Haiyan Chu
- Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China.,Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Meilin Wang
- Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China.,Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| | - Zhengdong Zhang
- Department of Genetic Toxicology, The Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu 211166, China.,Department of Environmental Genomics, Jiangsu Key Laboratory of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu 211166, China
| |
Collapse
|
17
|
Li F, Song QZ, Zhang YF, Wang XR, Cao LM, Li N, Zhao LX, Zhang SX, Zhuang XF. Identifying the EMT-related signature to stratify prognosis and evaluate the tumor microenvironment in lung adenocarcinoma. Front Genet 2022; 13:1008416. [PMID: 36186418 PMCID: PMC9523218 DOI: 10.3389/fgene.2022.1008416] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Accepted: 08/31/2022] [Indexed: 12/03/2022] Open
Abstract
Background: Epithelial-mesenchymal transition (EMT) is a critical process in tumor invasion and metastasis. EMT has been shown to significantly influence the invasion, metastasis, and poor prognosis in lung adenocarcinoma (LUAD). This study aimed to develop a novel EMT-related prognostic model capable of predicting overall survival (OS) in patients with LUAD. Methods: A total of 283 LUAD patients from TCGA RNA-seq dataset were assigned to a training cohort for model building, and 310 LUAD patients from GEO RNA-seq dataset were assigned to a validation cohort. EMT genes were acquired from MsigDB database and then prognosis-related EMT genes were identified by univariate Cox regression. Lasso regression was then performed to determine the genes and the corresponding variables to construct a prognosis risk model from the training cohort. Furthermore, characteristics of the tumor microenvironment (TME), mutation status and chemotherapy responses were analyzed to assess the differences between the two risk groups based on the prognostic model. In addition, RT-qPCR was employed to validate the expression patterns of the 6 genes derived from the risk model. Results: A six-gene EMT signature (PMEPA1, LOXL2, PLOD2, MMP14, SPOCK1 and DCN) was successfully constructed and validated. The signature assigned the LUAD patients into high-risk and low-risk groups. In comparison with the low-risk group, patients in the high-risk group had a significantly lower survival rate. ROC curves and calibration curves for the risk model demonstrated reliable stratification and predictive ability. The risk model was robustly correlated with multiple TME characteristics. Besides, the data showed that patients in the low-risk group had more immune activities, higher stemness scores and cytolytic activity scores and higher TMB. In addition, RT-qPCR results revealed that PMEPA1, LOXL2, PLOD2, MMP14, and SPOCK1 were notably upregulated in LUAD tissues, while DCN was downregulated. Conclusion: Our study successfully developed a novel EMT-related signature to predict prognosis of LUAD patients and guide treatment strategies. The six genes derived from the prediction signature might play a potential role in antitumor immunity and serve as promising therapeutic targets in LUAD.
Collapse
Affiliation(s)
- Feng Li
- Department of Cell Biology, Shanxi Province Cancer Hospital, Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences, Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, China
| | - Qing-Zhen Song
- Department of Special Geriatrics, Shanxi Province Cancer Hospital, Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences, Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, China
| | - Yi-Fan Zhang
- The First Clinical Medical College, Shanxi Medical University, Taiyuan, China
- Key Laboratory of Cellular Physiology at Shanxi Medical University, Ministry of Education, Taiyuan, China
| | - Xing-Ru Wang
- The Second Clinical Medical College, Shanxi Medical University, Taiyuan, China
| | - Li-Min Cao
- The First Clinical Medical College, Shanxi Medical University, Taiyuan, China
| | - Nan Li
- The School of Basic Medicine of Shanxi Medical University, Taiyuan, Shanxi, China
| | - Ling-Xia Zhao
- Department of Endocrinology and Metabolism, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences, Tongji Shanxi Hospital, Third Hospital of Shanxi Medical University, Taiyuan, China
- *Correspondence: Ling-Xia Zhao, ; Sheng-Xiao Zhang, ; Xiao-Fei Zhuang,
| | - Sheng-Xiao Zhang
- Key Laboratory of Cellular Physiology at Shanxi Medical University, Ministry of Education, Taiyuan, China
- Department of Rheumatology, The Second Hospital of Shanxi Medical University, Taiyuan, Shanxi, China
- *Correspondence: Ling-Xia Zhao, ; Sheng-Xiao Zhang, ; Xiao-Fei Zhuang,
| | - Xiao-Fei Zhuang
- Department of Thoracic Surgery, Shanxi Province Cancer Hospital, Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences, Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, China
- *Correspondence: Ling-Xia Zhao, ; Sheng-Xiao Zhang, ; Xiao-Fei Zhuang,
| |
Collapse
|
18
|
A pseudo-Siamese framework for circRNA-RBP binding sites prediction integrating BiLSTM and soft attention mechanism. Methods 2022; 207:57-64. [PMID: 36113743 DOI: 10.1016/j.ymeth.2022.09.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 08/24/2022] [Accepted: 09/09/2022] [Indexed: 11/20/2022] Open
Abstract
Circular RNAs (circRNAs) are widely expressed in tissues and play a key role in diseases through interacting with RNA binding proteins (RBPs). Since the high cost of traditional technology, computational methods are developed to identify the binding sites between circRNAs and RBPs. Unfortunately, these methods suffer from the insufficient learning of features and the single classification of output. To address these limitations, we propose a novel method named circ-pSBLA which constructs a pseudo-Siamese framework integrating Bi-directional long short-term memory (BiLSTM) network and soft attention mechanism for circRNA-RBP binding sites prediction. Softmax function and CatBoost are adopted to classify, respectively, and then a pseudo-Siamese framework is constructed. circ-pSBLA combines them to get final output. To validate the effectiveness of circ-pSBLA, we compare it with other state-of-the-art methods and carry out an ablation experiment on 17 sub-datasets. Moreover, we do motif analysis on 3 sub-datasets. The results show that circ-pSBLA achieves superior performance and outperforms other methods. All supporting source codes can be downloaded from https://github.com/gyj9811/circ-pSBLA.
Collapse
|
19
|
Wang Z, Lei X. A web server for identifying circRNA-RBP variable-length binding sites based on stacked generalization ensemble deep learning network. Methods 2022; 205:179-190. [PMID: 35810958 DOI: 10.1016/j.ymeth.2022.06.014] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 06/23/2022] [Accepted: 06/28/2022] [Indexed: 11/28/2022] Open
Abstract
Circular RNA (circRNA) can exert biological functions by interacting with RNA-binding protein (RBP), and some deep learning-based methods have been developed to predict RBP binding sites on circRNA. However, most of these methods identify circRNA-RBP binding sites are only based on single data resource and cannot provide exact binding sites, only providing the probability value of a sequence fragment. To solve these problems, we propose a binding sites localization algorithm that fuses binding sites from multiple databases, and further design a stacked generalization ensemble deep learning model named CirRBP to identify RBP binding sites on circRNA. The CirRBP is trained by combining the binding sites from multiple databases and makes predictions by weighted aggregating the predictions of each sub-model. The results show that the CirRBP outperforms any sub-model and existing online prediction model. For better access to our research results, we develop an open-source web application called CRWS (CircRNA-RBP Web Server). Its back-end learning model of the CRWS is a stacked generalization ensemble learning model CirRBP based on different deep learning frameworks. Given a full-length circRNA or fragment sequence and a target RBP, the CRWS can analyze and provide the exact potential binding sites of the target RBP on the given sequence through the binding sites localization algorithm, and visualize it. In addition, the CRWS can discover the most widely distributed motif in each RBP dataset. Up to now, CRWS is the first significant online tool that uses multi-source data to train models and predict exact binding sites. CRWS is now publicly and freely available without login requirement at: http://www.bioinformatics.team.
Collapse
Affiliation(s)
- Zhengfeng Wang
- School of Computer Science, Shaanxi Normal University, Xi'an 710119, China; College of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China; Guangxi Key Laboratory of Embedded Technology and Intelligent System, Guilin University of Technology, Guilin 541004, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an 710119, China.
| |
Collapse
|
20
|
Liu P, Wang Y, Zhang N, Zhao X, Li R, Wang Y, Chen C, Wang D, Zhang X, Chen L, Zhao D. Comprehensive identification of RNA transcripts and construction of RNA network in chronic obstructive pulmonary disease. Respir Res 2022; 23:154. [PMID: 35690768 PMCID: PMC9188256 DOI: 10.1186/s12931-022-02069-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 05/24/2022] [Indexed: 12/15/2022] Open
Abstract
Background Chronic obstructive pulmonary disease (COPD) is one of the world’s leading causes of death and a major chronic disease, highly prevalent in the aging population exposed to tobacco smoke and airborne pollutants, which calls for early and useful biomolecular predictors. Roles of noncoding RNAs in COPD have been proposed, however, not many studies have systematically investigated the crosstalk among various transcripts in this context. The construction of RNA functional networks such as lncRNA-mRNA, and circRNA-miRNA-mRNA interaction networks could therefore facilitate our understanding of RNA interactions in COPD. Here, we identified the expression of RNA transcripts in RNA sequencing from COPD patients, and the potential RNA networks were further constructed. Methods All fresh peripheral blood samples of three patients with COPD and three non-COPD patients were collected and examined for mRNA, miRNA, lncRNA, and circRNA expression followed by qRT-PCR validation. We also examined mRNA expression to enrich relevant biological pathways. lncRNA-mRNA coexpression network and circRNA-miRNA-mRNA network in COPD were constructed. Results In this study, we have comprehensively identified and analyzed the differentially expressed mRNAs, lncRNAs, miRNAs, and circRNAs in peripheral blood of COPD patients with high-throughput RNA sequencing. 282 mRNAs, 146 lncRNAs, 85 miRNAs, and 81 circRNAs were differentially expressed. GSEA analysis showed that these differentially expressed RNAs correlate with several critical biological processes such as “ncRNA metabolic process”, “ncRNA processing”, “ribosome biogenesis”, “rRNAs metabolic process”, “tRNA metabolic process” and “tRNA processing”, which might be participating in the progression of COPD. RT-qPCR with more clinical COPD samples was used for the validation of some differentially expressed RNAs, and the results were in high accordance with the RNA sequencing. Given the putative regulatory function of lncRNAs and circRNAs, we have constructed the co-expression network between lncRNA and mRNA. To demonstrate the potential interactions between circRNAs and miRNAs, we have also constructed a competing endogenous RNA (ceRNA) network of differential expression circRNA-miRNA-mRNA in COPD. Conclusions In this study, we have identified and analyzed the differentially expressed mRNAs, lncRNAs, miRNAs, and circRNAs, providing a systematic view of the differentially expressed RNA in the context of COPD. We have also constructed the lncRNA-mRNA co-expression network, and for the first time constructed the circRNA-miRNA-mRNA in COPD. This study reveals the RNA involvement and potential regulatory roles in COPD, and further uncovers the interactions among those RNAs, which will assist the pathological investigations of COPD and shed light on therapeutic targets exploration for COPD. Supplementary Information The online version contains supplementary material available at 10.1186/s12931-022-02069-8.
Collapse
Affiliation(s)
- Pengcheng Liu
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Yucong Wang
- Department of Clinical Laboratory, The First Affiliated Hospital of USTC, The CAS Key Laboratory of Innate Immunity and Chronic Disease, School of Basic Medical Sciences, Division of Life Science and Medicine, University of Science and Technology of China, Hefei, 230027, China
| | - Ningning Zhang
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Xiaomin Zhao
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Renming Li
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Yu Wang
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Chen Chen
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Dandan Wang
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China
| | - Xiaoming Zhang
- School of Basic Medicine, Anhui Medical University, Hefei, 230601, China
| | - Liang Chen
- Department of Clinical Laboratory, The First Affiliated Hospital of USTC, The CAS Key Laboratory of Innate Immunity and Chronic Disease, School of Basic Medical Sciences, Division of Life Science and Medicine, University of Science and Technology of China, Hefei, 230027, China.
| | - Dahai Zhao
- Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital, Anhui Medical University, 678 Furong Road, Hefei, 230601, Anhui Province, China.
| |
Collapse
|
21
|
Guo X, Piao H. Research Progress of circRNAs in Glioblastoma. Front Cell Dev Biol 2021; 9:791892. [PMID: 34881248 PMCID: PMC8645988 DOI: 10.3389/fcell.2021.791892] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Accepted: 11/02/2021] [Indexed: 01/10/2023] Open
Abstract
Circular RNAs (circRNAs) are a class of single-stranded covalently closed non-coding RNAs without a 5' cap structure or 3' terminal poly (A) tail, which are expressed in a variety of tissues and cells with conserved, stable and specific characteristics. Glioblastoma (GBM) is the most aggressive and lethal tumor in the central nervous system, characterized by high recurrence and mortality rates. The specific expression of circRNAs in GBM has demonstrated their potential to become new biomarkers for the development of GBM. The specific expression of circRNAs in GBM has shown their potential as new biomarkers for GBM cell proliferation, apoptosis, migration and invasion, which provides new ideas for GBM treatment. In this paper, we will review the biological properties and functions of circRNAs and their biological roles and clinical applications in GBM.
Collapse
Affiliation(s)
- Xu Guo
- Department of Neurosurgery, Liaoning Cancer Hospital and Institute, Cancer Hospital of China Medical University, Shenyang, China
| | - Haozhe Piao
- Department of Neurosurgery, Liaoning Cancer Hospital and Institute, Cancer Hospital of China Medical University, Shenyang, China
| |
Collapse
|