1
|
Guan Z, Jin X, Zhang X. MFF-nDA: A Computational Model for ncRNA-Disease Association Prediction Based on Multimodule Fusion. J Chem Inf Model 2025; 65:3324-3342. [PMID: 40129032 DOI: 10.1021/acs.jcim.5c00174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2025]
Abstract
Noncoding RNAs(ncRNAs), including piwi-interacting RNA(piRNA), long noncoding RNA(lncRNA), microRNA(miRNA), small nucleolar RNA(snoRNA), and circular RNA(circRNA), contribute significantly to gene expression regulation and serve as key factors in disease association studies and health-related exploration. Accurate prediction of ncRNA-disease associations is crucial for elucidating disease mechanisms and advancing therapeutic development. Recently, computational models based on a graph neural network have extensively emerged for identifying associations among various ncRNAs and diseases. However, existing computational models have not fully utilized integrative information on ncRNs and diseases, and reliance on GNN-based models alone may be limited in performance due to oversmoothing issues. On the other hand, existing models are mainly targeted at a specific type of ncRNA and may not be applicable to most ncRNAs. Therefore, to overcome these limitations, we propound a computational model MFF-nDA based on multimodule fusion. Specifically, we first introduce five types of similarity network information, including three types of ncRNA and two types of disease similarity information, in order to fully explore and optimize the multisource feature information on these entities. Subsequently, we establish three modules: heterogeneous network representation module based on Transformer, association network representation module based on graph convolutional network (GCN), and topological structure representation module based on graph attention network (GAT), which capture diverse features of nodes in heterogeneous networks and topological structure information reflected in association networks. The complementary effects of the three modules also help relieve the oversmoothing issue to some extent. By leveraging the multimodule fusion learning to comprehensively capture the diverse features of these entities, our model outperforms the available state-of-the-art methods, achieving an AUC greater than 0.9000 for each dataset. This demonstrates the highest predictive performance, making it a valuable tool for identifying potential ncRNA associated with diseases. The code of MFF-nDA can be accessed at https://github.com/Jack-Cxy/MFF-nDA.
Collapse
Affiliation(s)
- Zhihao Guan
- College of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China
- Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Anhui Agricultural University, Hefei 230036, China
| | - Xiu Jin
- College of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China
- Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Anhui Agricultural University, Hefei 230036, China
| | - Xiaodan Zhang
- College of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, China
- Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Anhui Agricultural University, Hefei 230036, China
| |
Collapse
|
2
|
Gondal MN, Farooqi HMU. Single-Cell Transcriptomic Approaches for Decoding Non-Coding RNA Mechanisms in Colorectal Cancer. Noncoding RNA 2025; 11:24. [PMID: 40126348 PMCID: PMC11932299 DOI: 10.3390/ncrna11020024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 01/27/2025] [Accepted: 03/03/2025] [Indexed: 03/25/2025] Open
Abstract
Non-coding RNAs (ncRNAs) play crucial roles in colorectal cancer (CRC) development and progression. Recent developments in single-cell transcriptome profiling methods have revealed surprising levels of expression variability among seemingly homogeneous cells, suggesting the existence of many more cell types than previously estimated. This review synthesizes recent advances in ncRNA research in CRC, emphasizing single-cell bioinformatics approaches for their analysis. We explore computational methods and tools used for ncRNA identification, characterization, and functional prediction in CRC, with a focus on single-cell RNA sequencing (scRNA-seq) data. The review highlights key bioinformatics strategies, including sequence-based and structure-based approaches, machine learning applications, and multi-omics data integration. We discuss how these computational techniques can be applied to analyze differential expression, perform functional enrichment, and construct regulatory networks involving ncRNAs in CRC. Additionally, we examine the role of bioinformatics in leveraging ncRNAs as diagnostic and prognostic biomarkers for CRC. We also discuss recent scRNA-seq studies revealing ncRNA heterogeneity in CRC. This review aims to provide a comprehensive overview of the current state of single-cell bioinformatics in ncRNA CRC research and outline future directions in this rapidly evolving field, emphasizing the integration of computational approaches with experimental validation to advance our understanding of ncRNA biology in CRC.
Collapse
Affiliation(s)
- Mahnoor Naseer Gondal
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA;
- Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Hafiz Muhammad Umer Farooqi
- Laboratory of Energy Metabolism, Division of Metabolic Disorders, Children’s Hospital of Orange County, Orange, CA 92868, USA
| |
Collapse
|
3
|
Pagoni M, Zogopoulos VL, Kontogiannis S, Tsolakou A, Zoumpourlis V, Tsangaris GT, Fokaefs E, Michalopoulos I, Tsatsakis AM, Drakoulis N. Integrated Pharmacogenetic Signature for the Prediction of Prostatic Neoplasms in Men With Metabolic Disorders. Cancer Genomics Proteomics 2025; 22:285-305. [PMID: 39993800 PMCID: PMC11880924 DOI: 10.21873/cgp.20502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 01/14/2025] [Accepted: 01/16/2025] [Indexed: 02/26/2025] Open
Abstract
BACKGROUND/AIM Oncogenic processes are delineated by metabolic dysregulation. Drug likeness is pharmacokinetically tested through the CYP450 enzymatic system, whose genetic aberrations under epigenetic stress could shift male organisms into prostate cancer pathways. Our objective was to predict the susceptibility to prostate neoplasia, focused on benign prostatic hyperplasia (BPH) and prostate cancer (PCa), based on the pharmacoepigenetic and the metabolic profile of Caucasians. MATERIALS AND METHODS Two independent cohorts of 47,389 individuals in total were assessed to find risk associations of CYP450 genes with prostatic neoplasia. The metabolic profile of the first cohort was statistically evaluated and frequencies of absorption-distribution-metabolism-excretion-toxicity (ADMET) properties were calculated. Prediction of miRNA pharmacoepigenetic targeting was performed. RESULTS We found that prostate cancer and benign prostatic hyperplasia patients of the first cohort shared common cardiometabolic trends. Drug classes C08CA, C09AA, C09CA, C10AA, C10AX of the cardiovascular, and G04CA, G04CB of the genitourinary systems, were associated with increased prostate cancer risk, while C03CA and N06AB of the cardiovascular and nervous systems were associated with low-risk for PCa. CYP3A4*1B was the most related pharmacogenetic polymorphism associated with prostate cancer susceptibility. miRNA-200c-3p and miRNA-27b-3p seem to be associated with CYP3A4 targeting and prostate cancer predisposition. Metabolomic analysis indicated that 11β-OHT, 2β-OHT, 15β-OHT, 2α-OHT and 6β-OHT had a high risk, and 16α-OHT, and 16β-OHT had an intermediate disease-risk. CONCLUSION These findings constitute a novel integrated signature for prostate cancer susceptibility. Further studies are required to assess their predictive value more fully.
Collapse
Affiliation(s)
- Maria Pagoni
- Research Group of Clinical Pharmacology and Pharmacogenomics, Faculty of Pharmacy, School of Health Sciences, National and Kapodistrian University of Athens, Athens, Greece;
| | - Vasileios L Zogopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens, Greece
| | | | - Annia Tsolakou
- Research Group of Clinical Pharmacology and Pharmacogenomics, Faculty of Pharmacy, School of Health Sciences, National and Kapodistrian University of Athens, Athens, Greece
| | | | - George Th Tsangaris
- Proteomics Research Unit, Biomedical Research Foundation, Academy of Athens, Athens, Greece;
| | | | - Ioannis Michalopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens, Greece
| | - Aristidis M Tsatsakis
- Department of Forensic Sciences and Toxicology, Faculty of Medicine, University of Crete, Heraklion, Greece
| | - Nikolaos Drakoulis
- Research Group of Clinical Pharmacology and Pharmacogenomics, Faculty of Pharmacy, School of Health Sciences, National and Kapodistrian University of Athens, Athens, Greece
| |
Collapse
|
4
|
Chhichholiya Y, Kumar V, Hlawndo L, Sangave K, Singh S, Munshi A. BreVamiR3': A comprehensive database for breast cancer-associated genetic variations in miRNA and 3' UTR of their target genes. Comput Biol Med 2025; 186:109639. [PMID: 39740512 DOI: 10.1016/j.compbiomed.2024.109639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 12/25/2024] [Accepted: 12/26/2024] [Indexed: 01/02/2025]
Abstract
Breast cancer (BC) is a multifactorial disease where microRNA (miRNA)-mediated dysregulated gene expression plays a pivotal role in tumorigenesis, progression, and clinical outcomes. Genetic variation, particularly SNPs in miRNA sequences and the 3' untranslated regions (3'UTRs) of their target genes, can disrupt miRNA-mRNA interactions, leading to altered gene expression. Despite several existing databases providing insights into various aspects of miRNAs and their target genes in association with the development of the disease. Still, there remains a critical gap in a unified, BC-specific repository that integrates SNPs in miRNAs, their target genes, and associated molecular mechanisms. To address this, we developed BreVamiR3' (Breast cancer-associated genetic Variations in miRNA and 3'UTR of their target genes), a novel, freely accessible database (https://brevamir3.web.app/index.html). BreVamiR3' features comprehensive and curated data on 500 miRNAs and 828 target genes associated with BC, including experimentally validated SNPs, chromosomal loci, expression profiles, BC subtypes, signaling pathways, and classifications of genes i.e. oncogene, tumor suppressor, or both. Unique features of BreVamiR3' include detailed annotations of SNPs disrupting miRNA-target interactions, subtype-specific miRNA expression, visualization of miRNA-chromosomal distributions and their experimental validation. Chromosome 19 harbours the highest number of BC-associated miRNAs, while target genes such as CCND1, KRAS, and ERBB2 demonstrate extensive miRNA regulation. By integrating genetic variations with functional and clinical relevance, BreVamiR3' provides a single platform to explore miRNA-mediated regulatory networks, their genetic variation, and downstream signaling pathways in BC. This database serves as a powerful resource for researchers, enabling exploration of miRNA-gene interactions, genetic variation, and their role in pathogenesis of BC and diverse clinical outcomes.
Collapse
Affiliation(s)
- Yogita Chhichholiya
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Vicky Kumar
- Department of Computer Science Engineering, Guru Nanak Dev DSEU Rohini Campus, Delhi Skill and Entrepreneurship University, Dwarka, Delhi, India
| | - Lalnunmawia Hlawndo
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Kshitij Sangave
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Sandeep Singh
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India
| | - Anjana Munshi
- Department of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, Punjab, India.
| |
Collapse
|
5
|
Márton É, Varga A, Domoszlai D, Buglyó G, Balázs A, Penyige A, Balogh I, Nagy B, Szilágyi M. Non-Coding RNAs in Cancer: Structure, Function, and Clinical Application. Cancers (Basel) 2025; 17:579. [PMID: 40002172 PMCID: PMC11853212 DOI: 10.3390/cancers17040579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2024] [Revised: 02/04/2025] [Accepted: 02/06/2025] [Indexed: 02/27/2025] Open
Abstract
We are on the brink of a paradigm shift in both theoretical and clinical oncology. Genomic and transcriptomic profiling, alongside personalized approaches that account for individual patient variability, are increasingly shaping discourse. Discussions on the future of personalized cancer medicine are mainly dominated by the potential of non-coding RNAs (ncRNAs), which play a prominent role in cancer progression and metastasis formation by regulating the expression of oncogenic or tumor suppressor proteins at transcriptional and post-transcriptional levels; furthermore, their cell-free counterparts might be involved in intercellular communication. Non-coding RNAs are considered to be promising biomarker candidates for early diagnosis of cancer as well as potential therapeutic agents. This review aims to provide clarity amidst the vast body of literature by focusing on diverse species of ncRNAs, exploring the structure, origin, function, and potential clinical applications of miRNAs, siRNAs, lncRNAs, circRNAs, snRNAs, snoRNAs, eRNAs, paRNAs, YRNAs, vtRNAs, and piRNAs. We discuss molecular methods used for their detection or functional studies both in vitro and in vivo. We also address the challenges that must be overcome to enter a new era of cancer diagnosis and therapy that will reshape the future of oncology.
Collapse
Affiliation(s)
- Éva Márton
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| | - Alexandra Varga
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| | - Dóra Domoszlai
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| | - Gergely Buglyó
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| | - Anita Balázs
- Department of Integrative Health Sciences, Institute of Health Sciences, Faculty of Health Sciences, University of Debrecen, H-4032 Debrecen, Hungary;
| | - András Penyige
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| | - István Balogh
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
- Division of Clinical Genetics, Department of Laboratory Medicine, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary
| | - Bálint Nagy
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| | - Melinda Szilágyi
- Department of Human Genetics, Faculty of Medicine, University of Debrecen, H-4032 Debrecen, Hungary; (É.M.); (A.V.); (D.D.); (G.B.); (A.P.); (I.B.); (B.N.)
| |
Collapse
|
6
|
Asim MN, Ibrahim MA, Asif T, Dengel A. RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models. Heliyon 2025; 11:e41488. [PMID: 39897847 PMCID: PMC11783440 DOI: 10.1016/j.heliyon.2024.e41488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 12/23/2024] [Accepted: 12/24/2024] [Indexed: 02/04/2025] Open
Abstract
Deciphering information of RNA sequences reveals their diverse roles in living organisms, including gene regulation and protein synthesis. Aberrations in RNA sequence such as dysregulation and mutations can drive a diverse spectrum of diseases including cancers, genetic disorders, and neurodegenerative conditions. Furthermore, researchers are harnessing RNA's therapeutic potential for transforming traditional treatment paradigms into personalized therapies through the development of RNA-based drugs and gene therapies. To gain insights of biological functions and to detect diseases at early stages and develop potent therapeutics, researchers are performing diverse types RNA sequence analysis tasks. RNA sequence analysis through conventional wet-lab methods is expensive, time-consuming and error prone. To enable large-scale RNA sequence analysis, empowerment of wet-lab experimental methods with Artificial Intelligence (AI) applications necessitates scientists to have a comprehensive knowledge of both DNA and AI fields. While molecular biologists encounter challenges in understanding AI methods, computer scientists often lack basic foundations of RNA sequence analysis tasks. Considering the absence of a comprehensive literature that bridges this research gap and promotes the development of AI-driven RNA sequence analysis applications, the contributions of this manuscript are manifold: It equips AI researchers with biological foundations of 47 distinct RNA sequence analysis tasks. It sets a stage for development of benchmark datasets related to 47 distinct RNA sequence analysis tasks by facilitating cruxes of 64 different biological databases. It presents word embeddings and language models applications across 47 distinct RNA sequence analysis tasks. It streamlines the development of new predictors by providing a comprehensive survey of 58 word embeddings and 70 language models based predictive pipelines performance values as well as top performing traditional sequence encoding based predictors and their performances across 47 RNA sequence analysis tasks.
Collapse
Affiliation(s)
- Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| | - Muhammad Ali Ibrahim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Tayyaba Asif
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Andreas Dengel
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| |
Collapse
|
7
|
Wakamatsu K, Maruyama A, Okazumi S. Evaluation of Plasma microRNA-222 as a Biomarker for Gastric Cancer. J Clin Med 2024; 14:98. [PMID: 39797181 PMCID: PMC11721468 DOI: 10.3390/jcm14010098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 12/20/2024] [Accepted: 12/24/2024] [Indexed: 01/13/2025] Open
Abstract
Background: The dysregulation of microRNAs (miRNAs) has been detected in patients with gastric cancer (GC), which inspired the use of miRNAs as a novel biomarker for GC. In this study, we investigated the previously reported miRNA dysfunction in cancer tissues as a potential plasma biomarker for GC using quantitative reverse transcriptase polymerase chain reaction (RT-PCR). Methods: The published miRNA abnormalities were searched in the microRNA Cancer Association Database. Plasma samples were collected from patients with GC (n = 26) and controls (n = 17). The sensitivity and specificity of polyadenylation RT-PCR (PA-RT) and stem-loop RT-PCR (SL-RT) were compared. Statistical comparisons between patients with GC and controls were performed to identify miRNA biomarkers, and correlation analyses between the threshold cycle (Ct) values of miRNAs and various blood biochemical parameters were performed to elucidate the confounding factors. Results: mir-17, mir-21, mir-31, mir-99b, mir-222, and U6 were selected. PA-RT showed greater sensitivity and lower specificity than SL-RT (PA-RT vs. SL-RT, mean Ct: 19.6 vs. 29.2; coefficient of variation: 0.42 vs. 0.10). Adopting SL-RT owing to its higher specificity, only mir-222 was significantly upregulated in patients with GC (GC vs. control, miRNA expression: 15.4 vs. 5.27, p = 0.0098). Regarding the correlation between blood biochemical parameters and cells with miRNA expression, mir-31 and mir-99b were correlated with blood urea nitrogen, mir-17, mir-21, and mir-99b were negatively correlated with platelets, and mir-21 was correlated with neutrophils. No obvious correlations were noted between mir-222 expression and blood parameters. Receiver operating characteristic (ROC) curve analysis indicated that mir-222 identified GC patients with a maximum area under the curve (0.73, 95% confidence interval 0.57-0.89). Conclusions: Plasma mir-222 was confirmed to be dysregulated in patients with GC, irrespective of blood biochemical parameters.
Collapse
Affiliation(s)
- Kotaro Wakamatsu
- Department of Surgery, Toho University Sakura Medical Center, 564-1 Shimoshizu, Sakura 285-8741, Chiba, Japan;
| | - Atsushi Maruyama
- Department of Life Science and Technology, Institute of Science Tokyo, 4259 B-57 Nagatsuta-cho, Midori, Yokohama 226-8501, Kanagawa, Japan;
| | - Shinichi Okazumi
- Department of Surgery, Toho University Sakura Medical Center, 564-1 Shimoshizu, Sakura 285-8741, Chiba, Japan;
| |
Collapse
|
8
|
Loreni F, Nenna A, Nappi F, Ferrisi C, Chello C, Lusini M, Vincenzi B, Tonini G, Chello M. miRNAs in the diagnosis and therapy of cardiac and mediastinal tumors: a new dawn for cardio-oncology? Future Cardiol 2024; 20:795-806. [PMID: 39513219 PMCID: PMC11622773 DOI: 10.1080/14796678.2024.2419225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 10/17/2024] [Indexed: 11/15/2024] Open
Abstract
Dysfunctions in miRNA production have been recently investigated as predictors of neoplasms and their therapeutic strategies. In this review, we summarize the available knowledge on miRNAs and cardiac tumors (such as myxoma) and mediastinal tumors (such as thymoma) and propose new avenues for future research. MiRNAs are crucial for cardiac development through the expression of cardiac transcription factors (miR-335-5p), hinder the cell cycle by modulating the activity of transcription factors (miR-126-3p, miR-320a), modulate the production of inflammatory factors such as interleukins (miR-217), and interfere with cell proliferation or apoptosis (miR-218, miR-634 and miR-122). Current and future research on miRNAs is essential, as a deep understanding could lead to a revolution in the field of diagnostics and prevention of neoplastic diseases.
Collapse
Affiliation(s)
- Francesco Loreni
- Cardiac Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| | - Antonio Nenna
- Cardiac Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| | - Francesco Nappi
- Cardiac Surgery, Centre Cardiologique du Nord, Saint Denis, 93200, France
| | - Chiara Ferrisi
- Cardiac Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| | - Camilla Chello
- PhD Course of Integrated Biomedical Sciences, Università Campus Bio-Medico di Roma, Rome, 00128, Italy
| | - Mario Lusini
- Cardiac Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| | - Bruno Vincenzi
- Oncology, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| | - Giuseppe Tonini
- Oncology, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| | - Massimo Chello
- Cardiac Surgery, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, 00128, Italy
| |
Collapse
|
9
|
Yang Y, Sun Y, Li F, Guan B, Liu JX, Shang J. MGCNRF: Prediction of Disease-Related miRNAs Based on Multiple Graph Convolutional Networks and Random Forest. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:15701-15709. [PMID: 37459265 DOI: 10.1109/tnnls.2023.3289182] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/01/2024]
Abstract
Increasing microRNAs (miRNAs) have been confirmed to be inextricably linked to various diseases, and the discovery of their associations has become a routine way of treating diseases. To overcome the time-consuming and laborious shortcoming of traditional experiments in verifying the associations of miRNAs and diseases (MDAs), a variety of computational methods have emerged. However, these methods still have many shortcomings in terms of predictive performance and accuracy. In this study, a model based on multiple graph convolutional networks and random forest (MGCNRF) was proposed for the prediction MDAs. Specifically, MGCNRF first mapped miRNA functional similarity and sequence similarity, disease semantic similarity and target similarity, and the known MDAs into four different two-layer heterogeneous networks. Second, MGCNRF applied four heterogeneous networks into four different layered attention graph convolutional networks (GCNs), respectively, to extract MDA embeddings. Finally, MGCNRF integrated the embeddings of every MDA into the features of the miRNA-disease pair and predicted potential MDAs through the random forest (RF). Fivefold cross-validation was applied to verify the prediction performance of MGCNRF, which outperforms the other seven state-of-the-art methods by area under curve. Furthermore, the accuracy and the case studies of different diseases further demonstrate the scientific rationale of MGCNRF. In conclusion, MGCNRF can serve as a scientific tool for predicting potential MDAs.
Collapse
|
10
|
Feng H, Ke C, Zou Q, Zhu Z, Liu T. Prediction of Potential miRNA-Disease Associations Based on a Masked Graph Autoencoder. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1874-1885. [PMID: 38954583 DOI: 10.1109/tcbb.2024.3421924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]
Abstract
Biomedical evidence has demonstrated the relevance of microRNA (miRNA) dysregulation in complex human diseases, and determining the relationship between miRNAs and diseases can aid in the early detection and prevention of diseases. Traditional biological experimental methods have the disadvantages of high cost and low efficiency, which are well compensated by computational methods. However, many computational methods have the challenge of excessively focusing on the neighbor relationship, ignoring the structural information of the graph, and belittling the redundant information of the graph structure. This study proposed a computational model based on a graph-masking autoencoder named MGAEMDA. MGAEMDA is an asymmetric framework in which the encoder maps partially observed graphs into latent representations. The decoder reconstructs the masked structural information based on the edge and node levels and combines it with linear matrices to obtain the result. The empirical results on the two datasets reveal that the MGAEMDA model performs better than its counterparts. We also demonstrated the predictive performance of MGAEMDA using a case study of four diseases, and all the top 30 predicted miRNAs were validated in the database, providing further evidence of the excellent performance of the model.
Collapse
|
11
|
Wang Y, Hong J, Lu Y, Sheng N, Fu Y, Yang L, Meng L, Huang L, Wang H. A Controllability Reinforcement Learning Method for Pancreatic Cancer Biomarker Identification. IEEE Trans Nanobioscience 2024; 23:556-563. [PMID: 39133596 DOI: 10.1109/tnb.2024.3441689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2024]
Abstract
Pancreatic cancer is one of the most malignant cancers with rapid progression and poor prognosis. The use of transcriptional data can be effective in finding new biomarkers for pancreatic cancer. Many network-based methods used to identify cancer biomarkers are proposed, among which the combination of network controllability appears. However, most of the existing methods do not study RNA, rely on priori and mutations information, or can only achieve classification tasks. In this study, we propose a method combined Relational Graph Convolutional Network and Deep Q-Network called RDDriver to identify pancreatic cancer biomarkers based on multi-layer heterogeneous transcriptional regulation network. Firstly, we construct a regulation network containing long non-coding RNA, microRNA, and messenger RNA. Secondly, Relational Graph Convolutional Network is used to learn the node representation. Finally, we use the idea of Deep Q-Network to build a model, which score and prioritize each RNA with the Popov-Belevitch-Hautus criterion. We train RDDriver on three small simulated networks, and calculate the average score after applying the model parameters to the regulation networks separately. To demonstrate the effectiveness of the method, we perform experiments for comparison between RDDriver and other eight methods based on the approximate benchmark of three types cancer drivers RNAs.
Collapse
|
12
|
Goel H, Goel A. MicroRNA and Rare Human Diseases. Genes (Basel) 2024; 15:1243. [PMID: 39457367 PMCID: PMC11507005 DOI: 10.3390/genes15101243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 09/19/2024] [Accepted: 09/24/2024] [Indexed: 10/28/2024] Open
Abstract
BACKGROUND The role of microRNAs (miRNAs) in the pathogenesis of rare genetic disorders has been gradually discovered. MiRNAs, a class of small non-coding RNAs, regulate gene expression by silencing target messenger RNAs (mRNAs). Their biogenesis involves transcription into primary miRNA (pri-miRNA), processing by the DROSHA-DGCR8 (DiGeorge syndrome critical region 8) complex, exportation to the cytoplasm, and further processing by DICER to generate mature miRNAs. These mature miRNAs are incorporated into the RNA-induced silencing complex (RISC), where they modulate gene expression. METHODS/RESULTS The dysregulation of miRNAs is implicated in various Mendelian disorders and familial diseases, including DICER1 syndrome, neurodevelopmental disorders (NDDs), and conditions linked to mutations in miRNA-binding sites. We summarized a few mechanisms how miRNA processing and regulation abnormalities lead to rare genetic disorders. Examples of such genetic diseases include hearing loss associated with MIR96 mutations, eye disorders linked to MIR184 mutations, and skeletal dysplasia involving MIR140 mutations. CONCLUSIONS Understanding these molecular mechanisms is crucial, as miRNA dysregulation is a key factor in the pathogenesis of these conditions, offering significant potential for the diagnosis and potential therapeutic intervention.
Collapse
Affiliation(s)
- Himanshu Goel
- Hunter Genetics, Waratah, NSW 2298, Australia
- School of Medicine and Public Health, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW 2308, Australia
| | - Amy Goel
- Billy Blue College of Design, Torrens University Australia, Adelaide, SA 5000, Australia;
| |
Collapse
|
13
|
Kamble P, Nagar PR, Bhakhar KA, Garg P, Sobhia ME, Naidu S, Bharatam PV. Cancer pharmacoinformatics: Databases and analytical tools. Funct Integr Genomics 2024; 24:166. [PMID: 39294509 DOI: 10.1007/s10142-024-01445-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Revised: 08/26/2024] [Accepted: 09/03/2024] [Indexed: 09/20/2024]
Abstract
Cancer is a subject of extensive investigation, and the utilization of omics technology has resulted in the generation of substantial volumes of big data in cancer research. Numerous databases are being developed to manage and organize this data effectively. These databases encompass various domains such as genomics, transcriptomics, proteomics, metabolomics, immunology, and drug discovery. The application of computational tools into various core components of pharmaceutical sciences constitutes "Pharmacoinformatics", an emerging paradigm in rational drug discovery. The three major features of pharmacoinformatics include (i) Structure modelling of putative drugs and targets, (ii) Compilation of databases and analysis using statistical approaches, and (iii) Employing artificial intelligence/machine learning algorithms for the discovery of novel therapeutic molecules. The development, updating, and analysis of databases using statistical approaches play a pivotal role in pharmacoinformatics. Multiple software tools are associated with oncoinformatics research. This review catalogs the databases and computational tools related to cancer drug discovery and highlights their potential implications in the pharmacoinformatics of cancer.
Collapse
Affiliation(s)
- Pradnya Kamble
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India
| | - Prinsa R Nagar
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India
| | - Kaushikkumar A Bhakhar
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India
| | - Prabha Garg
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India
| | - M Elizabeth Sobhia
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India
| | - Srivatsava Naidu
- Center of Biomedical Engineering, Indian Institute of Technology Ropar, Rupnagar, Punjab, India
| | - Prasad V Bharatam
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India.
- Department of Medicinal Chemistry, National Institute of Pharmaceutical Education and Research, S.A.S. Nagar, Punjab, India.
| |
Collapse
|
14
|
Cavalleri E, Cabri A, Soto-Gomez M, Bonfitto S, Perlasca P, Gliozzo J, Callahan TJ, Reese J, Robinson PN, Casiraghi E, Valentini G, Mesiti M. An ontology-based knowledge graph for representing interactions involving RNA molecules. Sci Data 2024; 11:906. [PMID: 39174566 PMCID: PMC11341713 DOI: 10.1038/s41597-024-03673-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 07/23/2024] [Indexed: 08/24/2024] Open
Abstract
The "RNA world" represents a novel frontier for the study of fundamental biological processes and human diseases and is paving the way for the development of new drugs tailored to each patient's biomolecular characteristics. Although scientific data about coding and non-coding RNA molecules are constantly produced and available from public repositories, they are scattered across different databases and a centralized, uniform, and semantically consistent representation of the "RNA world" is still lacking. We propose RNA-KG, a knowledge graph (KG) encompassing biological knowledge about RNAs gathered from more than 60 public databases, integrating functional relationships with genes, proteins, and chemicals and ontologically grounded biomedical concepts. To develop RNA-KG, we first identified, pre-processed, and characterized each data source; next, we built a meta-graph that provides an ontological description of the KG by representing all the bio-molecular entities and medical concepts of interest in this domain, as well as the types of interactions connecting them. Finally, we leveraged an instance-based semantically abstracted knowledge model to specify the ontological alignment according to which RNA-KG was generated. RNA-KG can be downloaded in different formats and also queried by a SPARQL endpoint. A thorough topological analysis of the resulting heterogeneous graph provides further insights into the characteristics of the "RNA world". RNA-KG can be both directly explored and visualized, and/or analyzed by applying computational methods to infer bio-medical knowledge from its heterogeneous nodes and edges. The resource can be easily updated with new experimental data, and specific views of the overall KG can be extracted according to the bio-medical problem to be studied.
Collapse
Affiliation(s)
- Emanuele Cavalleri
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
| | - Alberto Cabri
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
| | - Mauricio Soto-Gomez
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
| | - Sara Bonfitto
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
| | - Paolo Perlasca
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
| | - Jessica Gliozzo
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Justin Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Peter N Robinson
- Berlin Institute of Health - Charité, Universitätsmedizin, Berlin, 13353, Germany
- ELLIS, European Laboratory for Learning and Intelligent Systems, Munich, Germany
| | - Elena Casiraghi
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
- ELLIS, European Laboratory for Learning and Intelligent Systems, Munich, Germany
| | - Giorgio Valentini
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy
- ELLIS, European Laboratory for Learning and Intelligent Systems, Munich, Germany
| | - Marco Mesiti
- AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy.
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| |
Collapse
|
15
|
Si Y, Huang Z, Fang Z, Yuan Z, Huang Z, Li Y, Wei Y, Wu F, Yao YF. Global-local aware Heterogeneous Graph Contrastive Learning for multifaceted association prediction in miRNA-gene-disease networks. Brief Bioinform 2024; 25:bbae443. [PMID: 39256197 PMCID: PMC11387071 DOI: 10.1093/bib/bbae443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Revised: 08/11/2024] [Accepted: 08/30/2024] [Indexed: 09/12/2024] Open
Abstract
Unraveling the intricate network of associations among microRNAs (miRNAs), genes, and diseases is pivotal for deciphering molecular mechanisms, refining disease diagnosis, and crafting targeted therapies. Computational strategies, leveraging link prediction within biological graphs, present a cost-efficient alternative to high-cost empirical assays. However, while plenty of methods excel at predicting specific associations, such as miRNA-disease associations (MDAs), miRNA-target interactions (MTIs), and disease-gene associations (DGAs), a holistic approach harnessing diverse data sources for multifaceted association prediction remains largely unexplored. The limited availability of high-quality data, as vitro experiments to comprehensively confirm associations are often expensive and time-consuming, results in a sparse and noisy heterogeneous graph, hindering an accurate prediction of these complex associations. To address this challenge, we propose a novel framework called Global-local aware Heterogeneous Graph Contrastive Learning (GlaHGCL). GlaHGCL combines global and local contrastive learning to improve node embeddings in the heterogeneous graph. In particular, global contrastive learning enhances the robustness of node embeddings against noise by aligning global representations of the original graph and its augmented counterpart. Local contrastive learning enforces representation consistency between functionally similar or connected nodes across diverse data sources, effectively leveraging data heterogeneity and mitigating the issue of data scarcity. The refined node representations are applied to downstream tasks, such as MDA, MTI, and DGA prediction. Experiments show GlaHGCL outperforming state-of-the-art methods, and case studies further demonstrate its ability to accurately uncover new associations among miRNAs, genes, and diseases. We have made the datasets and source code publicly available at https://github.com/Sue-syx/GlaHGCL.
Collapse
Affiliation(s)
- Yuxuan Si
- Department of Ophthalmology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, East Qingchun Road, 310016 Zhejiang, China
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Zihan Huang
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Zhengqing Fang
- Department of Ophthalmology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, East Qingchun Road, 310016 Zhejiang, China
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Zhouhang Yuan
- Department of Ophthalmology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, East Qingchun Road, 310016 Zhejiang, China
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Zhengxing Huang
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Yingming Li
- College of Information Science and Electronic Engineering, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Ying Wei
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Fei Wu
- College of Computer Science and Technology, Zhejiang University, 38 Zheda Road, 310027 Zhejiang, China
| | - Yu-Feng Yao
- Department of Ophthalmology, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, East Qingchun Road, 310016 Zhejiang, China
- Department of Ophthalmology, The Fourth Affiliated Hospital of Soochow University, 215000 Suzhou, China
| |
Collapse
|
16
|
Chaudhary U, Banerjee S. Decoding the Non-coding: Tools and Databases Unveiling the Hidden World of "Junk" RNAs for Innovative Therapeutic Exploration. ACS Pharmacol Transl Sci 2024; 7:1901-1915. [PMID: 39022352 PMCID: PMC11249652 DOI: 10.1021/acsptsci.3c00388] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 05/15/2024] [Accepted: 05/27/2024] [Indexed: 07/20/2024]
Abstract
Non-coding RNAs are pivotal regulators of gene and protein expression, exerting crucial influences on diverse biological processes. Their dysregulation is frequently implicated in the onset and progression of diseases, notably cancer. A profound comprehension of the intricate mechanisms governing ncRNAs is imperative for devising innovative therapeutic interventions against these debilitating conditions. Significantly, nearly 80% of our genome comprises ncRNAs, underscoring their centrality in cellular processes. The elucidation of ncRNA functions is pivotal for grasping the complexities of gene regulation and its implications for human health. Modern genome sequencing techniques yield vast datasets, stored in specialized databases. To harness this wealth of information and to understand the crosstalk of non-coding RNAs, knowledge of available databases is required, and many new sophisticated computational tools have emerged. These tools play a pivotal role in the identification, prediction, and annotation of ncRNAs, thereby facilitating their experimental validation. This Review succinctly outlines the current understanding of ncRNAs, emphasizing their involvement in disease development. It also highlights the databases and tools instrumental in classifying, annotating, and evaluating ncRNAs. By extracting meaningful biological insights from seemingly "junk" data, these tools empower scientists to unravel the intricate roles of ncRNAs in shaping human health.
Collapse
Affiliation(s)
- Uma Chaudhary
- Department of Biotechnology,
School of Biosciences and Technology, Vellore
Institute of Technology (VIT), Vellore, Tamil Nadu 632014, India
| | - Satarupa Banerjee
- Department of Biotechnology,
School of Biosciences and Technology, Vellore
Institute of Technology (VIT), Vellore, Tamil Nadu 632014, India
| |
Collapse
|
17
|
Xuan P, Wang X, Cui H, Meng X, Nakaguchi T, Zhang T. Meta-Path Semantic and Global-Local Representation Learning Enhanced Graph Convolutional Model for Disease-Related miRNA Prediction. IEEE J Biomed Health Inform 2024; 28:4306-4316. [PMID: 38709611 DOI: 10.1109/jbhi.2024.3397003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Dysregulation of miRNAs is closely related to the progression of various diseases, so identifying disease-related miRNAs is crucial. Most recently proposed methods are based on graph reasoning, while they did not completely exploit the topological structure composed of the higher-order neighbor nodes and the global and local features of miRNA and disease nodes. We proposed a prediction method, MDAP, to learn semantic features of miRNA and disease nodes based on various meta-paths, as well as node features from the entire heterogeneous network perspective, and node pair attributes. Firstly, for both the miRNA and disease nodes, node category-wise meta-paths were constructed to integrate the similarity and association connection relationships. Each target node has its specific neighbor nodes for each meta-path, and the neighbors of longer meta-paths constitute its higher-order neighbor topological structure. Secondly, we constructed a meta-path specific graph convolutional network module to integrate the features of higher-order neighbors and their topology, and then learned the semantic representations of nodes. Thirdly, for the entire miRNA-disease heterogeneous network, a global-aware graph convolutional autoencoder was built to learn the network-view feature representations of nodes. We also designed semantic-level and representation-level attentions to obtain informative semantic features and node representations. Finally, the strategy based on the parallel convolutional-deconvolutional neural networks were designed to enhance the local feature learning for a pair of miRNA and disease nodes. The experiment results showed that MDAP outperformed other state-of-the-art methods, and the ablation experiments demonstrated the effectiveness of MDAP's major innovations. MDAP's ability in discovering potential disease-related miRNAs was further analyzed by the case studies over three diseases.
Collapse
|
18
|
Aravind VA, Kouznetsova VL, Kesari S, Tsigelny IF. Using Machine Learning and miRNA for the Diagnosis of Esophageal Cancer. J Appl Lab Med 2024; 9:684-695. [PMID: 38721901 DOI: 10.1093/jalm/jfae037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 02/20/2024] [Indexed: 07/02/2024]
Abstract
BACKGROUND Esophageal cancer (EC) remains a global health challenge, often diagnosed at advanced stages, leading to high mortality rates. Current diagnostic tools for EC are limited in their efficacy. This study aims to harness the potential of microRNAs (miRNAs) as novel, noninvasive diagnostic biomarkers for EC. Our objective was to determine the diagnostic accuracy of miRNAs, particularly in distinguishing miRNAs associated with EC from control miRNAs. METHODS We applied machine learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) and TensorFlow Keras to a dataset of miRNA sequences and gene targets, assessing the predictive power of several classifiers: naïve Bayes, multilayer perceptron, Hoeffding tree, random forest, and random tree. The data were further subjected to InfoGain feature selection to identify the most informative miRNA sequence and gene target descriptors. The ML models' abilities to distinguish between miRNA implicated in EC and control group miRNA was then tested. RESULTS Of the tested WEKA classifiers, the top 3 performing ones were random forest, Hoeffding tree, and naïve Bayes. The TensorFlow Keras neural network model was subsequently trained and tested, the model's predictive power was further validated using an independent dataset. The TensorFlow Keras gave an accuracy 0.91. The WEKA best algorithm (naïve Bayes) model yielded an accuracy of 0.94. CONCLUSIONS The results demonstrate the potential of ML-based miRNA classifiers in diagnosing EC. However, further studies are necessary to validate these findings and explore the full clinical potential of this approach.
Collapse
Affiliation(s)
- Vishnu A Aravind
- REHS program, San Diego Supercomputer Center, UC San Diego, San Diego, CA, United States
| | - Valentina L Kouznetsova
- San Diego Supercomputer Center, UC San Diego, San Diego, CA, United States
- BiAna, La Jolla, CA, United States
- CureScience Institute, San Diego, CA, United States
| | - Santosh Kesari
- Pacific Neuroscience Institute, Department of Translational Neurosciences, Santa Monica, United States
| | - Igor F Tsigelny
- San Diego Supercomputer Center, UC San Diego, San Diego, CA, United States
- BiAna, La Jolla, CA, United States
- CureScience Institute, San Diego, CA, United States
- Department of Neurosciences, UC San Diego, San Diego, CA, United States
| |
Collapse
|
19
|
Qu J, Liu S, Li H, Zhou J, Bian Z, Song Z, Jiang Z. Three-layer heterogeneous network based on the integration of CircRNA information for MiRNA-disease association prediction. PeerJ Comput Sci 2024; 10:e2070. [PMID: 38983241 PMCID: PMC11232581 DOI: 10.7717/peerj-cs.2070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 04/29/2024] [Indexed: 07/11/2024]
Abstract
Increasing research has shown that the abnormal expression of microRNA (miRNA) is associated with many complex diseases. However, biological experiments have many limitations in identifying the potential disease-miRNA associations. Therefore, we developed a computational model of Three-Layer Heterogeneous Network based on the Integration of CircRNA information for MiRNA-Disease Association prediction (TLHNICMDA). In the model, a disease-miRNA-circRNA heterogeneous network is built by known disease-miRNA associations, known miRNA-circRNA interactions, disease similarity, miRNA similarity, and circRNA similarity. Then, the potential disease-miRNA associations are identified by an update algorithm based on the global network. Finally, based on global and local leave-one-out cross validation (LOOCV), the values of AUCs in TLHNICMDA are 0.8795 and 0.7774. Moreover, the mean and standard deviation of AUC in 5-fold cross-validations is 0.8777+/-0.0010. Especially, the two types of case studies illustrated the usefulness of TLHNICMDA in predicting disease-miRNA interactions.
Collapse
Affiliation(s)
- Jia Qu
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Shuting Liu
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Han Li
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Jie Zhou
- Shaoxing University, School of Computer Science and Engineering, Shaoxing, Zhejiang, China
| | - Zekang Bian
- Jiangnan University, School of AI & Computer Science, Wuxi, Jiangsu, China
| | - Zihao Song
- Changzhou University, School of Computer Science and Artificial Intelligence, Changzhou, Jiangsu, China
| | - Zhibin Jiang
- Shaoxing University, School of Computer Science and Engineering, Shaoxing, Zhejiang, China
| |
Collapse
|
20
|
Qin C, Zhang J, Ma L. EMCMDA: predicting miRNA-disease associations via efficient matrix completion. Sci Rep 2024; 14:12761. [PMID: 38834687 DOI: 10.1038/s41598-024-63582-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 05/30/2024] [Indexed: 06/06/2024] Open
Abstract
Abundant researches have consistently illustrated the crucial role of microRNAs (miRNAs) in a wide array of essential biological processes. Furthermore, miRNAs have been validated as promising therapeutic targets for addressing complex diseases. Given the costly and time-consuming nature of traditional biological experimental validation methods, it is imperative to develop computational methods. In the work, we developed a novel approach named efficient matrix completion (EMCMDA) for predicting miRNA-disease associations. First, we calculated the similarities across multiple sources for miRNA/disease pairs and combined this information to create a holistic miRNA/disease similarity measure. Second, we utilized this biological information to create a heterogeneous network and established a target matrix derived from this network. Lastly, we framed the miRNA-disease association prediction issue as a low-rank matrix-complete issue that was addressed via minimizing matrix truncated schatten p-norm. Notably, we improved the conventional singular value contraction algorithm through using a weighted singular value contraction technique. This technique dynamically adjusts the degree of contraction based on the significance of each singular value, ensuring that the physical meaning of these singular values is fully considered. We evaluated the performance of EMCMDA by applying two distinct cross-validation experiments on two diverse databases, and the outcomes were statistically significant. In addition, we executed comprehensive case studies on two prevalent human diseases, namely lung cancer and breast cancer. Following prediction and multiple validations, it was evident that EMCMDA proficiently forecasts previously undisclosed disease-related miRNAs. These results underscore the robustness and efficacy of EMCMDA in miRNA-disease association prediction.
Collapse
Affiliation(s)
- Chao Qin
- School of Information Science and Engineering, Qilu Normal University, Jinan, 250200, China.
| | - Jiancheng Zhang
- School of Information Science and Engineering, Qilu Normal University, Jinan, 250200, China
| | - Lingyu Ma
- School of Control Science and Engineering, Harbin Institute of Technology, Weihai, 250200, China
| |
Collapse
|
21
|
An X, Wu W, Wang P, Mahmut A, Guo J, Dong J, Gong W, Liu B, Yang L, Ma Y, Xu X, Chen J, Cao W, Jiang Q. Long noncoding RNA TUG1 promotes malignant progression of osteosarcoma by enhancing ZBTB7C expression. Biomed J 2024; 47:100651. [PMID: 37562773 PMCID: PMC11225834 DOI: 10.1016/j.bj.2023.100651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 05/21/2023] [Accepted: 08/05/2023] [Indexed: 08/12/2023] Open
Abstract
BACKGROUND Dysregulation of long non-coding RNAs (lncRNAs) is an important component of tumorigenesis. Aberrant expression of lncRNA taurine upregulated gene 1 (lncTUG1) has been reported in various tumors; however, its precise role and key targets critically involved in osteosarcoma (OS) progression remain unclear. METHODS The expression profiles of lncRNAs and their regulated miRNAs related to OS progression were assessed by bioinformatics analysis and confirmed by qRT-PCR of OS cells. The miRNA targets were identified by transcriptome sequencing and verified by luciferase reporter and RNA pull-down assays. Several in vivo and in vitro approaches, including CCK8 assay, western blot, qRT-PCR, lentiviral transduction and OS cell xenograft mouse model were established to validate the effects of lncTUG1 regulation of miRNA and the downstream target genes on OS cell growth, apoptosis and progression. RESULTS We found that lncTUG1 and miR-26a-5p were inversely up or down-regulated in OS cells, and siRNA-mediated lncTUG1 knockdown reversed the miR-26a-5p down-regulation and suppressed proliferation and enhanced apoptosis of OS cells. Further, we identified that an oncoprotein ZBTB7C was also upregulated in OS cells that were subjected to lncTUG1/miR-26a-5p regulation. More importantly, ZBTB7C knockdown reduced the ZBTB7C upregulation and ZBTB7C overexpression diminished the anti-OS effects of lncTUG1 knockdown in the OS xenograft model. CONCLUSIONS Our data suggest that lncTUG1 acts as a miR-26a-5p sponge and promotes OS progression via up-regulating ZBTB7C, and targeting lncTUG1 might be an effective strategy to treat OS.
Collapse
Affiliation(s)
- Xueying An
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Wenshu Wu
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China
| | - Pu Wang
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China
| | - Abdurahman Mahmut
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China
| | - Junxia Guo
- Department of Sports Medicine and Adult Reconstructive Surgery, Nanjing Drum Tower Hospital, Clinical College of Nanjing Medical University, Nanjing, China
| | - Jian Dong
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China
| | - Wang Gong
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China
| | - Bin Liu
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China
| | - Lin Yang
- Department of Sports Medicine and Adult Reconstructive Surgery, Nanjing Drum Tower Hospital, Clinical College of Nanjing Medical University, Nanjing, China
| | - Yuze Ma
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China
| | - Xingquan Xu
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China.
| | - Jianmei Chen
- Institute of Translational Medicine, Medical College of Yangzhou University, Yangzhou, China.
| | - Wangsen Cao
- Nanjing University Medical School, Jiangsu Key Lab of Molecular Medicine. Nanjing, China; Department of Central Laboratory, Yancheng First Hospital, Affiliated Hospital of Nanjing University Medical School, The First People's Hospital of Yancheng, Yancheng, China.
| | - Qing Jiang
- State Key Laboratory of Pharmaceutical Biotechnology, Division of Sports Medicine and Adult Reconstructive Surgery, Department of Orthopedic Surgery, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, China; Branch of National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Nanjing, China.
| |
Collapse
|
22
|
Lee H, Ma T, Ke H, Ye Z, Chen S. dCCA: detecting differential covariation patterns between two types of high-throughput omics data. Brief Bioinform 2024; 25:bbae288. [PMID: 38888456 PMCID: PMC11184902 DOI: 10.1093/bib/bbae288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 05/01/2024] [Accepted: 06/03/2024] [Indexed: 06/20/2024] Open
Abstract
MOTIVATION The advent of multimodal omics data has provided an unprecedented opportunity to systematically investigate underlying biological mechanisms from distinct yet complementary angles. However, the joint analysis of multi-omics data remains challenging because it requires modeling interactions between multiple sets of high-throughput variables. Furthermore, these interaction patterns may vary across different clinical groups, reflecting disease-related biological processes. RESULTS We propose a novel approach called Differential Canonical Correlation Analysis (dCCA) to capture differential covariation patterns between two multivariate vectors across clinical groups. Unlike classical Canonical Correlation Analysis, which maximizes the correlation between two multivariate vectors, dCCA aims to maximally recover differentially expressed multivariate-to-multivariate covariation patterns between groups. We have developed computational algorithms and a toolkit to sparsely select paired subsets of variables from two sets of multivariate variables while maximizing the differential covariation. Extensive simulation analyses demonstrate the superior performance of dCCA in selecting variables of interest and recovering differential correlations. We applied dCCA to the Pan-Kidney cohort from the Cancer Genome Atlas Program database and identified differentially expressed covariations between noncoding RNAs and gene expressions. AVAILABILITY AND IMPLEMENTATION The R package that implements dCCA is available at https://github.com/hwiyoungstat/dCCA.
Collapse
Affiliation(s)
- Hwiyoung Lee
- Maryland Psychiatric Research Center, School of Medicine, University of Maryland, Baltimore, MD 21201, United States
- The University of Maryland Institute for Health Computing (UM-IHC), North Bethesda, MD 20852, United States
| | - Tianzhou Ma
- Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742, United States
| | - Hongjie Ke
- Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742, United States
| | - Zhenyao Ye
- The University of Maryland Institute for Health Computing (UM-IHC), North Bethesda, MD 20852, United States
- Division of Biostatistics and Bioinformatics, Department of Epidemiology and Public Health, School of Medicine, University of Maryland, Baltimore, MD 21201, United States
| | - Shuo Chen
- Maryland Psychiatric Research Center, School of Medicine, University of Maryland, Baltimore, MD 21201, United States
- The University of Maryland Institute for Health Computing (UM-IHC), North Bethesda, MD 20852, United States
- Division of Biostatistics and Bioinformatics, Department of Epidemiology and Public Health, School of Medicine, University of Maryland, Baltimore, MD 21201, United States
| |
Collapse
|
23
|
Planat M, Chester D. Topology and Dynamics of Transcriptome (Dys)Regulation. Int J Mol Sci 2024; 25:4971. [PMID: 38732192 PMCID: PMC11084388 DOI: 10.3390/ijms25094971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 04/29/2024] [Accepted: 04/30/2024] [Indexed: 05/13/2024] Open
Abstract
RNA transcripts play a crucial role as witnesses of gene expression health. Identifying disruptive short sequences in RNA transcription and regulation is essential for potentially treating diseases. Let us delve into the mathematical intricacies of these sequences. We have previously devised a mathematical approach for defining a "healthy" sequence. This sequence is characterized by having at most four distinct nucleotides (denoted as nt≤4). It serves as the generator of a group denoted as fp. The desired properties of this sequence are as follows: fp should be close to a free group of rank nt-1, it must be aperiodic, and fp should not have isolated singularities within its SL2(C) character variety (specifically within the corresponding Groebner basis). Now, let us explore the concept of singularities. There are cubic surfaces associated with the character variety of a four-punctured sphere denoted as S24. When we encounter these singularities, we find ourselves dealing with some algebraic solutions of a dynamical second-order differential (and transcendental) equation known as the Painlevé VI Equation. In certain cases, S24 degenerates, in the sense that two punctures collapse, resulting in a "wild" dynamics governed by the Painlevé equations of an index lower than VI. In our paper, we provide examples of these fascinating mathematical structures within the context of miRNAs. Specifically, we find a clear relationship between decorated character varieties of Painlevé equations and the character variety calculated from the seed of oncomirs. These findings should find many applications including cancer research and the investigation of neurodegenative diseases.
Collapse
Affiliation(s)
- Michel Planat
- Institut FEMTO-ST CNRS UMR 6174, Université de Franche-Comté, 15 B Avenue des Montboucons, F-25044 Besançon, France
| | - David Chester
- Quantum Gravity Research, Los Angeles, CA 90290, USA;
| |
Collapse
|
24
|
Sheng N, Xie X, Wang Y, Huang L, Zhang S, Gao L, Wang H. A Survey of Deep Learning for Detecting miRNA- Disease Associations: Databases, Computational Methods, Challenges, and Future Directions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:328-347. [PMID: 38194377 DOI: 10.1109/tcbb.2024.3351752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]
Abstract
MicroRNAs (miRNAs) are an important class of non-coding RNAs that play an essential role in the occurrence and development of various diseases. Identifying the potential miRNA-disease associations (MDAs) can be beneficial in understanding disease pathogenesis. Traditional laboratory experiments are expensive and time-consuming. Computational models have enabled systematic large-scale prediction of potential MDAs, greatly improving the research efficiency. With recent advances in deep learning, it has become an attractive and powerful technique for uncovering novel MDAs. Consequently, numerous MDA prediction methods based on deep learning have emerged. In this review, we first summarize publicly available databases related to miRNAs and diseases for MDA prediction. Next, we outline commonly used miRNA and disease similarity calculation and integration methods. Then, we comprehensively review the 48 existing deep learning-based MDA computation methods, categorizing them into classical deep learning and graph neural network-based techniques. Subsequently, we investigate the evaluation methods and metrics that are frequently used to assess MDA prediction performance. Finally, we discuss the performance trends of different computational methods, point out some problems in current research, and propose 9 potential future research directions. Data resources and recent advances in MDA prediction methods are summarized in the GitHub repository https://github.com/sheng-n/DL-miRNA-disease-association-methods.
Collapse
|
25
|
Yu P, Han Y, Meng L, Tian Y, Jin Z, Luo J, Han C, Xu W, Kong L, Zhang C. Exosomes derived from pulmonary metastatic sites enhance osteosarcoma lung metastasis by transferring the miR-194/215 cluster targeting MARCKS. Acta Pharm Sin B 2024; 14:2039-2056. [PMID: 38799644 PMCID: PMC11119511 DOI: 10.1016/j.apsb.2024.01.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2023] [Revised: 11/22/2023] [Accepted: 01/05/2024] [Indexed: 05/29/2024] Open
Abstract
Osteosarcoma, a prevalent primary malignant bone tumor, often presents with lung metastases, severely impacting patient survival rates. Extracellular vesicles, particularly exosomes, play a pivotal role in the formation and progression of osteosarcoma-related pulmonary lesions. However, the communication between primary osteosarcoma and exosome-mediated pulmonary lesions remains obscure, with the potential impact of pulmonary metastatic foci on osteosarcoma progression largely unknown. This study unveils an innovative mechanism by which exosomes originating from osteosarcoma pulmonary metastatic sites transport the miR-194/215 cluster to the primary tumor site. This transportation enhances lung metastatic capability by downregulating myristoylated alanine-rich C-kinase substrate (MARCKS) expression. Addressing this phenomenon, in this study we employ cationic bovine serum albumin (CBSA) to form nanoparticles (CBSA-anta-194/215) via electrostatic interaction with antagomir-miR-194/215. These nanoparticles are loaded into nucleic acid-depleted exosomal membrane vesicles (anta-194/215@Exo) targeting osteosarcoma lung metastatic sites. Intervention with bioengineered exosome mimetics (anta-194/215@Exo) not only impedes osteosarcoma progression but also significantly prolongs the lifespan of tumor-bearing mice. These findings suggest that pulmonary metastatic foci-derived exosomes initiate primary osteosarcoma lung metastasis by transferring the miR-194/215 cluster targeting MARCKS, making the miR-194/215 cluster a promising therapeutic target for inhibiting the progression of patients with osteosarcoma lung metastases.
Collapse
Affiliation(s)
- Pei Yu
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Yubao Han
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Lulu Meng
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Yanyuan Tian
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Zhiwei Jin
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Jun Luo
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Chao Han
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Wenjun Xu
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Lingyi Kong
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Chao Zhang
- State Key Laboratory of Natural Medicines, Jiangsu Key Laboratory of Bioactive Natural Product Research, School of Traditional Chinese Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| |
Collapse
|
26
|
Nazari A, Ghasemi T, Khalaj-Kondori M, Fathi R. Promoter of lncRNA MORT is aberrantly methylated in colorectal cancer. NUCLEOSIDES, NUCLEOTIDES & NUCLEIC ACIDS 2024; 44:111-123. [PMID: 38619194 DOI: 10.1080/15257770.2024.2328732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 03/04/2024] [Accepted: 03/04/2024] [Indexed: 04/16/2024]
Abstract
Aberrant DNA methylation plays essential roles in the colorectal cancer (CRC) carcinogenesis and has been demonstrated as a promising marker for cancer early detection. In this project, methylation status of the MORT promoter was studied in CRC and their marginal tissues using qMSP assay. Furthermore, we investigated the molecular function of MORT in CRC progression using computational analysis. The results showed a high methylation level of MORT promoter in CRC tissues. By in silico analysis, we found that MORT downregulation could promote the proliferation of CRC cells via sponging of has-miR-574-5p and has-miR-31-5p, and alteration of their targets expression pattern such as MYOCD and FOXP2. In conclusion, based on our results, promoter hypermethylation of MORT might be considered as a potential biomarker for CRC detection.
Collapse
Affiliation(s)
- Aylar Nazari
- Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, Iran
| | - Tayyebeh Ghasemi
- Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, Iran
| | - Mohammad Khalaj-Kondori
- Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, Iran
| | - Ramin Fathi
- Department of Genetics, Molecular Cell Group, Faculty of Basic Science, Islamic Azad University of Ahar, Ahar, Iran
| |
Collapse
|
27
|
Morishita EC, Nakamura S. Recent applications of artificial intelligence in RNA-targeted small molecule drug discovery. Expert Opin Drug Discov 2024; 19:415-431. [PMID: 38321848 DOI: 10.1080/17460441.2024.2313455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 01/30/2024] [Indexed: 02/08/2024]
Abstract
INTRODUCTION Targeting RNAs with small molecules offers an alternative to the conventional protein-targeted drug discovery and can potentially address unmet and emerging medical needs. The recent rise of interest in the strategy has already resulted in large amounts of data on disease associated RNAs, as well as on small molecules that bind to such RNAs. Artificial intelligence (AI) approaches, including machine learning and deep learning, present an opportunity to speed up the discovery of RNA-targeted small molecules by improving decision-making efficiency and quality. AREAS COVERED The topics described in this review include the recent applications of AI in the identification of RNA targets, RNA structure determination, screening of chemical compound libraries, and hit-to-lead optimization. The impact and limitations of the recent AI applications are discussed, along with an outlook on the possible applications of next-generation AI tools for the discovery of novel RNA-targeted small molecule drugs. EXPERT OPINION Key areas for improvement include developing AI tools for understanding RNA dynamics and RNA - small molecule interactions. High-quality and comprehensive data still need to be generated especially on the biological activity of small molecules that target RNAs.
Collapse
|
28
|
Chowdhury D, Mistry A, Maity D, Bhatia R, Priyadarshi S, Wadan S, Chakraborty S, Haldar S. Pan-cancer analyses suggest kindlin-associated global mechanochemical alterations. Commun Biol 2024; 7:372. [PMID: 38548811 PMCID: PMC10978987 DOI: 10.1038/s42003-024-06044-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 03/11/2024] [Indexed: 04/01/2024] Open
Abstract
Kindlins serve as mechanosensitive adapters, transducing extracellular mechanical cues to intracellular biochemical signals and thus, their perturbations potentially lead to cancer progressions. Despite the kindlin involvement in tumor development, understanding their genetic and mechanochemical characteristics across different cancers remains elusive. Here, we thoroughly examined genetic alterations in kindlins across more than 10,000 patients with 33 cancer types. Our findings reveal cancer-specific alterations, particularly prevalent in advanced tumor stage and during metastatic onset. We observed a significant co-alteration between kindlins and mechanochemical proteome in various tumors through the activation of cancer-related pathways and adverse survival outcomes. Leveraging normal mode analysis, we predicted structural consequences of cancer-specific kindlin mutations, highlighting potential impacts on stability and downstream signaling pathways. Our study unraveled alterations in epithelial-mesenchymal transition markers associated with kindlin activity. This comprehensive analysis provides a resource for guiding future mechanistic investigations and therapeutic strategies targeting the roles of kindlins in cancer treatment.
Collapse
Affiliation(s)
- Debojyoti Chowdhury
- Department of Chemical and Biological Sciences, S.N. Bose National Centre for Basic Sciences, Kolkata, West Bengal, 700106, India.
| | - Ayush Mistry
- Department of Biology, Trivedi School of Biosciences, Ashoka University, Sonepat, Haryana, 131029, India
| | - Debashruti Maity
- Department of Chemical and Biological Sciences, S.N. Bose National Centre for Basic Sciences, Kolkata, West Bengal, 700106, India
| | - Riti Bhatia
- Department of Biology, Trivedi School of Biosciences, Ashoka University, Sonepat, Haryana, 131029, India
| | - Shreyansh Priyadarshi
- Department of Biology, Trivedi School of Biosciences, Ashoka University, Sonepat, Haryana, 131029, India
| | - Simran Wadan
- Department of Biology, Trivedi School of Biosciences, Ashoka University, Sonepat, Haryana, 131029, India
| | - Soham Chakraborty
- Department of Biology, Trivedi School of Biosciences, Ashoka University, Sonepat, Haryana, 131029, India
| | - Shubhasis Haldar
- Department of Chemical and Biological Sciences, S.N. Bose National Centre for Basic Sciences, Kolkata, West Bengal, 700106, India.
- Department of Biology, Trivedi School of Biosciences, Ashoka University, Sonepat, Haryana, 131029, India.
- Technical Research Centre, S.N. Bose National Centre for Basic Sciences, Kolkata, West Bengal, 700106, India.
| |
Collapse
|
29
|
Ji B, Zou H, Xu L, Xie X, Peng S. MUSCLE: multi-view and multi-scale attentional feature fusion for microRNA-disease associations prediction. Brief Bioinform 2024; 25:bbae167. [PMID: 38605642 PMCID: PMC11009512 DOI: 10.1093/bib/bbae167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 03/02/2024] [Accepted: 03/31/2024] [Indexed: 04/13/2024] Open
Abstract
MicroRNAs (miRNAs) synergize with various biomolecules in human cells resulting in diverse functions in regulating a wide range of biological processes. Predicting potential disease-associated miRNAs as valuable biomarkers contributes to the treatment of human diseases. However, few previous methods take a holistic perspective and only concentrate on isolated miRNA and disease objects, thereby ignoring that human cells are responsible for multiple relationships. In this work, we first constructed a multi-view graph based on the relationships between miRNAs and various biomolecules, and then utilized graph attention neural network to learn the graph topology features of miRNAs and diseases for each view. Next, we added an attention mechanism again, and developed a multi-scale feature fusion module, aiming to determine the optimal fusion results for the multi-view topology features of miRNAs and diseases. In addition, the prior attribute knowledge of miRNAs and diseases was simultaneously added to achieve better prediction results and solve the cold start problem. Finally, the learned miRNA and disease representations were then concatenated and fed into a multi-layer perceptron for end-to-end training and predicting potential miRNA-disease associations. To assess the efficacy of our model (called MUSCLE), we performed 5- and 10-fold cross-validation (CV), which got average the Area under ROC curves of 0.966${\pm }$0.0102 and 0.973${\pm }$0.0135, respectively, outperforming most current state-of-the-art models. We then examined the impact of crucial parameters on prediction performance and performed ablation experiments on the feature combination and model architecture. Furthermore, the case studies about colon cancer, lung cancer and breast cancer also fully demonstrate the good inductive capability of MUSCLE. Our data and code are free available at a public GitHub repository: https://github.com/zht-code/MUSCLE.git.
Collapse
Affiliation(s)
- Boya Ji
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
| | - Haitao Zou
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
- College of Information Science and Engineering, Guilin University of Technology, Guilin 541006, China
| | - Liwen Xu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
| | - Xiaolan Xie
- College of Information Science and Engineering, Guilin University of Technology, Guilin 541006, China
| | - Shaoliang Peng
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
| |
Collapse
|
30
|
Suszynska M, Machowska M, Fraszczyk E, Michalczyk M, Philips A, Galka-Marciniak P, Kozlowski P. CMC: Cancer miRNA Census - a list of cancer-related miRNA genes. Nucleic Acids Res 2024; 52:1628-1644. [PMID: 38261968 PMCID: PMC10899758 DOI: 10.1093/nar/gkae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 01/03/2024] [Indexed: 01/25/2024] Open
Abstract
A growing body of evidence indicates an important role of miRNAs in cancer; however, there is no definitive, convenient-to-use list of cancer-related miRNAs or miRNA genes that may serve as a reference for analyses of miRNAs in cancer. To this end, we created a list of 165 cancer-related miRNA genes called the Cancer miRNA Census (CMC). The list is based on a score, built on various types of functional and genetic evidence for the role of particular miRNAs in cancer, e.g. miRNA-cancer associations reported in databases, associations of miRNAs with cancer hallmarks, or signals of positive selection of genetic alterations in cancer. The presence of well-recognized cancer-related miRNA genes, such as MIR21, MIR155, MIR15A, MIR17 or MIRLET7s, at the top of the CMC ranking directly confirms the accuracy and robustness of the list. Additionally, to verify and indicate the reliability of CMC, we performed a validation of criteria used to build CMC, comparison of CMC with various cancer data (publications and databases), and enrichment analyses of biological pathways and processes such as Gene Ontology or DisGeNET. All validation steps showed a strong association of CMC with cancer/cancer-related processes confirming its usefulness as a reference list of miRNA genes associated with cancer.
Collapse
Affiliation(s)
- Malwina Suszynska
- Department of Molecular Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Magdalena Machowska
- Department of Molecular Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Eliza Fraszczyk
- Department of Molecular Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Maciej Michalczyk
- Laboratory of Bioinformatics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Anna Philips
- Laboratory of Bioinformatics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Paulina Galka-Marciniak
- Department of Molecular Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| | - Piotr Kozlowski
- Department of Molecular Genetics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, 61-704, Poland
| |
Collapse
|
31
|
Xuan P, Xiu J, Cui H, Zhang X, Nakaguchi T, Zhang T. Complementary feature learning across multiple heterogeneous networks and multimodal attribute learning for predicting disease-related miRNAs. iScience 2024; 27:108639. [PMID: 38303724 PMCID: PMC10831890 DOI: 10.1016/j.isci.2023.108639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 11/02/2023] [Accepted: 12/01/2023] [Indexed: 02/03/2024] Open
Abstract
Inferring the latent disease-related miRNAs is helpful for providing a deep insight into observing the disease pathogenesis. We propose a method, CMMDA, to encode and integrate the context relationship among multiple heterogeneous networks, the complementary information across these networks, and the pairwise multimodal attributes. We first established multiple heterogeneous networks according to the diverse disease similarities. The feature representation embedding the context relationship is formulated for each miRNA (disease) node based on transformer. We designed a co-attention fusion mechanism to encode the complementary information among multiple networks. In terms of a pair of miRNA and disease nodes, the pairwise attributes from multiple networks form a multimodal attribute embedding. A module based on depthwise separable convolution is constructed to enhance the encoding of the specific features from each modality. The experimental results and the ablation studies show that CMMDA's superior performance and the effectiveness of its major innovations.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- Department of Computer Science, Shantou University, Shantou 515063, China
| | - Jinshan Xiu
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3083, Australia
| | - Xiaowen Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba 2638522, Japan
| | - Tiangang Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- School of Mathematical Science, Heilongjiang University, Harbin 150080, China
| |
Collapse
|
32
|
Cui C, Zhong B, Fan R, Cui Q. HMDD v4.0: a database for experimentally supported human microRNA-disease associations. Nucleic Acids Res 2024; 52:D1327-D1332. [PMID: 37650649 PMCID: PMC10767894 DOI: 10.1093/nar/gkad717] [Citation(s) in RCA: 41] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 08/07/2023] [Accepted: 08/19/2023] [Indexed: 09/01/2023] Open
Abstract
MicroRNAs (miRNAs) are a class of important small non-coding RNAs with critical molecular functions in almost all biological processes, and thus, they play important roles in disease diagnosis and therapy. Human MicroRNA Disease Database (HMDD) represents an important and comprehensive resource for biomedical researchers in miRNA-related medicine. Here, we introduce HMDD v4.0, which curates 53530 miRNA-disease association entries from literatures. In comparison to HMDD v3.0 released five years ago, HMDD v4.0 contains 1.5 times more entries. In addition, some new categories have been curated, including exosomal miRNAs implicated in diseases, virus-encoded miRNAs involved in human diseases, and entries containing miRNA-circRNA interactions. We also curated sex-biased miRNAs in diseases. Furthermore, in a case study, disease similarity analysis successfully revealed that sex-biased miRNAs related to developmental anomalies are associated with a number of human diseases with sex bias. HMDD can be freely visited at http://www.cuilab.cn/hmdd.
Collapse
Affiliation(s)
- Chunmei Cui
- Department of Biomedical Informatics, Center for Noncoding RNA Medicine, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing 100191, China
| | - Bitao Zhong
- Department of Biomedical Informatics, Center for Noncoding RNA Medicine, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing 100191, China
| | - Rui Fan
- Department of Biomedical Informatics, Center for Noncoding RNA Medicine, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing 100191, China
| | - Qinghua Cui
- Department of Biomedical Informatics, Center for Noncoding RNA Medicine, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Rd, Beijing 100191, China
- School of Sports Medicine, Wuhan Institute of Physical Education, No. 461 Luoyu Rd. Wuchang District, Wuhan 430079, Hubei Province, China
| |
Collapse
|
33
|
Galluzzo Y. A comprehensive review of the data and knowledge graphs approaches in bioinformatics. COMPUTER SCIENCE AND INFORMATION SYSTEMS 2024; 21:1055-1075. [DOI: 10.2298/csis230530027g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
The scientific community is currently showing strong interest in constructing knowledge graphs from heterogeneous domains (genomic, pharmaceutical, clinical etc.). The main goal here is to support researchers in gaining an immediate overview of the biomedical and clinical data that can be utilized to construct and extend KGs. A in-depth overview of the available biomedical data and the latest applications of knowledge graphs, from the biological to the clinical context, is provided showing the most recent methods of representing biomedical knowledge with embeddings (KGEs). Furthermore, this review, differentiates biomedical databases based on their construction process (whether manually curated by experts or not), aiming to offer a detailed overview and guide researchers in selecting the appropriate database for their research considering to the specific project needs, available resources, and data complexity. In conclusion, the review highlights current challenges: integration of different knowledge graphs and the interpretability of predictions of new relations.
Collapse
|
34
|
Dong B, Sun W, Xu D, Wang G, Zhang T. MDformer: A transformer-based method for predicting miRNA-Disease associations using multi-source feature fusion and maximal meta-path instances encoding. Comput Biol Med 2023; 167:107585. [PMID: 37890424 DOI: 10.1016/j.compbiomed.2023.107585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 09/15/2023] [Accepted: 10/15/2023] [Indexed: 10/29/2023]
Abstract
There is a growing body of evidence suggesting that microRNAs (miRNAs), small biological molecules, play a crucial role in the diagnosis, treatment, and prognostic assessment of diseases. However, it is often inefficient to verify the association between miRNAs and diseases (MDA) through traditional experimental methods. Based on this situation, researchers have proposed various computational-based methods, but the existing methods often have many drawbacks in terms of predictive effectiveness and accuracy. Therefore, in order to improve the prediction performance of computational methods, we propose a transformer-based prediction model (MDformer) for multi-source feature information. Specifically, first, we consider multiple features of miRNAs and diseases from the molecular biology perspective and utilize them in a fusion. Then high-quality node feature embeddings were generated using a feature encoder based on the transformer architecture and meta-path instances. Finally, a deep neural network was built for MDA prediction. To evaluate the performance of our model, we performed multiple 5-fold cross-validations as well as comparison experiments on HMDD v3.2 and HMDD v2.0 databases, and the experimental results of the average ROC area under the curve (AUC) were higher than the comparative methods for both databases at 0.9506 and 0.9369. We conducted case studies on five highly lethal cancers (breast, lung, colorectal, gastric, and hepatocellular cancers), and the first 30 predictions for these five diseases achieved 97.3% accuracy. In conclusion, MDformer is a reliable and scientifically sound tool that can be used to accurately predict MDA. In addition, the source code is available at https://github.com/Linda908/MDformer.
Collapse
Affiliation(s)
- Benzhi Dong
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China
| | - Weidong Sun
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China
| | - Dali Xu
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China
| | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China.
| | - Tianjiao Zhang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China.
| |
Collapse
|
35
|
Liu C, Yu C, Song G, Fan X, Peng S, Zhang S, Zhou X, Zhang C, Geng X, Wang T, Cheng W, Zhu W. Comprehensive analysis of miRNA-mRNA regulatory pairs associated with colorectal cancer and the role in tumor immunity. BMC Genomics 2023; 24:724. [PMID: 38036953 PMCID: PMC10688136 DOI: 10.1186/s12864-023-09635-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 08/29/2023] [Indexed: 12/02/2023] Open
Abstract
BACKGROUND MicroRNA (miRNA) which can act as post-transcriptional regulators of mRNAs via base-pairing with complementary sequences within mRNAs is involved in processes of the complex interaction between immune system and tumors. In this research, we elucidated the profiles of miRNAs and target mRNAs expression and their associations with the phenotypic hallmarks of colorectal cancers (CRC) by integrating transcriptomic, immunophenotype, methylation, mutation and survival data. RESULTS We conducted the analysis of differential miRNA/mRNA expression profile by GEO, TCGA and GTEx databases and the correlation between miRNA and targeted mRNA by miRTarBase and TarBase. Then we detected using qRT-PCR and validated the diagnostic value of miRNA-mRNA regulator pairs by the ROC, calibration curve and DCA. Phenotypic hallmarks of regulatory pairs including tumor-infiltrating lymphocytes, tumor microenvironment, tumor mutation burden, global methylation and gene mutation were also described. The expression levels of miRNAs and target mRNAs were detected in 80 paired colon tissue samples. Ultimately, we picked up two pivotal regulatory pairs (miR-139-5p/ STC1 and miR-20a-5p/ FGL2) and verified the diagnostic value of the complex model which is the combination of 4 signatures above-mentioned in 3 testing GEO datasets and an external validation cohort. CONCLUSIONS We found that 2 miRNAs by targeting 2 metastasis-related mRNAs were correlated with tumor-infiltrating macrophages, HRAS, and BRAF gene mutation status. Our results established the diagnostic model containing 2 miRNAs and their respective targeted mRNAs to distinguish CRCs and normal controls and displayed their complex roles in CRC pathogenesis especially tumor immunity.
Collapse
Affiliation(s)
- Cheng Liu
- Department of Gastroenterology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, Jiangsu, China
| | - Chun Yu
- Department of Gastroenterology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, Jiangsu, China
| | - Guoxin Song
- Department of Pathology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, China, Jiangsu
| | - Xingchen Fan
- Department of Oncology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, China, Jiangsu
| | - Shuang Peng
- Department of Oncology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, China, Jiangsu
| | - Shiyu Zhang
- Department of Oncology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, China, Jiangsu
| | - Xin Zhou
- Department of Oncology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, China, Jiangsu
| | - Cheng Zhang
- Department of Science and Technology, the First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, China, Jiangsu
| | - Xiangnan Geng
- Department of Clinical Engineer, the First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, China, Jiangsu
| | - Tongshan Wang
- Department of Oncology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, China, Jiangsu
| | - Wenfang Cheng
- Department of Gastroenterology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, Jiangsu, China.
| | - Wei Zhu
- Department of Oncology, the First Affiliated Hospital of Nanjing Medical University, 300 Guangzhou Road, Nanjing, 210029, China, Jiangsu.
| |
Collapse
|
36
|
Su B, Wang W, Lin X, Liu S, Huang X. Identifying the potential miRNA biomarkers based on multi-view networks and reinforcement learning for diseases. Brief Bioinform 2023; 25:bbad427. [PMID: 38018913 PMCID: PMC10753537 DOI: 10.1093/bib/bbad427] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 09/24/2023] [Accepted: 10/30/2023] [Indexed: 11/30/2023] Open
Abstract
MicroRNAs (miRNAs) play important roles in the occurrence and development of diseases. However, it is still challenging to identify the effective miRNA biomarkers for improving the disease diagnosis and prognosis. In this study, we proposed the miRNA data analysis method based on multi-view miRNA networks and reinforcement learning, miRMarker, to define the potential miRNA disease biomarkers. miRMarker constructs the cooperative regulation network and functional similarity network based on the expression data and known miRNA-disease relations, respectively. The cooperative regulation of miRNAs was evaluated by measuring the changes of relative expression. Natural language processing was introduced for calculating the miRNA functional similarity. Then, miRMarker integrates the multi-view miRNA networks and defines the informative miRNA modules through a reinforcement learning strategy. We compared miRMarker with eight efficient data analysis methods on nine transcriptomics datasets to show its superiority in disease sample discrimination. The comparison results suggested that miRMarker outperformed other data analysis methods in receiver operating characteristic analysis. Furthermore, the defined miRNA modules of miRMarker on colorectal cancer data not only show the excellent performance of cancer sample discrimination but also play significant roles in the cancer-related pathway disturbances. The experimental results indicate that miRMarker can build the robust miRNA interaction network by integrating the multi-view networks. Besides, exploring the miRNA interaction network using reinforcement learning favors defining the important miRNA modules. In summary, miRMarker can be a hopeful tool in biomarker identification for human diseases.
Collapse
Affiliation(s)
- Benzhe Su
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China
| | - Weiwei Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China
| | - Xiaohui Lin
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China
| | - Shenglan Liu
- School of Innovation and Entrepreneurship, Dalian University of Technology, Dalian 116024, Liaoning, China
| | - Xin Huang
- School of Mathematics and Information Science, Anshan Normal University, Anshan 114007, Liaoning, China
| |
Collapse
|
37
|
Dou R, Kang S, Yang H, Zhang W, Zhang Y, Liu Y, Ping Y, Pang B. Identifying the driver miRNAs with somatic copy number alterations driving dysregulated ceRNA networks in cancers. Biol Direct 2023; 18:79. [PMID: 37993951 PMCID: PMC10666415 DOI: 10.1186/s13062-023-00438-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 11/15/2023] [Indexed: 11/24/2023] Open
Abstract
BACKGROUND MicroRNAs (miRNAs) play critical roles in cancer initiation and progression, which were critical components to maintain the dynamic balance of competing endogenous RNA (ceRNA) networks. Somatic copy number alterations (SCNAs) in the cancer genome could disturb the transcriptome level of miRNA to deregulate this balance. However, the driving effects of SCNAs of miRNAs were insufficiently understood. METHODS In this study, we proposed a method to dissect the functional roles of miRNAs under different copy number states and identify driver miRNAs by integrating miRNA SCNAs profile, miRNA-target relationships and expression profiles of miRNA, mRNA and lncRNA. RESULTS Applying our method to 813 TCGA breast cancer (BRCA) samples, we identified 29 driver miRNAs whose SCNAs significantly and concordantly regulated their own expression levels and further inversely dysregulated expression levels of their targets or disturbed the miRNA-target networks they directly involved. Based on miRNA-target networks, we further constructed dynamic ceRNA networks driven by driver SCNAs of miRNAs and identified three different patterns of SCNA interference in the miRNA-mediated dynamic ceRNA networks. Survival analysis of driver miRNAs showed that high-level amplifications of four driver miRNAs (including has-miR-30d-3p, has-mir-30b-5p, has-miR-30d-5p and has-miR-151a-3p) in 8q24 characterized a new BRCA subtype with poor prognosis and contributed to the dysfunction of cancer-associated hallmarks in a complementary way. The SCNAs of driver miRNAs across different cancer types contributed to the cancer development by dysregulating different components of the same cancer hallmarks, suggesting the cancer specificity of driver miRNA. CONCLUSIONS These results demonstrate the efficacy of our method in identifying driver miRNAs and elucidating their functional roles driven by endogenous SCNAs, which is useful for interpreting cancer genomes and pathogenic mechanisms.
Collapse
Affiliation(s)
- Renjie Dou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China
| | - Shaobo Kang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China
| | - Huan Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China
| | - Wanmei Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China
| | - Yijing Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China
| | - Yuanyuan Liu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China
| | - Yanyan Ping
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China.
| | - Bo Pang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150081, Heilongjiang, China.
| |
Collapse
|
38
|
Ning Z, Wu J, Ding Y, Wang Y, Peng Q, Fu L. BertNDA: A Model Based on Graph-Bert and Multi-Scale Information Fusion for ncRNA-Disease Association Prediction. IEEE J Biomed Health Inform 2023; 27:5655-5664. [PMID: 37669210 DOI: 10.1109/jbhi.2023.3311808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]
Abstract
Non-coding RNAs (ncRNAs) are a class of RNA molecules that lack the ability to encode proteins in human cells, but play crucial roles in various biological process. Understanding the interactions between different ncRNAs and their impact on diseases can significantly contribute to diagnosis, prevention, and treatment of diseases. However, predicting tertiary interactions between ncRNAs and diseases based on structural information in multiple scales remains a challenging task. To address this challenge, we propose a method called BertNDA, aiming to predict potential relationships between miRNAs, lncRNAs, and diseases. The framework identifies the local information through connectionless subgraph, which aggregate neighbor nodes' feature. And global information is extracted by leveraging Laplace transform of graph structures and WL (Weisfeiler-Lehman) absolute role coding. Additionally, an EMLP (Element-wise MLP) structure is designed to fuse pairwise global information. The transformer-encoder is employed as the backbone of our approach, followed by a prediction-layer to output the final correlation score. Extensive experiments demonstrate that BertNDA outperforms state-of-the-art methods in prediction assignment and exhibits significant potential for various biological applications. Moreover, we develop an online prediction platform that incorporates the prediction model, providing users with an intuitive and interactive experience. Overall, our model offers an efficient, accurate, and comprehensive tool for predicting tertiary associations between ncRNAs and diseases.
Collapse
|
39
|
Sindhoo A, Sipy S, Khan A, Selvaraj G, Alshammari A, Casida ME, Wei DQ. ESOMIR: a curated database of biomarker genes and miRNAs associated with esophageal cancer. Database (Oxford) 2023; 2023:baad063. [PMID: 37815872 PMCID: PMC10563827 DOI: 10.1093/database/baad063] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 07/10/2023] [Accepted: 09/16/2023] [Indexed: 10/12/2023]
Abstract
'Esophageal cancer' (EC) is a highly aggressive and deadly complex disease. It comprises two types, esophageal adenocarcinoma (EAC) and esophageal squamous cell carcinoma (ESCC), with Barrett's esophagus (BE) being the only known precursor. Recent research has revealed that microRNAs (miRNAs) play a crucial role in the development, prognosis and treatment of EC and are involved in various human diseases. Biological databases have become essential for cancer research as they provide information on genes, proteins, pathways and their interactions. These databases collect, store and manage large amounts of molecular data, which can be used to identify patterns, predict outcomes and generate hypotheses. However, no comprehensive database exists for EC and miRNA relationships. To address this gap, we developed a dynamic database named 'ESOMIR (miRNA in esophageal cancer) (https://esomir.dqweilab-sjtu.com)', which includes information about targeted genes and miRNAs associated with EC. The database uses analysis and prediction methods, including experimentally endorsed miRNA(s) information. ESOMIR is a user-friendly interface that allows easy access to EC-associated data by searching for miRNAs, target genes, sequences, chromosomal positions and associated signaling pathways. The search modules are designed to provide specific data access to users based on their requirements. Additionally, the database provides information about network interactions, signaling pathways and region information of chromosomes associated with the 3'untranslated region (3'UTR) or 5'UTR and exon sites. Users can also access energy levels of specific miRNAs with targeted genes. A fuzzy term search is included in each module to enhance the ease of use for researchers. ESOMIR can be a valuable tool for researchers and clinicians to gain insight into EC, including identifying biomarkers and treatments for this aggressive tumor. Database URL https://esomir.dqweilab-sjtu.com.
Collapse
Affiliation(s)
- Asma Sindhoo
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Dongchuan Road Minhang District, Shanghai 200240, PR China
| | - Saima Sipy
- Sindh Madressatul Islam University, Karachi, Sindh 74600, Pakistan
| | - Abbas Khan
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Dongchuan Road Minhang District, Shanghai 200240, PR China
- State Key Laboratory of Microbial Metabolism, Shanghai–Islamabad–Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai, Minhang 200030, PR China
| | - Gurudeeban Selvaraj
- Centre for Research in Molecular Modelling (CERMM), Department of Chemistry and Biochemistry, Concordia University, Montreal, Quebec H4B 1R6, Canada
| | - Abdulrahman Alshammari
- Department of Pharmacology and Toxicology, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia
| | - Mark Earl Casida
- Laboratoire de Spectrom´etrie, Interactions et Chimie th´eorique (SITh), D´epartement de Chimie Mol´eculaire (DCM, UMR CNRS/UGA 5250), Institut de Chimie Mol´eculaire de Grenoble (ICMG, FR2607), Universit´e Grenoble Alpes (UGA), 301 rue de la Chimie BP 53, Grenoble Cedex F-38041, France
| | - Dong-Qing Wei
- Department of Bioinformatics and Biological Statistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Dongchuan Road Minhang District, Shanghai 200240, PR China
- State Key Laboratory of Microbial Metabolism, Shanghai–Islamabad–Belgrade Joint Innovation Center on Antibacterial Resistances, Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education and School of Life Sciences and Biotechnology, Shanghai, Minhang 200030, PR China
- Peng Cheng Laboratory, Phase I Building 8, Xili Street, Montreal, Vanke Cloud City, Nashan District, Shenzhen, Guangdong 518055, PR China
| |
Collapse
|
40
|
Hu X, Liu D, Zhang J, Fan Y, Ouyang T, Luo Y, Zhang Y, Deng L. A comprehensive review and evaluation of graph neural networks for non-coding RNA and complex disease associations. Brief Bioinform 2023; 24:bbad410. [PMID: 37985451 DOI: 10.1093/bib/bbad410] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 10/07/2023] [Accepted: 10/25/2023] [Indexed: 11/22/2023] Open
Abstract
Non-coding RNAs (ncRNAs) play a critical role in the occurrence and development of numerous human diseases. Consequently, studying the associations between ncRNAs and diseases has garnered significant attention from researchers in recent years. Various computational methods have been proposed to explore ncRNA-disease relationships, with Graph Neural Network (GNN) emerging as a state-of-the-art approach for ncRNA-disease association prediction. In this survey, we present a comprehensive review of GNN-based models for ncRNA-disease associations. Firstly, we provide a detailed introduction to ncRNAs and GNNs. Next, we delve into the motivations behind adopting GNNs for predicting ncRNA-disease associations, focusing on data structure, high-order connectivity in graphs and sparse supervision signals. Subsequently, we analyze the challenges associated with using GNNs in predicting ncRNA-disease associations, covering graph construction, feature propagation and aggregation, and model optimization. We then present a detailed summary and performance evaluation of existing GNN-based models in the context of ncRNA-disease associations. Lastly, we explore potential future research directions in this rapidly evolving field. This survey serves as a valuable resource for researchers interested in leveraging GNNs to uncover the complex relationships between ncRNAs and diseases.
Collapse
Affiliation(s)
- Xiaowen Hu
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Dayun Liu
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Jiaxuan Zhang
- Department of Electrical and Computer Engineering, University of California, San Diego,92093 CA, USA
| | - Yanhao Fan
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Tianxiang Ouyang
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Yue Luo
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Yuanpeng Zhang
- school of software, Xinjiang University, 830046 Urumqi, China
| | - Lei Deng
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| |
Collapse
|
41
|
Wang S, Li Y, Zhang Y, Pang S, Qiao S, Zhang Y, Wang F. Generative Adversarial Matrix Completion Network based on Multi-Source Data Fusion for miRNA-Disease Associations Prediction. Brief Bioinform 2023; 24:bbad270. [PMID: 37482409 DOI: 10.1093/bib/bbad270] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Revised: 06/16/2023] [Accepted: 07/04/2023] [Indexed: 07/25/2023] Open
Abstract
Numerous biological studies have shown that considering disease-associated micro RNAs (miRNAs) as potential biomarkers or therapeutic targets offers new avenues for the diagnosis of complex diseases. Computational methods have gradually been introduced to reveal disease-related miRNAs. Considering that previous models have not fused sufficiently diverse similarities, that their inappropriate fusion methods may lead to poor quality of the comprehensive similarity network and that their results are often limited by insufficiently known associations, we propose a computational model called Generative Adversarial Matrix Completion Network based on Multi-source Data Fusion (GAMCNMDF) for miRNA-disease association prediction. We create a diverse network connecting miRNAs and diseases, which is then represented using a matrix. The main task of GAMCNMDF is to complete the matrix and obtain the predicted results. The main innovations of GAMCNMDF are reflected in two aspects: GAMCNMDF integrates diverse data sources and employs a nonlinear fusion approach to update the similarity networks of miRNAs and diseases. Also, some additional information is provided to GAMCNMDF in the form of a 'hint' so that GAMCNMDF can work successfully even when complete data are not available. Compared with other methods, the outcomes of 10-fold cross-validation on two distinct databases validate the superior performance of GAMCNMDF with statistically significant results. It is worth mentioning that we apply GAMCNMDF in the identification of underlying small molecule-related miRNAs, yielding outstanding performance results in this specific domain. In addition, two case studies about two important neoplasms show that GAMCNMDF is a promising prediction method.
Collapse
Affiliation(s)
- ShuDong Wang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| | - YunYin Li
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| | - YuanYuan Zhang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| | - ShanChen Pang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| | - SiBo Qiao
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| | - Yu Zhang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| | - FuYu Wang
- College of Computer Science and Technology, Qingdao Institute of Software, China University of Petroleum (East China), 66 Changjiang Xi Lu, 266580, Shandong, China
| |
Collapse
|
42
|
Wu J, Ning Z, Ding Y, Wang Y, Peng Q, Fu L. KGETCDA: an efficient representation learning framework based on knowledge graph encoder from transformer for predicting circRNA-disease associations. Brief Bioinform 2023; 24:bbad292. [PMID: 37587836 DOI: 10.1093/bib/bbad292] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 07/27/2023] [Accepted: 07/27/2023] [Indexed: 08/18/2023] Open
Abstract
Recent studies have demonstrated the significant role that circRNA plays in the progression of human diseases. Identifying circRNA-disease associations (CDA) in an efficient manner can offer crucial insights into disease diagnosis. While traditional biological experiments can be time-consuming and labor-intensive, computational methods have emerged as a viable alternative in recent years. However, these methods are often limited by data sparsity and their inability to explore high-order information. In this paper, we introduce a novel method named Knowledge Graph Encoder from Transformer for predicting CDA (KGETCDA). Specifically, KGETCDA first integrates more than 10 databases to construct a large heterogeneous non-coding RNA dataset, which contains multiple relationships between circRNA, miRNA, lncRNA and disease. Then, a biological knowledge graph is created based on this dataset and Transformer-based knowledge representation learning and attentive propagation layers are applied to obtain high-quality embeddings with accurately captured high-order interaction information. Finally, multilayer perceptron is utilized to predict the matching scores of CDA based on their embeddings. Our empirical results demonstrate that KGETCDA significantly outperforms other state-of-the-art models. To enhance user experience, we have developed an interactive web-based platform named HNRBase that allows users to visualize, download data and make predictions using KGETCDA with ease. The code and datasets are publicly available at https://github.com/jinyangwu/KGETCDA.
Collapse
Affiliation(s)
- Jinyang Wu
- School of Automation Science and Engineering, Xi'an Jiaotong University, 710049, Shaanxi, China
| | - Zhiwei Ning
- School of Automation Science and Engineering, Xi'an Jiaotong University, 710049, Shaanxi, China
| | - Yidong Ding
- School of Automation Science and Engineering, Xi'an Jiaotong University, 710049, Shaanxi, China
| | - Ying Wang
- School of Automation Science and Engineering, Xi'an Jiaotong University, 710049, Shaanxi, China
| | - Qinke Peng
- School of Automation Science and Engineering, Xi'an Jiaotong University, 710049, Shaanxi, China
| | - Laiyi Fu
- School of Automation Science and Engineering, Xi'an Jiaotong University, 710049, Shaanxi, China
- Research Institute of Xi'an Jiaotong University, 311200, Zhejiang, China
- Sichuan Digital Economy Industry Development Research Institute, 610036, Sichuan, China
| |
Collapse
|
43
|
Gao S, Kuang Z, Duan T, Deng L. DEJKMDR: miRNA-disease association prediction method based on graph convolutional network. Front Med (Lausanne) 2023; 10:1234050. [PMID: 37780568 PMCID: PMC10536249 DOI: 10.3389/fmed.2023.1234050] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2023] [Accepted: 08/16/2023] [Indexed: 10/03/2023] Open
Abstract
Numerous studies have shown that miRNAs play a crucial role in the investigation of complex human diseases. Identifying the connection between miRNAs and diseases is crucial for advancing the treatment of complex diseases. However, traditional methods are frequently constrained by the small sample size and high cost, so computational simulations are urgently required to rapidly and accurately forecast the potential correlation between miRNA and disease. In this paper, the DEJKMDR, a graph convolutional network (GCN)-based miRNA-disease association prediction model is proposed. The novelty of this model lies in the fact that DEJKMDR integrates biomolecular information on miRNA and illness, including functional miRNA similarity, disease semantic similarity, and miRNA and disease similarity, according to their Gaussian interaction attribute. In order to minimize overfitting, some edges are randomly destroyed during the training phase after DropEdge has been used to regularize the edges. JK-Net, meanwhile, is employed to combine various domain scopes through the adaptive learning of nodes in various placements. The experimental results demonstrate that this strategy has superior accuracy and dependability than previous algorithms in terms of predicting an unknown miRNA-disease relationship. In a 10-fold cross-validation, the average AUC of DEJKMDR is determined to be 0.9772.
Collapse
Affiliation(s)
- Shiyuan Gao
- School of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha, China
| | - Zhufang Kuang
- School of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha, China
| | - Tao Duan
- School of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha, China
| | - Lei Deng
- School of Computer Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
44
|
Sun J, Xu M, Ru J, James-Bott A, Xiong D, Wang X, Cribbs AP. Small molecule-mediated targeting of microRNAs for drug discovery: Experiments, computational techniques, and disease implications. Eur J Med Chem 2023; 257:115500. [PMID: 37262996 PMCID: PMC11554572 DOI: 10.1016/j.ejmech.2023.115500] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/05/2023] [Accepted: 05/15/2023] [Indexed: 06/03/2023]
Abstract
Small molecules have been providing medical breakthroughs for human diseases for more than a century. Recently, identifying small molecule inhibitors that target microRNAs (miRNAs) has gained importance, despite the challenges posed by labour-intensive screening experiments and the significant efforts required for medicinal chemistry optimization. Numerous experimentally-verified cases have demonstrated the potential of miRNA-targeted small molecule inhibitors for disease treatment. This new approach is grounded in their posttranscriptional regulation of the expression of disease-associated genes. Reversing dysregulated gene expression using this mechanism may help control dysfunctional pathways. Furthermore, the ongoing improvement of algorithms has allowed for the integration of computational strategies built on top of laboratory-based data, facilitating a more precise and rational design and discovery of lead compounds. To complement the use of extensive pharmacogenomics data in prioritising potential drugs, our previous work introduced a computational approach based on only molecular sequences. Moreover, various computational tools for predicting molecular interactions in biological networks using similarity-based inference techniques have been accumulated in established studies. However, there are a limited number of comprehensive reviews covering both computational and experimental drug discovery processes. In this review, we outline a cohesive overview of both biological and computational applications in miRNA-targeted drug discovery, along with their disease implications and clinical significance. Finally, utilizing drug-target interaction (DTIs) data from DrugBank, we showcase the effectiveness of deep learning for obtaining the physicochemical characterization of DTIs.
Collapse
Affiliation(s)
- Jianfeng Sun
- Botnar Research Centre, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
| | - Miaoer Xu
- Department of Biology, Emory University, Atlanta, GA, 30322, USA
| | - Jinlong Ru
- Chair of Prevention of Microbial Diseases, School of Life Sciences Weihenstephan, Technical University of Munich, Freising, 85354, Germany
| | - Anna James-Bott
- Botnar Research Centre, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK
| | - Dapeng Xiong
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA; Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Xia Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
| | - Adam P Cribbs
- Botnar Research Centre, Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
| |
Collapse
|
45
|
Mahajan M, Sarkar A, Mondal S. Cell cycle protein BORA is associated with colorectal cancer progression by AURORA-PLK1 cascades: a bioinformatics analysis. J Cell Commun Signal 2023; 17:773-791. [PMID: 36538275 PMCID: PMC10409947 DOI: 10.1007/s12079-022-00719-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 12/14/2022] [Indexed: 12/24/2022] Open
Abstract
Colorectal cancer (CRC) is the third most diagnosed cancer in the world. A better understanding of the molecular mechanism of CRC is essential for making novel strategies for the CRC management and its prevention. The present study aims to explore the molecular mechanism through integrated bioinformatics analysis by analyzing genes and their co-expression pattern in normal and CRC states. GSE110223, GSE110224 and GSE113513 gene expression profiles were analyzed in this study. The co-expression networks for normal and tumor samples were constructed separately and analyzed to identify the modules, sub-networks and key genes. Gene regulatory network analysis was done to understand the regulatory mechanism of selected genes. Survival analysis was performed for the identified sub-networks and key genes to understand their role in CRC progression. A total of seven modules were detected and the KEGG pathway analysis revealed these modules were mainly enriched with cell cycle, metabolism and signaling-related pathways. E2F6 and ETV4 transcription factors regulating the activity of multiple genes of identified modules were found to be up-regulated in CRC. Six Sub-networks and seven key genes, BORA, CCT7, DTL, RUVBL1, RUVBL2, THEM6 and TMEM97 associated with the CRC progression were identified. Disease-gene association analysis identified a novel association of the BORA gene with CRC that activates and regulates the AURORA-PLK1 cascades in the cell cycle. Survival analysis indicates that the overexpressed BORA is associated with unfavourable overall survival in CRC. The mechanistic role of BORA in the regulation of cell cycle progression suggests that BORA might act as a potential therapeutic target for CRC.
Collapse
Affiliation(s)
- Mohita Mahajan
- Department of Biological Sciences, Birla Institute of Technology and Science, Pilani, K.K. Birla Goa Campus, Zuarinagar, Goa 403726 India
| | - Angshuman Sarkar
- Department of Biological Sciences, Birla Institute of Technology and Science, Pilani, K.K. Birla Goa Campus, Zuarinagar, Goa 403726 India
| | - Sukanta Mondal
- Department of Biological Sciences, Birla Institute of Technology and Science, Pilani, K.K. Birla Goa Campus, Zuarinagar, Goa 403726 India
| |
Collapse
|
46
|
Shen Y, Gao YL, Wang J, Guan BX, Liu JX. Identification of Disease-Associated MicroRNAs Via Locality-Constrained Linear Coding-Based Ensemble Learning. J Comput Biol 2023; 30:926-936. [PMID: 37466461 DOI: 10.1089/cmb.2023.0084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2023] Open
Abstract
Clinical trials indicate that the dysregulation of microRNAs (miRNAs) is closely associated with the development of diseases. Therefore, predicting miRNA-disease associations is significant for studying the pathogenesis of diseases. Since traditional wet-lab methods are resource-intensive, cost-saving computational models can be an effective complementary tool in biological experiments. In this work, a locality-constrained linear coding is proposed to predict associations (ILLCEL). Among them, ILLCEL adopts miRNA sequence similarity, miRNA functional similarity, disease semantic similarity, and interaction profile similarity obtained by locality-constrained linear coding (LLC) as the priori information. Next, features and similarities extracted from multiperspectives are input to the ensemble learning framework to improve the comprehensiveness of the prediction. Significantly, the introduction of hypergraph-regular terms improves the accuracy of prediction by describing complex associations between samples. The results under fivefold cross validation indicate that ILLCEL achieves superior prediction performance. In case studies, known associations are accurately predicted and novel associations are verified in HMDD v3.2, miRCancer, and existing literature. It is concluded that ILLCEL can be served as a powerful tool for inferring potential associations.
Collapse
Affiliation(s)
- Yi Shen
- School of Computer Science, Qufu Normal University, Rizhao, China
| | - Ying-Lian Gao
- Qufu Normal University Library, Qufu Normal University, Rizhao, China
| | - Juan Wang
- School of Computer Science, Qufu Normal University, Rizhao, China
| | - Bo-Xin Guan
- School of Computer Science, Qufu Normal University, Rizhao, China
| | - Jin-Xing Liu
- School of Computer Science, Qufu Normal University, Rizhao, China
| |
Collapse
|
47
|
Jamali E, Safarzadeh A, Hussen BM, Liehr T, Ghafouri-Fard S, Taheri M. Single cell RNA-seq analysis with a systems biology approach to recognize important differentially expressed genes in pancreatic ductal adenocarcinoma compared to adjacent non-cancerous samples by targeting pancreatic endothelial cells. Pathol Res Pract 2023; 248:154614. [PMID: 37329816 DOI: 10.1016/j.prp.2023.154614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/03/2023] [Accepted: 06/10/2023] [Indexed: 06/19/2023]
Abstract
Pancreatic ductal adenocarcinoma (PDAC) is a cancer that is usually diagnosed at late stages. This highly aggressive tumor is resistant to most therapeutic approaches, necessitating identification of differentially expressed genes to design new therapies. Herein, we have analyzed single cell RNA-seq data with a systems biology approach to identify important differentially expressed genes in PDAC samples compared to adjacent non-cancerous samples. Our approach revealed 1462 DEmRNAs, including 1389 downregulated DEmRNAs (like PRSS1 and CLPS) and 73 upregulated DEmRNAs (like HSPA1A and SOCS3), 27 DElncRNAs, including 26 downregulated DElncRNAs (like LINC00472 and SNHG7) and 1 upregulated DElncRNA (SNHG5). We also listed a number of dysregulated signaling pathways, abnormally expressed genes and aberrant cellular functions in PDAC which can be used as possible biomarkers and therapeutic targets in this type of cancer.
Collapse
Affiliation(s)
- Elena Jamali
- Department of Pathology, Loghman Hakim Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Arash Safarzadeh
- Phytochemistry Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Bashdar Mahmud Hussen
- Department of Clinical Analysis, College of Pharmacy, Hawler Medical University, Kurdistan Region, Iraq
| | - Thomas Liehr
- Institute of Human Genetics, Jena University Hospital, Jena, Germany.
| | - Soudeh Ghafouri-Fard
- Department of Medical Genetics, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Mohammad Taheri
- Institute of Human Genetics, Jena University Hospital, Jena, Germany; Urology and Nephrology Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
48
|
He Q, Qiao W, Fang H, Bao Y. Improving the identification of miRNA-disease associations with multi-task learning on gene-disease networks. Brief Bioinform 2023; 24:bbad203. [PMID: 37287133 DOI: 10.1093/bib/bbad203] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 04/24/2023] [Accepted: 05/10/2023] [Indexed: 06/09/2023] Open
Abstract
MicroRNAs (miRNAs) are a family of non-coding RNA molecules with vital roles in regulating gene expression. Although researchers have recognized the importance of miRNAs in the development of human diseases, it is very resource-consuming to use experimental methods for identifying which dysregulated miRNA is associated with a specific disease. To reduce the cost of human effort, a growing body of studies has leveraged computational methods for predicting the potential miRNA-disease associations. However, the extant computational methods usually ignore the crucial mediating role of genes and suffer from the data sparsity problem. To address this limitation, we introduce the multi-task learning technique and develop a new model called MTLMDA (Multi-Task Learning model for predicting potential MicroRNA-Disease Associations). Different from existing models that only learn from the miRNA-disease network, our MTLMDA model exploits both miRNA-disease and gene-disease networks for improving the identification of miRNA-disease associations. To evaluate model performance, we compare our model with competitive baselines on a real-world dataset of experimentally supported miRNA-disease associations. Empirical results show that our model performs best using various performance metrics. We also examine the effectiveness of model components via ablation study and further showcase the predictive power of our model for six types of common cancers. The data and source code are available from https://github.com/qwslle/MTLMDA.
Collapse
Affiliation(s)
- Qiang He
- College of Medicine and Biological Information Engineering, Northeastern University, 110169 Shenyang, China
| | - Wei Qiao
- College of Medicine and Biological Information Engineering, Northeastern University, 110169 Shenyang, China
| | - Hui Fang
- Research Institute for Interdisciplinary Science and School of Information Management and Engineering, Shanghai University of Finance and Economics, 200434 Shanghai, China
| | - Yang Bao
- Antai College of Economics and Management, Shanghai Jiao Tong University, 200030 Shanghai, China
| |
Collapse
|
49
|
Firouzjaei AA, Sharifi K, Khazaei M, Mohammadi-Yeganeh S, Aghaee-Bakhtiari SH. Screening and introduction of key cell cycle microRNAs deregulated in colorectal cancer by integrated bioinformatics analysis. Chem Biol Drug Des 2023; 102:137-152. [PMID: 37081586 DOI: 10.1111/cbdd.14242] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 03/05/2023] [Accepted: 04/03/2023] [Indexed: 04/22/2023]
Abstract
Colorectal cancer (CRC) is the second most common cancer in women and the third most common in men worldwide. Impaired cell cycle regulation leads to many cancers and is also approved in CRC. Therefore, cell cycle regulation is a critical therapeutic target for CRC. Furthermore, miRNAs have been discovered as regulators in a variety of cancer-related pathways. This study is designed to investigate how miRNAs and mRNAs interact to regulate the cell cycle in CRC patients. Utilizing the Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Expression Omnibus (GEO), and Therapeutic Target Database (TTD), cell cycle-associated genes were identified and evaluated. Seven of the 22 differentially expressed genes (DEGs) implicated in the cell cycle in three GSEs (GSE24514, GSE10950, and GSE74604) were identified as potential therapeutic targets. Then, using PyRx software, we performed docking proteins with selected drugs. The results demonstrated that these drugs are appropriate molecules for targeting cell cycle DEGs. Tarbase, miRTarbase, miRDIP, and miRCancer databases were used to find miRNAs that target the indicated genes. The ability of these six miRNAs to impact the cell cycle in colorectal cancer may be concluded. These miRNAs were found to be downregulated in SW480 cells when compared to the normal tissue. Our data imply that a precise selection of bioinformatics tools can facilitate the identification of miRNAs that impact mRNA translation at different stages of the cell cycle. The candidates can be investigated more as targets for cell cycle arrest in cancers.
Collapse
Affiliation(s)
- Ali Ahmadizad Firouzjaei
- Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Kazem Sharifi
- Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Majid Khazaei
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
- Department of Medical Physiology, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Samira Mohammadi-Yeganeh
- Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Cellular and Molecular Biology Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Seyed Hamid Aghaee-Bakhtiari
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad, Iran
- Department of Medical Biotechnology, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| |
Collapse
|
50
|
Darvish L, Bahreyni-Toossi MT, Aghaee-Bakhtiari SH, Firouzjaei AA, Amraee A, Tarighatnia A, Azimian H. Inducing apoptosis by using microRNA in radio-resistant prostate cancer: an in-silico study with an in-vitro validation. Mol Biol Rep 2023:10.1007/s11033-023-08545-8. [PMID: 37294470 DOI: 10.1007/s11033-023-08545-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 05/22/2023] [Indexed: 06/10/2023]
Abstract
BACKGROUND One of the problems with radiation therapy (RT) is that prostate tumor cells are often radio-resistant, which results in treatment failure. This study aimed to determine the procedure involved in radio-resistant prostate cancer apoptosis. For a deeper insight, we devoted a novel bioinformatics approach to analyze the targeting between microRNAs and radio-resistant prostate cancer genes. METHOD This study uses the Tarbase, and the Mirtarbase databases as validated experimental databases and mirDIP as a predicted database to identify microRNAs that target radio-resistant anti-apoptotic genes. These genes are used to construct the radio-resistant prostate cancer genes network using the online tool STRING. The validation of causing apoptosis by using microRNA was confirmed with flow cytometry of Annexin V. RESULTS The anti-apoptotic gene of radio-resistant prostate cancer included BCL-2, MCL1, XIAP, STAT3, NOTCH1, REL, REL B, BIRC3, and AKT1 genes. These genes were identified as anti-apoptotic genes for radio-resistant prostate cancer. The crucial microRNA that knockdown all of these genes was hsa-miR-7-5p. The highest rate of apoptotic cells in a cell transfected with hsa-miR-7-5p was (32.90 ± 1.49), plenti III (21.99 ± 3.72), and the control group (5.08 ± 0.88) in 0 Gy (P < 0.001); also, this rate was in miR-7-5p (47.01 ± 2.48), plenti III (33.79 ± 3.40), and the control group (16.98 ± 3.11) (P < 0.001) for 4 Gy. CONCLUSION The use of this new treatment such as gene therapy to suppress genes involved in apoptosis can help to improve the treatment results and increase the quality of life of patients with prostate cancer.
Collapse
Affiliation(s)
- Leili Darvish
- Department of Medical Physics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | | | - Seyed Hamid Aghaee-Bakhtiari
- Department of Medical Biotechnology and Nanotechnology, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
- Bioinformatics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Ali Ahmadizad Firouzjaei
- Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Azadeh Amraee
- Department of Medical Physics, Faculty of Medicine, School of Medicine, Lorestan University of Medical Sciences, khorramabad, Iran
| | - Ali Tarighatnia
- Department of Medical Physics, Faculty of Medicine, Ardabil University of Medical Sciences, Ardabil, Iran
| | - Hosein Azimian
- Medical Physics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.
| |
Collapse
|