1
|
Liu T, Chen Q, Liu R, Sun Y, Wang Y, Zhu Y, Zhao T. DMGAT: predicting ncRNA-drug resistance associations based on diffusion map and heterogeneous graph attention network. Brief Bioinform 2025; 26:bbaf179. [PMID: 40251829 PMCID: PMC12008124 DOI: 10.1093/bib/bbaf179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2025] [Revised: 03/26/2025] [Accepted: 03/30/2025] [Indexed: 04/21/2025] Open
Abstract
Non-coding RNAs (ncRNAs) play crucial roles in drug resistance and sensitivity, making them important biomarkers and therapeutic targets. However, predicting ncRNA-drug associations is challenging due to issues such as dataset imbalance and sparsity, limiting the identification of robust biomarkers. Existing models often fall short in capturing local and global sequence information, limiting the reliability of predictions. This study introduces DMGAT (diffusion map and heterogeneous graph attention network), a novel deep learning model designed to predict ncRNA-drug associations. DMGAT integrates diffusion maps for sequence embedding, graph convolutional networks for feature extraction, and GAT for heterogeneous information fusion. To address dataset imbalance, the model incorporates sensitivity associations and employs a random forest classifier to select reliable negative samples. DMGAT embeds ncRNA sequences and drug SMILES using the word2vec technique, capturing local and global sequence information. The model constructs a heterogeneous network by combining sequence similarity and Gaussian Interaction Profile kernel similarity, providing a comprehensive representation of ncRNA-drug interactions. Evaluated through five-fold cross-validation on a curated dataset from NoncoRNA and ncDR, DMGAT outperforms seven state-of-the-art methods, achieving the highest area under the receiver operating characteristic curve (0.8964), area under the precision-recall curve (0.8984), recall (0.9576), and F1-score (0.8285). The raw data are released to Zenodo with identifier 13929676. The source code of DMGAT is available at https://github.com/liutingyu0616/DMGAT/tree/main.
Collapse
Affiliation(s)
- Tingyu Liu
- School of Medicine and Heath, Harbin Institute of Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, China
| | - Qiuhao Chen
- Zhengzhou Research Institute, Harbin Instituteof Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China
| | - Renjie Liu
- Zhengzhou Research Institute, Harbin Instituteof Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China
| | - Yuzhi Sun
- School of Computer Science and Technology, Harbin Institute of Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China
| | - Yadong Wang
- School of Computer Science and Technology, Harbin Institute of Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China
| | - Yan Zhu
- College of Veterinary Medicine, Northeast Agricultural University, 150038, Xiangfang District, Changjiang Road No. 600, Harbin, China
| | - Tianyi Zhao
- School of Medicine and Heath, Harbin Institute of Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, China
- Zhengzhou Research Institute, Harbin Instituteof Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China
| |
Collapse
|
2
|
Aswathy R, Chalos VA, Suganya K, Sumathi S. Advancing miRNA cancer research through artificial intelligence: from biomarker discovery to therapeutic targeting. Med Oncol 2024; 42:30. [PMID: 39688780 DOI: 10.1007/s12032-024-02579-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Accepted: 12/03/2024] [Indexed: 12/18/2024]
Abstract
MicroRNAs (miRNAs), a class of small non-coding RNAs, play a vital role in regulating gene expression at the post-transcriptional level. Their discovery has profoundly impacted therapeutic strategies, particularly in cancer treatment, where RNA therapeutics, including miRNA-based targeted therapies, have gained prominence. Advances in RNA sequencing technologies have facilitated a comprehensive exploration of miRNAs-from fundamental research to their diagnostic and prognostic potential in various diseases, notably cancers. However, the manual handling and interpretation of vast RNA datasets pose significant challenges. The advent of artificial intelligence (AI) has revolutionized biological research by efficiently extracting insights from complex data. Machine learning algorithms, particularly deep learning techniques are effective for identifying critical miRNAs across different cancers and developing prognostic models. Moreover, the integration of AI has led to the creation of comprehensive miRNA databases for identifying mRNA and gene targets, thus facilitating deeper understanding and application in cancer research. This review comprehensively examines current developments in the application of machine learning techniques in miRNA research across diverse cancers. We discuss their roles in identifying biomarkers, elucidating miRNA targets, establishing disease associations, predicting prognostic outcomes, and exploring broader AI applications in cancer research. This review aims to guide researchers in leveraging AI techniques effectively within the miRNA field, thereby accelerating advancements in cancer diagnostics and therapeutics.
Collapse
Affiliation(s)
- Raghu Aswathy
- Department of Biochemistry, Biotechnology and Bioinformatics, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, Tamil Nadu, 641043, India
| | - Varghese Angel Chalos
- Department of Biochemistry, Biotechnology and Bioinformatics, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, Tamil Nadu, 641043, India
| | - Kanagaraj Suganya
- Department of Biochemistry, Biotechnology and Bioinformatics, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, Tamil Nadu, 641043, India
| | - Sundaravadivelu Sumathi
- Department of Biochemistry, Biotechnology and Bioinformatics, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, Tamil Nadu, 641043, India.
| |
Collapse
|
3
|
Panicucci C, Sahin E, Bartolucci M, Casalini S, Brolatti N, Pedemonte M, Baratto S, Pintus S, Principi E, D'Amico A, Pane M, Sframeli M, Messina S, Albamonte E, Sansone VA, Mercuri E, Bertini E, Sezerman U, Petretto A, Bruno C. Proteomics profiling and machine learning in nusinersen-treated patients with spinal muscular atrophy. Cell Mol Life Sci 2024; 81:393. [PMID: 39254732 PMCID: PMC11387582 DOI: 10.1007/s00018-024-05426-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Revised: 08/11/2024] [Accepted: 08/25/2024] [Indexed: 09/11/2024]
Abstract
AIM The availability of disease-modifying therapies and newborn screening programs for spinal muscular atrophy (SMA) has generated an urgent need for reliable prognostic biomarkers to classify patients according to disease severity. We aim to identify cerebrospinal fluid (CSF) prognostic protein biomarkers in CSF samples of SMA patients collected at baseline (T0), and to describe proteomic profile changes and biological pathways influenced by nusinersen before the sixth nusinersen infusion (T302). METHODS In this multicenter retrospective longitudinal study, we employed an untargeted liquid chromatography mass spectrometry (LC-MS)-based proteomic approach on CSF samples collected from 61 SMA patients treated with nusinersen (SMA1 n=19, SMA2 n=19, SMA3 n=23) at T0 at T302. The Random Forest (RF) machine learning algorithm and pathway enrichment analysis were applied for analysis. RESULTS The RF algorithm, applied to the protein expression profile of naïve patients, revealed several proteins that could classify the different types of SMA according to their differential abundance at T0. Analysis of changes in proteomic profiles identified a total of 147 differentially expressed proteins after nusinersen treatment in SMA1, 135 in SMA2, and 289 in SMA3. Overall, nusinersen-induced changes on proteomic profile were consistent with i) common effects observed in allSMA types (i.e. regulation of axonogenesis), and ii) disease severity-specific changes, namely regulation of glucose metabolism in SMA1, of coagulation processes in SMA2, and of complement cascade in SMA3. CONCLUSIONS This untargeted LC-MS proteomic profiling in the CSF of SMA patients revealed differences in protein expression in naïve patients and showed nusinersen-related modulation in several biological processes after 10 months of treatment. Further confirmatory studies are needed to validate these results in larger number of patients and over abroader timeframe.
Collapse
Affiliation(s)
- Chiara Panicucci
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy
| | - Eray Sahin
- Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Turkey
| | - Martina Bartolucci
- Core Facilities-Clinical Proteomics and Metabolomics, IRCCS Istituto Giannina Gaslini, Genova, Italy
| | - Sara Casalini
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy
| | - Noemi Brolatti
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy
| | - Marina Pedemonte
- Pediatric Neurology Unit, IRCCS Istituto Giannina Gaslini, Genova, Italy
| | - Serena Baratto
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy
| | - Sara Pintus
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy
| | - Elisa Principi
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy
| | - Adele D'Amico
- Unit of Neuromuscular and Neurodegenerative Disorders, IRCCS Bambino Gesù Children's Hospital, Rome, Italy
| | - Marika Pane
- Centro Clinico Nemo, IRCCS Fondazione Policlinico Universitario Agostino Gemelli, Rome, Italy
| | - Marina Sframeli
- Department of Neurosciences, University of Messina, Messina, Italy
| | - Sonia Messina
- Department of Neurosciences, University of Messina, Messina, Italy
| | - Emilio Albamonte
- Neurorehabilitation Unit, Centro Clinico NeMO, University of Milan, Milan, Italy
| | - Valeria A Sansone
- Neurorehabilitation Unit, Centro Clinico NeMO, University of Milan, Milan, Italy
| | - Eugenio Mercuri
- Centro Clinico Nemo, IRCCS Fondazione Policlinico Universitario Agostino Gemelli, Rome, Italy
| | - Enrico Bertini
- Unit of Neuromuscular and Neurodegenerative Disorders, IRCCS Bambino Gesù Children's Hospital, Rome, Italy
| | - Ugur Sezerman
- Department of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul, Turkey
| | - Andrea Petretto
- Core Facilities-Clinical Proteomics and Metabolomics, IRCCS Istituto Giannina Gaslini, Genova, Italy
| | - Claudio Bruno
- Center of Translational and Experimental Myology, IRCCS Istituto Giannina Gaslini, Via G. Gaslini, 5, I-16147, Genova, Italy.
- Department of Neuroscience, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health- DINOGMI, University of Genova, Genova, Italy.
| |
Collapse
|
4
|
Daniel Thomas S, Vijayakumar K, John L, Krishnan D, Rehman N, Revikumar A, Kandel Codi JA, Prasad TSK, S S V, Raju R. Machine Learning Strategies in MicroRNA Research: Bridging Genome to Phenome. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2024; 28:213-233. [PMID: 38752932 DOI: 10.1089/omi.2024.0047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2024]
Abstract
MicroRNAs (miRNAs) have emerged as a prominent layer of regulation of gene expression. This article offers the salient and current aspects of machine learning (ML) tools and approaches from genome to phenome in miRNA research. First, we underline that the complexity in the analysis of miRNA function ranges from their modes of biogenesis to the target diversity in diverse biological conditions. Therefore, it is imperative to first ascertain the miRNA coding potential of genomes and understand the regulatory mechanisms of their expression. This knowledge enables the efficient classification of miRNA precursors and the identification of their mature forms and respective target genes. Second, and because one miRNA can target multiple mRNAs and vice versa, another challenge is the assessment of the miRNA-mRNA target interaction network. Furthermore, long-noncoding RNA (lncRNA)and circular RNAs (circRNAs) also contribute to this complexity. ML has been used to tackle these challenges at the high-dimensional data level. The present expert review covers more than 100 tools adopting various ML approaches pertaining to, for example, (1) miRNA promoter prediction, (2) precursor classification, (3) mature miRNA prediction, (4) miRNA target prediction, (5) miRNA- lncRNA and miRNA-circRNA interactions, (6) miRNA-mRNA expression profiling, (7) miRNA regulatory module detection, (8) miRNA-disease association, and (9) miRNA essentiality prediction. Taken together, we unpack, critically examine, and highlight the cutting-edge synergy of ML approaches and miRNA research so as to develop a dynamic and microlevel understanding of human health and diseases.
Collapse
Affiliation(s)
- Sonet Daniel Thomas
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Krithika Vijayakumar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Levin John
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Deepak Krishnan
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Niyas Rehman
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Amjesh Revikumar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Kerala Genome Data Centre, Kerala Development and Innovation Strategic Council, Thiruvananthapuram, Kerala, India
| | - Jalaluddin Akbar Kandel Codi
- Department of Surgical Oncology, Yenepoya Medical College, Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | | | - Vinodchandra S S
- Department of Computer Science, University of Kerala, Thiruvananthapuram, Kerala, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| |
Collapse
|
5
|
Oh YM, Lee SW. Patient-derived neuron model: Capturing age-dependent adult-onset degenerative pathology in Huntington's disease. Mol Cells 2024; 47:100046. [PMID: 38492889 PMCID: PMC11021366 DOI: 10.1016/j.mocell.2024.100046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/27/2024] [Accepted: 03/08/2024] [Indexed: 03/18/2024] Open
Abstract
MicroRNAs play a crucial role in directly reprogramming (converting) human fibroblasts into neurons. Specifically, miR-9/9* and miR-124 (miR-9/9*-124) display neurogenic and cell fate-switching activities when ectopically expressed in human fibroblasts by erasing fibroblast identity and inducing a pan-neuronal state. These converted neurons maintain the biological age of the starting fibroblasts and thus provide a human neuron-based platform to study cellular properties in aged neurons and model adult-onset neurodegenerative disorders using patient-derived cells. Furthermore, the expression of striatal-enriched transcription factors in conjunction with miR-9/9*-124 guides the identity of medium spiny neurons (MSNs), the primary targets in Huntington's disease (HD). Converted MSNs from HD patient-derived fibroblasts (HD-MSNs) can replicate HD-related phenotypes including neurodegeneration associated with age-related declines in critical cellular functions such as autophagy. Here, we review the role of microRNAs in the direct conversion of patient-derived fibroblasts into MSNs and the practical application of converted HD-MSNs as a model for studying adult-onset neuropathology in HD. We provide valuable insights into age-related, cell-intrinsic changes contributing to neurodegeneration in HD-MSNs. Ultimately, we address a comprehensive understanding of the complex molecular landscape underlying HD pathology, offering potential avenues for therapeutic application.
Collapse
Affiliation(s)
- Young Mi Oh
- Department of Biomedical Sciences, Mercer University School of Medicine, Columbus, GA 31901, USA
| | - Seong Won Lee
- Department of Biomedical Sciences, Mercer University School of Medicine, Columbus, GA 31901, USA
| |
Collapse
|