1
|
Khemka N, Morris G, Kazemzadeh L, Costard LS, Neubert V, Bauer S, Rosenow F, Venø MT, Kjems J, Henshall DC, Prehn JHM, Connolly NMC. Integrative network analysis of miRNA-mRNA expression profiles during epileptogenesis in rats reveals therapeutic targets after emergence of first spontaneous seizure. Sci Rep 2024; 14:15313. [PMID: 38961125 PMCID: PMC11222454 DOI: 10.1038/s41598-024-66117-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 06/27/2024] [Indexed: 07/05/2024] Open
Abstract
Epileptogenesis is the process by which a normal brain becomes hyperexcitable and capable of generating spontaneous recurrent seizures. The extensive dysregulation of gene expression associated with epileptogenesis is shaped, in part, by microRNAs (miRNAs) - short, non-coding RNAs that negatively regulate protein levels. Functional miRNA-mediated regulation can, however, be difficult to elucidate due to the complexity of miRNA-mRNA interactions. Here, we integrated miRNA and mRNA expression profiles sampled over multiple time-points during and after epileptogenesis in rats, and applied bi-clustering and Bayesian modelling to construct temporal miRNA-mRNA-mRNA interaction networks. Network analysis and enrichment of network inference with sequence- and human disease-specific information identified key regulatory miRNAs with the strongest influence on the mRNA landscape, and miRNA-mRNA interactions closely associated with epileptogenesis and subsequent epilepsy. Our findings underscore the complexity of miRNA-mRNA regulation, can be used to prioritise miRNA targets in specific systems, and offer insights into key regulatory processes in epileptogenesis with therapeutic potential for further investigation.
Collapse
Affiliation(s)
- Niraj Khemka
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland
| | - Gareth Morris
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland
- Neuroscience, Physiology and Pharmacology, University College London, London, UK
- Division of Neuroscience, University of Manchester, Manchester, UK
| | - Laleh Kazemzadeh
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland
| | - Lara S Costard
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Valentin Neubert
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Sebastian Bauer
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Felix Rosenow
- Epilepsy Center, Department of Neurology, Philipps University Marburg, Marburg, Germany
- Epilepsy Center Frankfurt Rhine-Main, Neurocenter, University Hospital Frankfurt and Center for Personalized Translational Epilepsy Research, Goethe-University, Frankfurt, Germany
| | - Morten T Venø
- Interdisciplinary Nanoscience Center, Dept. of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
- Omiics ApS, Aarhus, Denmark
| | - Jørgen Kjems
- Interdisciplinary Nanoscience Center, Dept. of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - David C Henshall
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland
| | - Jochen H M Prehn
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
| | - Niamh M C Connolly
- Centre for Systems Medicine & Dept. of Physiology & Medical Physics, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
- FutureNeuro SFI Research Centre, RCSI University of Medicine and Health Sciences, Dublin, Ireland.
| |
Collapse
|
2
|
Arora A, Tsigelny IF, Kouznetsova VL. Laryngeal cancer diagnosis via miRNA-based decision tree model. Eur Arch Otorhinolaryngol 2024; 281:1391-1399. [PMID: 38147113 DOI: 10.1007/s00405-023-08383-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 11/27/2023] [Indexed: 12/27/2023]
Abstract
PURPOSE Laryngeal cancer (LC) is the most common head and neck cancer, which often goes undiagnosed due to the inaccessible nature of current diagnosis methods in some parts of the world. Many recent studies have shown that microRNAs (miRNAs) are crucial biomarkers for a variety of cancers. METHODS In this study, we create a decision tree model for the diagnosis of laryngeal cancer using a created series of miRNA attributes, such as sequence-based characteristics, predicted miRNA target genes, and gene pathways. This series of attributes is extracted from both differentially expressed blood-based miRNAs in laryngeal cancer and random, non-associated with cancer miRNAs. RESULTS Several machine-learning (ML) algorithms were tested in the ML model, and the Hoeffding Tree classifier yields the highest accuracy (86.8%) in miRNAs-based recognition of laryngeal cancer. Furthermore, our model is validated with the independent laryngeal cancer datasets and can accurately diagnose laryngeal cancer with 86% accuracy. We also explored the biological relationships of the attributes used in our model to understand their relationship with cancer proliferation or suppression pathways. CONCLUSION Our study demonstrates that the proposed model and an inexpensive miRNA testing strategy have the potential to serve as an additional method for diagnosing laryngeal cancer.
Collapse
Affiliation(s)
- Aarav Arora
- REHS Program, San Diego Supercomputer Center, UC San Diego, La Jolla, CA, USA
| | - Igor F Tsigelny
- San Diego Supercomputer Center, UC San Diego, La Jolla, CA, USA.
- BiAna, La Jolla, CA, USA.
- Department of Neurosciences, UC San Diego, La Jolla, CA, USA.
- CureScience, San Diego, CA, USA.
| | - Valentina L Kouznetsova
- San Diego Supercomputer Center, UC San Diego, La Jolla, CA, USA
- BiAna, La Jolla, CA, USA
- CureScience, San Diego, CA, USA
| |
Collapse
|
3
|
Chen J, Han G, Xu A, Akutsu T, Cai H. Identifying miRNA-Gene Common and Specific Regulatory Modules for Cancer Subtyping by a High-Order Graph Matching Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:421-431. [PMID: 35320104 DOI: 10.1109/tcbb.2022.3161635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Identifying regulatory modules between miRNAs and genes is crucial in cancer research. It promotes a comprehensive understanding of the molecular mechanisms of cancer. The genomic data collected from subjects usually relate to different cancer statuses, such as different TNM Classifications of Malignant Tumors (TNM) or histological subtypes. Simple integrated analyses generally identify the core of the tumorigenesis (common modules) but miss the subtype-specific regulatory mechanisms (specific modules). In contrast, separate analyses can only report the differences and ignore important common modules. Therefore, there is an urgent need to develop a novel method to jointly analyze miRNA and gene data of different cancer statuses to identify common and specific modules. To that end, we developed a High-Order Graph Matching model to identify Common and Specific modules (HOGMCS) between miRNA and gene data of different cancer statuses. We first demonstrate the superiority of HOGMCS through a comparison with four state-of-the-art techniques using a set of simulated data. Then, we apply HOGMCS on stomach adenocarcinoma data with four TNM stages and two histological types, and breast invasive carcinoma data with four PAM50 subtypes. The experimental results demonstrate that HOGMCS can accurately extract common and subtype-specific miRNA-gene regulatory modules, where many identified miRNA-gene interactions have been confirmed in several public databases.
Collapse
|
4
|
Lu L, Xu A, Gao F, Tian C, Wang H, Zhang J, Xie Y, Liu P, Liu S, Yang C, Ye Z, Wu X. Mesenchymal Stem Cell-Derived Exosomes as a Novel Strategy for the Treatment of Intervertebral Disc Degeneration. Front Cell Dev Biol 2022; 9:770510. [PMID: 35141231 PMCID: PMC8818990 DOI: 10.3389/fcell.2021.770510] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 12/31/2021] [Indexed: 12/15/2022] Open
Abstract
Intervertebral disc degeneration (IVDD) has been reported to be the most prevalent contributor to low back pain, posing a significant strain on the healthcare systems on a global scale. Currently, there are no approved therapies available for the prevention of the progressive degeneration of intervertebral disc (IVD); however, emerging regenerative strategies that aim to restore the normal structure of the disc have been fundamentally promising. In the last decade, mesenchymal stem cells (MSCs) have received a significant deal of interest for the treatment of IVDD due to their differentiation potential, immunoregulatory capabilities, and capability to be cultured and regulated in a favorable environment. Recent investigations show that the pleiotropic impacts of MSCs are regulated by the production of soluble paracrine factors. Exosomes play an important role in regulating such effects. In this review, we have summarized the current treatments for disc degenerative diseases and their limitations and highlighted the therapeutic role and its underlying mechanism of MSC-derived exosomes in IVDD, as well as the possible future developments for exosomes.
Collapse
Affiliation(s)
- Lin Lu
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Aoshuang Xu
- Institute of Hematology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Fei Gao
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Chenjun Tian
- The First Clinical Medical College of Lanzhou University, Lanzhou, China
- The First Hospital of Lanzhou University, Lanzhou, China
| | - Honglin Wang
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Jiayao Zhang
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Yi Xie
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Pengran Liu
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Songxiang Liu
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Cao Yang
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Zhewei Ye
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- *Correspondence: Zhewei Ye, ; Xinghuo Wu,
| | - Xinghuo Wu
- Department of Orthopaedics Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- *Correspondence: Zhewei Ye, ; Xinghuo Wu,
| |
Collapse
|
5
|
Zhu F, Li J, Liu J, Min W. Network-based cancer genomic data integration for pattern discovery. BMC Genom Data 2021; 22:54. [PMID: 34886811 PMCID: PMC8662848 DOI: 10.1186/s12863-021-01004-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted. RESULTS In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered. CONCLUSIONS All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer.
Collapse
Affiliation(s)
- Fangfang Zhu
- State Key Laboratory of Nuclear Resources and Environment and School of Water Resources and Environmental Engineering, East China University of Technology, Nanchang, 330013, China
- State Key Laboratory of Nuclear Resources and Environment and School of Chemistry, Biology and Materials Science, East China University of Technology, Nanchang, 330013, China
| | - Jiang Li
- State Key Laboratory of Nuclear Resources and Environment and School of Chemistry, Biology and Materials Science, East China University of Technology, Nanchang, 330013, China.
| | - Juan Liu
- School of Computer Science, Wuhan University, Wuhan, 430072, China.
| | - Wenwen Min
- School of Mathematics and Computer Science, Jiangxi Science and Technology Normal University, Nanchang, 330038, China.
- Information School, Yunnan University, Kunming, 650091, China.
| |
Collapse
|
6
|
Soremekun O, Ezenwa C, Paimo O, Madu C, Omotoso O, Akinifesi O, Ntuenibok N. Integrative analysis of microRNA expression level profile identifies microRNAs associated with prostate cancer. GENE REPORTS 2021. [DOI: 10.1016/j.genrep.2021.101376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
7
|
Wani N, Barh D, Raza K. Modular network inference between miRNA-mRNA expression profiles using weighted co-expression network analysis. J Integr Bioinform 2021; 18:20210029. [PMID: 34800012 PMCID: PMC8709739 DOI: 10.1515/jib-2021-0029] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 10/20/2021] [Accepted: 10/28/2021] [Indexed: 12/14/2022] Open
Abstract
Connecting transcriptional and post-transcriptional regulatory networks solves an important puzzle in the elucidation of gene regulatory mechanisms. To decipher the complexity of these connections, we build co-expression network modules for mRNA as well as miRNA expression profiles of breast cancer data. We construct gene and miRNA co-expression modules using the weighted gene co-expression network analysis (WGCNA) method and establish the significance of these modules (Genes/miRNAs) for cancer phenotype. This work also infers an interaction network between the genes of the turquoise module from mRNA expression data and hubs of the turquoise module from miRNA expression data. A pathway enrichment analysis using a miRsystem web tool for miRNA hubs and some of their targets, reveal their enrichment in several important pathways associated with the progression of cancer.
Collapse
Affiliation(s)
- Nisar Wani
- Computer Science and Engineering Department, Govt. College of Engineering and Technology Safapora, Ganderbal Kashmir, J&K, India
| | - Debmalya Barh
- Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, WB, India
- Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Khalid Raza
- Department of Computer Science, Jamia Millia Islamia, New Delhi, India
| |
Collapse
|
8
|
TSCCA: A tensor sparse CCA method for detecting microRNA-gene patterns from multiple cancers. PLoS Comput Biol 2021; 17:e1009044. [PMID: 34061840 PMCID: PMC8195367 DOI: 10.1371/journal.pcbi.1009044] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Revised: 06/11/2021] [Accepted: 05/05/2021] [Indexed: 12/22/2022] Open
Abstract
Existing studies have demonstrated that dysregulation of microRNAs (miRNAs or miRs) is involved in the initiation and progression of cancer. Many efforts have been devoted to identify microRNAs as potential biomarkers for cancer diagnosis, prognosis and therapeutic targets. With the rapid development of miRNA sequencing technology, a vast amount of miRNA expression data for multiple cancers has been collected. These invaluable data repositories provide new paradigms to explore the relationship between miRNAs and cancer. Thus, there is an urgent need to explore the complex cancer-related miRNA-gene patterns by integrating multi-omics data in a pan-cancer paradigm. In this study, we present a tensor sparse canonical correlation analysis (TSCCA) method for identifying cancer-related miRNA-gene modules across multiple cancers. TSCCA is able to overcome the drawbacks of existing solutions and capture both the cancer-shared and specific miRNA-gene co-expressed modules with better biological interpretations. We comprehensively evaluate the performance of TSCCA using a set of simulated data and matched miRNA/gene expression data across 33 cancer types from the TCGA database. We uncover several dysfunctional miRNA-gene modules with important biological functions and statistical significance. These modules can advance our understanding of miRNA regulatory mechanisms of cancer and provide insights into miRNA-based treatments for cancer. MicroRNAs (miRNAs) are a class of small non-coding RNAs. Previous studies have revealed that miRNA-gene regulatory modules play key roles in the occurrence and development of cancer. However, little has been done to discover miRNA-gene regulatory modules from a pan-cancer view. Thus, it is urgently needed to develop new methods to explore the complex cancer-related miRNA-gene patterns by integrating multi-omics data of multi-cancers. To build the connections between miRNA-gene regulatory modules across different cancer types, we propose a tensor sparse canonical correlation analysis (TSCCA) method. Our specific contributions are two-fold: (1) We propose a sparse statistical learning model TSCCA and an efficient block-coordinate descent algorithm to solve it. (2) We apply TSCCA to a multi-omics data set of 33 cancer types from TCGA and identify some cancer-related miRNA-gene modules with important biological functions and statistical significance.
Collapse
|
9
|
Fang R, Yang H, Gao Y, Cao H, Goode EL, Cui Y. Gene-based mediation analysis in epigenetic studies. Brief Bioinform 2021; 22:bbaa113. [PMID: 32608480 PMCID: PMC8660163 DOI: 10.1093/bib/bbaa113] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Revised: 04/07/2020] [Accepted: 05/12/2020] [Indexed: 12/15/2022] Open
Abstract
Mediation analysis has been a useful tool for investigating the effect of mediators that lie in the path from the independent variable to the outcome. With the increasing dimensionality of mediators such as in (epi)genomics studies, high-dimensional mediation model is needed. In this work, we focus on epigenetic studies with the goal to identify important DNA methylations that act as mediators between an exposure disease outcome. Specifically, we focus on gene-based high-dimensional mediation analysis implemented with kernel principal component analysis to capture potential nonlinear mediation effect. We first review the current high-dimensional mediation models and then propose two gene-based analytical approaches: gene-based high-dimensional mediation analysis based on linearity assumption between mediators and outcome (gHMA-L) and gene-based high-dimensional mediation analysis based on nonlinearity assumption (gHMA-NL). Since the underlying true mediation relationship is unknown in practice, we further propose an omnibus test of gene-based high-dimensional mediation analysis (gHMA-O) by combing gHMA-L and gHMA-NL. Extensive simulation studies show that gHMA-L performs better under the model linear assumption and gHMA-NL does better under the model nonlinear assumption, while gHMA-O is a more powerful and robust method by combining the two. We apply the proposed methods to two datasets to investigate genes whose methylation levels act as important mediators in the relationship: (1) between alcohol consumption and epithelial ovarian cancer risk using data from the Mayo Clinic Ovarian Cancer Case-Control Study and (2) between childhood maltreatment and comorbid post-traumatic stress disorder and depression in adulthood using data from the Gray Trauma Project.
Collapse
|
10
|
Paul S. RFCM 3: Computational Method for Identification of miRNA-mRNA Regulatory Modules in Cervical Cancer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1729-1740. [PMID: 30990434 DOI: 10.1109/tcbb.2019.2910851] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Cervical cancer is a leading severe malignancy throughout the world. Molecular processes and biomarkers leading to tumor progression in cervical cancer are either unknown or only partially understood. An increasing number of studies have shown that microRNAs play an important role in tumorigenesis so understanding the regulatory mechanism of miRNAs in gene-regulatory network will help elucidate the complex biological processes that occur during malignancy. Functional genomics data provides opportunities to study the aberrant microRNA-messenger RNA (miRNA-mRNA) interaction. Identification of miRNA-mRNA regulatory modules will aid deciphering aberrant transcriptional regulatory network in cervical cancer but is computationally challenging. In this regard, an algorithm, termed as relevant and functionally consistent miRNA-mRNA modules (RFCM3), is proposed. It integrates miRNA and mRNA expression data of cervical cancer for identification of potential miRNA-mRNA modules. It selects set of miRNA-mRNA modules by maximizing relation of mRNAs with miRNA and functional similarity between selected mRNAs. Later, using the knowledge of the miRNA-miRNA synergistic network different modules are fused and finally a set of modules are generated containing several miRNAs as well as mRNAs. This type of module explains the underlying biological pathways containing multiple miRNAs and mRNAs. The effectiveness of the proposed approach over other existing methods has been demonstrated on a miRNA and mRNA expression data of cervical cancer with respect to enrichment analyses and other standard metrices. The prognostic value of the genes in a module with respect to cervical cancer is also demonstrated. The approach was found to generate more robust, integrated, and functionally enriched miRNA-mRNA modules in cervical cancer.
Collapse
|
11
|
Huang J, Chen J, Zhang B, Zhu L, Cai H. Evaluation of gene-drug common module identification methods using pharmacogenomics data. Brief Bioinform 2020; 22:5860683. [PMID: 32591780 DOI: 10.1093/bib/bbaa087] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 04/06/2020] [Accepted: 04/23/2020] [Indexed: 01/21/2023] Open
Abstract
Accurately identifying the interactions between genomic factors and the response of cancer drugs plays important roles in drug discovery, drug repositioning and cancer treatment. A number of studies revealed that interactions between genes and drugs were 'many-genes-to-many drugs' interactions, i.e. common modules, opposed to 'one-gene-to-one-drug' interactions. Such modules fully explain the interactions between complex biological regulatory mechanisms and cancer drugs. However, strategies for effectively and robustly identifying the underlying common modules among pharmacogenomics data remain to be improved. In this paper, we aim to provide a detailed evaluation of three categories of state-of-the-art common module identification techniques from a machine learning perspective, including non-negative matrix factorization (NMF), partial least squares (PLS) and network analyses. We first evaluate the performance of six methods, namely SNMNMF, NetNMF, SNPLS, O2PLS, NSBM and HOGMMNC, using two series of simulated data sets with different noise levels and outlier ratios. Then, we conduct experiments using a real world data set of 2091 genes and 101 drugs in 392 cancer cell lines and compare the real experimental results from the aspect of biological process term enrichment, gene-drug and drug-drug interactions. Finally, we present interesting findings from our evaluation study and discuss the advantages and drawbacks of each method. Supplementary information: Supplementary file is available at Briefings in Bioinformatics online.
Collapse
Affiliation(s)
- Jie Huang
- South China University of Technology, School of Computer Science and Engineering, Guangzhou, 510006, China
| | - Jiazhou Chen
- South China University of Technology, School of Computer Science and Engineering, Guangzhou, 510006, China
| | - Bin Zhang
- South China University of Technology, School of Computer Science and Engineering, Guangzhou, 510006, China
| | - Lei Zhu
- South China University of Technology, School of Computer Science and Engineering, Guangzhou, 510006, China
| | - Hongmin Cai
- South China University of Technology, School of Computer Science and Engineering, Guangzhou, 510006, China
| |
Collapse
|
12
|
Rahmanian S, Murad R, Breschi A, Zeng W, Mackiewicz M, Williams B, Davis CA, Roberts B, Meadows S, Moore D, Trout D, Zaleski C, Dobin A, Sei LH, Drenkow J, Scavelli A, Gingeras TR, Wold BJ, Myers RM, Guigó R, Mortazavi A. Dynamics of microRNA expression during mouse prenatal development. Genome Res 2019; 29:1900-1909. [PMID: 31645363 PMCID: PMC6836743 DOI: 10.1101/gr.248997.119] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Accepted: 08/29/2019] [Indexed: 12/15/2022]
Abstract
MicroRNAs (miRNAs) play a critical role as posttranscriptional regulators of gene expression. The ENCODE Project profiled the expression of miRNAs in an extensive set of organs during a time-course of mouse embryonic development and captured the expression dynamics of 785 miRNAs. We found distinct organ-specific and developmental stage-specific miRNA expression clusters, with an overall pattern of increasing organ-specific expression as embryonic development proceeds. Comparative analysis of conserved miRNAs in mouse and human revealed stronger clustering of expression patterns by organ type rather than by species. An analysis of messenger RNA expression clusters compared with miRNA expression clusters identifies the potential role of specific miRNA expression clusters in suppressing the expression of mRNAs specific to other developmental programs in the organ in which these miRNAs are expressed during embryonic development. Our results provide the most comprehensive time-course of miRNA expression as part of an integrated ENCODE reference data set for mouse embryonic development.
Collapse
Affiliation(s)
- Sorena Rahmanian
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| | - Rabi Murad
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| | - Alessandra Breschi
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Barcelona 08003, Catalonia, Spain
| | - Weihua Zeng
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| | - Mark Mackiewicz
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Brian Williams
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Carrie A Davis
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Brian Roberts
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Sarah Meadows
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Dianna Moore
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Diane Trout
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Chris Zaleski
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Alex Dobin
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Lei-Hoon Sei
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Jorg Drenkow
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Alex Scavelli
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Thomas R Gingeras
- Functional Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Barbara J Wold
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Roderic Guigó
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG) and UPF, Barcelona 08003, Catalonia, Spain
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
13
|
Su L, Liu G, Wang J, Xu D. A rectified factor network based biclustering method for detecting cancer-related coding genes and miRNAs, and their interactions. Methods 2019; 166:22-30. [PMID: 31121299 PMCID: PMC6708461 DOI: 10.1016/j.ymeth.2019.05.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 04/14/2019] [Accepted: 05/13/2019] [Indexed: 12/12/2022] Open
Abstract
Detecting cancer-related genes and their interactions is a crucial task in cancer research. For this purpose, we proposed an efficient method, to detect coding genes, microRNAs (miRNAs), and their interactions related to a particular cancer or a cancer subtype using their expression data from the same set of samples. Firstly, biclusters specific to a particular type of cancer are detected based on rectified factor networks and ranked according to their associations with general cancers. Secondly, coding genes and miRNAs in each bicluster are prioritized by considering their differential expression and differential correlation values, protein-protein interaction data, and potential cancer markers. Finally, a rank fusion process is used to obtain the final comprehensive rank by combining multiple ranking results. We applied our proposed method on breast cancer datasets. Results show that our method outperforms other methods in detecting breast cancer-related coding genes and miRNAs. Furthermore, our method is very efficient in computing time, which can handle tens of thousands genes/miRNAs and hundreds of patients in hours on a desktop. This work may aid researchers in studying the genetic architecture of complex diseases, and improving the accuracy of diagnosis.
Collapse
Affiliation(s)
- Lingtao Su
- Department of Computer Science and Technology, Jilin University, Changchun 130012, China; Department of Electrical Engineering & Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Guixia Liu
- Department of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Juexin Wang
- Department of Electrical Engineering & Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Dong Xu
- Department of Electrical Engineering & Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
14
|
Xiao Q, Luo J, Liang C, Cai J, Li G, Cao B. CeModule: an integrative framework for discovering regulatory patterns from genomic data in cancer. BMC Bioinformatics 2019; 20:67. [PMID: 30732558 PMCID: PMC6367773 DOI: 10.1186/s12859-019-2654-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Accepted: 01/24/2019] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Non-coding RNAs (ncRNAs) are emerging as key regulators and play critical roles in a wide range of tumorigenesis. Recent studies have suggested that long non-coding RNAs (lncRNAs) could interact with microRNAs (miRNAs) and indirectly regulate miRNA targets through competing interactions. Therefore, uncovering the competing endogenous RNA (ceRNA) regulatory mechanism of lncRNAs, miRNAs and mRNAs in post-transcriptional level will aid in deciphering the underlying pathogenesis of human polygenic diseases and may unveil new diagnostic and therapeutic opportunities. However, the functional roles of vast majority of cancer specific ncRNAs and their combinational regulation patterns are still insufficiently understood. RESULTS Here we develop an integrative framework called CeModule to discover lncRNA, miRNA and mRNA-associated regulatory modules. We fully utilize the matched expression profiles of lncRNAs, miRNAs and mRNAs and establish a model based on joint orthogonality non-negative matrix factorization for identifying modules. Meanwhile, we impose the experimentally verified miRNA-lncRNA interactions, the validated miRNA-mRNA interactions and the weighted gene-gene network into this framework to improve the module accuracy through the network-based penalties. The sparse regularizations are also used to help this model obtain modular sparse solutions. Finally, an iterative multiplicative updating algorithm is adopted to solve the optimization problem. CONCLUSIONS We applied CeModule to two cancer datasets including ovarian cancer (OV) and uterine corpus endometrial carcinoma (UCEC) obtained from TCGA. The modular analysis indicated that the identified modules involving lncRNAs, miRNAs and mRNAs are significantly associated and functionally enriched in cancer-related biological processes and pathways, which may provide new insights into the complex regulatory mechanism of human diseases at the system level.
Collapse
Affiliation(s)
- Qiu Xiao
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
- Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Hunan Normal University, Changsha, 410081, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China.
| | - Cheng Liang
- College of Information Science and Engineering, Shandong Normal University, Jinan, 250000, China
| | - Jie Cai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Guanghui Li
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Buwen Cao
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| |
Collapse
|
15
|
Zhang Y, Liu C, Wang J, Li X. Application of Monte Carlo cross-validation to identify pathway cross-talk in neonatal sepsis. Exp Biol Med (Maywood) 2018; 243:444-450. [PMID: 29513099 PMCID: PMC5882034 DOI: 10.1177/1535370218759635] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 01/17/2018] [Indexed: 01/06/2023] Open
Abstract
To explore genetic pathway cross-talk in neonates with sepsis, an integrated approach was used in this paper. To explore the potential relationships between differently expressed genes between normal uninfected neonates and neonates with sepsis and pathways, genetic profiling and biologic signaling pathway were first integrated. For different pathways, the score was obtained based upon the genetic expression by quantitatively analyzing the pathway cross-talk. The paired pathways with high cross-talk were identified by random forest classification. The purpose of the work was to find the best pairs of pathways able to discriminate sepsis samples versus normal samples. The results found 10 pairs of pathways, which were probably able to discriminate neonates with sepsis versus normal uninfected neonates. Among them, the best two paired pathways were identified according to analysis of extensive literature. Impact statement To find the best pairs of pathways able to discriminate sepsis samples versus normal samples, an RF classifier, the DS obtained by DEGs of paired pathways significantly associated, and Monte Carlo cross-validation were applied in this paper. Ten pairs of pathways were probably able to discriminate neonates with sepsis versus normal uninfected neonates. Among them, the best two paired pathways ((7) IL-6 Signaling and Phospholipase C Signaling (PLC); (8) Glucocorticoid Receptor (GR) Signaling and Dendritic Cell Maturation) were identified according to analysis of extensive literature.
Collapse
Affiliation(s)
- Yuxia Zhang
- Department of Neonatal, Qilu Hospital of Shandong
University, Jinan, Shandong 250012, China
| | - Cui Liu
- Department of Neonatal, Qilu Hospital of Shandong
University, Jinan, Shandong 250012, China
| | - Jingna Wang
- Department of Neonatal, Qilu Hospital of Shandong
University, Jinan, Shandong 250012, China
| | - Xingxia Li
- Department of Neonatal, Qilu Hospital of Shandong
University, Jinan, Shandong 250012, China
| |
Collapse
|
16
|
Luo J, Huang W, Cao B. A novel approach to identify the miRNA-mRNA causal regulatory modules in Cancer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:309-315. [PMID: 28113985 DOI: 10.1109/tcbb.2016.2612199] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
MicroRNAs (miRNAs) play an essential role in many biological processes by regulating the target genes, especially in the initiation and development of cancers. Therefore, the identification of the miRNA-mRNA regulatory modules is important for understanding the regulatory mechanisms. Most computational methods only used statistical correlations in predicting miRNA-mRNA modules, and neglected the fact there are causal relationships between miRNAs and their target genes. In this paper, we propose a novel approach called CALM(the causal regulatory modules) to identify the miRNA-mRNA regulatory modules through integrating the causal interactions and statistical correlations between the miRNAs and their target genes. Our algorithm largely consists of three steps: it first forms the causal regulatory relationships of miRNAs and genes from gene expression profiles and detects the miRNA clusters according to the GO function information of their target genes, then expands each miRNA cluster by greedy adding(discarding) the target genes to maximize the modularity score. To show the performance of our method, we apply CALM on four datasets including EMT, breast, ovarian, thyroid cancer and validate our results. The experiment results show that our method can not only outperform the compared method, but also achieve ideal overall performance in terms of the functional enrichment.
Collapse
|
17
|
Jin D, Lee H. FGMD: A novel approach for functional gene module detection in cancer. PLoS One 2017; 12:e0188900. [PMID: 29244808 PMCID: PMC5731741 DOI: 10.1371/journal.pone.0188900] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 11/15/2017] [Indexed: 12/26/2022] Open
Abstract
With the increasing availability of multi-dimensional biological datasets for the same samples (i.e., gene expression, microRNAs, copy numbers, mutations, methylations), it has now become possible to systematically understand the regulatory mechanisms operating in a cancer cell. For this task, it is important to discover a set of co-expressed genes with functions, representing a so-called functional gene module, because co-expressed genes tend to be co-regulated by the same regulators, including transcription factors, microRNAs, and copy number aberrations. Several algorithms have been used to identify such gene modules, including hierarchical clustering and non-negative matrix factorization. Although these algorithms have been applied to many microarray datasets, only a few systematic analyses of these algorithms have been performed for RNA-sequencing (RNA-Seq) data to date. Although gene expression levels determined based on microarray and RNA-Seq datasets tend to be highly correlated, the expression levels of some genes differ depending on the platforms used for analysis, which may result in the construction of different gene modules for the same samples. Here, we compare several module detection algorithms applied to both microarray and RNA-seq datasets. We further propose a new functional gene module detection algorithm (FGMD), which is based on a hierarchical clustering algorithm that was modified to reflect actual biological observations, including the fact that a single gene may be involved in multiple biological pathways. Application of existing algorithms and the new FGMD algorithm to breast cancer and ovarian cancer datasets from The Cancer Genome Atlas showed that the FGMD algorithm had the best performance for most of the functional pathway enrichment tests and in the transcription factor enrichment test. We expect that the FGMD algorithm will contribute to improving the identification of functional gene modules related to cancer.
Collapse
Affiliation(s)
- Daeyong Jin
- Korea Environment Institute, Sejong, South Korea
| | - Hyunju Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, South Korea
- * E-mail:
| |
Collapse
|
18
|
Seo J, Jin D, Choi CH, Lee H. Integration of MicroRNA, mRNA, and Protein Expression Data for the Identification of Cancer-Related MicroRNAs. PLoS One 2017; 12:e0168412. [PMID: 28056026 PMCID: PMC5215789 DOI: 10.1371/journal.pone.0168412] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Accepted: 11/29/2016] [Indexed: 12/20/2022] Open
Abstract
MicroRNAs (miRNAs) are responsible for the regulation of target genes involved in various biological processes, and may play oncogenic or tumor suppressive roles. Many studies have investigated the relationships between miRNAs and their target genes, using mRNA and miRNA expression data. However, mRNA expression levels do not necessarily represent the exact gene expression profiles, since protein translation may be regulated in several different ways. Despite this, large-scale protein expression data have been integrated rarely when predicting gene-miRNA relationships. This study explores two approaches for the investigation of gene-miRNA relationships by integrating mRNA expression and protein expression data. First, miRNAs were ranked according to their effects on cancer development. We calculated influence scores for each miRNA, based on the number of significant mRNA-miRNA and protein-miRNA correlations. Furthermore, we constructed modules containing mRNAs, proteins, and miRNAs, in which these three molecular types are highly correlated. The regulatory interactions between miRNA and genes in these modules have been validated based on the direct regulations, indirect regulations, and co-regulations through transcription factors. We applied our approaches to glioblastomas (GBMs), ranked miRNAs depending on their effects on GBM, and obtained 52 GBM-related modules. Compared with the miRNA rankings and modules constructed using only mRNA expression data, the rankings and modules constructed using mRNA and protein expression data were shown to have better performance. Additionally, we experimentally verified that miR-504, highly ranked and included in the identified modules, plays a suppressive role in GBM development. We demonstrated that the integration of both expression profiles allows a more precise analysis of gene-miRNA interactions and the identification of a higher number of cancer-related miRNAs and regulatory mechanisms.
Collapse
Affiliation(s)
- Jiyoun Seo
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwanjgu, Republic of Korea
| | - Daeyong Jin
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwanjgu, Republic of Korea
| | - Chan-Hun Choi
- College of Korean Medicine, Dongshin University, Naju-si, Jeollanam-do, Republic of Korea
| | - Hyunju Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwanjgu, Republic of Korea
| |
Collapse
|
19
|
Prioritizing cancer-related microRNAs by integrating microRNA and mRNA datasets. Sci Rep 2016; 6:35350. [PMID: 27734929 PMCID: PMC5062133 DOI: 10.1038/srep35350] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Accepted: 09/28/2016] [Indexed: 12/29/2022] Open
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs regulating the expression of target genes, and they are involved in cancer initiation and progression. Even though many cancer-related miRNAs were identified, their functional impact may vary, depending on their effects on the regulation of other miRNAs and genes. In this study, we propose a novel method for the prioritization of candidate cancer-related miRNAs that may affect the expression of other miRNAs and genes across the entire biological network. For this, we propose three important features: the average expression of a miRNA in multiple cancer samples, the average of the absolute correlation values between the expression of a miRNA and expression of all genes, and the number of predicted miRNA target genes. These three features were integrated using order statistics. By applying the proposed approach to four cancer types, glioblastoma, ovarian cancer, prostate cancer, and breast cancer, we prioritized candidate cancer-related miRNAs and determined their functional roles in cancer-related pathways. The proposed approach can be used to identify miRNAs that play crucial roles in driving cancer development, and the elucidation of novel potential therapeutic targets for cancer treatment.
Collapse
|
20
|
Walsh CJ, Hu P, Batt J, Dos Santos CC. Discovering MicroRNA-Regulatory Modules in Multi-Dimensional Cancer Genomic Data: A Survey of Computational Methods. Cancer Inform 2016; 15:25-42. [PMID: 27721651 PMCID: PMC5051584 DOI: 10.4137/cin.s39369] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Revised: 08/14/2016] [Accepted: 08/16/2016] [Indexed: 12/20/2022] Open
Abstract
MicroRNAs (miRs) are small single-stranded noncoding RNA that function in RNA silencing and post-transcriptional regulation of gene expression. An increasing number of studies have shown that miRs play an important role in tumorigenesis, and understanding the regulatory mechanism of miRs in this gene regulatory network will help elucidate the complex biological processes at play during malignancy. Despite advances, determination of miR–target interactions (MTIs) and identification of functional modules composed of miRs and their specific targets remain a challenge. A large amount of data generated by high-throughput methods from various sources are available to investigate MTIs. The development of data-driven tools to harness these multi-dimensional data has resulted in significant progress over the past decade. In parallel, large-scale cancer genomic projects are allowing new insights into the commonalities and disparities of miR–target regulation across cancers. In the first half of this review, we explore methods for identification of pairwise MTIs, and in the second half, we explore computational tools for discovery of miR-regulatory modules in a cancer-specific and pan-cancer context. We highlight strengths and limitations of each of these tools as a practical guide for the computational biologists.
Collapse
Affiliation(s)
- Christopher J Walsh
- Keenan and Li Ka Shing Knowledge Institute of Saint Michael's Hospital, Toronto, ON, Canada.; Institute of Medical Sciences and Department of Medicine, University of Toronto, Toronto, ON, Canada
| | - Pingzhao Hu
- Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, MB, Canada
| | - Jane Batt
- Keenan and Li Ka Shing Knowledge Institute of Saint Michael's Hospital, Toronto, ON, Canada.; Institute of Medical Sciences and Department of Medicine, University of Toronto, Toronto, ON, Canada
| | - Claudia C Dos Santos
- Keenan and Li Ka Shing Knowledge Institute of Saint Michael's Hospital, Toronto, ON, Canada.; Institute of Medical Sciences and Department of Medicine, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
21
|
Sridharan K, Gogtay NJ. Therapeutic nucleic acids: current clinical status. Br J Clin Pharmacol 2016; 82:659-72. [PMID: 27111518 DOI: 10.1111/bcp.12987] [Citation(s) in RCA: 165] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Revised: 04/20/2016] [Accepted: 04/21/2016] [Indexed: 02/06/2023] Open
Abstract
Deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) are simple linear polymers that have been the subject of considerable research in the last two decades and have now moved into the realm of being stand-alone therapeutic agents. Much of this has stemmed from the appreciation that they carry out myriad functions that go beyond mere storage of genetic information and protein synthesis. Therapy with nucleic acids either uses unmodified DNA or RNA or closely related compounds. From both a development and regulatory perspective, they fall somewhere between small molecules and biologics. Several of these compounds are in clinical development and many have received regulatory approval for human use. This review addresses therapeutic uses of DNA based on antisense oligonucleotides, DNA aptamers and gene therapy; and therapeutic uses of RNA including micro RNAs, short interfering RNAs, ribozymes, RNA decoys and circular RNAs. With their specificity, functional diversity and limited toxicity, therapeutic nucleic acids hold enormous promise. However, challenges that need to be addressed include targeted delivery, mass production at low cost, sustaining efficacy and minimizing off-target toxicity. Technological developments will hold the key to this and help accelerate drug approvals in the years to come.
Collapse
Affiliation(s)
- Kannan Sridharan
- Department of Health Sciences, College of Medicine, Nursing and Health Sciences, Fiji National University, Suva, Fiji
| | - Nithya Jaideep Gogtay
- Department of Clinical Pharmacology, Seth GS Medical College and KEM Hospital, Mumbai, India
| |
Collapse
|
22
|
A Two-Stage Method to Identify Joint Modules From Matched MicroRNA and mRNA Expression Data. IEEE Trans Nanobioscience 2016. [DOI: 10.1109/tnb.2016.2556744] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
23
|
Yang Z, Michailidis G. A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data. Bioinformatics 2016; 32:1-8. [PMID: 26377073 PMCID: PMC5006236 DOI: 10.1093/bioinformatics/btv544] [Citation(s) in RCA: 112] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2015] [Revised: 09/08/2015] [Accepted: 09/09/2015] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION Recent advances in high-throughput omics technologies have enabled biomedical researchers to collect large-scale genomic data. As a consequence, there has been growing interest in developing methods to integrate such data to obtain deeper insights regarding the underlying biological system. A key challenge for integrative studies is the heterogeneity present in the different omics data sources, which makes it difficult to discern the coordinated signal of interest from source-specific noise or extraneous effects. RESULTS We introduce a novel method of multi-modal data analysis that is designed for heterogeneous data based on non-negative matrix factorization. We provide an algorithm for jointly decomposing the data matrices involved that also includes a sparsity option for high-dimensional settings. The performance of the proposed method is evaluated on synthetic data and on real DNA methylation, gene expression and miRNA expression data from ovarian cancer samples obtained from The Cancer Genome Atlas. The results show the presence of common modules across patient samples linked to cancer-related pathways, as well as previously established ovarian cancer subtypes. AVAILABILITY AND IMPLEMENTATION The source code repository is publicly available at https://github.com/yangzi4/iNMF. CONTACT gmichail@umich.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zi Yang
- Department of Statistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - George Michailidis
- Department of Statistics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
24
|
Yao X, Yan J, Kim S, Nho K, Risacher SL, Inlow M, Moore JH, Saykin AJ, Shen L, for the Alzheimer’s Disease Neuroimaging Initiative **. Two-dimensional Enrichment Analysis for Mining High-level Imaging Genetic Associations. BRAIN INFORMATICS AND HEALTH : 8TH INTERNATIONAL CONFERENCE, BIH 2015, LONDON, UK, AUGUST 30-SEPTEMBER 2, 2015 : PROCEEDINGS. BIH (CONFERENCE) (8TH : 2015 : LONDON, ENGLAND) 2015; 9250:115-124. [PMID: 26568986 PMCID: PMC4640356 DOI: 10.1007/978-3-319-23344-4_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Enrichment analysis has been widely applied in the genome-wide association studies (GWAS), where gene sets corresponding to biological pathways are examined for significant associations with a phenotype to help increase statistical power and improve biological interpretation. In this work, we expand the scope of enrichment analysis into brain imaging genetics, an emerging field that studies how genetic variation influences brain structure and function measured by neuroimaging quantitative traits (QT). Given the high dimensionality of both imaging and genetic data, we propose to study Imaging Genetic Enrichment Analysis (IGEA), a new enrichment analysis paradigm that jointly considers meaningful gene sets (GS) and brain circuits (BC) and examines whether any given GS-BC pair is enriched in a list of gene-QT findings. Using gene expression data from Allen Human Brain Atlas and imaging genetics data from Alzheimer's Disease Neuroimaging Initiative as test beds, we present an IGEA framework and conduct a proof-of-concept study. This empirical study identifies 12 significant high level two dimensional imaging genetics modules. Many of these modules are relevant to a variety of neurobiological pathways or neurodegenerative diseases, showing the promise of the proposal framework for providing insight into the mechanism of complex diseases.
Collapse
Affiliation(s)
- Xiaohui Yao
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
- School of Informatics and Computing, Indiana University Indianapolis, IN, USA
| | - Jingwen Yan
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
- School of Informatics and Computing, Indiana University Indianapolis, IN, USA
| | - Sungeun Kim
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
| | - Kwangsik Nho
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
| | - Shannon L Risacher
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
| | - Mark Inlow
- Mathematics, Rose-Hulman Institute of Technology, IN, USA
| | - Jason H. Moore
- Biomedical Informatics, School of Medicine, University of Pennsylvania, PA, USA
| | - Andrew J. Saykin
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
| | - Li Shen
- Radiology and Imaging Sciences, Indiana University School of Medicine, IN, USA
- School of Informatics and Computing, Indiana University Indianapolis, IN, USA
| | | |
Collapse
|
25
|
Integrative Analysis with Monte Carlo Cross-Validation Reveals miRNAs Regulating Pathways Cross-Talk in Aggressive Breast Cancer. BIOMED RESEARCH INTERNATIONAL 2015; 2015:831314. [PMID: 26240829 PMCID: PMC4512830 DOI: 10.1155/2015/831314] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Revised: 05/31/2015] [Accepted: 06/08/2015] [Indexed: 12/11/2022]
Abstract
In this work an integrated approach was used to identify functional miRNAs regulating gene pathway cross-talk in breast cancer (BC). We first integrated gene expression profiles and biological pathway information to explore the underlying associations between genes differently expressed among normal and BC samples and pathways enriched from these genes. For each pair of pathways, a score was derived from the distribution of gene expression levels by quantifying their pathway cross-talk. Random forest classification allowed the identification of pairs of pathways with high cross-talk. We assessed miRNAs regulating the identified gene pathways by a mutual information analysis. A Fisher test was applied to demonstrate their significance in the regulated pathways. Our results suggest interesting networks of pathways that could be key regulatory of target genes in BC, including stem cell pluripotency, coagulation, and hypoxia pathways and miRNAs that control these networks could be potential biomarkers for diagnostic, prognostic, and therapeutic development in BC. This work shows that standard methods of predicting normal and tumor classes such as differentially expressed miRNAs or transcription factors could lose intrinsic features; instead our approach revealed the responsible molecules of the disease.
Collapse
|