1
|
Zhang J, Xiong C, Wei X, Yang H, Zhao C. Modeling ncRNA Synergistic Regulation in Cancer. Methods Mol Biol 2025; 2883:377-402. [PMID: 39702718 DOI: 10.1007/978-1-0716-4290-0_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2024]
Abstract
Cancer seriously threatens human life and health, and the structure and function of genes within cancer cells have changed relative to normal cells. Essentially, cancer is a polygenic disorder, and the core of its occurrence and development is caused by polygenic synergy. Non-coding RNAs (ncRNAs) act as regulators to modulate gene expression levels, and they provide theoretical basis and new technology for the diagnosis and preventive treatment of cancer. However, the study of ncRNA regulation and its role as biomarkers in cancer remain largely unearthed. Driven by multi-omics data, an abundance of computational methods, tools, and databases have been developed for predicting ncRNA-cancer association/causality, inferring ncRNA regulation, and modeling ncRNA synergistic regulation. This chapter aims to provide a comprehensive perspective of modeling ncRNA synergistic regulation. Since the ncRNAs involved in cancer contribute to modeling cancer-associated ncRNA synergistic regulation, we first review the databases and tools of cancer-related ncRNAs. Then we investigate the existing tools or methods for modeling ncRNA-directed and ncRNA-mediated regulation. In addition, we survey the available computational tools or methods for modeling ncRNA synergistic regulation, including synergistic interaction and synergistic competition. Finally, we discuss the future directions and challenges in modeling ncRNA synergistic regulation.
Collapse
Affiliation(s)
- Junpeng Zhang
- School of Engineering, Dali University, Dali, Yunnan, China
| | - Chenchen Xiong
- School of Engineering, Dali University, Dali, Yunnan, China
- Beijing CapitalBio Pharma Technology Co., Ltd., Beijing, China
| | - Xuemei Wei
- School of Engineering, Dali University, Dali, Yunnan, China
| | - Haolin Yang
- School of Engineering, Dali University, Dali, Yunnan, China
| | - Chunwen Zhao
- School of Engineering, Dali University, Dali, Yunnan, China
| |
Collapse
|
2
|
Zhang J, Liu L, Wei X, Zhao C, Luo Y, Li J, Le TD. Scanning sample-specific miRNA regulation from bulk and single-cell RNA-sequencing data. BMC Biol 2024; 22:218. [PMID: 39334271 PMCID: PMC11438147 DOI: 10.1186/s12915-024-02020-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 09/24/2024] [Indexed: 09/30/2024] Open
Abstract
BACKGROUND RNA-sequencing technology provides an effective tool for understanding miRNA regulation in complex human diseases, including cancers. A large number of computational methods have been developed to make use of bulk and single-cell RNA-sequencing data to identify miRNA regulations at the resolution of multiple samples (i.e. group of cells or tissues). However, due to the heterogeneity of individual samples, there is a strong need to infer miRNA regulation specific to individual samples to uncover miRNA regulation at the single-sample resolution level. RESULTS Here, we develop a framework, Scan, for scanning sample-specific miRNA regulation. Since a single network inference method or strategy cannot perform well for all types of new data, Scan incorporates 27 network inference methods and two strategies to infer tissue-specific or cell-specific miRNA regulation from bulk or single-cell RNA-sequencing data. Results on bulk and single-cell RNA-sequencing data demonstrate the effectiveness of Scan in inferring sample-specific miRNA regulation. Moreover, we have found that incorporating the prior information of miRNA targets can generally improve the accuracy of miRNA target prediction. In addition, Scan can contribute to construct cell/tissue correlation networks and recover aggregate miRNA regulatory networks. Finally, the comparison results have shown that the performance of network inference methods is likely to be data-specific, and selecting optimal network inference methods is required for more accurate prediction of miRNA targets. CONCLUSIONS Scan provides a useful method to help infer sample-specific miRNA regulation for new data, benchmark new network inference methods and deepen the understanding of miRNA regulation at the resolution of individual samples.
Collapse
Affiliation(s)
- Junpeng Zhang
- School of Engineering, Dali University, Dali, 671003, Yunnan, China.
| | - Lin Liu
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Xuemei Wei
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Chunwen Zhao
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Yanbi Luo
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Jiuyong Li
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Thuc Duy Le
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia.
| |
Collapse
|
3
|
Fan Z, Kernan KF, Sriram A, Benos PV, Canna SW, Carcillo JA, Kim S, Park HJ. Deep neural networks with knockoff features identify nonlinear causal relations and estimate effect sizes in complex biological systems. Gigascience 2022; 12:giad044. [PMID: 37395630 PMCID: PMC10316696 DOI: 10.1093/gigascience/giad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 01/31/2023] [Accepted: 05/29/2023] [Indexed: 07/04/2023] Open
Abstract
BACKGROUND Learning the causal structure helps identify risk factors, disease mechanisms, and candidate therapeutics for complex diseases. However, although complex biological systems are characterized by nonlinear associations, existing bioinformatic methods of causal inference cannot identify the nonlinear relationships and estimate their effect size. RESULTS To overcome these limitations, we developed the first computational method that explicitly learns nonlinear causal relations and estimates the effect size using a deep neural network approach coupled with the knockoff framework, named causal directed acyclic graphs using deep learning variable selection (DAG-deepVASE). Using simulation data of diverse scenarios and identifying known and novel causal relations in molecular and clinical data of various diseases, we demonstrated that DAG-deepVASE consistently outperforms existing methods in identifying true and known causal relations. In the analyses, we also illustrate how identifying nonlinear causal relations and estimating their effect size help understand the complex disease pathobiology, which is not possible using other methods. CONCLUSIONS With these advantages, the application of DAG-deepVASE can help identify driver genes and therapeutic agents in biomedical studies and clinical trials.
Collapse
Affiliation(s)
- Zhenjiang Fan
- Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Kate F Kernan
- Division of Pediatric Critical Care Medicine, Department of Critical Care Medicine, Children's Hospital of Pittsburgh, Center for Critical Care Nephrology and Clinical Research Investigation and Systems Modeling of Acute Illness Center, University of Pittsburgh, Pittsburgh, PA 15260,USA
| | - Aditya Sriram
- Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Panayiotis V Benos
- Department of Epidemiology, University of Florida, Gainesville, FL 32610, USA
| | - Scott W Canna
- Pediatric Rheumatology, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Joseph A Carcillo
- Division of Pediatric Critical Care Medicine, Department of Critical Care Medicine, Children's Hospital of Pittsburgh, Center for Critical Care Nephrology and Clinical Research Investigation and Systems Modeling of Acute Illness Center, University of Pittsburgh, Pittsburgh, PA 15260,USA
| | - Soyeon Kim
- Division of Pediatric Pulmonary Medicine, Children's Hospital of Pittsburgh, Pittsburgh, PA 15224, USA
- Department of Pediatrics, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15224, USA
| | - Hyun Jung Park
- Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA 15213, USA
| |
Collapse
|
4
|
Kelly J, Berzuini C, Keavney B, Tomaszewski M, Guo H. A review of causal discovery methods for molecular network analysis. Mol Genet Genomic Med 2022; 10:e2055. [PMID: 36087049 PMCID: PMC9544222 DOI: 10.1002/mgg3.2055] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 07/12/2022] [Accepted: 08/18/2022] [Indexed: 11/08/2022] Open
Abstract
BACKGROUND With the increasing availability and size of multi-omics datasets, investigating the casual relationships between molecular phenotypes has become an important aspect of exploring underlying biology andgenetics. There are an increasing number of methodlogies that have been developed and applied to moleular networks to investigate these causal interactions. METHODS We have introduced and reviewed the available methods for building large-scale causal molecular networks that have been developed and applied in the past decade. RESULTS In this review we have identified and summarized the existing methods for infering causality in large-scale causal molecular networks, and discussed important factors that will need to be considered in future research in this area. CONCLUSION Existing methods to infering causal molecular networks have their own strengths and limitations so there is no one best approach, and it is instead down to the discretion of the researcher. This review also to discusses some of the current limitations to biological interpretation of these networks, and important factors to consider for future studies on molecular networks.
Collapse
Affiliation(s)
- Jack Kelly
- Centre for Biostatistics, School of Health Sciences, Faculty of Medicine, Biology and HealthUniversity of ManchesterManchesterUK
| | - Carlo Berzuini
- Centre for Biostatistics, School of Health Sciences, Faculty of Medicine, Biology and HealthUniversity of ManchesterManchesterUK
| | - Bernard Keavney
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and HealthUniversity of ManchesterManchesterUK
- Division of Cardiology and Manchester Academic Health Science CentreManchester University NHS Foundation TrustManchesterUK
| | - Maciej Tomaszewski
- Division of Cardiovascular Sciences, Faculty of Medicine, Biology and HealthUniversity of ManchesterManchesterUK
- Manchester Heart Centre and Manchester Academic Health Science CentreManchester University NHS Foundation TrustManchesterUK
| | - Hui Guo
- Centre for Biostatistics, School of Health Sciences, Faculty of Medicine, Biology and HealthUniversity of ManchesterManchesterUK
| |
Collapse
|
5
|
Zhang J, Liu L, Xu T, Zhang W, Zhao C, Li S, Li J, Rao N, Le TD. Exploring cell-specific miRNA regulation with single-cell miRNA-mRNA co-sequencing data. BMC Bioinformatics 2021; 22:578. [PMID: 34856921 PMCID: PMC8641245 DOI: 10.1186/s12859-021-04498-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 11/19/2021] [Indexed: 11/13/2022] Open
Abstract
Background Existing computational methods for studying miRNA regulation are mostly based on bulk miRNA and mRNA expression data. However, bulk data only allows the analysis of miRNA regulation regarding a group of cells, rather than the miRNA regulation unique to individual cells. Recent advance in single-cell miRNA-mRNA co-sequencing technology has opened a way for investigating miRNA regulation at single-cell level. However, as currently single-cell miRNA-mRNA co-sequencing data is just emerging and only available at small-scale, there is a strong need of novel methods to exploit existing single-cell data for the study of cell-specific miRNA regulation. Results In this work, we propose a new method, CSmiR (Cell-Specific miRNA regulation) to combine single-cell miRNA-mRNA co-sequencing data and putative miRNA-mRNA binding information to identify miRNA regulatory networks at the resolution of individual cells. We apply CSmiR to the miRNA-mRNA co-sequencing data in 19 K562 single-cells to identify cell-specific miRNA-mRNA regulatory networks for understanding miRNA regulation in each K562 single-cell. By analyzing the obtained cell-specific miRNA-mRNA regulatory networks, we observe that the miRNA regulation in each K562 single-cell is unique. Moreover, we conduct detailed analysis on the cell-specific miRNA regulation associated with the miR-17/92 family as a case study. The comparison results indicate that CSmiR is effective in predicting cell-specific miRNA targets. Finally, through exploring cell–cell similarity matrix characterized by cell-specific miRNA regulation, CSmiR provides a novel strategy for clustering single-cells and helps to understand cell–cell crosstalk. Conclusions To the best of our knowledge, CSmiR is the first method to explore miRNA regulation at a single-cell resolution level, and we believe that it can be a useful method to enhance the understanding of cell-specific miRNA regulation. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04498-6.
Collapse
Affiliation(s)
- Junpeng Zhang
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 610054, Sichuan, China. .,School of Engineering, Dali University, Dali, 671003, Yunnan, China.
| | - Lin Liu
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Taosheng Xu
- Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, China
| | - Wu Zhang
- School of Agriculture and Biological Sciences, Dali University, Dali, 671003, Yunnan, China
| | - Chunwen Zhao
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Sijing Li
- School of Engineering, Dali University, Dali, 671003, Yunnan, China
| | - Jiuyong Li
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Nini Rao
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, 610054, Sichuan, China.
| | - Thuc Duy Le
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia.
| |
Collapse
|
6
|
Cifuentes-Bernal AM, Pham VV, Li X, Liu L, Li J, Le TD. A pseudotemporal causality approach to identifying miRNA-mRNA interactions during biological processes. Bioinformatics 2021; 37:807-814. [PMID: 33070184 DOI: 10.1093/bioinformatics/btaa899] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 09/02/2020] [Accepted: 10/06/2020] [Indexed: 12/19/2022] Open
Abstract
MOTIVATION microRNAs (miRNAs) are important gene regulators and they are involved in many biological processes, including cancer progression. Therefore, correctly identifying miRNA-mRNA interactions is a crucial task. To this end, a huge number of computational methods has been developed, but they mainly use the data at one snapshot and ignore the dynamics of a biological process. The recent development of single cell data and the booming of the exploration of cell trajectories using 'pseudotime' concept have inspired us to develop a pseudotime-based method to infer the miRNA-mRNA relationships characterizing a biological process by taking into account the temporal aspect of the process. RESULTS We have developed a novel approach, called pseudotime causality, to find the causal relationships between miRNAs and mRNAs during a biological process. We have applied the proposed method to both single cell and bulk sequencing datasets for Epithelia to Mesenchymal Transition, a key process in cancer metastasis. The evaluation results show that our method significantly outperforms existing methods in finding miRNA-mRNA interactions in both single cell and bulk data. The results suggest that utilizing the pseudotemporal information from the data helps reveal the gene regulation in a biological process much better than using the static information. AVAILABILITY AND IMPLEMENTATION R scripts and datasets can be found at https://github.com/AndresMCB/PTC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Vu Vh Pham
- UniSA STEM, University of South Australia, Adelaide, South Australia, 5095 Mawson Lakes, Australia
| | - Xiaomei Li
- UniSA STEM, University of South Australia, Adelaide, South Australia, 5095 Mawson Lakes, Australia
| | - Lin Liu
- UniSA STEM, University of South Australia, Adelaide, South Australia, 5095 Mawson Lakes, Australia
| | - Jiuyong Li
- UniSA STEM, University of South Australia, Adelaide, South Australia, 5095 Mawson Lakes, Australia
| | - Thuc Duy Le
- UniSA STEM, University of South Australia, Adelaide, South Australia, 5095 Mawson Lakes, Australia
| |
Collapse
|
7
|
Gjorgjieva M, Sobolewski C, Dolicka D, Correia de Sousa M, Foti M. miRNAs and NAFLD: from pathophysiology to therapy. Gut 2019; 68:2065-2079. [PMID: 31300518 DOI: 10.1136/gutjnl-2018-318146] [Citation(s) in RCA: 167] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/25/2019] [Accepted: 05/29/2019] [Indexed: 12/11/2022]
Abstract
Non-alcoholic fatty liver disease (NAFLD) is associated with a thorough reprogramming of hepatic metabolism. Epigenetic mechanisms, in particular those associated with deregulation of the expressions and activities of microRNAs (miRNAs), play a major role in metabolic disorders associated with NAFLD and their progression towards more severe stages of the disease. In this review, we discuss the recent progress addressing the role of the many facets of complex miRNA regulatory networks in the development and progression of NAFLD. The basic concepts and mechanisms of miRNA-mediated gene regulation as well as the various setbacks encountered in basic and translational research in this field are debated. miRNAs identified so far, whose expressions/activities are deregulated in NAFLD, and which contribute to the outcomes of this pathology are further reviewed. Finally, the potential therapeutic usages in a short to medium term of miRNA-based strategies in NAFLD, in particular to identify non-invasive biomarkers, or to design pharmacological analogues/inhibitors having a broad range of actions on hepatic metabolism, are highlighted.
Collapse
Affiliation(s)
- Monika Gjorgjieva
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Cyril Sobolewski
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Dobrochna Dolicka
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Marta Correia de Sousa
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Michelangelo Foti
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| |
Collapse
|
8
|
Le TD, Hoang T, Li J, Liu L, Liu H, Hu S. A Fast PC Algorithm for High Dimensional Causal Discovery with Multi-Core PCs. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:1483-1495. [PMID: 27429444 DOI: 10.1109/tcbb.2016.2591526] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Discovering causal relationships from observational data is a crucial problem and it has applications in many research areas. The PC algorithm is the state-of-the-art constraint based method for causal discovery. However, runtime of the PC algorithm, in the worst-case, is exponential to the number of nodes (variables), and thus it is inefficient when being applied to high dimensional data, e.g., gene expression datasets. On another note, the advancement of computer hardware in the last decade has resulted in the widespread availability of multi-core personal computers. There is a significant motivation for designing a parallelized PC algorithm that is suitable for personal computers and does not require end users' parallel computing knowledge beyond their competency in using the PC algorithm. In this paper, we develop parallel-PC, a fast and memory efficient PC algorithm using the parallel computing technique. We apply our method to a range of synthetic and real-world high dimensional datasets. Experimental results on a dataset from the DREAM 5 challenge show that the original PC algorithm could not produce any results after running more than 24 hours; meanwhile, our parallel-PC algorithm managed to finish within around 12 hours with a 4-core CPU computer, and less than six hours with a 8-core CPU computer. Furthermore, we integrate parallel-PC into a causal inference method for inferring miRNA-mRNA regulatory relationships. The experimental results show that parallel-PC helps improve both the efficiency and accuracy of the causal inference algorithm.
Collapse
|
9
|
Pham VV, Zhang J, Liu L, Truong B, Xu T, Nguyen TT, Li J, Le TD. Identifying miRNA-mRNA regulatory relationships in breast cancer with invariant causal prediction. BMC Bioinformatics 2019; 20:143. [PMID: 30876399 PMCID: PMC6419852 DOI: 10.1186/s12859-019-2668-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 02/05/2019] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND microRNAs (miRNAs) regulate gene expression at the post-transcriptional level and they play an important role in various biological processes in the human body. Therefore, identifying their regulation mechanisms is essential for the diagnostics and therapeutics for a wide range of diseases. There have been a large number of researches which use gene expression profiles to resolve this problem. However, the current methods have their own limitations. Some of them only identify the correlation of miRNA and mRNA expression levels instead of the causal or regulatory relationships while others infer the causality but with a high computational complexity. To overcome these issues, in this study, we propose a method to identify miRNA-mRNA regulatory relationships in breast cancer using the invariant causal prediction. The key idea of invariant causal prediction is that the cause miRNAs of their target mRNAs are the ones which have persistent causal relationships with the target mRNAs across different environments. RESULTS In this research, we aim to find miRNA targets which are consistent across different breast cancer subtypes. Thus, first of all, we apply the Pam50 method to categorize BRCA samples into different "environment" groups based on different cancer subtypes. Then we use the invariant causal prediction method to find miRNA-mRNA regulatory relationships across subtypes. We validate the results with the miRNA-transfected experimental data and the results show that our method outperforms the state-of-the-art methods. In addition, we also integrate this new method with the Pearson correlation analysis method and Lasso in an ensemble method to take the advantages of these methods. We then validate the results of the ensemble method with the experimentally confirmed data and the ensemble method shows the best performance, even comparing to the proposed causal method. CONCLUSIONS This research found miRNA targets which are consistent across different breast cancer subtypes. Further functional enrichment analysis shows that miRNAs involved in the regulatory relationships predicated by the proposed methods tend to synergistically regulate target genes, indicating the usefulness of these methods, and the identified miRNA targets could be used in the design of wet-lab experiments to discover the causes of breast cancer.
Collapse
Affiliation(s)
- Vu Vh Pham
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, Australia
| | - Junpeng Zhang
- School of Engineering, Dali University, Dali, Yunnan, China
| | - Lin Liu
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, Australia
| | - Buu Truong
- Pham Ngoc Thach University of Medicine, Ho Chi Minh, Vietnam
| | - Taosheng Xu
- Institute of Intelligent Machines, Heifei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
| | - Trung T Nguyen
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, Australia
| | - Jiuyong Li
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, Australia
| | - Thuc D Le
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, Australia.
| |
Collapse
|
10
|
Luo J, Huang W, Cao B. A novel approach to identify the miRNA-mRNA causal regulatory modules in Cancer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:309-315. [PMID: 28113985 DOI: 10.1109/tcbb.2016.2612199] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
MicroRNAs (miRNAs) play an essential role in many biological processes by regulating the target genes, especially in the initiation and development of cancers. Therefore, the identification of the miRNA-mRNA regulatory modules is important for understanding the regulatory mechanisms. Most computational methods only used statistical correlations in predicting miRNA-mRNA modules, and neglected the fact there are causal relationships between miRNAs and their target genes. In this paper, we propose a novel approach called CALM(the causal regulatory modules) to identify the miRNA-mRNA regulatory modules through integrating the causal interactions and statistical correlations between the miRNAs and their target genes. Our algorithm largely consists of three steps: it first forms the causal regulatory relationships of miRNAs and genes from gene expression profiles and detects the miRNA clusters according to the GO function information of their target genes, then expands each miRNA cluster by greedy adding(discarding) the target genes to maximize the modularity score. To show the performance of our method, we apply CALM on four datasets including EMT, breast, ovarian, thyroid cancer and validate our results. The experiment results show that our method can not only outperform the compared method, but also achieve ideal overall performance in terms of the functional enrichment.
Collapse
|
11
|
Liu Y, Du Q, Wang Q, Yu H, Liu J, Tian Y, Chang C, Lei J. Causal inference between bioavailability of heavy metals and environmental factors in a large-scale region. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2017; 226:370-378. [PMID: 28457732 DOI: 10.1016/j.envpol.2017.03.019] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Revised: 02/16/2017] [Accepted: 03/08/2017] [Indexed: 06/07/2023]
Abstract
The causation between bioavailability of heavy metals and environmental factors are generally obtained from field experiments at local scales at present, and lack sufficient evidence from large scales. However, inferring causation between bioavailability of heavy metals and environmental factors across large-scale regions is challenging. Because the conventional correlation-based approaches used for causation assessments across large-scale regions, at the expense of actual causation, can result in spurious insights. In this study, a general approach framework, Intervention calculus when the directed acyclic graph (DAG) is absent (IDA) combined with the backdoor criterion (BC), was introduced to identify causation between the bioavailability of heavy metals and the potential environmental factors across large-scale regions. We take the Pearl River Delta (PRD) in China as a case study. The causal structures and effects were identified based on the concentrations of heavy metals (Zn, As, Cu, Hg, Pb, Cr, Ni and Cd) in soil (0-20 cm depth) and vegetable (lettuce) and 40 environmental factors (soil properties, extractable heavy metals and weathering indices) in 94 samples across the PRD. Results show that the bioavailability of heavy metals (Cd, Zn, Cr, Ni and As) was causally influenced by soil properties and soil weathering factors, whereas no causal factor impacted the bioavailability of Cu, Hg and Pb. No latent factor was found between the bioavailability of heavy metals and environmental factors. The causation between the bioavailability of heavy metals and environmental factors at field experiments is consistent with that on a large scale. The IDA combined with the BC provides a powerful tool to identify causation between the bioavailability of heavy metals and environmental factors across large-scale regions. Causal inference in a large system with the dynamic changes has great implications for system-based risk management.
Collapse
Affiliation(s)
- Yuqiong Liu
- School of Resource and Environmental Science, Wuhan University, 129 Luoyu Road, Wuhan 430079, China; Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Guangdong Institute of Eco-environmental Science & Technology, Guangzhou 510650, China; Hunan Hydro&Power Design Institute, Changsha, 410007, China
| | - Qingyun Du
- School of Resource and Environmental Science, Wuhan University, 129 Luoyu Road, Wuhan 430079, China; Key Laboratory of Geographic Information System, Ministry of Education, Wuhan University, 129 Luoyu Road, Wuhan 430079, China.
| | - Qi Wang
- Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Guangdong Institute of Eco-environmental Science & Technology, Guangzhou 510650, China.
| | - Huanyun Yu
- Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Guangdong Institute of Eco-environmental Science & Technology, Guangzhou 510650, China
| | - Jianfeng Liu
- Research Institute of Forestry, Chinese Academy of Forestry, Beijing 100091, China
| | - Yu Tian
- Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Guangdong Institute of Eco-environmental Science & Technology, Guangzhou 510650, China
| | - Chunying Chang
- Guangdong Provincial Academy of Environmental Science, Guangzhou 510045, China
| | - Jing Lei
- College of Agriculture, Guangxi University, Nanning 530005, China
| |
Collapse
|
12
|
Oh M, Rhee S, Moon JH, Chae H, Lee S, Kang J, Kim S. Literature-based condition-specific miRNA-mRNA target prediction. PLoS One 2017; 12:e0174999. [PMID: 28362846 PMCID: PMC5376335 DOI: 10.1371/journal.pone.0174999] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2016] [Accepted: 03/17/2017] [Indexed: 01/20/2023] Open
Abstract
miRNAs are small non-coding RNAs that regulate gene expression by binding to the 3'-UTR of genes. Many recent studies have reported that miRNAs play important biological roles by regulating specific mRNAs or genes. Many sequence-based target prediction algorithms have been developed to predict miRNA targets. However, these methods are not designed for condition-specific target predictions and produce many false positives; thus, expression-based target prediction algorithms have been developed for condition-specific target predictions. A typical strategy to utilize expression data is to leverage the negative control roles of miRNAs on genes. To control false positives, a stringent cutoff value is typically set, but in this case, these methods tend to reject many true target relationships, i.e., false negatives. To overcome these limitations, additional information should be utilized. The literature is probably the best resource that we can utilize. Recent literature mining systems compile millions of articles with experiments designed for specific biological questions, and the systems provide a function to search for specific information. To utilize the literature information, we used a literature mining system, BEST, that automatically extracts information from the literature in PubMed and that allows the user to perform searches of the literature with any English words. By integrating omics data analysis methods and BEST, we developed Context-MMIA, a miRNA-mRNA target prediction method that combines expression data analysis results and the literature information extracted based on the user-specified context. In the pathway enrichment analysis using genes included in the top 200 miRNA-targets, Context-MMIA outperformed the four existing target prediction methods that we tested. In another test on whether prediction methods can re-produce experimentally validated target relationships, Context-MMIA outperformed the four existing target prediction methods. In summary, Context-MMIA allows the user to specify a context of the experimental data to predict miRNA targets, and we believe that Context-MMIA is very useful for predicting condition-specific miRNA targets.
Collapse
Affiliation(s)
- Minsik Oh
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
| | - Sungmin Rhee
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
| | - Ji Hwan Moon
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Heejoon Chae
- Division of Computer Science, Sookmyung Women’s University, Seoul, Republic of Korea
| | - Sunwon Lee
- Department of Computer Science and Engineering, Korea University, Seoul, Republic of Korea
| | - Jaewoo Kang
- Department of Computer Science and Engineering, Korea University, Seoul, Republic of Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea
- * E-mail:
| |
Collapse
|
13
|
Nguyen T, Diaz D, Tagett R, Draghici S. Overcoming the matched-sample bottleneck: an orthogonal approach to integrate omic data. Sci Rep 2016; 6:29251. [PMID: 27403564 PMCID: PMC4941544 DOI: 10.1038/srep29251] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2016] [Accepted: 06/14/2016] [Indexed: 01/22/2023] Open
Abstract
MicroRNAs (miRNAs) are small non-coding RNA molecules whose primary function is to regulate the expression of gene products via hybridization to mRNA transcripts, resulting in suppression of translation or mRNA degradation. Although miRNAs have been implicated in complex diseases, including cancer, their impact on distinct biological pathways and phenotypes is largely unknown. Current integration approaches require sample-matched miRNA/mRNA datasets, resulting in limited applicability in practice. Since these approaches cannot integrate heterogeneous information available across independent experiments, they neither account for bias inherent in individual studies, nor do they benefit from increased sample size. Here we present a novel framework able to integrate miRNA and mRNA data (vertical data integration) available in independent studies (horizontal meta-analysis) allowing for a comprehensive analysis of the given phenotypes. To demonstrate the utility of our method, we conducted a meta-analysis of pancreatic and colorectal cancer, using 1,471 samples from 15 mRNA and 14 miRNA expression datasets. Our two-dimensional data integration approach greatly increases the power of statistical analysis and correctly identifies pathways known to be implicated in the phenotypes. The proposed framework is sufficiently general to integrate other types of data obtained from high-throughput assays.
Collapse
Affiliation(s)
- Tin Nguyen
- Wayne State University, Department of Computer Science, Detroit, 48202, Michigan, USA
| | - Diana Diaz
- Wayne State University, Department of Computer Science, Detroit, 48202, Michigan, USA
| | - Rebecca Tagett
- Wayne State University, Department of Computer Science, Detroit, 48202, Michigan, USA
| | - Sorin Draghici
- Wayne State University, Department of Computer Science, Detroit, 48202, Michigan, USA.,Wayne State University, Department of Obstetrics and Gynecology, Detroit, 48202, Michigan, USA
| |
Collapse
|
14
|
Le TD, Zhang J, Liu L, Liu H, Li J. miRLAB: An R Based Dry Lab for Exploring miRNA-mRNA Regulatory Relationships. PLoS One 2015; 10:e0145386. [PMID: 26716983 PMCID: PMC4696828 DOI: 10.1371/journal.pone.0145386] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 12/03/2015] [Indexed: 11/19/2022] Open
Abstract
microRNAs (miRNAs) are important gene regulators at post-transcriptional level, and inferring miRNA-mRNA regulatory relationships is a crucial problem. Consequently, several computational methods of predicting miRNA targets have been proposed using expression data with or without sequence based miRNA target information. A typical procedure for applying and evaluating such a method is i) collecting matched miRNA and mRNA expression profiles in a specific condition, e.g. a cancer dataset from The Cancer Genome Atlas (TCGA), ii) applying the new computational method to the selected dataset, iii) validating the predictions against knowledge from literature and third-party databases, and comparing the performance of the method with some existing methods. This procedure is time consuming given the time elapsed when collecting and processing data, repeating the work from existing methods, searching for knowledge from literature and third-party databases to validate the results, and comparing the results from different methods. The time consuming procedure prevents researchers from quickly testing new computational models, analysing new datasets, and selecting suitable methods for assisting with the experiment design. Here, we present an R package, miRLAB, for automating the procedure of inferring and validating miRNA-mRNA regulatory relationships. The package provides a complete set of pipelines for testing new methods and analysing new datasets. miRLAB includes a pipeline to obtain matched miRNA and mRNA expression datasets directly from TCGA, 12 benchmark computational methods for inferring miRNA-mRNA regulatory relationships, the functions for validating the predictions using experimentally validated miRNA target data and miRNA perturbation data, and the tools for comparing the results from different computational methods.
Collapse
Affiliation(s)
- Thuc Duy Le
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, South Australia, Australia
- * E-mail: (TDL); (JL)
| | - Junpeng Zhang
- Faculty of Engineering, Dali University, Dali, China
| | - Lin Liu
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, South Australia, Australia
| | - Huawen Liu
- Department of Computer Science, Zhejiang Normal University, China
| | - Jiuyong Li
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, South Australia, Australia
- * E-mail: (TDL); (JL)
| |
Collapse
|
15
|
Holcomb M, Ding YH, Dai D, McDonald RJ, McDonald JS, Kallmes DF, Kadirvel R. RNA-Sequencing Analysis of Messenger RNA/MicroRNA in a Rabbit Aneurysm Model Identifies Pathways and Genes of Interest. AJNR Am J Neuroradiol 2015; 36:1710-5. [PMID: 26228879 DOI: 10.3174/ajnr.a4390] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2014] [Accepted: 01/30/2015] [Indexed: 12/23/2022]
Abstract
BACKGROUND AND PURPOSE Rabbit aneurysm models are used for the testing of embolization devices and elucidating the mechanisms of human intracranial aneurysm growth and healing. We used RNA-sequencing technology to identify genes relevant to induced rabbit aneurysm biology and to identify genes and pathways of potential clinical interest. This process included sequencing microRNAs, which are important regulatory noncoding RNAs. MATERIALS AND METHODS Elastase-induced saccular aneurysms were created at the origin of the right common carotid artery in 6 rabbits. Messenger RNA and microRNA were isolated from the aneurysm and from the control left common carotid artery at 12 weeks and processed by using RNA-sequencing technology. The results from RNA sequencing were analyzed by using the Ingenuity Pathway Analysis tool. RESULTS A total of 9396 genes were analyzed by using RNA sequencing, 648 (6.9%) of which were found to be significantly differentially expressed between the aneurysms and control tissues (P < .05; false-discovery rate, <0.01; fold change, >2 or <.5). Of these genes, 614 were mapped successfully, 143 were down-regulated, and 471 were up-regulated in the aneurysms as compared with controls. Using the same criteria for significance, 3 microRNAs were identified as down-regulated and 5 were identified as up-regulated. Pathway analysis associated these genes with inflammatory response, cellular migration, and coagulation, among other functions and pathologies. CONCLUSIONS RNA-sequencing analysis of rabbit aneurysms revealed differential regulation of some key pathways, including inflammation and antigen presentation. ANKRD1 and TACR1 were identified as genes of interest in the regulation of matrix metalloproteinases.
Collapse
Affiliation(s)
- M Holcomb
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota
| | - Y-H Ding
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota
| | - D Dai
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota
| | - R J McDonald
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota
| | - J S McDonald
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota
| | - D F Kallmes
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota
| | - R Kadirvel
- From the Neuroradiology Research Laboratory, Mayo Clinic, Rochester, Minnesota.
| |
Collapse
|
16
|
Le TD, Zhang J, Liu L, Li J. Ensemble Methods for MiRNA Target Prediction from Expression Data. PLoS One 2015; 10:e0131627. [PMID: 26114448 PMCID: PMC4482624 DOI: 10.1371/journal.pone.0131627] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 06/04/2015] [Indexed: 01/23/2023] Open
Abstract
Background microRNAs (miRNAs) are short regulatory RNAs that are involved in several diseases, including cancers. Identifying miRNA functions is very important in understanding disease mechanisms and determining the efficacy of drugs. An increasing number of computational methods have been developed to explore miRNA functions by inferring the miRNA-mRNA regulatory relationships from data. Each of the methods is developed based on some assumptions and constraints, for instance, assuming linear relationships between variables. For such reasons, computational methods are often subject to the problem of inconsistent performance across different datasets. On the other hand, ensemble methods integrate the results from individual methods and have been proved to outperform each of their individual component methods in theory. Results In this paper, we investigate the performance of some ensemble methods over the commonly used miRNA target prediction methods. We apply eight different popular miRNA target prediction methods to three cancer datasets, and compare their performance with the ensemble methods which integrate the results from each combination of the individual methods. The validation results using experimentally confirmed databases show that the results of the ensemble methods complement those obtained by the individual methods and the ensemble methods perform better than the individual methods across different datasets. The ensemble method, Pearson+IDA+Lasso, which combines methods in different approaches, including a correlation method, a causal inference method, and a regression method, is the best performed ensemble method in this study. Further analysis of the results of this ensemble method shows that the ensemble method can obtain more targets which could not be found by any of the single methods, and the discovered targets are more statistically significant and functionally enriched. The source codes, datasets, miRNA target predictions by all methods, and the ground truth for validation are available in the Supplementary materials.
Collapse
Affiliation(s)
- Thuc Duy Le
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, South Australia, Australia
- * E-mail: (TDL), (JL)
| | | | - Lin Liu
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, South Australia, Australia
| | - Jiuyong Li
- School of Information Technology and Mathematical Sciences, University of South Australia, Adelaide, South Australia, Australia
- * E-mail: (TDL), (JL)
| |
Collapse
|