101
|
Basu S, Duren W, Evans CR, Burant CF, Michailidis G, Karnovsky A. Sparse network modeling and metscape-based visualization methods for the analysis of large-scale metabolomics data. Bioinformatics 2018; 33:1545-1553. [PMID: 28137712 DOI: 10.1093/bioinformatics/btx012] [Citation(s) in RCA: 119] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 01/11/2017] [Indexed: 02/01/2023] Open
Abstract
Motivation Recent technological advances in mass spectrometry, development of richer mass spectral libraries and data processing tools have enabled large scale metabolic profiling. Biological interpretation of metabolomics studies heavily relies on knowledge-based tools that contain information about metabolic pathways. Incomplete coverage of different areas of metabolism and lack of information about non-canonical connections between metabolites limits the scope of applications of such tools. Furthermore, the presence of a large number of unknown features, which cannot be readily identified, but nonetheless can represent bona fide compounds, also considerably complicates biological interpretation of the data. Results Leveraging recent developments in the statistical analysis of high-dimensional data, we developed a new Debiased Sparse Partial Correlation algorithm (DSPC) for estimating partial correlation networks and implemented it as a Java-based CorrelationCalculator program. We also introduce a new version of our previously developed tool Metscape that enables building and visualization of correlation networks. We demonstrate the utility of these tools by constructing biologically relevant networks and in aiding identification of unknown compounds. Availability and Implementation http://metscape.med.umich.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sumanta Basu
- Department of Statistics, University of California, Berkeley, CA, USA.,Department of Genome Dynamics, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - William Duren
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Charles R Evans
- Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Charles F Burant
- Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - Alla Karnovsky
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| |
Collapse
|
102
|
Nontargeted Metabolomics Reveals the Multilevel Response to Antibiotic Perturbations. Cell Rep 2018; 19:1214-1228. [PMID: 28494870 DOI: 10.1016/j.celrep.2017.04.002] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2016] [Revised: 09/27/2016] [Accepted: 03/31/2017] [Indexed: 11/21/2022] Open
Abstract
Microbes have shown a remarkable ability in evading the killing actions of antimicrobial agents, such that treatment of bacterial infections represents once more an urgent global challenge. Understanding the initial bacterial response to antimicrobials may reveal intrinsic tolerance mechanisms to antibiotics and suggest alternative and less conventional therapeutic strategies. Here, we used mass spectrometry-based metabolomics to monitor the immediate metabolic response of Escherichia coli to a variety of antibiotic perturbations. We show that rapid metabolic changes can reflect drug mechanisms of action and reveal the active role of metabolism in mediating the first stress response to antimicrobials. We uncovered a role for ammonium imbalance in aggravating chloramphenicol toxicity and the essential function of deoxythymidine 5'-diphosphate (dTDP)-rhamnose synthesis for the immediate transcriptional upregulation of GyrA in response to quinolone antibiotics. Our results suggest bacterial metabolism as an attractive target to interfere with the early bacterial response to antibiotic treatments and reduce the probability for survival and eventual evolution of antibiotic resistance.
Collapse
|
103
|
Zampieri M, Szappanos B, Buchieri MV, Trauner A, Piazza I, Picotti P, Gagneux S, Borrell S, Gicquel B, Lelievre J, Papp B, Sauer U. High-throughput metabolomic analysis predicts mode of action of uncharacterized antimicrobial compounds. Sci Transl Med 2018; 10:eaal3973. [PMID: 29467300 PMCID: PMC6544516 DOI: 10.1126/scitranslmed.aal3973] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Revised: 04/11/2017] [Accepted: 09/27/2017] [Indexed: 12/19/2022]
Abstract
Rapidly spreading antibiotic resistance and the low discovery rate of new antimicrobial compounds demand more effective strategies for early drug discovery. One bottleneck in the drug discovery pipeline is the identification of the modes of action (MoAs) of new compounds. We have developed a rapid systematic metabolome profiling strategy to classify the MoAs of bioactive compounds. The method predicted MoA-specific metabolic responses in the nonpathogenic bacterium Mycobacterium smegmatis after treatment with 62 reference compounds with known MoAs and different metabolic and nonmetabolic targets. We then analyzed a library of 212 new antimycobacterial compounds with unknown MoAs from a drug discovery effort by the pharmaceutical company GlaxoSmithKline (GSK). More than 70% of these new compounds induced metabolic responses in M. smegmatis indicative of known MoAs, seven of which were experimentally validated. Only 8% (16) of the compounds appeared to target unconventional cellular processes, illustrating the difficulty in discovering new antibiotics with different MoAs among compounds used as monotherapies. For six of the GSK compounds with potentially new MoAs, the metabolome profiles suggested their ability to interfere with trehalose and lipid metabolism. This was supported by whole-genome sequencing of spontaneous drug-resistant mutants of the pathogen Mycobacterium tuberculosis and in vitro compound-proteome interaction analysis for one of these compounds. Our compendium of drug-metabolome profiles can be used to rapidly query the MoAs of uncharacterized antimicrobial compounds and should be a useful resource for the drug discovery community.
Collapse
Affiliation(s)
- Mattia Zampieri
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland.
| | - Balazs Szappanos
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Maria Virginia Buchieri
- Mycobacterial Genetics Unit, Institut Pasteur, 25-28 Rue du Docteur Roux, 75015 Paris, France
| | - Andrej Trauner
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Ilaria Piazza
- Institute of Biochemistry, Department of Biology, ETH Zürich, Zürich, Switzerland
| | - Paola Picotti
- Institute of Biochemistry, Department of Biology, ETH Zürich, Zürich, Switzerland
| | - Sébastien Gagneux
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Sonia Borrell
- Swiss Tropical and Public Health Institute, Basel, Switzerland
- University of Basel, Basel, Switzerland
| | - Brigitte Gicquel
- Mycobacterial Genetics Unit, Institut Pasteur, 25-28 Rue du Docteur Roux, 75015 Paris, France
| | - Joel Lelievre
- Disease of the Developing World, GlaxoSmithKline, Severo Ochoa, Tres Cantos, Madrid 28760, Spain
| | - Balazs Papp
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Uwe Sauer
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| |
Collapse
|
104
|
Ehsan Elahi F, Hasan A. A method for estimating Hill function-based dynamic models of gene regulatory networks. ROYAL SOCIETY OPEN SCIENCE 2018; 5:171226. [PMID: 29515843 PMCID: PMC5830732 DOI: 10.1098/rsos.171226] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 01/25/2018] [Indexed: 08/24/2023]
Abstract
Gene regulatory networks (GRNs) are quite large and complex. To better understand and analyse GRNs, mathematical models are being employed. Different types of models, such as logical, continuous and stochastic models, can be used to describe GRNs. In this paper, we present a new approach to identify continuous models, because they are more suitable for large number of genes and quantitative analysis. One of the most promising techniques for identifying continuous models of GRNs is based on Hill functions and the generalized profiling method (GPM). The advantage of this approach is low computational cost and insensitivity to initial conditions. In the GPM, a constrained nonlinear optimization problem has to be solved that is usually underdetermined. In this paper, we propose a new optimization approach in which we reformulate the optimization problem such that constraints are embedded implicitly in the cost function. Moreover, we propose to split the unknown parameter in two sets based on the structure of Hill functions. These two sets are estimated separately to resolve the issue of the underdetermined problem. As a case study, we apply the proposed technique on the SOS response in Escherichia coli and compare the results with the existing literature.
Collapse
Affiliation(s)
| | - Ammar Hasan
- National University of Sciences and Technology (NUST), H-12, 44000, Islamabad, Pakistan
| |
Collapse
|
105
|
Model-free inference of direct network interactions from nonlinear collective dynamics. Nat Commun 2017; 8:2192. [PMID: 29259167 PMCID: PMC5736722 DOI: 10.1038/s41467-017-02288-4] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Accepted: 11/17/2017] [Indexed: 12/13/2022] Open
Abstract
The topology of interactions in network dynamical systems fundamentally underlies their function. Accelerating technological progress creates massively available data about collective nonlinear dynamics in physical, biological, and technological systems. Detecting direct interaction patterns from those dynamics still constitutes a major open problem. In particular, current nonlinear dynamics approaches mostly require to know a priori a model of the (often high dimensional) system dynamics. Here we develop a model-independent framework for inferring direct interactions solely from recording the nonlinear collective dynamics generated. Introducing an explicit dependency matrix in combination with a block-orthogonal regression algorithm, the approach works reliably across many dynamical regimes, including transient dynamics toward steady states, periodic and non-periodic dynamics, and chaos. Together with its capabilities to reveal network (two point) as well as hypernetwork (e.g., three point) interactions, this framework may thus open up nonlinear dynamics options of inferring direct interaction patterns across systems where no model is known. Network dynamical systems can represent the interactions involved in the collective dynamics of gene regulatory networks or metabolic circuits. Here Casadiego et al. present a method for inferring these types of interactions directly from observed time series without relying on their model.
Collapse
|
106
|
Lünsmann BJ, Kirst C, Timme M. Transition to reconstructibility in weakly coupled networks. PLoS One 2017; 12:e0186624. [PMID: 29053744 PMCID: PMC5650155 DOI: 10.1371/journal.pone.0186624] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Accepted: 10/04/2017] [Indexed: 11/25/2022] Open
Abstract
Across scientific disciplines, thresholded pairwise measures of statistical dependence between time series are taken as proxies for the interactions between the dynamical units of a network. Yet such correlation measures often fail to reflect the underlying physical interactions accurately. Here we systematically study the problem of reconstructing direct physical interaction networks from thresholding correlations. We explicate how local common cause and relay structures, heterogeneous in-degrees and non-local structural properties of the network generally hinder reconstructibility. However, in the limit of weak coupling strengths we prove that stationary systems with dynamics close to a given operating point transition to universal reconstructiblity across all network topologies.
Collapse
Affiliation(s)
- Benedict J. Lünsmann
- Network Dynamics, Max Planck Institute for Dynamics and Self-Organization (MPIDS), 37077 Göttingen, Germany
- Max Planck Institute for the Physics of Complex Systems (MPIPKS), 01187 Dresden, Germany
| | - Christoph Kirst
- Network Dynamics, Max Planck Institute for Dynamics and Self-Organization (MPIDS), 37077 Göttingen, Germany
- Rockefeller University, NY 10065-6399 New York, United States of America
| | - Marc Timme
- Network Dynamics, Max Planck Institute for Dynamics and Self-Organization (MPIDS), 37077 Göttingen, Germany
- Max Planck Institute for the Physics of Complex Systems (MPIPKS), 01187 Dresden, Germany
- Bernstein Center for Computational Neuroscience (BCCN), 37077 Göttingen, Germany
- Chair for Network Dynamics, Center for Advancing Electronics Dresden (cfaed) and Institute for Theoretical Physics, Technical University of Dresden, 01062 Dresden, Germany
- Department of Physics, Technical University of Darmstadt, 64289 Darmstadt, Germany
| |
Collapse
|
107
|
Fang X, Sastry A, Mih N, Kim D, Tan J, Yurkovich JT, Lloyd CJ, Gao Y, Yang L, Palsson BO. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities. Proc Natl Acad Sci U S A 2017; 114:10286-10291. [PMID: 28874552 PMCID: PMC5617254 DOI: 10.1073/pnas.1702581114] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the Escherichia coli TRN-probably the best characterized TRN-several questions remain. Here, we address three questions: (i) How complete is our knowledge of the E. coli TRN; (ii) how well can we predict gene expression using this TRN; and (iii) how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism's TRN from disparate data types.
Collapse
Affiliation(s)
- Xin Fang
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093
| | - Anand Sastry
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093
| | - Nathan Mih
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093
- Bioinformatics and Systems Biology Program, University of California at San Diego, La Jolla, CA 92093
| | - Donghyuk Kim
- Department of Genetic Engineering, Kyung Hee University, Yongin 17104, South Korea
| | - Justin Tan
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093
| | - James T Yurkovich
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093
- Bioinformatics and Systems Biology Program, University of California at San Diego, La Jolla, CA 92093
| | - Colton J Lloyd
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093
| | - Ye Gao
- Division of Biological Sciences, University of California at San Diego, La Jolla, CA 92093
| | - Laurence Yang
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093;
| | - Bernhard O Palsson
- Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093;
- Bioinformatics and Systems Biology Program, University of California at San Diego, La Jolla, CA 92093
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2970 Horsholm, Denmark
- Department of Pediatrics, University of California at San Diego, La Jolla, CA 92093
| |
Collapse
|
108
|
Ma C, Zhang HF, Lai YC. Reconstructing complex networks without time series. Phys Rev E 2017; 96:022320. [PMID: 28950596 DOI: 10.1103/physreve.96.022320] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Indexed: 06/07/2023]
Abstract
In the real world there are situations where the network dynamics are transient (e.g., various spreading processes) and the final nodal states represent the available data. Can the network topology be reconstructed based on data that are not time series? Assuming that an ensemble of the final nodal states resulting from statistically independent initial triggers (signals) of the spreading dynamics is available, we develop a maximum likelihood estimation-based framework to accurately infer the interaction topology. For dynamical processes that result in a binary final state, the framework enables network reconstruction based solely on the final nodal states. Additional information, such as the first arrival time of each signal at each node, can improve the reconstruction accuracy. For processes with a uniform final state, the first arrival times can be exploited to reconstruct the network. We derive a mathematical theory for our framework and validate its performance and robustness using various combinations of spreading dynamics and real-world network topologies.
Collapse
Affiliation(s)
- Chuang Ma
- School of Mathematical Science, Anhui University, Hefei 230601, China
| | - Hai-Feng Zhang
- School of Mathematical Science, Anhui University, Hefei 230601, China
- Center of Information Support &Assurance Technology, Anhui University, Hefei 230601, China
- Department of Communication Engineering, North University of China, Taiyuan, Shan'xi 030051, China
| | - Ying-Cheng Lai
- School of Electrical, Computer and Energy Engineering, Department of Physics, Arizona State University, Tempe, Arizona 85287, USA
| |
Collapse
|
109
|
Shao B, Yuan H, Zhang R, Wang X, Zhang S, Ouyang Q, Hao N, Luo C. Reconstructing the regulatory circuit of cell fate determination in yeast mating response. PLoS Comput Biol 2017; 13:e1005671. [PMID: 28742153 PMCID: PMC5546706 DOI: 10.1371/journal.pcbi.1005671] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Revised: 08/07/2017] [Accepted: 07/09/2017] [Indexed: 12/22/2022] Open
Abstract
Massive technological advances enabled high-throughput measurements of proteomic changes in biological processes. However, retrieving biological insights from large-scale protein dynamics data remains a challenging task. Here we used the mating differentiation in yeast Saccharomyces cerevisiae as a model and developed integrated experimental and computational approaches to analyze the proteomic dynamics during the process of cell fate determination. When exposed to a high dose of mating pheromone, the yeast cell undergoes growth arrest and forms a shmoo-like morphology; however, at intermediate doses, chemotropic elongated growth is initialized. To understand the gene regulatory networks that control this differentiation switch, we employed a high-throughput microfluidic imaging system that allows real-time and simultaneous measurements of cell growth and protein expression. Using kinetic modeling of protein dynamics, we classified the stimulus-dependent changes in protein abundance into two sources: global changes due to physiological alterations and gene-specific changes. A quantitative framework was proposed to decouple gene-specific regulatory modes from the growth-dependent global modulation of protein abundance. Based on the temporal patterns of gene-specific regulation, we established the network architectures underlying distinct cell fates using a reverse engineering method and uncovered the dose-dependent rewiring of gene regulatory network during mating differentiation. Furthermore, our results suggested a potential crosstalk between the pheromone response pathway and the target of rapamycin (TOR)-regulated ribosomal biogenesis pathway, which might underlie a cell differentiation switch in yeast mating response. In summary, our modeling approach addresses the distinct impacts of the global and gene-specific regulation on the control of protein dynamics and provides new insights into the mechanisms of cell fate determination. We anticipate that our integrated experimental and modeling strategies could be widely applicable to other biological systems. A systematic characterization of the proteomic changes during the process of cell differentiation is critical for understanding the underlying molecular mechanisms. However, protein expression can be largely affected by changes in cell physiological state, which hampers the detection of regulatory interactions. Here we proposed an integrated experimental and computational framework to reconstruct regulatory circuits in mating differentiation of budding yeast Saccharomyces cerevisiae, in which distinct cell fates are triggered by alteration in pheromone concentration. A modeling approach was developed to decouple gene-specific regulation from growth-dependent global regulation of protein expression, allowing us to reverse engineering the gene regulatory circuits underlying distinct cell fates. Our work highlights the importance of model-based analysis of proteomic data and delivers new insight into dose-dependent differentiation behavior of budding yeast.
Collapse
Affiliation(s)
- Bin Shao
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
| | - Haiyu Yuan
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
| | - Rongfei Zhang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Xuan Wang
- School of Informatics and Computing, Indiana University, Bloomington, Indiana, United States of America
| | - Shuwen Zhang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Qi Ouyang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Nan Hao
- Section of Molecular Biology, Division of Biological Sciences, University of California San Diego, La Jolla, California, United States of America
- * E-mail: (CL); (NH)
| | - Chunxiong Luo
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
- * E-mail: (CL); (NH)
| |
Collapse
|
110
|
Cestnik R, Rosenblum M. Reconstructing networks of pulse-coupled oscillators from spike trains. Phys Rev E 2017; 96:012209. [PMID: 29347231 DOI: 10.1103/physreve.96.012209] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Indexed: 11/07/2022]
Abstract
We present an approach for reconstructing networks of pulse-coupled neuronlike oscillators from passive observation of pulse trains of all nodes. It is assumed that units are described by their phase response curves and that their phases are instantaneously reset by incoming pulses. Using an iterative procedure, we recover the properties of all nodes, namely their phase response curves and natural frequencies, as well as strengths of all directed connections.
Collapse
Affiliation(s)
- Rok Cestnik
- Department of Physics and Astronomy, University of Potsdam, Karl-Liebknecht-Strasse 24/25, D-14476 Potsdam-Golm, Germany.,Department of Human Movement Sciences, MOVE Research Institute Amsterdam, Vrije Universiteit Amsterdam, van der Boechorststraat 9, Amsterdam, Netherlands
| | - Michael Rosenblum
- Department of Physics and Astronomy, University of Potsdam, Karl-Liebknecht-Strasse 24/25, D-14476 Potsdam-Golm, Germany.,The Research Institute of Supercomputing, Lobachevsky National Research State University of Nizhny Novgorod, Russia
| |
Collapse
|
111
|
Yu L, Su R, Wang B, Zhang L, Zou Y, Zhang J, Gao L. Prediction of Novel Drugs for Hepatocellular Carcinoma Based on Multi-Source Random Walk. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:966-977. [PMID: 27076463 DOI: 10.1109/tcbb.2016.2550453] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Computational approaches for predicting drug-disease associations by integrating gene expression and biological network provide great insights to the complex relationships among drugs, targets, disease genes, and diseases at a system level. Hepatocellular carcinoma (HCC) is one of the most common malignant tumors with a high rate of morbidity and mortality. We provide an integrative framework to predict novel d rugs for HCC based on multi-source random walk (PD-MRW). Firstly, based on gene expression and protein interaction network, we construct a gene-gene weighted i nteraction network (GWIN). Then, based on multi-source random walk in GWIN, we build a drug-drug similarity network. Finally, based on the known drugs for HCC, we score all drugs in the drug-drug similarity network. The robustness of our predictions, their overlap with those reported in Comparative Toxicogenomics Database (CTD) and literatures, and their enriched KEGG pathway demonstrate our approach can effectively identify new drug indications. Specifically, regorafenib (Rank = 9 in top-20 list) is proven to be effective in Phase I and II clinical trials of HCC, and the Phase III trial is ongoing. And, it has 11 overlapping pathways with HCC with lower p-values. Focusing on a particular disease, we believe our approach is more accurate and possesses better scalability.
Collapse
|
112
|
Vilar S, Hripcsak G. The role of drug profiles as similarity metrics: applications to repurposing, adverse effects detection and drug-drug interactions. Brief Bioinform 2017; 18:670-681. [PMID: 27273288 PMCID: PMC6078166 DOI: 10.1093/bib/bbw048] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2016] [Revised: 04/18/2016] [Indexed: 12/30/2022] Open
Abstract
Explosion of the availability of big data sources along with the development in computational methods provides a useful framework to study drugs' actions, such as interactions with pharmacological targets and off-targets. Databases related to protein interactions, adverse effects and genomic profiles are available to be used for the construction of computational models. In this article, we focus on the description of biological profiles for drugs that can be used as a system to compare similarity and create methods to predict and analyze drugs' actions. We highlight profiles constructed with different biological data, such as target-protein interactions, gene expression measurements, adverse effects and disease profiles. We focus on the discovery of new targets or pathways for drugs already in the pharmaceutical market, also called drug repurposing, in the interaction with off-targets responsible for adverse reactions and in drug-drug interaction analysis. The current and future applications, strengths and challenges facing all these methods are also discussed. Biological profiles or signatures are an important source of data generation to deeply analyze biological actions with important implications in drug-related studies.
Collapse
Affiliation(s)
- Santiago Vilar
- Corresponding author: Santiago Vilar, Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032, USA. E-mail: ; George Hripcsak, Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032, USA. E-mail:
| | - George Hripcsak
- Corresponding author: Santiago Vilar, Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032, USA. E-mail: ; George Hripcsak, Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032, USA. E-mail:
| |
Collapse
|
113
|
Sisto M, Lorusso L, Ingravallo G, Lisi S. Exocrine Gland Morphogenesis: Insights into the Role of Amphiregulin from Development to Disease. Arch Immunol Ther Exp (Warsz) 2017; 65:477-499. [DOI: 10.1007/s00005-017-0478-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Accepted: 06/02/2017] [Indexed: 12/12/2022]
|
114
|
Li J, Shen Z, Wang WX, Grebogi C, Lai YC. Universal data-based method for reconstructing complex networks with binary-state dynamics. Phys Rev E 2017; 95:032303. [PMID: 28415181 DOI: 10.1103/physreve.95.032303] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2016] [Indexed: 11/07/2022]
Abstract
To understand, predict, and control complex networked systems, a prerequisite is to reconstruct the network structure from observable data. Despite recent progress in network reconstruction, binary-state dynamics that are ubiquitous in nature, technology, and society still present an outstanding challenge in this field. Here we offer a framework for reconstructing complex networks with binary-state dynamics by developing a universal data-based linearization approach that is applicable to systems with linear, nonlinear, discontinuous, or stochastic dynamics governed by monotonic functions. The linearization procedure enables us to convert the network reconstruction into a sparse signal reconstruction problem that can be resolved through convex optimization. We demonstrate generally high reconstruction accuracy for a number of complex networks associated with distinct binary-state dynamics from using binary data contaminated by noise and missing data. Our framework is completely data driven, efficient, and robust, and does not require any a priori knowledge about the detailed dynamical process on the network. The framework represents a general paradigm for reconstructing, understanding, and exploiting complex networked systems with binary-state dynamics.
Collapse
Affiliation(s)
- Jingwen Li
- School of Systems Science, Beijing Normal University, Beijing 100875, China
| | - Zhesi Shen
- School of Systems Science, Beijing Normal University, Beijing 100875, China
| | - Wen-Xu Wang
- School of Systems Science, Beijing Normal University, Beijing 100875, China.,Business School, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Celso Grebogi
- Institute for Complex Systems and Mathematical Biology, King's College, University of Aberdeen, Aberdeen AB24 3UE, United Kingdom
| | - Ying-Cheng Lai
- Institute for Complex Systems and Mathematical Biology, King's College, University of Aberdeen, Aberdeen AB24 3UE, United Kingdom.,School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, Arizona 85287, USA.,Department of Physics, Arizona State University, Tempe, Arizona 85287, USA
| |
Collapse
|
115
|
Knüppel R, Kuttenberger C, Ferreira-Cerca S. Toward Time-Resolved Analysis of RNA Metabolism in Archaea Using 4-Thiouracil. Front Microbiol 2017; 8:286. [PMID: 28286499 PMCID: PMC5323407 DOI: 10.3389/fmicb.2017.00286] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 02/13/2017] [Indexed: 11/13/2022] Open
Abstract
Archaea are widespread organisms colonizing almost every habitat on Earth. However, the molecular biology of archaea still remains relatively uncharacterized. RNA metabolism is a central cellular process, which has been extensively analyzed in both bacteria and eukarya. In contrast, analysis of RNA metabolism dynamic in archaea has been limited to date. To facilitate analysis of the RNA metabolism dynamic at a system-wide scale in archaea, we have established non-radioactive pulse labeling of RNA, using the nucleotide analog 4-thiouracil (4TU) in two commonly used model archaea: the halophile Euryarchaeota Haloferax volcanii, and the thermo-acidophile Crenarchaeota Sulfolobus acidocaldarius. In this work, we show that 4TU pulse labeling can be efficiently performed in these two organisms in a dose- and time-dependent manner. In addition, our results suggest that uracil prototrophy had no critical impact on the overall 4TU incorporation in RNA molecules. Accordingly, our work suggests that 4TU incorporation can be widely performed in archaea, thereby expanding the molecular toolkit to analyze archaeal gene expression network dynamic in unprecedented detail.
Collapse
Affiliation(s)
- Robert Knüppel
- Biochemistry III, Institute for Biochemistry, Genetics and Microbiology, University of Regensburg Regensburg, Germany
| | - Corinna Kuttenberger
- Biochemistry III, Institute for Biochemistry, Genetics and Microbiology, University of Regensburg Regensburg, Germany
| | - Sébastien Ferreira-Cerca
- Biochemistry III, Institute for Biochemistry, Genetics and Microbiology, University of Regensburg Regensburg, Germany
| |
Collapse
|
116
|
Sikdar S, Datta S. A novel statistical approach for identification of the master regulator transcription factor. BMC Bioinformatics 2017; 18:79. [PMID: 28148240 PMCID: PMC5288875 DOI: 10.1186/s12859-017-1499-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 01/27/2017] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Transcription factors are known to play key roles in carcinogenesis and therefore, are gaining popularity as potential therapeutic targets in drug development. A 'master regulator' transcription factor often appears to control most of the regulatory activities of the other transcription factors and the associated genes. This 'master regulator' transcription factor is at the top of the hierarchy of the transcriptomic regulation. Therefore, it is important to identify and target the master regulator transcription factor for proper understanding of the associated disease process and identifying the best therapeutic option. METHODS We present a novel two-step computational approach for identification of master regulator transcription factor in a genome. At the first step of our method we test whether there exists any master regulator transcription factor in the system. We evaluate the concordance of two ranked lists of transcription factors using a statistical measure. In case the concordance measure is statistically significant, we conclude that there is a master regulator. At the second step, our method identifies the master regulator transcription factor, if there exists one. RESULTS In the simulation scenario, our method performs reasonably well in validating the existence of a master regulator when the number of subjects in each treatment group is reasonably large. In application to two real datasets, our method ensures the existence of master regulators and identifies biologically meaningful master regulators. An R code for implementing our method in a sample test data can be found in http://www.somnathdatta.org/software . CONCLUSION We have developed a screening method of identifying the 'master regulator' transcription factor just using only the gene expression data. Understanding the regulatory structure and finding the master regulator help narrowing the search space for identifying biomarkers for complex diseases such as cancer. In addition to identifying the master regulator our method provides an overview of the regulatory structure of the transcription factors which control the global gene expression profiles and consequently the cell functioning.
Collapse
Affiliation(s)
- Sinjini Sikdar
- Department of Biostatistics, University of Florida, Gainesville, FL, 32611, USA
| | - Susmita Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
117
|
Nitzan M, Casadiego J, Timme M. Revealing physical interaction networks from statistics of collective dynamics. SCIENCE ADVANCES 2017; 3:e1600396. [PMID: 28246630 PMCID: PMC5302872 DOI: 10.1126/sciadv.1600396] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2016] [Accepted: 12/10/2016] [Indexed: 05/22/2023]
Abstract
Revealing physical interactions in complex systems from observed collective dynamics constitutes a fundamental inverse problem in science. Current reconstruction methods require access to a system's model or dynamical data at a level of detail often not available. We exploit changes in invariant measures, in particular distributions of sampled states of the system in response to driving signals, and use compressed sensing to reveal physical interaction networks. Dynamical observations following driving suffice to infer physical connectivity even if they are temporally disordered, are acquired at large sampling intervals, and stem from different experiments. Testing various nonlinear dynamic processes emerging on artificial and real network topologies indicates high reconstruction quality for existence as well as type of interactions. These results advance our ability to reveal physical interaction networks in complex synthetic and natural systems.
Collapse
Affiliation(s)
- Mor Nitzan
- Racah Institute of Physics, Hebrew University of Jerusalem, 9190401 Jerusalem, Israel
- Faculty of Medicine, Hebrew University of Jerusalem, 9112001 Jerusalem, Israel
- School of Computer Science, Hebrew University of Jerusalem, 9190401 Jerusalem, Israel
| | - Jose Casadiego
- Network Dynamics, Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
- International Max Planck Research School for Physics of Biological and Complex Systems, 37077 Göttingen, Germany
| | - Marc Timme
- Network Dynamics, Max Planck Institute for Dynamics and Self-Organization, 37077 Göttingen, Germany
- International Max Planck Research School for Physics of Biological and Complex Systems, 37077 Göttingen, Germany
- Technical University of Dresden, Institute for Theoretical Physics, 01062 Dresden, Germany
- Bernstein Center for Computational Neuroscience, 37077 Göttingen, Germany
- Department of Physics, Technical University of Darmstadt, 64289 Darmstadt, Germany
| |
Collapse
|
118
|
Lobo D, Levin M. Computing a Worm: Reverse-Engineering Planarian Regeneration. EMERGENCE, COMPLEXITY AND COMPUTATION 2017. [DOI: 10.1007/978-3-319-33921-4_24] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
119
|
Tjärnberg A, Morgan DC, Studham M, Nordling TEM, Sonnhammer ELL. GeneSPIDER – gene regulatory network inference benchmarking with controlled network and data properties. MOLECULAR BIOSYSTEMS 2017; 13:1304-1312. [DOI: 10.1039/c7mb00058h] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
A key question in network inference, that has not been properly answered, is what accuracy can be expected for a given biological dataset and inference method.
Collapse
Affiliation(s)
- Andreas Tjärnberg
- Stockholm Bioinformatics Center
- Science for Life Laboratory
- Sweden
- Department of Biochemistry and Biophysics
- Stockholm University
| | - Daniel C. Morgan
- Stockholm Bioinformatics Center
- Science for Life Laboratory
- Sweden
- Department of Biochemistry and Biophysics
- Stockholm University
| | - Matthew Studham
- Stockholm Bioinformatics Center
- Science for Life Laboratory
- Sweden
- Department of Biochemistry and Biophysics
- Stockholm University
| | - Torbjörn E. M. Nordling
- Stockholm Bioinformatics Center
- Science for Life Laboratory
- Sweden
- Department of Mechanical Engineering
- National Cheng Kung University
| | - Erik L. L. Sonnhammer
- Stockholm Bioinformatics Center
- Science for Life Laboratory
- Sweden
- Department of Biochemistry and Biophysics
- Stockholm University
| |
Collapse
|
120
|
Zheng G, Xu Y, Zhang X, Liu ZP, Wang Z, Chen L, Zhu XG. CMIP: a software package capable of reconstructing genome-wide regulatory networks using gene expression data. BMC Bioinformatics 2016; 17:535. [PMID: 28155637 PMCID: PMC5260056 DOI: 10.1186/s12859-016-1324-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A gene regulatory network (GRN) represents interactions of genes inside a cell or tissue, in which vertexes and edges stand for genes and their regulatory interactions respectively. Reconstruction of gene regulatory networks, in particular, genome-scale networks, is essential for comparative exploration of different species and mechanistic investigation of biological processes. Currently, most of network inference methods are computationally intensive, which are usually effective for small-scale tasks (e.g., networks with a few hundred genes), but are difficult to construct GRNs at genome-scale. RESULTS Here, we present a software package for gene regulatory network reconstruction at a genomic level, in which gene interaction is measured by the conditional mutual information measurement using a parallel computing framework (so the package is named CMIP). The package is a greatly improved implementation of our previous PCA-CMI algorithm. In CMIP, we provide not only an automatic threshold determination method but also an effective parallel computing framework for network inference. Performance tests on benchmark datasets show that the accuracy of CMIP is comparable to most current network inference methods. Moreover, running tests on synthetic datasets demonstrate that CMIP can handle large datasets especially genome-wide datasets within an acceptable time period. In addition, successful application on a real genomic dataset confirms its practical applicability of the package. CONCLUSIONS This new software package provides a powerful tool for genomic network reconstruction to biological community. The software can be accessed at http://www.picb.ac.cn/CMIP/ .
Collapse
Affiliation(s)
- Guangyong Zheng
- CAS Key Laboratory of Computational Biology and State Key Laboratory of Hybrid Rice, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 20031, China.
| | - Yaochen Xu
- CAS Key Laboratory of Computational Biology and State Key Laboratory of Hybrid Rice, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 20031, China.,Software Engineering Institute, East China Normal University, 3663 North Zhongshan Road, Shanghai, 200062, China
| | - Xiujun Zhang
- Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 200031, China
| | - Zhi-Ping Liu
- Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 200031, China
| | - Zhuo Wang
- College of Life Science and Biotechnology, Shanghai Jiaotong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 200031, China.
| | - Xin-Guang Zhu
- CAS Key Laboratory of Computational Biology and State Key Laboratory of Hybrid Rice, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai, 20031, China.
| |
Collapse
|
121
|
Variable neighborhood search for reverse engineering of gene regulatory networks. J Biomed Inform 2016; 65:120-131. [PMID: 27919733 DOI: 10.1016/j.jbi.2016.11.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Revised: 11/16/2016] [Accepted: 11/27/2016] [Indexed: 01/08/2023]
Abstract
A new search heuristic, Divided Neighborhood Exploration Search, designed to be used with inference algorithms such as Bayesian networks to improve on the reverse engineering of gene regulatory networks is presented. The approach systematically moves through the search space to find topologies representative of gene regulatory networks that are more likely to explain microarray data. In empirical testing it is demonstrated that the novel method is superior to the widely employed greedy search techniques in both the quality of the inferred networks and computational time.
Collapse
|
122
|
Paul D, Peng J, Burman P. Nonparametric estimation of dynamics of monotone trajectories. Ann Stat 2016. [DOI: 10.1214/15-aos1409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
123
|
Wu K, Liu J, Wang S. Reconstructing Networks from Profit Sequences in Evolutionary Games via a Multiobjective Optimization Approach with Lasso Initialization. Sci Rep 2016; 6:37771. [PMID: 27886244 PMCID: PMC5122890 DOI: 10.1038/srep37771] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 11/01/2016] [Indexed: 12/02/2022] Open
Abstract
Evolutionary games (EG) model a common type of interactions in various complex, networked, natural and social systems. Given such a system with only profit sequences being available, reconstructing the interacting structure of EG networks is fundamental to understand and control its collective dynamics. Existing approaches used to handle this problem, such as the lasso, a convex optimization method, need a user-defined constant to control the tradeoff between the natural sparsity of networks and measurement error (the difference between observed data and simulated data). However, a shortcoming of these approaches is that it is not easy to determine these key parameters which can maximize the performance. In contrast to these approaches, we first model the EG network reconstruction problem as a multiobjective optimization problem (MOP), and then develop a framework which involves multiobjective evolutionary algorithm (MOEA), followed by solution selection based on knee regions, termed as MOEANet, to solve this MOP. We also design an effective initialization operator based on the lasso for MOEA. We apply the proposed method to reconstruct various types of synthetic and real-world networks, and the results show that our approach is effective to avoid the above parameter selecting problem and can reconstruct EG networks with high accuracy.
Collapse
Affiliation(s)
- Kai Wu
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi'an 710071, China
| | - Jing Liu
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi'an 710071, China
| | - Shuai Wang
- Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, Xidian University, Xi'an 710071, China
| |
Collapse
|
124
|
BGRMI: A method for inferring gene regulatory networks from time-course gene expression data and its application in breast cancer research. Sci Rep 2016; 6:37140. [PMID: 27876826 PMCID: PMC5120305 DOI: 10.1038/srep37140] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 10/24/2016] [Indexed: 02/06/2023] Open
Abstract
Reconstructing gene regulatory networks (GRNs) from gene expression data is a challenging problem. Existing GRN reconstruction algorithms can be broadly divided into model-free and model–based methods. Typically, model-free methods have high accuracy but are computation intensive whereas model-based methods are fast but less accurate. We propose Bayesian Gene Regulation Model Inference (BGRMI), a model-based method for inferring GRNs from time-course gene expression data. BGRMI uses a Bayesian framework to calculate the probability of different models of GRNs and a heuristic search strategy to scan the model space efficiently. Using benchmark datasets, we show that BGRMI has higher/comparable accuracy at a fraction of the computational cost of competing algorithms. Additionally, it can incorporate prior knowledge of potential gene regulation mechanisms and TF hetero-dimerization processes in the GRN reconstruction process. We incorporated existing ChIP-seq data and known protein interactions between TFs in BGRMI as sources of prior knowledge to reconstruct transcription regulatory networks of proliferating and differentiating breast cancer (BC) cells from time-course gene expression data. The reconstructed networks revealed key driver genes of proliferation and differentiation in BC cells. Some of these genes were not previously studied in the context of BC, but may have clinical relevance in BC treatment.
Collapse
|
125
|
de Anda-Jáuregui G, Velázquez-Caldelas TE, Espinal-Enríquez J, Hernández-Lemus E. Transcriptional Network Architecture of Breast Cancer Molecular Subtypes. Front Physiol 2016; 7:568. [PMID: 27920729 PMCID: PMC5118907 DOI: 10.3389/fphys.2016.00568] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2016] [Accepted: 11/08/2016] [Indexed: 12/22/2022] Open
Abstract
Breast cancer heterogeneity is evident at the clinical, histological and molecular level. High throughput technologies allowed the identification of intrinsic subtypes that capture transcriptional differences among tumors. A remaining question is whether said differences are associated to a particular transcriptional program which involves different connections between the same molecules. In other words, whether particular transcriptional network architectures can be linked to specific phenotypes. In this work we infer, construct and analyze transcriptional networks from whole-genome gene expression microarrays, by using an information theory approach. We use 493 samples of primary breast cancer tissue classified in four molecular subtypes: Luminal A, Luminal B, Basal and HER2-enriched. For comparison, a network for non-tumoral mammary tissue (61 samples) is also inferred and analyzed. Transcriptional networks present particular architectures in each breast cancer subtype as well as in the non-tumor breast tissue. We find substantial differences between the non-tumor network and those networks inferred from cancer samples, in both structure and gene composition. More importantly, we find specific network architectural features associated to each breast cancer subtype. Based on breast cancer networks' centrality, we identify genes previously associated to the disease, either, generally (i.e., CNR2) or to a particular subtype (such as LCK). Similarly, we identify LUZP4, a gene barely explored in breast cancer, playing a role in transcriptional networks with subtype-specific relevance. With this approach we observe architectural differences between cancer and non-cancer at network level, as well as differences between cancer subtype networks which might be associated with breast cancer heterogeneity. The centrality measures of these networks allow us to identify genes with potential biomedical implications to breast cancer.
Collapse
Affiliation(s)
| | | | - Jesús Espinal-Enríquez
- Computational Genomics, National Institute of Genomic MedicineMexico City, Mexico
- Complejidad en Biología de Sistemas, Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de MéxicoMexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics, National Institute of Genomic MedicineMexico City, Mexico
- Complejidad en Biología de Sistemas, Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de MéxicoMexico City, Mexico
| |
Collapse
|
126
|
Zhou Y, Wang P, Wang X, Zhu J, Song PXK. Sparse multivariate factor analysis regression models and its applications to integrative genomics analysis. Genet Epidemiol 2016; 41:70-80. [PMID: 27862229 DOI: 10.1002/gepi.22018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Revised: 09/16/2016] [Accepted: 09/19/2016] [Indexed: 01/25/2023]
Abstract
The multivariate regression model is a useful tool to explore complex associations between two kinds of molecular markers, which enables the understanding of the biological pathways underlying disease etiology. For a set of correlated response variables, accounting for such dependency can increase statistical power. Motivated by integrative genomic data analyses, we propose a new methodology-sparse multivariate factor analysis regression model (smFARM), in which correlations of response variables are assumed to follow a factor analysis model with latent factors. This proposed method not only allows us to address the challenge that the number of association parameters is larger than the sample size, but also to adjust for unobserved genetic and/or nongenetic factors that potentially conceal the underlying response-predictor associations. The proposed smFARM is implemented by the EM algorithm and the blockwise coordinate descent algorithm. The proposed methodology is evaluated and compared to the existing methods through extensive simulation studies. Our results show that accounting for latent factors through the proposed smFARM can improve sensitivity of signal detection and accuracy of sparse association map estimation. We illustrate smFARM by two integrative genomics analysis examples, a breast cancer dataset, and an ovarian cancer dataset, to assess the relationship between DNA copy numbers and gene expression arrays to understand genetic regulatory patterns relevant to the disease. We identify two trans-hub regions: one in cytoband 17q12 whose amplification influences the RNA expression levels of important breast cancer genes, and the other in cytoband 9q21.32-33, which is associated with chemoresistance in ovarian cancer.
Collapse
Affiliation(s)
- Yan Zhou
- Merck & Co, North Wales, PA, USA
| | - Pei Wang
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Xianlong Wang
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Ji Zhu
- University of Michigan, Ann Arbor, MI, USA
| | | |
Collapse
|
127
|
Jover LF, Romberg J, Weitz JS. Inferring phage-bacteria infection networks from time-series data. ROYAL SOCIETY OPEN SCIENCE 2016; 3:160654. [PMID: 28018655 PMCID: PMC5180153 DOI: 10.1098/rsos.160654] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Accepted: 10/03/2016] [Indexed: 05/09/2023]
Abstract
In communities with bacterial viruses (phage) and bacteria, the phage-bacteria infection network establishes which virus types infect which host types. The structure of the infection network is a key element in understanding community dynamics. Yet, this infection network is often difficult to ascertain. Introduced over 60 years ago, the plaque assay remains the gold standard for establishing who infects whom in a community. This culture-based approach does not scale to environmental samples with increased levels of phage and bacterial diversity, much of which is currently unculturable. Here, we propose an alternative method of inferring phage-bacteria infection networks. This method uses time-series data of fluctuating population densities to estimate the complete interaction network without having to test each phage-bacteria pair individually. We use in silico experiments to analyse the factors affecting the quality of network reconstruction and find robust regimes where accurate reconstructions are possible. In addition, we present a multi-experiment approach where time series from different experiments are combined to improve estimates of the infection network. This approach also mitigates against the possibility of evolutionary changes to relevant phenotypes during the time course of measurement.
Collapse
Affiliation(s)
| | - Justin Romberg
- School of Electrical and Computer Engineering, Atlanta, GA, USA
| | - Joshua S. Weitz
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
- Author for correspondence: Joshua S. Weitz e-mail:
| |
Collapse
|
128
|
McGoff KA, Guo X, Deckard A, Kelliher CM, Leman AR, Francey LJ, Hogenesch JB, Haase SB, Harer JL. The Local Edge Machine: inference of dynamic models of gene regulation. Genome Biol 2016; 17:214. [PMID: 27760556 PMCID: PMC5072315 DOI: 10.1186/s13059-016-1076-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 10/03/2016] [Indexed: 12/31/2022] Open
Abstract
We present a novel approach, the Local Edge Machine, for the inference of regulatory interactions directly from time-series gene expression data. We demonstrate its performance, robustness, and scalability on in silico datasets with varying behaviors, sizes, and degrees of complexity. Moreover, we demonstrate its ability to incorporate biological prior information and make informative predictions on a well-characterized in vivo system using data from budding yeast that have been synchronized in the cell cycle. Finally, we use an atlas of transcription data in a mammalian circadian system to illustrate how the method can be used for discovery in the context of large complex networks.
Collapse
Affiliation(s)
- Kevin A McGoff
- Department of Mathematics and Statistics, UNC Charlotte, 9201 University City Blvd., Charlotte, 28269, NC, USA.
| | - Xin Guo
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong, China
| | | | | | - Adam R Leman
- Department of Biology, Duke University, Durham, NC, USA
| | - Lauren J Francey
- Department of Molecular and Cellular Physiology, University of Cincinnati, Cincinnati, OH, USA
| | - John B Hogenesch
- Department of Molecular and Cellular Physiology, University of Cincinnati, Cincinnati, OH, USA
| | | | - John L Harer
- Department of Mathematics, Duke University, Durham, NC, USA
| |
Collapse
|
129
|
Wang J, Wu Q, Hu XT, Tian T. An integrated approach to infer dynamic protein-gene interactions - A case study of the human P53 protein. Methods 2016; 110:3-13. [PMID: 27514497 DOI: 10.1016/j.ymeth.2016.08.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Revised: 07/18/2016] [Accepted: 08/01/2016] [Indexed: 11/19/2022] Open
Abstract
Investigating the dynamics of genetic regulatory networks through high throughput experimental data, such as microarray gene expression profiles, is a very important but challenging task. One of the major hindrances in building detailed mathematical models for genetic regulation is the large number of unknown model parameters. To tackle this challenge, a new integrated method is proposed by combining a top-down approach and a bottom-up approach. First, the top-down approach uses probabilistic graphical models to predict the network structure of DNA repair pathway that is regulated by the p53 protein. Two networks are predicted, namely a network of eight genes with eight inferred interactions and an extended network of 21 genes with 17 interactions. Then, the bottom-up approach using differential equation models is developed to study the detailed genetic regulations based on either a fully connected regulatory network or a gene network obtained by the top-down approach. Model simulation error, parameter identifiability and robustness property are used as criteria to select the optimal network. Simulation results together with permutation tests of input gene network structures indicate that the prediction accuracy and robustness property of the two predicted networks using the top-down approach are better than those of the corresponding fully connected networks. In particular, the proposed approach reduces computational cost significantly for inferring model parameters. Overall, the new integrated method is a promising approach for investigating the dynamics of genetic regulation.
Collapse
Affiliation(s)
- Junbai Wang
- Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Montebello, 0310 Oslo, Norway
| | - Qianqian Wu
- School of Mathematical Sciences, Monash University, Melbourne 3800, Victoria, Australia; School of Mathematics, Hefei University of Technology, Hefei, Anhui 230009, China
| | - Xiaohua Tony Hu
- College of Computing and Informatics, Drexel University, Philadelphia, PA 19104, USA
| | - Tianhai Tian
- School of Mathematical Sciences, Monash University, Melbourne 3800, Victoria, Australia.
| |
Collapse
|
130
|
Grabow C, Macinko J, Silver D, Porfiri M. Detecting causality in policy diffusion processes. CHAOS (WOODBURY, N.Y.) 2016; 26:083113. [PMID: 27586609 PMCID: PMC4991992 DOI: 10.1063/1.4961067] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 08/01/2016] [Indexed: 06/06/2023]
Abstract
A universal question in network science entails learning about the topology of interaction from collective dynamics. Here, we address this question by examining diffusion of laws across US states. We propose two complementary techniques to unravel determinants of this diffusion process: information-theoretic union transfer entropy and event synchronization. In order to systematically investigate their performance on law activity data, we establish a new stochastic model to generate synthetic law activity data based on plausible networks of interactions. Through extensive parametric studies, we demonstrate the ability of these methods to reconstruct networks, varying in size, link density, and degree heterogeneity. Our results suggest that union transfer entropy should be preferred for slowly varying processes, which may be associated with policies attending to specific local problems that occur only rarely or with policies facing high levels of opposition. In contrast, event synchronization is effective for faster enactment rates, which may be related to policies involving Federal mandates or incentives. This study puts forward a data-driven toolbox to explain the determinants of legal activity applicable to political science, across dynamical systems, information theory, and complex networks.
Collapse
Affiliation(s)
- Carsten Grabow
- Department of Mechanical and Aerospace Engineering, New York University, Tandon School of Engineering, Brooklyn, New York 11201, USA
| | - James Macinko
- Department of Community Health Sciences and Department of Health Policy and Management, Fielding School of Public Health, University of California, 650 Charles Young Dr., Los Angeles, California 90095, USA
| | - Diana Silver
- Department of Nutrition, Food Studies, and Public Health, 411 Lafayette Street, New York University, Steinhardt School of Culture, Education, and Human Development, New York, New York 10003, USA
| | - Maurizio Porfiri
- Department of Mechanical and Aerospace Engineering, New York University, Tandon School of Engineering, Brooklyn, New York 11201, USA
| |
Collapse
|
131
|
Han X, Shen Z, Wang WX, Lai YC, Grebogi C. Reconstructing direct and indirect interactions in networked public goods game. Sci Rep 2016; 6:30241. [PMID: 27444774 PMCID: PMC4996070 DOI: 10.1038/srep30241] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 07/01/2016] [Indexed: 11/13/2022] Open
Abstract
Network reconstruction is a fundamental problem for understanding many complex
systems with unknown interaction structures. In many complex systems, there are
indirect interactions between two individuals without immediate connection but with
common neighbors. Despite recent advances in network reconstruction, we continue to
lack an approach for reconstructing complex networks with indirect interactions.
Here we introduce a two-step strategy to resolve the reconstruction problem, where
in the first step, we recover both direct and indirect interactions by employing the
Lasso to solve a sparse signal reconstruction problem, and in the second step, we
use matrix transformation and optimization to distinguish between direct and
indirect interactions. The network structure corresponding to direct interactions
can be fully uncovered. We exploit the public goods game occurring on complex
networks as a paradigm for characterizing indirect interactions and test our
reconstruction approach. We find that high reconstruction accuracy can be achieved
for both homogeneous and heterogeneous networks, and a number of empirical networks
in spite of insufficient data measurement contaminated by noise. Although a general
framework for reconstructing complex networks with arbitrary types of indirect
interactions is yet lacking, our approach opens new routes to separate direct and
indirect interactions in a representative complex system.
Collapse
Affiliation(s)
- Xiao Han
- School of Systems Science, Beijing Normal University, Beijing, 100875, P. R. China
| | - Zhesi Shen
- School of Systems Science, Beijing Normal University, Beijing, 100875, P. R. China
| | - Wen-Xu Wang
- School of Systems Science, Beijing Normal University, Beijing, 100875, P. R. China.,Business School, University of Shanghai for Science and Technology, Shanghai 200093, P. R. China
| | - Ying-Cheng Lai
- School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, Arizona 85287, USA
| | - Celso Grebogi
- Institute for Complex Systems and Mathematical Biology, Kings College, University of Aberdeen, Aberdeen AB24 3UE, UK
| |
Collapse
|
132
|
Jang H, Kim KKK, Braatz RD, Gopaluni RB, Lee JH. Regularized maximum likelihood estimation of sparse stochastic monomolecular biochemical reaction networks. Comput Chem Eng 2016. [DOI: 10.1016/j.compchemeng.2016.03.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
133
|
Dong J, Yao ZJ, Wen M, Zhu MF, Wang NN, Miao HY, Lu AP, Zeng WB, Cao DS. BioTriangle: a web-accessible platform for generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions. J Cheminform 2016; 8:34. [PMID: 27330567 PMCID: PMC4915156 DOI: 10.1186/s13321-016-0146-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 06/14/2016] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND More and more evidences from network biology indicate that most cellular components exert their functions through interactions with other cellular components, such as proteins, DNAs, RNAs and small molecules. The rapidly increasing amount of publicly available data in biology and chemistry enables researchers to revisit interaction problems by systematic integration and analysis of heterogeneous data. Currently, some tools have been developed to represent these components. However, they have some limitations and only focus on the analysis of either small molecules or proteins or DNAs/RNAs. To the best of our knowledge, there is still a lack of freely-available, easy-to-use and integrated platforms for generating molecular descriptors of DNAs/RNAs, proteins, small molecules and their interactions. RESULTS Herein, we developed a comprehensive molecular representation platform, called BioTriangle, to emphasize the integration of cheminformatics and bioinformatics into a molecular informatics platform for computational biology study. It contains a feature-rich toolkit used for the characterization of various biological molecules and complex interaction samples including chemicals, proteins, DNAs/RNAs and even their interactions. By using BioTriangle, users are able to start a full pipelining from getting molecular data, molecular representation to constructing machine learning models conveniently. CONCLUSION BioTriangle provides a user-friendly interface to calculate various features of biological molecules and complex interaction samples conveniently. The computing tasks can be submitted and performed simply in a browser without any sophisticated installation and configuration process. BioTriangle is freely available at http://biotriangle.scbdd.com.Graphical abstractAn overview of BioTriangle. A platform for generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions.
Collapse
Affiliation(s)
- Jie Dong
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China
| | - Zhi-Jiang Yao
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Ming Wen
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Min-Feng Zhu
- School of Mathematics and Statistics, Central South University, Changsha, People's Republic of China
| | - Ning-Ning Wang
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China
| | - Hong-Yu Miao
- School of Public Health, University of Texas Health Science Center, Houston, TX USA
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR People's Republic of China
| | - Wen-Bin Zeng
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China
| | - Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China ; Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR People's Republic of China
| |
Collapse
|
134
|
De Cremer K, De Brucker K, Staes I, Peeters A, Van den Driessche F, Coenye T, Cammue BPA, Thevissen K. Stimulation of superoxide production increases fungicidal action of miconazole against Candida albicans biofilms. Sci Rep 2016; 6:27463. [PMID: 27272719 PMCID: PMC4895440 DOI: 10.1038/srep27463] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Accepted: 05/17/2016] [Indexed: 12/20/2022] Open
Abstract
We performed a whole-transcriptome analysis of miconazole-treated Candida albicans biofilms, using RNA-sequencing. Our aim was to identify molecular pathways employed by biofilm cells of this pathogen to resist action of the commonly used antifungal miconazole. As expected, genes involved in sterol biosynthesis and genes encoding drug efflux pumps were highly induced in biofilm cells upon miconazole treatment. Other processes were affected as well, including the electron transport chain (ETC), of which eight components were transcriptionally downregulated. Within a diverse set of 17 inhibitors/inducers of the transcriptionally affected pathways, the ETC inhibitors acted most synergistically with miconazole against C. albicans biofilm cells. Synergy was not observed for planktonically growing C. albicans cultures or when biofilms were treated in oxygen-deprived conditions, pointing to a biofilm-specific oxygen-dependent tolerance mechanism. In line, a correlation between miconazole's fungicidal action against C. albicans biofilm cells and the levels of superoxide radicals was observed, and confirmed both genetically and pharmacologically using a triple superoxide dismutase mutant and a superoxide dismutase inhibitor N-N'-diethyldithiocarbamate, respectively. Consequently, ETC inhibitors that result in mitochondrial dysfunction and affect production of reactive oxygen species can increase miconazole's fungicidal activity against C. albicans biofilm cells.
Collapse
Affiliation(s)
- Kaat De Cremer
- Centre of Microbial and Plant Genetics, KU Leuven, Kasteelpark Arenberg 20, box 2460, 3001 Leuven, Belgium
- Department of Plant Systems Biology, VIB, Technologiepark 927, 9052 Gent, Belgium
| | - Katrijn De Brucker
- Centre of Microbial and Plant Genetics, KU Leuven, Kasteelpark Arenberg 20, box 2460, 3001 Leuven, Belgium
| | - Ines Staes
- Centre of Microbial and Plant Genetics, KU Leuven, Kasteelpark Arenberg 20, box 2460, 3001 Leuven, Belgium
| | - Annelies Peeters
- Centre of Microbial and Plant Genetics, KU Leuven, Kasteelpark Arenberg 20, box 2460, 3001 Leuven, Belgium
| | - Freija Van den Driessche
- Laboratory of Pharmaceutical Microbiology, Ghent University, Ottergemsesteenweg 460, 9000 Gent, Belgium
| | - Tom Coenye
- Laboratory of Pharmaceutical Microbiology, Ghent University, Ottergemsesteenweg 460, 9000 Gent, Belgium
| | - Bruno P. A. Cammue
- Centre of Microbial and Plant Genetics, KU Leuven, Kasteelpark Arenberg 20, box 2460, 3001 Leuven, Belgium
- Department of Plant Systems Biology, VIB, Technologiepark 927, 9052 Gent, Belgium
| | - Karin Thevissen
- Centre of Microbial and Plant Genetics, KU Leuven, Kasteelpark Arenberg 20, box 2460, 3001 Leuven, Belgium
| |
Collapse
|
135
|
Effective gene expression data generation framework based on multi-model approach. Artif Intell Med 2016; 70:41-61. [PMID: 27431036 DOI: 10.1016/j.artmed.2016.05.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Accepted: 05/27/2016] [Indexed: 11/20/2022]
Abstract
OBJECTIVE Overcome the lack of enough samples in gene expression data sets having thousands of genes but a small number of samples challenging the computational methods using them. METHODS AND MATERIAL This paper introduces a multi-model artificial gene expression data generation framework where different gene regulatory network (GRN) models contribute to the final set of samples based on the characteristics of their underlying paradigms. In the first stage, we build different GRN models, and sample data from each of them separately. Then, we pool the generated samples into a rich set of gene expression samples, and finally try to select the best of the generated samples based on a multi-objective selection method measuring the quality of the generated samples from three different aspects such as compatibility, diversity and coverage. We use four alternative GRN models, namely, ordinary differential equations, probabilistic Boolean networks, multi-objective genetic algorithm and hierarchical Markov model. RESULTS We conducted a comprehensive set of experiments based on both real-life biological and synthetic gene expression data sets. We show that our multi-objective sample selection mechanism effectively combines samples from different models having up to 95% compatibility, 10% diversity and 50% coverage. We show that the samples generated by our framework has up to 1.5x higher compatibility, 2x higher diversity and 2x higher coverage than the samples generated by the individual models that the multi-model framework uses. Moreover, the results show that the GRNs inferred from the samples generated by our framework can have 2.4x higher precision, 12x higher recall, and 5.4x higher f-measure values than the GRNs inferred from the original gene expression samples. CONCLUSIONS Therefore, we show that, we can significantly improve the quality of generated gene expression samples by integrating different computational models into one unified framework without dealing with complex internal details of each individual model. Moreover, the rich set of artificial gene expression samples is able to capture some biological relations that can even not be captured by the original gene expression data set.
Collapse
|
136
|
Barranca VJ, Zhou D, Cai D. Compressive sensing reconstruction of feed-forward connectivity in pulse-coupled nonlinear networks. Phys Rev E 2016; 93:060201. [PMID: 27415190 DOI: 10.1103/physreve.93.060201] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2015] [Indexed: 06/06/2023]
Abstract
Utilizing the sparsity ubiquitous in real-world network connectivity, we develop a theoretical framework for efficiently reconstructing sparse feed-forward connections in a pulse-coupled nonlinear network through its output activities. Using only a small ensemble of random inputs, we solve this inverse problem through the compressive sensing theory based on a hidden linear structure intrinsic to the nonlinear network dynamics. The accuracy of the reconstruction is further verified by the fact that complex inputs can be well recovered using the reconstructed connectivity. We expect this Rapid Communication provides a new perspective for understanding the structure-function relationship as well as compressive sensing principle in nonlinear network dynamics.
Collapse
Affiliation(s)
- Victor J Barranca
- Department of Mathematics and Statistics, Swarthmore College, Swarthmore, Pennsylvania 19081, USA
| | - Douglas Zhou
- Department of Mathematics, MOE-LSC, Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
| | - David Cai
- Department of Mathematics, MOE-LSC, Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
- Courant Institute of Mathematical Sciences and Center for Neural Science, New York University, New York, New York 10012, USA
- NYUAD Institute, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates
| |
Collapse
|
137
|
Chatterjee P, Pal NR. Construction of synergy networks from gene expression data related to disease. Gene 2016; 590:250-62. [PMID: 27222483 DOI: 10.1016/j.gene.2016.05.029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2015] [Revised: 03/11/2016] [Accepted: 05/17/2016] [Indexed: 02/07/2023]
Abstract
A few methods have been developed to determine whether genes collaborate with each other in relation to a particular disease using an information theoretic measure of synergy. Here, we propose an alternative definition of synergy and justify that our definition improves upon the existing measures of synergy in the context of gene interactions. We use this definition on a prostate cancer data set consisting of gene expression levels in both cancerous and non-cancerous samples and identify pairs of genes which are unable to discriminate between cancerous and non-cancerous samples individually but can do so jointly when we take their synergistic property into account. We also propose a very simple yet effective technique for computation of conditional entropy at a very low cost. The worst case complexity of our method is O(n) while the best case complexity of a state-of-the-art method is O(n(2)). Furthermore, our method can also be extended to find synergistic relation among triplets or even among a larger number of genes. Finally, we validate our results by demonstrating that these findings cannot be due to pure chance and provide the relevance of the synergistic pairs in cancer biology.
Collapse
Affiliation(s)
- Prantik Chatterjee
- Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta, India
| | - Nikhil Ranjan Pal
- Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta, India.
| |
Collapse
|
138
|
Xiao F, Gao L, Ye Y, Hu Y, He R. Inferring Gene Regulatory Networks Using Conditional Regulation Pattern to Guide Candidate Genes. PLoS One 2016; 11:e0154953. [PMID: 27171286 PMCID: PMC4865039 DOI: 10.1371/journal.pone.0154953] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 04/21/2016] [Indexed: 12/13/2022] Open
Abstract
Combining path consistency (PC) algorithms with conditional mutual information (CMI) are widely used in reconstruction of gene regulatory networks. CMI has many advantages over Pearson correlation coefficient in measuring non-linear dependence to infer gene regulatory networks. It can also discriminate the direct regulations from indirect ones. However, it is still a challenge to select the conditional genes in an optimal way, which affects the performance and computation complexity of the PC algorithm. In this study, we develop a novel conditional mutual information-based algorithm, namely RPNI (Regulation Pattern based Network Inference), to infer gene regulatory networks. For conditional gene selection, we define the co-regulation pattern, indirect-regulation pattern and mixture-regulation pattern as three candidate patterns to guide the selection of candidate genes. To demonstrate the potential of our algorithm, we apply it to gene expression data from DREAM challenge. Experimental results show that RPNI outperforms existing conditional mutual information-based methods in both accuracy and time complexity for different sizes of gene samples. Furthermore, the robustness of our algorithm is demonstrated by noisy interference analysis using different types of noise.
Collapse
Affiliation(s)
- Fei Xiao
- School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
- * E-mail:
| | - Yusen Ye
- School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Yuxuan Hu
- School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| | - Ruijie He
- School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710071, China
| |
Collapse
|
139
|
Abstract
Systems metabolic engineering, which recently emerged as metabolic engineering integrated with systems biology, synthetic biology, and evolutionary engineering, allows engineering of microorganisms on a systemic level for the production of valuable chemicals far beyond its native capabilities. Here, we review the strategies for systems metabolic engineering and particularly its applications in Escherichia coli. First, we cover the various tools developed for genetic manipulation in E. coli to increase the production titers of desired chemicals. Next, we detail the strategies for systems metabolic engineering in E. coli, covering the engineering of the native metabolism, the expansion of metabolism with synthetic pathways, and the process engineering aspects undertaken to achieve higher production titers of desired chemicals. Finally, we examine a couple of notable products as case studies produced in E. coli strains developed by systems metabolic engineering. The large portfolio of chemical products successfully produced by engineered E. coli listed here demonstrates the sheer capacity of what can be envisioned and achieved with respect to microbial production of chemicals. Systems metabolic engineering is no longer in its infancy; it is now widely employed and is also positioned to further embrace next-generation interdisciplinary principles and innovation for its upgrade. Systems metabolic engineering will play increasingly important roles in developing industrial strains including E. coli that are capable of efficiently producing natural and nonnatural chemicals and materials from renewable nonfood biomass.
Collapse
Affiliation(s)
- Kyeong Rok Choi
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon 34141, Republic of Korea
| | - Jae Ho Shin
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon 34141, Republic of Korea
| | - Jae Sung Cho
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon 34141, Republic of Korea
| | - Dongsoo Yang
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon 34141, Republic of Korea
| | - Sang Yup Lee
- BioInformatics Research Center, KAIST, Daejeon 34141, Republic of Korea
- BioProcess Engineering Research Center, KAIST, Daejeon 34141, Republic of Korea
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon 34141, Republic of Korea
| |
Collapse
|
140
|
A method for analysis and design of metabolism using metabolomics data and kinetic models: Application on lipidomics using a novel kinetic model of sphingolipid metabolism. Metab Eng 2016; 37:46-62. [PMID: 27113440 DOI: 10.1016/j.ymben.2016.04.002] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Revised: 01/05/2016] [Accepted: 04/20/2016] [Indexed: 11/22/2022]
Abstract
We present a model-based method, designated Inverse Metabolic Control Analysis (IMCA), which can be used in conjunction with classical Metabolic Control Analysis for the analysis and design of cellular metabolism. We demonstrate the capabilities of the method by first developing a comprehensively curated kinetic model of sphingolipid biosynthesis in the yeast Saccharomyces cerevisiae. Next we apply IMCA using the model and integrating lipidomics data. The combinatorial complexity of the synthesis of sphingolipid molecules, along with the operational complexity of the participating enzymes of the pathway, presents an excellent case study for testing the capabilities of the IMCA. The exceptional agreement of the predictions of the method with genome-wide data highlights the importance and value of a comprehensive and consistent engineering approach for the development of such methods and models. Based on the analysis, we identified the class of enzymes regulating the distribution of sphingolipids among species and hydroxylation states, with the D-phospholipase SPO14 being one of the most prominent. The method and the applications presented here can be used for a broader, model-based inverse metabolic engineering approach.
Collapse
|
141
|
Zhao Y, Jiang M, Chen Y. Inferring gene regulatory networks using a time-delayed mass action model. J Bioinform Comput Biol 2016; 14:1650012. [PMID: 27093908 DOI: 10.1142/s0219720016500128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
This paper demonstrates a new time-delayed mass action model which applies a set of delay differential equations (DDEs) to represent the dynamics of gene regulatory networks (GRNs). The mass action model is a classical model which is often used to describe the kinetics of biochemical processes, so it is fit for GRN modeling. The ability to incorporate time-delayed parameters in this model enables different time delays of interaction between genes. Moreover, an efficient learning method which employs population-based incremental learning (PBIL) algorithm and trigonometric differential evolution (TDE) algorithm TDE is proposed to automatically evolve the structure of the network and infer the optimal parameters from observed time-series gene expression data. Experiments on three well-known motifs of GRN and a real budding yeast cell cycle network show that the proposal can not only successfully infer the network structure and parameters but also has a strong anti-noise ability. Compared with other works, this method also has a great improvement in performances.
Collapse
Affiliation(s)
- Yaou Zhao
- * School of Information Science and Engineering, University of Jinan, Jinan, Shandong Province 250100, P. R. China.,† Shandong Provincial Key Laboratory of Network Based, Intelligent Computing, University of Jinan, Jinan, Shandong Province 250022, P. R. China
| | - Mingyan Jiang
- * School of Information Science and Engineering, University of Jinan, Jinan, Shandong Province 250100, P. R. China
| | - Yuehui Chen
- † Shandong Provincial Key Laboratory of Network Based, Intelligent Computing, University of Jinan, Jinan, Shandong Province 250022, P. R. China
| |
Collapse
|
142
|
Stable Gene Regulatory Network Modeling From Steady-State Data. Bioengineering (Basel) 2016; 3:bioengineering3020012. [PMID: 28952574 PMCID: PMC5597136 DOI: 10.3390/bioengineering3020012] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Revised: 03/09/2016] [Accepted: 04/06/2016] [Indexed: 12/19/2022] Open
Abstract
Gene regulatory networks represent an abstract mapping of gene regulations in living cells. They aim to capture dependencies among molecular entities such as transcription factors, proteins and metabolites. In most applications, the regulatory network structure is unknown, and has to be reverse engineered from experimental data consisting of expression levels of the genes usually measured as messenger RNA concentrations in microarray experiments. Steady-state gene expression data are obtained from measurements of the variations in expression activity following the application of small perturbations to equilibrium states in genetic perturbation experiments. In this paper, the least absolute shrinkage and selection operator-vector autoregressive (LASSO-VAR) originally proposed for the analysis of economic time series data is adapted to include a stability constraint for the recovery of a sparse and stable regulatory network that describes data obtained from noisy perturbation experiments. The approach is applied to real experimental data obtained for the SOS pathway in Escherichia coli and the cell cycle pathway for yeast Saccharomyces cerevisiae. Significant features of this method are the ability to recover networks without inputting prior knowledge of the network topology, and the ability to be efficiently applied to large scale networks due to the convex nature of the method.
Collapse
|
143
|
Schulze S, Schleicher J, Guthke R, Linde J. How to Predict Molecular Interactions between Species? Front Microbiol 2016; 7:442. [PMID: 27065992 PMCID: PMC4814556 DOI: 10.3389/fmicb.2016.00442] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2015] [Accepted: 03/18/2016] [Indexed: 12/21/2022] Open
Abstract
Organisms constantly interact with other species through physical contact which leads to changes on the molecular level, for example the transcriptome. These changes can be monitored for all genes, with the help of high-throughput experiments such as RNA-seq or microarrays. The adaptation of the gene expression to environmental changes within cells is mediated through complex gene regulatory networks. Often, our knowledge of these networks is incomplete. Network inference predicts gene regulatory interactions based on transcriptome data. An emerging application of high-throughput transcriptome studies are dual transcriptomics experiments. Here, the transcriptome of two or more interacting species is measured simultaneously. Based on a dual RNA-seq data set of murine dendritic cells infected with the fungal pathogen Candida albicans, the software tool NetGenerator was applied to predict an inter-species gene regulatory network. To promote further investigations of molecular inter-species interactions, we recently discussed dual RNA-seq experiments for host-pathogen interactions and extended the applied tool NetGenerator (Schulze et al., 2015). The updated version of NetGenerator makes use of measurement variances in the algorithmic procedure and accepts gene expression time series data with missing values. Additionally, we tested multiple modeling scenarios regarding the stimuli functions of the gene regulatory network. Here, we summarize the work by Schulze et al. (2015) and put it into a broader context. We review various studies making use of the dual transcriptomics approach to investigate the molecular basis of interacting species. Besides the application to host-pathogen interactions, dual transcriptomics data are also utilized to study mutualistic and commensalistic interactions. Furthermore, we give a short introduction into additional approaches for the prediction of gene regulatory networks and discuss their application to dual transcriptomics data. We conclude that the application of network inference on dual-transcriptomics data is a promising approach to predict molecular inter-species interactions.
Collapse
Affiliation(s)
- Sylvie Schulze
- Research Group Systems Biology and Bioinformatics, Leibniz-Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute Jena, Germany
| | - Jana Schleicher
- Research Group Systems Biology and Bioinformatics, Leibniz-Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute Jena, Germany
| | - Reinhard Guthke
- Research Group Systems Biology and Bioinformatics, Leibniz-Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute Jena, Germany
| | - Jörg Linde
- Research Group Systems Biology and Bioinformatics, Leibniz-Institute for Natural Product Research and Infection Biology - Hans-Knöll-Institute Jena, Germany
| |
Collapse
|
144
|
Noh H, Gunawan R. Inferring gene targets of drugs and chemical compounds from gene expression profiles. Bioinformatics 2016; 32:2120-7. [PMID: 27153589 PMCID: PMC4937192 DOI: 10.1093/bioinformatics/btw148] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Accepted: 03/11/2016] [Indexed: 01/08/2023] Open
Abstract
Motivation: Finding genes which are directly perturbed or targeted by drugs is of great interest and importance in drug discovery. Several network filtering methods have been created to predict the gene targets of drugs from gene expression data based on an ordinary differential equation model of the gene regulatory network (GRN). A critical step in these methods involves inferring the GRN from the expression data, which is a very challenging problem on its own. In addition, existing network filtering methods require computationally intensive parameter tuning or expression data from experiments with known genetic perturbations or both. Results: We developed a method called DeltaNet for the identification of drug targets from gene expression data. Here, the gene target predictions were directly inferred from the data without a separate step of GRN inference. DeltaNet formulation led to solving an underdetermined linear regression problem, for which we employed least angle regression (DeltaNet-LAR) or LASSO regularization (DeltaNet-LASSO). The predictions using DeltaNet for expression data of Escherichia coli, yeast, fruit fly and human were significantly more accurate than those using network filtering methods, namely mode of action by network identification (MNI) and sparse simultaneous equation model (SSEM). Furthermore, DeltaNet using LAR did not require any parameter tuning and could provide computational speed-up over existing methods. Conclusion: DeltaNet is a robust and numerically efficient tool for identifying gene perturbations from gene expression data. Importantly, the method requires little to no expert supervision, while providing accurate gene target predictions. Availability and implementation: DeltaNet is available on http://www.cabsel.ethz.ch/tools/DeltaNet. Contact:rudi.gunawan@chem.ethz.ch Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Heeju Noh
- Institute for Chemical and Bioengineering, Zurich, ETH Zurich, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Rudiyanto Gunawan
- Institute for Chemical and Bioengineering, Zurich, ETH Zurich, Switzerland Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
145
|
Lobo D, Hammelman J, Levin M. MoCha: Molecular Characterization of Unknown Pathways. J Comput Biol 2016; 23:291-7. [PMID: 26950055 DOI: 10.1089/cmb.2015.0211] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Automated methods for the reverse-engineering of complex regulatory networks are paving the way for the inference of mechanistic comprehensive models directly from experimental data. These novel methods can infer not only the relations and parameters of the known molecules defined in their input datasets, but also unknown components and pathways identified as necessary by the automated algorithms. Identifying the molecular nature of these unknown components is a crucial step for making testable predictions and experimentally validating the models, yet no specific and efficient tools exist to aid in this process. To this end, we present here MoCha (Molecular Characterization), a tool optimized for the search of unknown proteins and their pathways from a given set of known interacting proteins. MoCha uses the comprehensive dataset of protein-protein interactions provided by the STRING database, which currently includes more than a billion interactions from over 2,000 organisms. MoCha is highly optimized, performing typical searches within seconds. We demonstrate the use of MoCha with the characterization of unknown components from reverse-engineered models from the literature. MoCha is useful for working on network models by hand or as a downstream step of a model inference engine workflow and represents a valuable and efficient tool for the characterization of unknown pathways using known data from thousands of organisms. MoCha and its source code are freely available online under the GPLv3 license.
Collapse
Affiliation(s)
- Daniel Lobo
- 1 Department of Biological Sciences, University of Maryland , Baltimore County, Baltimore, Maryland
| | - Jennifer Hammelman
- 2 Center for Regenerative and Developmental Biology, and Department of Biology, Tufts University , Medford, Massachusetts
| | - Michael Levin
- 2 Center for Regenerative and Developmental Biology, and Department of Biology, Tufts University , Medford, Massachusetts
| |
Collapse
|
146
|
He B, Tan K. Understanding transcriptional regulatory networks using computational models. Curr Opin Genet Dev 2016; 37:101-108. [PMID: 26950762 DOI: 10.1016/j.gde.2016.02.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Revised: 01/29/2016] [Accepted: 02/08/2016] [Indexed: 01/06/2023]
Abstract
Transcriptional regulatory networks (TRNs) encode instructions for animal development and physiological responses. Recent advances in genomic technologies and computational modeling have revolutionized our ability to construct models of TRNs. Here, we survey current computational methods for inferring TRN models using genome-scale data. We discuss their advantages and limitations. We summarize representative TRNs constructed using genome-scale data in both normal and disease development. We discuss lessons learned about the structure/function relationship of TRNs, based on examining various large-scale TRN models. Finally, we outline some open questions regarding TRNs, including how to improve model accuracy by integrating complementary data types, how to infer condition-specific TRNs, and how to compare TRNs across conditions and species in order to understand their structure/function relationship.
Collapse
Affiliation(s)
- Bing He
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA
| | - Kai Tan
- Interdisciplinary Graduate Program in Genetics, University of Iowa, Iowa City, IA 52242, USA; Department of Internal Medicine, University of Iowa, Iowa City, IA 52242, USA.
| |
Collapse
|
147
|
Su RQ, Wang WX, Wang X, Lai YC. Data-based reconstruction of complex geospatial networks, nodal positioning and detection of hidden nodes. ROYAL SOCIETY OPEN SCIENCE 2016; 3:150577. [PMID: 26909187 PMCID: PMC4736942 DOI: 10.1098/rsos.150577] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Accepted: 11/26/2015] [Indexed: 06/05/2023]
Abstract
Given a complex geospatial network with nodes distributed in a two-dimensional region of physical space, can the locations of the nodes be determined and their connection patterns be uncovered based solely on data? We consider the realistic situation where time series/signals can be collected from a single location. A key challenge is that the signals collected are necessarily time delayed, due to the varying physical distances from the nodes to the data collection centre. To meet this challenge, we develop a compressive-sensing-based approach enabling reconstruction of the full topology of the underlying geospatial network and more importantly, accurate estimate of the time delays. A standard triangularization algorithm can then be employed to find the physical locations of the nodes in the network. We further demonstrate successful detection of a hidden node (or a hidden source or threat), from which no signal can be obtained, through accurate detection of all its neighbouring nodes. As a geospatial network has the feature that a node tends to connect with geophysically nearby nodes, the localized region that contains the hidden node can be identified.
Collapse
Affiliation(s)
- Ri-Qi Su
- School of Electrical, Computer, and Energy Engineering, Arizona State University, Tempe, AZ 85287, USA
| | - Wen-Xu Wang
- Department of Systems Science, School of Management and Center for Complexity Research, Beijing Normal University, Beijing 100875, People’s Republic of China
| | - Xiao Wang
- School of Biological and Health Systems Engineering, Arizona State University, Tempe, AZ 85287, USA
| | - Ying-Cheng Lai
- School of Electrical, Computer, and Energy Engineering, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
148
|
Bayesian Computation Methods for Inferring Regulatory Network Models Using Biomedical Data. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 939:289-307. [DOI: 10.1007/978-981-10-1503-8_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
149
|
Acharya L, Reynolds R, Zhu D. Network inference through synergistic subnetwork evolution. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2015; 2015:12. [PMID: 26640480 PMCID: PMC4662719 DOI: 10.1186/s13637-015-0027-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 08/21/2015] [Indexed: 12/02/2022]
Abstract
Study of signaling networks is important for a better understanding of cell behaviors e.g., growth, differentiation, metabolism, proptosis, and gaining deeper insights into the molecular mechanisms of complex diseases. While there have been many successes in developing computational approaches for identifying potential genes and proteins involved in cell signaling, new methods are needed for identifying network structures that depict underlying signal cascading mechanisms. In this paper, we propose a new computational approach for inferring signaling network structures from overlapping gene sets related to the networks. In the proposed approach, a signaling network is represented as a directed graph and is viewed as a union of many active paths representing linear and overlapping chains of signal cascading activities in the network. Gene sets represent the sets of genes participating in active paths without prior knowledge of the order in which genes occur within each path. From a compendium of unordered gene sets, the proposed algorithm reconstructs the underlying network structure through evolution of synergistic active paths. In our context, the extent of edge overlapping among active paths is used to define the synergy present in a network. We evaluated the performance of the proposed algorithm in terms of its convergence and recovering true active paths by utilizing four gene set compendiums derived from the KEGG database. Evaluation of results demonstrate the ability of the algorithm in reconstructing the underlying networks with high accuracy and precision.
Collapse
Affiliation(s)
- Lipi Acharya
- Dow AgroSciences, 9330 Zionsville Road, Indianapolis, IN 46268 USA
| | - Robert Reynolds
- Department of Computer Science, Wayne State University, 5057 Woodward Avenue, Detroit, MI 48202 USA
| | - Dongxiao Zhu
- Department of Computer Science, Wayne State University, 5057 Woodward Avenue, Detroit, MI 48202 USA
| |
Collapse
|
150
|
Chatterjee P, Pal NR. Discovery of synergistic genetic network: A minimum spanning tree-based approach. J Bioinform Comput Biol 2015; 14:1650003. [PMID: 26620041 DOI: 10.1142/s0219720016500037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Identification of gene interactions is one of the very well-known and important problems in the field of genetics. However, discovering synergistic gene interactions is a relatively new problem which has been proven to be as significant as the former in genetics. Several approaches have been proposed in this regard and most of them depend upon information theoretic measures. These approaches quantize the gene expression levels, explicitly or implicitly and therefore, may lose information. Here, we have proposed a novel approach for identifying synergistic gene interactions directly from the continuous expression levels, using a minimum spanning tree (MST)-based algorithm. We have used this approach to find pairs of synergistically interacting genes in prostate cancer. The advantages of our method are that it does not need any discretization and it can be extended straightway to find synergistically interacting sets of genes having three or more elements as per the requirement of the situation. We have demonstrated the relevance of the synergistic genes in cancer biology using KEGG pathway analysis and otherwise.
Collapse
|