1
|
Xi X, Ruffieux H. A modeling framework for detecting and leveraging node-level information in Bayesian network inference. Biostatistics 2024; 26:kxae021. [PMID: 38916966 PMCID: PMC11823055 DOI: 10.1093/biostatistics/kxae021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 03/11/2024] [Accepted: 06/02/2024] [Indexed: 06/27/2024] Open
Abstract
Bayesian graphical models are powerful tools to infer complex relationships in high dimension, yet are often fraught with computational and statistical challenges. If exploited in a principled way, the increasing information collected alongside the data of primary interest constitutes an opportunity to mitigate these difficulties by guiding the detection of dependence structures. For instance, gene network inference may be informed by the use of publicly available summary statistics on the regulation of genes by genetic variants. Here we present a novel Gaussian graphical modeling framework to identify and leverage information on the centrality of nodes in conditional independence graphs. Specifically, we consider a fully joint hierarchical model to simultaneously infer (i) sparse precision matrices and (ii) the relevance of node-level information for uncovering the sought-after network structure. We encode such information as candidate auxiliary variables using a spike-and-slab submodel on the propensity of nodes to be hubs, which allows hypothesis-free selection and interpretation of a sparse subset of relevant variables. As efficient exploration of large posterior spaces is needed for real-world applications, we develop a variational expectation conditional maximization algorithm that scales inference to hundreds of samples, nodes and auxiliary variables. We illustrate and exploit the advantages of our approach in simulations and in a gene network study which identifies hub genes involved in biological pathways relevant to immune-mediated diseases.
Collapse
Affiliation(s)
- Xiaoyue Xi
- MRC Biostatistics Unit, University of Cambridge, East Forvie Building, Forvie Site, Robinson Way, Cambridge CB2 0SR, United Kingdom
| | - Hélène Ruffieux
- MRC Biostatistics Unit, University of Cambridge, East Forvie Building, Forvie Site, Robinson Way, Cambridge CB2 0SR, United Kingdom
| |
Collapse
|
2
|
Barba-Ostria C, Carrera-Pacheco SE, Gonzalez-Pastor R, Heredia-Moya J, Mayorga-Ramos A, Rodríguez-Pólit C, Zúñiga-Miranda J, Arias-Almeida B, Guamán LP. Evaluation of Biological Activity of Natural Compounds: Current Trends and Methods. Molecules 2022; 27:4490. [PMID: 35889361 PMCID: PMC9324072 DOI: 10.3390/molecules27144490] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 06/26/2022] [Accepted: 07/04/2022] [Indexed: 02/08/2023] Open
Abstract
Natural compounds have diverse structures and are present in different forms of life. Metabolites such as tannins, anthocyanins, and alkaloids, among others, serve as a defense mechanism in live organisms and are undoubtedly compounds of interest for the food, cosmetic, and pharmaceutical industries. Plants, bacteria, and insects represent sources of biomolecules with diverse activities, which are in many cases poorly studied. To use these molecules for different applications, it is essential to know their structure, concentrations, and biological activity potential. In vitro techniques that evaluate the biological activity of the molecules of interest have been developed since the 1950s. Currently, different methodologies have emerged to overcome some of the limitations of these traditional techniques, mainly via reductions in time and costs. These emerging technologies continue to appear due to the urgent need to expand the analysis capacity of a growing number of reported biomolecules. This review presents an updated summary of the conventional and relevant methods to evaluate the natural compounds' biological activity in vitro.
Collapse
Affiliation(s)
- Carlos Barba-Ostria
- Escuela de Medicina, Colegio de Ciencias de la Salud Quito, Universidad San Francisco de Quito USFQ, Quito 170901, Ecuador;
| | - Saskya E. Carrera-Pacheco
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Rebeca Gonzalez-Pastor
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Jorge Heredia-Moya
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Arianna Mayorga-Ramos
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Cristina Rodríguez-Pólit
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Johana Zúñiga-Miranda
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Benjamin Arias-Almeida
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| | - Linda P. Guamán
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170527, Ecuador; (S.E.C.-P.); (R.G.-P.); (J.H.-M.); (A.M.-R.); (C.R.-P.); (J.Z.-M.); (B.A.-A.)
| |
Collapse
|
3
|
Gaglio D, Bonanomi M, Valtorta S, Bharat R, Ripamonti M, Conte F, Fiscon G, Righi N, Napodano E, Papa F, Raccagni I, Parker SJ, Cifola I, Camboni T, Paci P, Colangelo AM, Vanoni M, Metallo CM, Moresco RM, Alberghina L. Disruption of redox homeostasis for combinatorial drug efficacy in K-Ras tumors as revealed by metabolic connectivity profiling. Cancer Metab 2020; 8:22. [PMID: 33005401 PMCID: PMC7523077 DOI: 10.1186/s40170-020-00227-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 09/06/2020] [Indexed: 12/14/2022] Open
Abstract
Abstract Background Rewiring of metabolism induced by oncogenic K-Ras in cancer cells involves both glucose and glutamine utilization sustaining enhanced, unrestricted growth. The development of effective anti-cancer treatments targeting metabolism may be facilitated by the identification and rational combinatorial targeting of metabolic pathways. Methods We performed mass spectrometric metabolomics analysis in vitro and in vivo experiments to evaluate the efficacy of drugs and identify metabolic connectivity. Results We show that K-Ras-mutant lung and colon cancer cells exhibit a distinct metabolic rewiring, the latter being more dependent on respiration. Combined treatment with the glutaminase inhibitor CB-839 and the PI3K/aldolase inhibitor NVP-BKM120 more consistently reduces cell growth of tumor xenografts. Maximal growth inhibition correlates with the disruption of redox homeostasis, involving loss of reduced glutathione regeneration, redox cofactors, and a decreased connectivity among metabolites primarily involved in nucleic acid metabolism. Conclusions Our findings open the way to develop metabolic connectivity profiling as a tool for a selective strategy of combined drug repositioning in precision oncology.
Collapse
Affiliation(s)
- Daniela Gaglio
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Segrate, MI Italy.,ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy
| | - Marcella Bonanomi
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Silvia Valtorta
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Segrate, MI Italy.,ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Medicine and Surgery and Tecnomed Foundation, University of Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Rohit Bharat
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Marilena Ripamonti
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Segrate, MI Italy.,ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy
| | - Federica Conte
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Giulia Fiscon
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Nicole Righi
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Elisabetta Napodano
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Segrate, MI Italy.,ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy
| | - Federico Papa
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Isabella Raccagni
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Segrate, MI Italy.,ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Nuclear Medicine Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Seth J Parker
- Department of Bioengineering, University of California, San Diego, La Jolla, CA USA.,Moores Cancer Center, University of California, San Diego, La Jolla, CA USA
| | - Ingrid Cifola
- Institute for Biomedical Technologies (ITB), National Research Council (CNR), Segrate, Milan, Italy
| | - Tania Camboni
- Institute for Biomedical Technologies (ITB), National Research Council (CNR), Segrate, Milan, Italy
| | - Paola Paci
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy.,Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Anna Maria Colangelo
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Marco Vanoni
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| | - Christian M Metallo
- Department of Bioengineering, University of California, San Diego, La Jolla, CA USA.,Moores Cancer Center, University of California, San Diego, La Jolla, CA USA
| | - Rosa Maria Moresco
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), Segrate, MI Italy.,ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Medicine and Surgery and Tecnomed Foundation, University of Milano-Bicocca, Via Cadore 48, 20900 Monza, Italy
| | - Lilia Alberghina
- ISBE. IT/Centre of Systems Biology, Piazza della Scienza 4, 20126 Milan, Italy.,Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza 2, 20126 Milan, Italy
| |
Collapse
|
4
|
Chanda P, Costa E, Hu J, Sukumar S, Van Hemert J, Walia R. Information Theory in Computational Biology: Where We Stand Today. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E627. [PMID: 33286399 PMCID: PMC7517167 DOI: 10.3390/e22060627] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 05/31/2020] [Accepted: 06/03/2020] [Indexed: 12/30/2022]
Abstract
"A Mathematical Theory of Communication" was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon's work have formed the basis of information theory, a cornerstone of statistical learning and inference, and has been playing a key role in disciplines such as physics and thermodynamics, probability and statistics, computational sciences and biological sciences. In this article we review the basic information theory based concepts and describe their key applications in multiple major areas of research in computational biology-gene expression and transcriptomics, alignment-free sequence comparison, sequencing and error correction, genome-wide disease-gene association mapping, metabolic networks and metabolomics, and protein sequence, structure and interaction analysis.
Collapse
Affiliation(s)
- Pritam Chanda
- Corteva Agriscience™, Indianapolis, IN 46268, USA
- Computer and Information Science, Indiana University-Purdue University, Indianapolis, IN 46202, USA
| | - Eduardo Costa
- Corteva Agriscience™, Mogi Mirim, Sao Paulo 13801-540, Brazil
| | - Jie Hu
- Corteva Agriscience™, Indianapolis, IN 46268, USA
| | | | | | - Rasna Walia
- Corteva Agriscience™, Johnston, IA 50131, USA
| |
Collapse
|
5
|
Altenbuchinger M, Weihs A, Quackenbush J, Grabe HJ, Zacharias HU. Gaussian and Mixed Graphical Models as (multi-)omics data analysis tools. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2020; 1863:194418. [PMID: 31639475 PMCID: PMC7166149 DOI: 10.1016/j.bbagrm.2019.194418] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 08/21/2019] [Accepted: 08/21/2019] [Indexed: 11/30/2022]
Abstract
Gaussian Graphical Models (GGMs) are tools to infer dependencies between biological variables. Popular applications are the reconstruction of gene, protein, and metabolite association networks. GGMs are an exploratory research tool that can be useful to discover interesting relations between genes (functional clusters) or to identify therapeutically interesting genes, but do not necessarily infer a network in the mechanistic sense. Although GGMs are well investigated from a theoretical and applied perspective, important extensions are not well known within the biological community. GGMs assume, for instance, multivariate normal distributed data. If this assumption is violated Mixed Graphical Models (MGMs) can be the better choice. In this review, we provide the theoretical foundations of GGMs, present extensions such as MGMs or multi-class GGMs, and illustrate how those methods can provide insight in biological mechanisms. We summarize several applications and present user-friendly estimation software. This article is part of a Special Issue entitled: Transcriptional Profiles and Regulatory Gene Networks edited by Dr. Dr. Federico Manuel Giorgi and Dr. Shaun Mahony.
Collapse
Affiliation(s)
- Michael Altenbuchinger
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, MA Boston, 02115, USA.
| | - Antoine Weihs
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, 17475 Greifswald, Germany
| | - John Quackenbush
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, MA Boston, 02115, USA; Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA 02115, USA; Department of Medicine, Harvard Medical School, Boston, MA 02115, USA
| | - Hans Jörgen Grabe
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, 17475 Greifswald, Germany; German Center for Neurodegenerative Diseases DZNE, Site Rostock/Greifswald, 17475 Greifswald, Germany
| | - Helena U Zacharias
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, 17475 Greifswald, Germany.
| |
Collapse
|
6
|
Emwas AH, Roy R, McKay RT, Tenori L, Saccenti E, Gowda GAN, Raftery D, Alahmari F, Jaremko L, Jaremko M, Wishart DS. NMR Spectroscopy for Metabolomics Research. Metabolites 2019; 9:E123. [PMID: 31252628 PMCID: PMC6680826 DOI: 10.3390/metabo9070123] [Citation(s) in RCA: 612] [Impact Index Per Article: 102.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 06/14/2019] [Accepted: 06/18/2019] [Indexed: 12/14/2022] Open
Abstract
Over the past two decades, nuclear magnetic resonance (NMR) has emerged as one of the three principal analytical techniques used in metabolomics (the other two being gas chromatography coupled to mass spectrometry (GC-MS) and liquid chromatography coupled with single-stage mass spectrometry (LC-MS)). The relative ease of sample preparation, the ability to quantify metabolite levels, the high level of experimental reproducibility, and the inherently nondestructive nature of NMR spectroscopy have made it the preferred platform for long-term or large-scale clinical metabolomic studies. These advantages, however, are often outweighed by the fact that most other analytical techniques, including both LC-MS and GC-MS, are inherently more sensitive than NMR, with lower limits of detection typically being 10 to 100 times better. This review is intended to introduce readers to the field of NMR-based metabolomics and to highlight both the advantages and disadvantages of NMR spectroscopy for metabolomic studies. It will also explore some of the unique strengths of NMR-based metabolomics, particularly with regard to isotope selection/detection, mixture deconvolution via 2D spectroscopy, automation, and the ability to noninvasively analyze native tissue specimens. Finally, this review will highlight a number of emerging NMR techniques and technologies that are being used to strengthen its utility and overcome its inherent limitations in metabolomic applications.
Collapse
Affiliation(s)
- Abdul-Hamid Emwas
- Core Labs, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Raja Roy
- Centre of Biomedical Research, Formerly, Centre of Biomedical Magnetic Resonance, Sanjay Gandhi Post-Graduate Institute of Medical Sciences Campus, Uttar Pradesh 226014, India
| | - Ryan T McKay
- Department of Chemistry, University of Alberta, Edmonton, AB T6G 2W2, Canada
| | - Leonardo Tenori
- Department of Experimental and Clinical Medicine, University of Florence, Largo Brambilla 3, 50134 Florence, Italy
| | - Edoardo Saccenti
- Laboratory of Systems and Synthetic Biology Wageningen University & Research, Stippeneng 4, 6708 WE Wageningen, The Netherlands
| | - G A Nagana Gowda
- Northwest Metabolomics Research Center, Department of Anesthesiology and Pain Medicine, University of Washington, 850 Republican St., Seattle, WA 98109, USA
| | - Daniel Raftery
- Northwest Metabolomics Research Center, Department of Anesthesiology and Pain Medicine, University of Washington, 850 Republican St., Seattle, WA 98109, USA
- Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue, Seattle, WA 98109, USA
| | - Fatimah Alahmari
- Department of NanoMedicine Research, Institute for Research and Medical Consultations (IRMC), Imam Abdulrahman bin Faisal University, Dammam 31441, Saudi Arabia
| | - Lukasz Jaremko
- Division of Biological and Environmental Sciences and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Mariusz Jaremko
- Division of Biological and Environmental Sciences and Engineering (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - David S Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E8, Canada
| |
Collapse
|
7
|
Khatibipour MJ, Kurtoğlu F, Çakır T. JacLy: a Jacobian-based method for the inference of metabolic interactions from the covariance of steady-state metabolome data. PeerJ 2018; 6:e6034. [PMID: 30564518 PMCID: PMC6286809 DOI: 10.7717/peerj.6034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Accepted: 10/30/2018] [Indexed: 11/20/2022] Open
Abstract
Reverse engineering metabolome data to infer metabolic interactions is a challenging research topic. Here we introduce JacLy, a Jacobian-based method to infer metabolic interactions of small networks (<20 metabolites) from the covariance of steady-state metabolome data. The approach was applied to two different in silico small-scale metabolome datasets. The power of JacLy lies on the use of steady-state metabolome data to predict the Jacobian matrix of the system, which is a source of information on structure and dynamic characteristics of the system. Besides its advantage of inferring directed interactions, its superiority over correlation-based network inference was especially clear in terms of the required number of replicates and the effect of the use of priori knowledge in the inference. Additionally, we showed the use of standard deviation of the replicate data as a suitable approximation for the magnitudes of metabolite fluctuations inherent in the system.
Collapse
Affiliation(s)
- Mohammad Jafar Khatibipour
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University, Gebze, Kocaeli, Turkey.,Department of Chemical Engineering, Gebze Technical University, Gebze, Kocaeli, Turkey
| | - Furkan Kurtoğlu
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University, Gebze, Kocaeli, Turkey
| | - Tunahan Çakır
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University, Gebze, Kocaeli, Turkey
| |
Collapse
|
8
|
Rosato A, Tenori L, Cascante M, De Atauri Carulla PR, Martins Dos Santos VAP, Saccenti E. From correlation to causation: analysis of metabolomics data using systems biology approaches. Metabolomics 2018; 14:37. [PMID: 29503602 PMCID: PMC5829120 DOI: 10.1007/s11306-018-1335-y] [Citation(s) in RCA: 135] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Accepted: 01/31/2018] [Indexed: 12/26/2022]
Abstract
INTRODUCTION Metabolomics is a well-established tool in systems biology, especially in the top-down approach. Metabolomics experiments often results in discovery studies that provide intriguing biological hypotheses but rarely offer mechanistic explanation of such findings. In this light, the interpretation of metabolomics data can be boosted by deploying systems biology approaches. OBJECTIVES This review aims to provide an overview of systems biology approaches that are relevant to metabolomics and to discuss some successful applications of these methods. METHODS We review the most recent applications of systems biology tools in the field of metabolomics, such as network inference and analysis, metabolic modelling and pathways analysis. RESULTS We offer an ample overview of systems biology tools that can be applied to address metabolomics problems. The characteristics and application results of these tools are discussed also in a comparative manner. CONCLUSIONS Systems biology-enhanced analysis of metabolomics data can provide insights into the molecular mechanisms originating the observed metabolic profiles and enhance the scientific impact of metabolomics studies.
Collapse
Affiliation(s)
- Antonio Rosato
- Magnetic Resonance Center and Department of Chemistry "Ugo Schiff", University of Florence, Florence, Italy.
| | - Leonardo Tenori
- Department of Experimental and Clinical Medicine, University of Florence, Florence, Italy
| | - Marta Cascante
- CIBER de Enfermedades hepáticas y digestivas (CIBERHD, Madrid) and Department of Biochemistry and Molecular Biomedicine, Universitat de Barcelona, Barcelona, Spain
| | - Pedro Ramon De Atauri Carulla
- CIBER de Enfermedades hepáticas y digestivas (CIBERHD, Madrid) and Department of Biochemistry and Molecular Biomedicine, Universitat de Barcelona, Barcelona, Spain
| | - Vitor A P Martins Dos Santos
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, The Netherlands
- LifeGlimmer GmbH, Berlin, Germany
| | - Edoardo Saccenti
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, The Netherlands.
| |
Collapse
|
9
|
Schwahn K, Beleggia R, Omranian N, Nikoloski Z. Stoichiometric Correlation Analysis: Principles of Metabolic Functionality from Metabolomics Data. FRONTIERS IN PLANT SCIENCE 2017; 8:2152. [PMID: 29326746 PMCID: PMC5741659 DOI: 10.3389/fpls.2017.02152] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2017] [Accepted: 12/05/2017] [Indexed: 06/07/2023]
Abstract
Recent advances in metabolomics technologies have resulted in high-quality (time-resolved) metabolic profiles with an increasing coverage of metabolic pathways. These data profiles represent read-outs from often non-linear dynamics of metabolic networks. Yet, metabolic profiles have largely been explored with regression-based approaches that only capture linear relationships, rendering it difficult to determine the extent to which the data reflect the underlying reaction rates and their couplings. Here we propose an approach termed Stoichiometric Correlation Analysis (SCA) based on correlation between positive linear combinations of log-transformed metabolic profiles. The log-transformation is due to the evidence that metabolic networks can be modeled by mass action law and kinetics derived from it. Unlike the existing approaches which establish a relation between pairs of metabolites, SCA facilitates the discovery of higher-order dependence between more than two metabolites. By using a paradigmatic model of the tricarboxylic acid cycle we show that the higher-order dependence reflects the coupling of concentration of reactant complexes, capturing the subtle difference between the employed enzyme kinetics. Using time-resolved metabolic profiles from Arabidopsis thaliana and Escherichia coli, we show that SCA can be used to quantify the difference in coupling of reactant complexes, and hence, reaction rates, underlying the stringent response in these model organisms. By using SCA with data from natural variation of wild and domesticated wheat and tomato accession, we demonstrate that the domestication is accompanied by loss of such couplings, in these species. Therefore, application of SCA to metabolomics data from natural variation in wild and domesticated populations provides a mechanistic way to understanding domestication and its relation to metabolic networks.
Collapse
Affiliation(s)
- Kevin Schwahn
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics Group, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Romina Beleggia
- Consiglio per la Ricerca in Agricoltura e L'analisi Dell'economia Agraria, Centro di Ricerca per la Cerealicoltura e le Colture Industriali (CREA-CI), Foggia, Italy
| | - Nooshin Omranian
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics Group, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
- Center of Plant Systems Biology and Biotechnology, Plovdiv, Bulgaria
| | - Zoran Nikoloski
- Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics Group, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
- Center of Plant Systems Biology and Biotechnology, Plovdiv, Bulgaria
| |
Collapse
|
10
|
Voit EO. The best models of metabolism. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2017; 9:10.1002/wsbm.1391. [PMID: 28544810 PMCID: PMC5643013 DOI: 10.1002/wsbm.1391] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2017] [Revised: 03/31/2017] [Accepted: 04/01/2017] [Indexed: 12/25/2022]
Abstract
Biochemical systems are among of the oldest application areas of mathematical modeling. Spanning a time period of over one hundred years, the repertoire of options for structuring a model and for formulating reactions has been constantly growing, and yet, it is still unclear whether or to what degree some models are better than others and how the modeler is to choose among them. In fact, the variety of options has become overwhelming and difficult to maneuver for novices and experts alike. This review outlines the metabolic model design process and discusses the numerous choices for modeling frameworks and mathematical representations. It tries to be inclusive, even though it cannot be complete, and introduces the various modeling options in a manner that is as unbiased as that is feasible. However, the review does end with personal recommendations for the choices of default models. WIREs Syst Biol Med 2017, 9:e1391. doi: 10.1002/wsbm.1391 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
11
|
Suarez-Diez M, Adam J, Adamski J, Chasapi SA, Luchinat C, Peters A, Prehn C, Santucci C, Spyridonidis A, Spyroulias GA, Tenori L, Wang-Sattler R, Saccenti E. Plasma and Serum Metabolite Association Networks: Comparability within and between Studies Using NMR and MS Profiling. J Proteome Res 2017; 16:2547-2559. [PMID: 28517934 PMCID: PMC5645760 DOI: 10.1021/acs.jproteome.7b00106] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
![]()
Blood is one of the most used biofluids
in metabolomics studies,
and the serum and plasma fractions are routinely used as a proxy for
blood itself. Here we investigated the association networks of an
array of 29 metabolites identified and quantified via NMR in the plasma
and serum samples of two cohorts of ∼1000 healthy blood donors
each. A second study of 377 individuals was used to extract plasma
and serum samples from the same individual on which a set of 122 metabolites
were detected and quantified using FIA–MS/MS. Four different
inference algorithms (ARANCE, CLR, CORR, and PCLRC) were used to obtain
consensus networks. The plasma and serum networks obtained from different
studies showed different topological properties with the serum network
being more connected than the plasma network. On a global level, metabolite
association networks from plasma and serum fractions obtained from
the same blood sample of healthy people show similar topologies, and
at a local level, some differences arise like in the case of amino
acids.
Collapse
Affiliation(s)
- Maria Suarez-Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research , Stippeneng 4, 6708 WE Wageningen, The Netherlands
| | - Jonathan Adam
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,Institute of Epidemiology II, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,German Center for Diabetes Research (DZD), Helmholtz Zentrum München , 85764 München-Neuherberg, Germany
| | - Jerzy Adamski
- German Center for Diabetes Research (DZD), Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,Institute of Experimental Genetics, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,Chair of Experimental Genetics, Center of Life and Food Sciences Weihenstephan, Technische Universität München , 85353 Freising, Germany
| | | | - Claudio Luchinat
- Magnetic Resonance Center (CERM), University of Florence , Via L. Sacconi 6, 50019 Sesto Fiorentino, Italy.,Department of Chemistry, University of Florence , Via della Lastruccia 3, 50019 Sesto Fiorentino, Italy
| | - Annette Peters
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,Institute of Epidemiology II, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,German Center for Diabetes Research (DZD), Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,Department of Environmental Health, Harvard School of Public Health , Boston, Massachusetts 02115, United States
| | - Cornelia Prehn
- Institute of Experimental Genetics, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany
| | - Claudio Santucci
- Magnetic Resonance Center (CERM), University of Florence , Via L. Sacconi 6, 50019 Sesto Fiorentino, Italy
| | | | | | - Leonardo Tenori
- Department of Experimental and Clinical Medicine, University of Florence , Largo Brambilla 3, 501134 Florence, Italy
| | - Rui Wang-Sattler
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,Institute of Epidemiology II, Helmholtz Zentrum München , 85764 München-Neuherberg, Germany.,German Center for Diabetes Research (DZD), Helmholtz Zentrum München , 85764 München-Neuherberg, Germany
| | - Edoardo Saccenti
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research , Stippeneng 4, 6708 WE Wageningen, The Netherlands
| |
Collapse
|
12
|
Sigman ME, Williams MR. Assessing evidentiary value in fire debris analysis by chemometric and likelihood ratio approaches. Forensic Sci Int 2016; 264:113-21. [DOI: 10.1016/j.forsciint.2016.03.051] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2015] [Revised: 03/18/2016] [Accepted: 03/25/2016] [Indexed: 11/16/2022]
|
13
|
Information theory in systems biology. Part I: Gene regulatory and metabolic networks. Semin Cell Dev Biol 2015; 51:3-13. [PMID: 26701126 DOI: 10.1016/j.semcdb.2015.12.007] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 12/07/2015] [Indexed: 11/22/2022]
Abstract
"A Mathematical Theory of Communication", was published in 1948 by Claude Shannon to establish a framework that is now known as information theory. In recent decades, information theory has gained much attention in the area of systems biology. The aim of this paper is to provide a systematic review of those contributions that have applied information theory in inferring or understanding of biological systems. Based on the type of system components and the interactions between them, we classify the biological systems into 4 main classes: gene regulatory, metabolic, protein-protein interaction and signaling networks. In the first part of this review, we attempt to introduce most of the existing studies on two types of biological networks, including gene regulatory and metabolic networks, which are founded on the concepts of information theory.
Collapse
|
14
|
Suarez-Diez M, Saccenti E. Effects of Sample Size and Dimensionality on the Performance of Four Algorithms for Inference of Association Networks in Metabonomics. J Proteome Res 2015; 14:5119-30. [PMID: 26496246 DOI: 10.1021/acs.jproteome.5b00344] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
We investigated the effect of sample size and dimensionality on the performance of four algorithms (ARACNE, CLR, CORR, and PCLRC) when they are used for the inference of metabolite association networks. We report that as many as 100-400 samples may be necessary to obtain stable network estimations, depending on the algorithm and the number of measured metabolites. The CLR and PCLRC methods produce similar results, whereas network inference based on correlations provides sparse networks; we found ARACNE to be unsuitable for this application, being unable to recover the underlying metabolite association network. We recommend the PCLRC algorithm for the inference on metabolite association networks.
Collapse
Affiliation(s)
- Maria Suarez-Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University and Research Center , Dreijenplein 10, 6703 HB Wageningen, The Netherlands
| | - Edoardo Saccenti
- Laboratory of Systems and Synthetic Biology, Wageningen University and Research Center , Dreijenplein 10, 6703 HB Wageningen, The Netherlands
| |
Collapse
|
15
|
Lopatka M, Sigman ME, Sjerps MJ, Williams MR, Vivó-Truyols G. Class-conditional feature modeling for ignitable liquid classification with substantial substrate contribution in fire debris analysis. Forensic Sci Int 2015; 252:177-86. [DOI: 10.1016/j.forsciint.2015.04.035] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Revised: 04/24/2015] [Accepted: 04/30/2015] [Indexed: 11/24/2022]
|
16
|
Cakır T, Khatibipour MJ. Metabolic network discovery by top-down and bottom-up approaches and paths for reconciliation. Front Bioeng Biotechnol 2014; 2:62. [PMID: 25520953 PMCID: PMC4253960 DOI: 10.3389/fbioe.2014.00062] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 11/14/2014] [Indexed: 11/13/2022] Open
Abstract
The primary focus in the network-centric analysis of cellular metabolism by systems biology approaches is to identify the active metabolic network for the condition of interest. Two major approaches are available for the discovery of the condition-specific metabolic networks. One approach starts from genome-scale metabolic networks, which cover all possible reactions known to occur in the related organism in a condition-independent manner, and applies methods such as the optimization-based Flux-Balance Analysis to elucidate the active network. The other approach starts from the condition-specific metabolome data, and processes the data with statistical or optimization-based methods to extract information content of the data such that the active network is inferred. These approaches, termed bottom-up and top-down, respectively, are currently employed independently. However, considering that both approaches have the same goal, they can both benefit from each other paving the way for the novel integrative analysis methods of metabolome data- and flux-analysis approaches in the post-genomic era. This study reviews the strengths of constraint-based analysis and network inference methods reported in the metabolic systems biology field; then elaborates on the potential paths to reconcile the two approaches to shed better light on how the metabolism functions.
Collapse
Affiliation(s)
- Tunahan Cakır
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University (formerly known as Gebze Institute of Technology) , Gebze , Turkey
| | - Mohammad Jafar Khatibipour
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University (formerly known as Gebze Institute of Technology) , Gebze , Turkey ; Department of Chemical Engineering, Gebze Technical University (formerly known as Gebze Institute of Technology) , Gebze , Turkey
| |
Collapse
|
17
|
Cazzaniga P, Damiani C, Besozzi D, Colombo R, Nobile MS, Gaglio D, Pescini D, Molinari S, Mauri G, Alberghina L, Vanoni M. Computational strategies for a system-level understanding of metabolism. Metabolites 2014; 4:1034-87. [PMID: 25427076 PMCID: PMC4279158 DOI: 10.3390/metabo4041034] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 11/05/2014] [Accepted: 11/12/2014] [Indexed: 12/20/2022] Open
Abstract
Cell metabolism is the biochemical machinery that provides energy and building blocks to sustain life. Understanding its fine regulation is of pivotal relevance in several fields, from metabolic engineering applications to the treatment of metabolic disorders and cancer. Sophisticated computational approaches are needed to unravel the complexity of metabolism. To this aim, a plethora of methods have been developed, yet it is generally hard to identify which computational strategy is most suited for the investigation of a specific aspect of metabolism. This review provides an up-to-date description of the computational methods available for the analysis of metabolic pathways, discussing their main advantages and drawbacks. In particular, attention is devoted to the identification of the appropriate scale and level of accuracy in the reconstruction of metabolic networks, and to the inference of model structure and parameters, especially when dealing with a shortage of experimental measurements. The choice of the proper computational methods to derive in silico data is then addressed, including topological analyses, constraint-based modeling and simulation of the system dynamics. A description of some computational approaches to gain new biological knowledge or to formulate hypotheses is finally provided.
Collapse
Affiliation(s)
- Paolo Cazzaniga
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Chiara Damiani
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Daniela Besozzi
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Riccardo Colombo
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Marco S Nobile
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Daniela Gaglio
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Dario Pescini
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Sara Molinari
- Dipartimento di Biotecnologie e Bioscienze, Università degli Studi di Milano-Bicocca, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Giancarlo Mauri
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Lilia Alberghina
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Marco Vanoni
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| |
Collapse
|
18
|
Kaduk M, Hoefsloot HC, Vis DJ, Reijmers T, van der Greef J, Smilde AK, Hendriks MM. Correlated measurement error hampers association network inference. J Chromatogr B Analyt Technol Biomed Life Sci 2014; 966:93-9. [DOI: 10.1016/j.jchromb.2014.04.048] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 04/18/2014] [Accepted: 04/24/2014] [Indexed: 10/25/2022]
|
19
|
Tarlak F, Sadıkoğlu H, Çakır T. The role of flexibility and optimality in the prediction of intracellular fluxes of microbial central carbon metabolism. MOLECULAR BIOSYSTEMS 2014; 10:2459-65. [DOI: 10.1039/c4mb00117f] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
20
|
Kurt Z, Aydin N, Altay G. A comprehensive comparison of association estimators for gene network inference algorithms. Bioinformatics 2014; 30:2142-9. [DOI: 10.1093/bioinformatics/btu182] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
|
21
|
Öksüz M, Sadıkoğlu H, Çakır T. Sparsity as cellular objective to infer directed metabolic networks from steady-state metabolome data: a theoretical analysis. PLoS One 2013; 8:e84505. [PMID: 24391961 PMCID: PMC3877278 DOI: 10.1371/journal.pone.0084505] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 11/21/2013] [Indexed: 12/11/2022] Open
Abstract
Since metabolome data are derived from the underlying metabolic network, reverse engineering of such data to recover the network topology is of wide interest. Lyapunov equation puts a constraint to the link between data and network by coupling the covariance of data with the strength of interactions (Jacobian matrix). This equation, when expressed as a linear set of equations at steady state, constitutes a basis to infer the network structure given the covariance matrix of data. The sparse structure of metabolic networks points to reactions which are active based on minimal enzyme production, hinting at sparsity as a cellular objective. Therefore, for a given covariance matrix, we solved Lyapunov equation to calculate Jacobian matrix by a simultaneous use of minimization of Euclidean norm of residuals and maximization of sparsity (the number of zeros in Jacobian matrix) as objective functions to infer directed small-scale networks from three kingdoms of life (bacteria, fungi, mammalian). The inference performance of the approach was found to be promising, with zero False Positive Rate, and almost one True positive Rate. The effect of missing data on results was additionally analyzed, revealing superiority over similarity-based approaches which infer undirected networks. Our findings suggest that the covariance of metabolome data implies an underlying network with sparsest pattern. The theoretical analysis forms a framework for further investigation of sparsity-based inference of metabolic networks from real metabolome data.
Collapse
Affiliation(s)
- Melik Öksüz
- Department of Bioengineering, Gebze Institute of Technology, Gebze, Kocaeli, Turkey
- Department of Chemical Engineering, Gebze Institute of Technology, Gebze, Kocaeli, Turkey
| | - Hasan Sadıkoğlu
- Department of Chemical Engineering, Gebze Institute of Technology, Gebze, Kocaeli, Turkey
| | - Tunahan Çakır
- Department of Bioengineering, Gebze Institute of Technology, Gebze, Kocaeli, Turkey
- * E-mail:
| |
Collapse
|
22
|
Villaverde AF, Ross J, Banga JR. Reverse engineering cellular networks with information theoretic methods. Cells 2013; 2:306-29. [PMID: 24709703 PMCID: PMC3972682 DOI: 10.3390/cells2020306] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Revised: 04/22/2013] [Accepted: 04/27/2013] [Indexed: 11/16/2022] Open
Abstract
Building mathematical models of cellular networks lies at the core of systems biology. It involves, among other tasks, the reconstruction of the structure of interactions between molecular components, which is known as network inference or reverse engineering. Information theory can help in the goal of extracting as much information as possible from the available data. A large number of methods founded on these concepts have been proposed in the literature, not only in biology journals, but in a wide range of areas. Their critical comparison is difficult due to the different focuses and the adoption of different terminologies. Here we attempt to review some of the existing information theoretic methodologies for network inference, and clarify their differences. While some of these methods have achieved notable success, many challenges remain, among which we can mention dealing with incomplete measurements, noisy data, counterintuitive behaviour emerging from nonlinear relations or feedback loops, and computational burden of dealing with large data sets.
Collapse
Affiliation(s)
| | - John Ross
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA.
| | - Julio R Banga
- Bioprocess Engineering Group, IIM-CSIC, Eduardo Cabello 6, Vigo 36208, Spain.
| |
Collapse
|
23
|
Bartel J, Krumsiek J, Theis FJ. Statistical methods for the analysis of high-throughput metabolomics data. Comput Struct Biotechnol J 2013; 4:e201301009. [PMID: 24688690 PMCID: PMC3962125 DOI: 10.5936/csbj.201301009] [Citation(s) in RCA: 185] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 03/05/2013] [Accepted: 03/07/2013] [Indexed: 11/24/2022] Open
Abstract
Metabolomics is a relatively new high-throughput technology that aims at measuring all endogenous metabolites within a biological sample in an unbiased fashion. The resulting metabolic profiles may be regarded as functional signatures of the physiological state, and have been shown to comprise effects of genetic regulation as well as environmental factors. This potential to connect genotypic to phenotypic information promises new insights and biomarkers for different research fields, including biomedical and pharmaceutical research. In the statistical analysis of metabolomics data, many techniques from other omics fields can be reused. However recently, a number of tools specific for metabolomics data have been developed as well. The focus of this mini review will be on recent advancements in the analysis of metabolomics data especially by utilizing Gaussian graphical models and independent component analysis.
Collapse
Affiliation(s)
- Jörg Bartel
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | - Jan Krumsiek
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | - Fabian J Theis
- Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany ; Department of Mathematics, Technische Universität München, Boltzmannstr. 3, 85747 Garching, Germany
| |
Collapse
|
24
|
Cicek AE, Bederman I, Henderson L, Drumm ML, Ozsoyoglu G. ADEMA: an algorithm to determine expected metabolite level alterations using mutual information. PLoS Comput Biol 2013; 9:e1002859. [PMID: 23341761 PMCID: PMC3547803 DOI: 10.1371/journal.pcbi.1002859] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2012] [Accepted: 10/23/2012] [Indexed: 01/07/2023] Open
Abstract
Metabolomics is a relatively new “omics” platform, which analyzes a discrete set of metabolites detected in bio-fluids or tissue samples of organisms. It has been used in a diverse array of studies to detect biomarkers and to determine activity rates for pathways based on changes due to disease or drugs. Recent improvements in analytical methodology and large sample throughput allow for creation of large datasets of metabolites that reflect changes in metabolic dynamics due to disease or a perturbation in the metabolic network. However, current methods of comprehensive analyses of large metabolic datasets (metabolomics) are limited, unlike other “omics” approaches where complex techniques for analyzing coexpression/coregulation of multiple variables are applied. This paper discusses the shortcomings of current metabolomics data analysis techniques, and proposes a new multivariate technique (ADEMA) based on mutual information to identify expected metabolite level changes with respect to a specific condition. We show that ADEMA better predicts De Novo Lipogenesis pathway metabolite level changes in samples with Cystic Fibrosis (CF) than prediction based on the significance of individual metabolite level changes. We also applied ADEMA's classification scheme on three different cohorts of CF and wildtype mice. ADEMA was able to predict whether an unknown mouse has a CF or a wildtype genotype with 1.0, 0.84, and 0.9 accuracy for each respective dataset. ADEMA results had up to 31% higher accuracy as compared to other classification algorithms. In conclusion, ADEMA advances the state-of-the-art in metabolomics analysis, by providing accurate and interpretable classification results. Metabolomics is an experimental approach that analyzes differences in metabolite levels detected in experimental samples. It has been used in the literature to understand the changes in metabolism with respect to diseases or drugs. Unlike transcriptomics or proteomics, which analyze gene and protein expression levels respectively, the techniques that consider co-regulation of multiple metabolites are quite limited. In this paper, we propose a novel technique, called ADEMA, which computes the expected level changes for each metabolite with respect to a given condition. ADEMA considers multiple metabolites at the same time and is mutual information (MI)-based. We show that ADEMA predicts metabolite level changes for young mice with Cystic Fibrosis (CF) better than significance testing that considers one metabolite at a time. Using three different datasets that contain CF and wild-type (WT) mice, we show that ADEMA can classify an individual as being CF or WT based on the metabolic profiles (with 1.0, 0.84, and 0.9 accuracy, respectively). Compared to other well-known classification algorithms, ADEMA's accuracy is higher by up to 31%.
Collapse
Affiliation(s)
- A Ercument Cicek
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, Ohio, USA.
| | | | | | | | | |
Collapse
|
25
|
Lecca P, Priami C. Biological network inference for drug discovery. Drug Discov Today 2012; 18:256-64. [PMID: 23147668 DOI: 10.1016/j.drudis.2012.11.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Revised: 10/04/2012] [Accepted: 11/05/2012] [Indexed: 12/31/2022]
Abstract
A better understanding of the pathophysiology should help deliver drugs whose targets are involved in the causative processes underlying a disease. Biological network inference uses computational methods for deducing from high-throughput experimental data, the topology and the causal structure of the interactions among the drugs and their targets. Therefore, biological network inference can support and contribute to the experimental identification of both gene and protein networks causing a disease as well as the biochemical networks of drugs metabolism and mechanisms of action. The resulting high-level networks serve as a foundational basis for more detailed mechanistic models and are increasingly used in drug discovery by pharmaceutical and biotechnology companies. We review and compare recent computational technologies for network inference applied to drug discovery.
Collapse
Affiliation(s)
- Paola Lecca
- The Microsoft Research, University of Trento, Centre for Computational and Systems Biology, Piazza Manifattura 1 - 38068 Rovereto, Italy.
| | | |
Collapse
|
26
|
Vertès AA, Inui M, Yukawa H. Postgenomic Approaches to Using Corynebacteria as Biocatalysts. Annu Rev Microbiol 2012; 66:521-50. [DOI: 10.1146/annurev-micro-010312-105506] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Alain A. Vertès
- Research Institute of Innovative Technology for the Earth, Kizugawadai, Kizugawa, Kyoto 619-0292, Japan;
| | - Masayuki Inui
- Research Institute of Innovative Technology for the Earth, Kizugawadai, Kizugawa, Kyoto 619-0292, Japan;
| | - Hideaki Yukawa
- Research Institute of Innovative Technology for the Earth, Kizugawadai, Kizugawa, Kyoto 619-0292, Japan;
| |
Collapse
|
27
|
Sugimoto M, Kawakami M, Robert M, Soga T, Tomita M. Bioinformatics Tools for Mass Spectroscopy-Based Metabolomic Data Processing and Analysis. Curr Bioinform 2012; 7:96-108. [PMID: 22438836 PMCID: PMC3299976 DOI: 10.2174/157489312799304431] [Citation(s) in RCA: 199] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2011] [Revised: 10/25/2011] [Accepted: 12/07/2011] [Indexed: 01/04/2023]
Abstract
Biological systems are increasingly being studied in a holistic manner, using omics approaches, to provide quantitative and qualitative descriptions of the diverse collection of cellular components. Among the omics approaches, metabolomics, which deals with the quantitative global profiling of small molecules or metabolites, is being used extensively to explore the dynamic response of living systems, such as organelles, cells, tissues, organs and whole organisms, under diverse physiological and pathological conditions. This technology is now used routinely in a number of applications, including basic and clinical research, agriculture, microbiology, food science, nutrition, pharmaceutical research, environmental science and the development of biofuels. Of the multiple analytical platforms available to perform such analyses, nuclear magnetic resonance and mass spectrometry have come to dominate, owing to the high resolution and large datasets that can be generated with these techniques. The large multidimensional datasets that result from such studies must be processed and analyzed to render this data meaningful. Thus, bioinformatics tools are essential for the efficient processing of huge datasets, the characterization of the detected signals, and to align multiple datasets and their features. This paper provides a state-of-the-art overview of the data processing tools available, and reviews a collection of recent reports on the topic. Data conversion, pre-processing, alignment, normalization and statistical analysis are introduced, with their advantages and disadvantages, and comparisons are made to guide the reader.
Collapse
Affiliation(s)
- Masahiro Sugimoto
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa 252-8520, Japan
- Graduate School of Medicine and Faculty of Medicine Kyoto University, Yoshida-Konoe-cho, Sakyo-ku, Kyoto 606-8501, Japan
| | - Masato Kawakami
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Department of Environment and Information Studies, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| | - Martin Robert
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| | - Tomoyoshi Soga
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Department of Environment and Information Studies, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| | - Masaru Tomita
- Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan
- Department of Environment and Information Studies, Keio University, Fujisawa, Kanagawa 252-8520, Japan
| |
Collapse
|
28
|
Liu L, Zhu Y, Li J, Wang M, Lee P, Du G, Chen J. Microbial production of propionic acid from propionibacteria: current state, challenges and perspectives. Crit Rev Biotechnol 2012; 32:374-81. [PMID: 22299651 DOI: 10.3109/07388551.2011.651428] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Propionic acid (PA) is an important building block chemical and finds a variety of applications in organic synthesis, food, feeding stuffs, perfume, paint and pharmaceutical industries. Presently, PA is mainly produced by petrochemical route. With the continuous increase in oil prices, public concern about environmental pollution, and the consumers' desire for bio-based natural and green ingredients in foods and pharmaceuticals, PA production from propionibacteria has attracted considerable attention, and substantial progresses have been made on microbial PA production. However, production of PA by propionibacteria is facing challenges such as severe inhibition of end-products during cell growth and the formation of by-products (acetic acid and succinic acid). The integration of reverse metabolic engineering and systematic metabolic engineering provides an opportunity to significantly improve the acid tolerance of propionibacteria and reduce the formation of by-products, and makes it feasible to strengthen the commercial competition of biotechnological PA production from propionibacteria to be comparable to the petrochemical route.
Collapse
Affiliation(s)
- Long Liu
- Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | | | | | | | | | | | | |
Collapse
|
29
|
Metabolomics-assisted synthetic biology. Curr Opin Biotechnol 2011; 23:22-8. [PMID: 22104721 DOI: 10.1016/j.copbio.2011.10.014] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Revised: 10/27/2011] [Accepted: 10/30/2011] [Indexed: 12/19/2022]
Abstract
As the world progresses from a fossil-fuel based economy to a more sustainable one, synthetic biology will become increasingly important for the production of high-value fine chemicals as well as low-value commodities in bulk. The integration of metabolomics and fluxomics within synthetic biology projects will be vital at all levels, including the initial design of the pathways to be generated, through to the optimisation of those pathways so that more efficient conversion of low-cost starting materials into highly desirable products can be achieved. This review highlights these areas and details the most important and exciting advances being made in this area.
Collapse
|
30
|
Hendriks MM, Eeuwijk FA, Jellema RH, Westerhuis JA, Reijmers TH, Hoefsloot HC, Smilde AK. Data-processing strategies for metabolomics studies. Trends Analyt Chem 2011. [DOI: 10.1016/j.trac.2011.04.019] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
31
|
Hendrickx DM, Hendriks MMWB, Eilers PHC, Smilde AK, Hoefsloot HCJ. Reverse engineering of metabolic networks, a critical assessment. MOLECULAR BIOSYSTEMS 2010; 7:511-20. [PMID: 21069230 DOI: 10.1039/c0mb00083c] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Inferring metabolic networks from metabolite concentration data is a central topic in systems biology. Mathematical techniques to extract information about the network from data have been proposed in the literature. This paper presents a critical assessment of the feasibility of reverse engineering of metabolic networks, illustrated with a selection of methods. Appropriate data are simulated to study the performance of four representative methods. An overview of sampling and measurement methods currently in use for generating time-resolved metabolomics data is given and contrasted with the needs of the discussed reverse engineering methods. The results of this assessment show that if full inference of a real-world metabolic network is the goal there is a large discrepancy between the requirements of reverse engineering of metabolic networks and contemporary measurement practice. Recommendations for improved time-resolved experimental designs are given.
Collapse
Affiliation(s)
- Diana M Hendrickx
- Biosystems Data Analysis, Swammerdam Institute for Life Sciences, University of Amsterdam, The Netherlands.
| | | | | | | | | |
Collapse
|
32
|
Soltow QA, Jones DP, Promislow DEL. A network perspective on metabolism and aging. Integr Comp Biol 2010; 50:844-54. [PMID: 21031036 DOI: 10.1093/icb/icq094] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Aging affects a myriad of genetic, biochemical, and metabolic processes, and efforts to understand the underlying molecular basis of aging are often thwarted by the complexity of the aging process. By taking a systems biology approach, network analysis is well-suited to study the decline in function with age. Network analysis has already been utilized in describing other complex processes such as development, evolution, and robustness. Networks of gene expression and protein-protein interaction have provided valuable insight into the loss of connectivity and network structure throughout lifespan. Here, we advocate the use of metabolic networks to expand the work from genomics and proteomics. As metabolism is the final fingerprint of functionality and has been implicated in multiple theories of aging, metabolomic methods combined with metabolite network analyses should pave the way to investigate how relationships of metabolites change with age and how these interactions affect phenotype and function of the aging individual. The metabolomic network approaches highlighted in this review are fundamental for an understanding of systematic declines and of failure to function with age.
Collapse
Affiliation(s)
- Quinlyn A Soltow
- Department of Medicine, Emory University, Atlanta, GA 30322, USA.
| | | | | |
Collapse
|
33
|
Dauner M. From fluxes and isotope labeling patterns towards in silico cells. Curr Opin Biotechnol 2010; 21:55-62. [DOI: 10.1016/j.copbio.2010.01.014] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2009] [Revised: 01/23/2010] [Accepted: 01/31/2010] [Indexed: 10/19/2022]
|