1
|
De Filippis GM, Amalfitano D, Russo C, Tommasino C, Rinaldi AM. A systematic mapping study of semantic technologies in multi-omics data integration. J Biomed Inform 2025; 165:104809. [PMID: 40154721 DOI: 10.1016/j.jbi.2025.104809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Revised: 02/03/2025] [Accepted: 03/07/2025] [Indexed: 04/01/2025]
Abstract
OBJECTIVE The integration of multi-omics data is essential for understanding complex biological systems, providing insights beyond single-omics approaches. However, challenges related to data heterogeneity, standardization, and computational scalability persist. This study explores the interdisciplinary application of semantic technologies to enhance data integration, standardization, and analysis in multi-omics research. METHODS We performed a systematic mapping study assessing literature from 2014 to 2024, focusing on the utilization of ontologies, knowledge graphs, and graph-based methods for multi-omics integration. RESULTS Our findings indicate a growing number of publications in this field, predominantly appearing in high-impact journals. The deployment of semantic technologies has notably improved data visualization, querying, and management, thus enhancing gene and pathway discovery, and providing deeper disease insights and more accurate predictive modeling. CONCLUSION The study underscores the significance of semantic technologies in overcoming multi-omics integration challenges. Future research should focus on integrating diverse data types, developing advanced computational tools, and incorporating AI and machine learning to foster personalized medicine applications.
Collapse
Affiliation(s)
- Giovanni Maria De Filippis
- Department of Electrical Engineering and Information Technology DIETI, University of Naples Federico II, Via Claudio, 21, Naples, 80125, Italy.
| | - Domenico Amalfitano
- Department of Electrical Engineering and Information Technology DIETI, University of Naples Federico II, Via Claudio, 21, Naples, 80125, Italy.
| | - Cristiano Russo
- Department of Electrical Engineering and Information Technology DIETI, University of Naples Federico II, Via Claudio, 21, Naples, 80125, Italy.
| | - Cristian Tommasino
- Department of Electrical Engineering and Information Technology DIETI, University of Naples Federico II, Via Claudio, 21, Naples, 80125, Italy.
| | - Antonio Maria Rinaldi
- Department of Electrical Engineering and Information Technology DIETI, University of Naples Federico II, Via Claudio, 21, Naples, 80125, Italy.
| |
Collapse
|
2
|
Razalli II, Abdullah-Zawawi MR, Tamizi AA, Harun S, Zainal-Abidin RA, Jalal MIA, Ullah MA, Zainal Z. Accelerating crop improvement via integration of transcriptome-based network biology and genome editing. PLANTA 2025; 261:92. [PMID: 40095140 DOI: 10.1007/s00425-025-04666-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 03/03/2025] [Indexed: 03/19/2025]
Abstract
MAIN CONCLUSION Big data and network biology infer functional coupling between genes. In combination with machine learning, network biology can dramatically accelerate the pace of gene discovery using modern transcriptomics approaches and be validated via genome editing technology for improving crops to stresses. Unlike other living things, plants are sessile and frequently face various environmental challenges due to climate change. The cumulative effects of combined stresses can significantly influence both plant growth and yields. In navigating the complexities of climate change, ensuring the nourishment of our growing population hinges on implementing precise agricultural systems. Conventional breeding methods have been commonly employed; however, their efficacy has been impeded by limitations in terms of time, cost, and infrastructure. Cutting-edge tools focussing on big data are being championed to usher in a new era in stress biology, aiming to cultivate crops that exhibit enhanced resilience to multifactorial stresses. Transcriptomics, combined with network biology and machine learning, is proving to be a powerful approach for identifying potential genes to target for gene editing, specifically to enhance stress tolerance. The integration of transcriptomic data with genome editing can yield significant benefits, such as gaining insights into gene function by modifying or manipulating of specific genes in the target plant. This review provides valuable insights into the use of transcriptomics platforms and the application of biological network analysis and machine learning in the discovery of novel genes, thereby enhancing the understanding of plant responses to combined or sequential stress. The transcriptomics as a forefront omics platform and how it is employed through biological networks and machine learning that lead to novel gene discoveries for producing multi-stress-tolerant crops, limitations, and future directions have also been discussed.
Collapse
Affiliation(s)
- Izreen Izzati Razalli
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia
| | - Muhammad-Redha Abdullah-Zawawi
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Jalan Ya'acob Latiff, Bandar Tun Razak, 56000, Cheras, Kuala Lumpur, Malaysia
| | - Amin-Asyraf Tamizi
- Malaysian Agricultural Research and Development Institute (MARDI), 43400, Serdang, Selangor, Malaysia
| | - Sarahani Harun
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia
| | | | - Muhammad Irfan Abdul Jalal
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Jalan Ya'acob Latiff, Bandar Tun Razak, 56000, Cheras, Kuala Lumpur, Malaysia
| | - Mohammad Asad Ullah
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia
- Bangladesh Institute of Nuclear Agriculture (BINA), BAU Campus, Mymensingh, 2202, Bangladesh
| | - Zamri Zainal
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.
| |
Collapse
|
3
|
Park SY, Kwon O, van den Broek T, Bouwman J, Kim JY. A toolkit for quantifying individual response to herbal extracts in metabolic and inflammatory stress. NPJ Sci Food 2025; 9:14. [PMID: 39900584 PMCID: PMC11791183 DOI: 10.1038/s41538-024-00354-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 12/16/2024] [Indexed: 02/05/2025] Open
Abstract
This study developed a health assessment tool to analyze dynamic stress responses and resilience with the PhenFlex challenge. This study integrated a health space model and machine learning to quantify and visualize the impact of herbal extracts on inflammatory and metabolic health at the individual level. Two randomized, double-blind, placebo-controlled crossover trials were conducted involving participants with PhenFlex challenge after overnight fasting. Blood samples were collected, and a machine learning algorithm was used to predict health estimation scores based on metabolic and inflammatory responses. The resulting health space model visually represents individuals' health status in a 2-D space. Intervention with herbal extracts (e.g., Angelica keiskei, AK, and Capsosiphon fulvescens, CF) resulted in higher health scores in the health space, indicating improved health. This research emphasizes the quantification of phenotypic changes for personalized nutrition and health optimization. Overall, this study provides a valuable toolkit for validating herbal extract efficacy and extends its application to personalized nutrition.
Collapse
Affiliation(s)
- Soo-Yeon Park
- Department of Food Science and Biotechnology, Seoul National University of Science and Technology, Seoul, Republic of Korea
| | - Oran Kwon
- Department of Nutritional Science and Food Management, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, Republic of Korea
- Logme Inc., Seoul, Republic of Korea
| | - Tim van den Broek
- The Netherlands Organization for Applied Scientific Research (TNO), Leiden, The Netherlands
| | - Jildau Bouwman
- The Netherlands Organization for Applied Scientific Research (TNO), Leiden, The Netherlands
| | - Ji Yeon Kim
- Department of Food Science and Biotechnology, Seoul National University of Science and Technology, Seoul, Republic of Korea.
- Department of Nano Bio Engineering, Seoul National University of Science and Technology, Seoul, Republic of Korea.
| |
Collapse
|
4
|
Dieckmann L, Lahti-Pulkkinen M, Cruceanu C, Räikkönen K, Binder EB, Czamara D. Quantitative trait locus mapping in placenta: A comparative study of chorionic villus and birth placenta. HGG ADVANCES 2024; 5:100326. [PMID: 38993113 PMCID: PMC11365441 DOI: 10.1016/j.xhgg.2024.100326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 07/08/2024] [Accepted: 07/08/2024] [Indexed: 07/13/2024] Open
Abstract
The placenta, a pivotal player in the prenatal environment, holds crucial insights into early developmental pathways and future health outcomes. In this study, we explored genetic molecular regulation in chorionic villus samples (CVS) from the first trimester and placenta tissue at birth. We assessed quantitative trait locus (QTL) mapping on DNA methylation and gene expression data in a Finnish cohort of 574 individuals. We found more QTLs in birth placenta than in first-trimester placenta. Nevertheless, a substantial amount of associations overlapped in their effects and showed consistent direction in both tissues, with increasing molecular genetic effects from early pregnancy to birth placenta. The identified QTLs in birth placenta were most enriched in genes with placenta-specific expression. Conducting a phenome-wide-association study (PheWAS) on the associated SNPs, we observed numerous overlaps with genome-wide association study (GWAS) hits (spanning 57 distinct traits and 23 SNPs), with notable enrichments for immunological, skeletal, and respiratory traits. The QTL-SNP rs1737028 (chr6:29737993) presented with the highest number of GWAS hits. This SNP was related to HLA-G expression via DNA methylation and was associated with various immune, respiratory, and psychiatric traits. Our findings implicate increasing genetic molecular regulation during the course of pregnancy and support the involvement of placenta gene regulation, particularly in immunological traits. This study presents a framework for understanding placenta-specific gene regulation during pregnancy and its connection to health-related traits.
Collapse
Affiliation(s)
- Linda Dieckmann
- Department Genes and Environment, Max Planck Institute of Psychiatry, 80804 Munich, Germany; International Max Planck Research School for Translational Psychiatry, 80804 Munich, Germany
| | - Marius Lahti-Pulkkinen
- Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland; Finnish Institute for Health and Welfare, 00271 Helsinki, Finland; Centre for Cardiovascular Science, Queen's Medical Research Institute, University of Edinburgh, Edinburgh EH8 9YL, UK
| | - Cristiana Cruceanu
- Department of Physiology and Pharmacology, Karolinska Institute, 17177 Stockholm, Sweden
| | - Katri Räikkönen
- Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland; Department of Obstetrics and Gynecology, Helsinki University Hospital, University of Helsinki, 00014 Helsinki, Finland
| | - Elisabeth B Binder
- Department Genes and Environment, Max Planck Institute of Psychiatry, 80804 Munich, Germany; Department of Psychiatry and Behavioral Sciences, School of Medicine, Emory University, Atlanta, GA 30322, USA.
| | - Darina Czamara
- Department Genes and Environment, Max Planck Institute of Psychiatry, 80804 Munich, Germany.
| |
Collapse
|
5
|
Kobel CM, Merkesvik J, Burgos IMT, Lai W, Øyås O, Pope PB, Hvidsten TR, Aho VTE. Integrating host and microbiome biology using holo-omics. Mol Omics 2024; 20:438-452. [PMID: 38963125 DOI: 10.1039/d4mo00017j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
Holo-omics is the use of omics data to study a host and its inherent microbiomes - a biological system known as a "holobiont". A microbiome that exists in such a space often encounters habitat stability and in return provides metabolic capacities that can benefit their host. Here we present an overview of beneficial host-microbiome systems and propose and discuss several methodological frameworks that can be used to investigate the intricacies of the many as yet undefined host-microbiome interactions that influence holobiont homeostasis. While this is an emerging field, we anticipate that ongoing methodological advancements will enhance the biological resolution that is necessary to improve our understanding of host-microbiome interplay to make meaningful interpretations and biotechnological applications.
Collapse
Affiliation(s)
- Carl M Kobel
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway.
| | - Jenny Merkesvik
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | | | - Wanxin Lai
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Ove Øyås
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway.
| | - Phillip B Pope
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway.
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
- Centre for Microbiome Research, School of Biomedical Sciences, Queensland University of Technology (QUT), Translational Research Institute, Woolloongabba, Queensland, Australia
| | - Torgeir R Hvidsten
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Velma T E Aho
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway.
| |
Collapse
|
6
|
Dong L, Zhang Y, Fu B, Swart C, Jiang H, Liu Y, Huggett J, Wielgosz R, Niu C, Li Q, Zhang Y, Park SR, Sui Z, Yu L, Liu Y, Xie Q, Zhang H, Yang Y, Dai X, Shi L, Yin Y, Fang X. Reliable biological and multi-omics research through biometrology. Anal Bioanal Chem 2024; 416:3645-3663. [PMID: 38507042 DOI: 10.1007/s00216-024-05239-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 02/28/2024] [Accepted: 03/04/2024] [Indexed: 03/22/2024]
Abstract
Metrology is the science of measurement and its applications, whereas biometrology is the science of biological measurement and its applications. Biometrology aims to achieve accuracy and consistency of biological measurements by focusing on the development of metrological traceability, biological reference measurement procedures, and reference materials. Irreproducibility of biological and multi-omics research results from different laboratories, platforms, and analysis methods is hampering the translation of research into clinical uses and can often be attributed to the lack of biologists' attention to the general principles of metrology. In this paper, the progresses of biometrology including metrology on nucleic acid, protein, and cell measurements and its impacts on the improvement of reliability and comparability in biological research are reviewed. Challenges in obtaining more reliable biological and multi-omics measurements due to the lack of primary reference measurement procedures and new standards for biological reference materials faced by biometrology are discussed. In the future, in addition to establishing reliable reference measurement procedures, developing reference materials from single or multiple parameters to multi-omics scale should be emphasized. Thinking in way of biometrology is warranted for facilitating the translation of high-throughput omics research into clinical practices.
Collapse
Affiliation(s)
- Lianhua Dong
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China.
| | - Yu Zhang
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | - Boqiang Fu
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | - Claudia Swart
- Physikalisch-Technische Bundesanstalt, 38116, Braunschweig, Germany
| | | | - Yahui Liu
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | - Jim Huggett
- National Measurement Laboratory at LGC (NML), Teddington, Middlesex, UK
| | - Robert Wielgosz
- Bureau International Des Poids Et Mesures (BIPM), Pavillon de Breteuil, 92312, Sèvres Cedex, France
| | - Chunyan Niu
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | - Qianyi Li
- BGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Yongzhuo Zhang
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | - Sang-Ryoul Park
- Korea Research Institute of Standards and Science, Daejeon, Republic of Korea
| | - Zhiwei Sui
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | - Lianchao Yu
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China
| | | | - Qing Xie
- BGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Hongfu Zhang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | | | - Xinhua Dai
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China.
| | - Leming Shi
- State Key Laboratory of Genetic Engineering, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, 200438, China
| | - Ye Yin
- BGI, BGI-Shenzhen, Shenzhen, 518083, China.
| | - Xiang Fang
- Center for Advanced Measurement of Science, National Institute of Metrology, Beijing, 100029, China.
| |
Collapse
|
7
|
Holbrook-Smith D, Trouillon J, Sauer U. Metabolomics and Microbial Metabolism: Toward a Systematic Understanding. Annu Rev Biophys 2024; 53:41-64. [PMID: 38109374 DOI: 10.1146/annurev-biophys-030722-021957] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Over the past decades, our understanding of microbial metabolism has increased dramatically. Metabolomics, a family of techniques that are used to measure the quantities of small molecules in biological samples, has been central to these efforts. Advances in analytical chemistry have made it possible to measure the relative and absolute concentrations of more and more compounds with increasing levels of certainty. In this review, we highlight how metabolomics has contributed to understanding microbial metabolism and in what ways it can still be deployed to expand our systematic understanding of metabolism. To that end, we explain how metabolomics was used to (a) characterize network topologies of metabolism and its regulation networks, (b) elucidate the control of metabolic function, and (c) understand the molecular basis of higher-order phenomena. We also discuss areas of inquiry where technological advances should continue to increase the impact of metabolomics, as well as areas where our understanding is bottlenecked by other factors such as the availability of statistical and modeling frameworks that can extract biological meaning from metabolomics data.
Collapse
Affiliation(s)
| | - Julian Trouillon
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland;
| | - Uwe Sauer
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland;
| |
Collapse
|
8
|
Chen Y, Gustafsson J, Yang J, Nielsen J, Kerkhoven EJ. Single-cell omics analysis with genome-scale metabolic modeling. Curr Opin Biotechnol 2024; 86:103078. [PMID: 38359604 DOI: 10.1016/j.copbio.2024.103078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 01/19/2024] [Indexed: 02/17/2024]
Abstract
Single-cell technologies have been widely used in biological studies and generated a plethora of single-cell data to be interpreted. Due to the inclusion of the priori metabolic network knowledge as well as gene-protein-reaction associations, genome-scale metabolic models (GEMs) have been a powerful tool to integrate and thereby interpret various omics data mostly from bulk samples. Here, we first review two common ways to leverage bulk omics data with GEMs and then discuss advances on integrative analysis of single-cell omics data with GEMs. We end by presenting our views on current challenges and perspectives in this field.
Collapse
Affiliation(s)
- Yu Chen
- Key Laboratory of Quantitative Synthetic Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
| | - Johan Gustafsson
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, SE-405 30 Gothenburg, Sweden; Department of Life Sciences, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden
| | - Jingyu Yang
- Key Laboratory of Quantitative Synthetic Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Jens Nielsen
- Department of Life Sciences, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden; BioInnovation Institute, DK-2200 Copenhagen, Denmark
| | - Eduard J Kerkhoven
- Department of Life Sciences, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden; Novo Nordisk Foundation Center for Biosustainability, Technology University of Denmark, DK-2800 Kgs. Lyngby, Denmark; SciLifeLab, Chalmers University of Technology, SE-412 96 Gothenburg, Sweden.
| |
Collapse
|
9
|
Murmu S, Sinha D, Chaurasia H, Sharma S, Das R, Jha GK, Archak S. A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions. FRONTIERS IN PLANT SCIENCE 2024; 15:1292054. [PMID: 38504888 PMCID: PMC10948452 DOI: 10.3389/fpls.2024.1292054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 01/24/2024] [Indexed: 03/21/2024]
Abstract
Plants intricately deploy defense systems to counter diverse biotic and abiotic stresses. Omics technologies, spanning genomics, transcriptomics, proteomics, and metabolomics, have revolutionized the exploration of plant defense mechanisms, unraveling molecular intricacies in response to various stressors. However, the complexity and scale of omics data necessitate sophisticated analytical tools for meaningful insights. This review delves into the application of artificial intelligence algorithms, particularly machine learning and deep learning, as promising approaches for deciphering complex omics data in plant defense research. The overview encompasses key omics techniques and addresses the challenges and limitations inherent in current AI-assisted omics approaches. Moreover, it contemplates potential future directions in this dynamic field. In summary, AI-assisted omics techniques present a robust toolkit, enabling a profound understanding of the molecular foundations of plant defense and paving the way for more effective crop protection strategies amidst climate change and emerging diseases.
Collapse
Affiliation(s)
- Sneha Murmu
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Dipro Sinha
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Himanshushekhar Chaurasia
- Central Institute for Research on Cotton Technology, Indian Council of Agricultural Research (ICAR), Mumbai, India
| | - Soumya Sharma
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Ritwika Das
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Girish Kumar Jha
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Sunil Archak
- National Bureau of Plant Genetic Resources, Indian Council of Agricultural Research (ICAR), New Delhi, India
| |
Collapse
|
10
|
Biswas A, Kumari A, Gaikwad DS, Pandey DK. Revolutionizing Biological Science: The Synergy of Genomics in Health, Bioinformatics, Agriculture, and Artificial Intelligence. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2023; 27:550-569. [PMID: 38100404 DOI: 10.1089/omi.2023.0197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2023]
Abstract
With climate emergency, COVID-19, and the rise of planetary health scholarship, the binary of human and ecosystem health has been deeply challenged. The interdependence of human and nonhuman animal health is increasingly acknowledged and paving the way for new frontiers in integrative biology. The convergence of genomics in health, bioinformatics, agriculture, and artificial intelligence (AI) has ushered in a new era of possibilities and applications. However, the sheer volume of genomic/multiomics big data generated also presents formidable sociotechnical challenges in extracting meaningful biological, planetary health and ecological insights. Over the past few years, AI-guided bioinformatics has emerged as a powerful tool for managing, analyzing, and interpreting complex biological datasets. The advances in AI, particularly in machine learning and deep learning, have been transforming the fields of genomics, planetary health, and agriculture. This article aims to unpack and explore the formidable range of possibilities and challenges that result from such transdisciplinary integration, and emphasizes its radically transformative potential for human and ecosystem health. The integration of these disciplines is also driving significant advancements in precision medicine and personalized health care. This presents an unprecedented opportunity to deepen our understanding of complex biological systems and advance the well-being of all life in planetary ecosystems. Notwithstanding in mind its sociotechnical, ethical, and critical policy challenges, the integration of genomics, multiomics, planetary health, and agriculture with AI-guided bioinformatics opens up vast opportunities for transnational collaborative efforts, data sharing, analysis, valorization, and interdisciplinary innovations in life sciences and integrative biology.
Collapse
Affiliation(s)
- Aakanksha Biswas
- Amity Institute of Biotechnology, Amity University Jharkhand, Ranchi, India
| | - Aditi Kumari
- Amity Institute of Biotechnology, Amity University Jharkhand, Ranchi, India
| | - D S Gaikwad
- Amity Institute of Organic Agriculture, Amity University, Noida, India
| | - Dhananjay K Pandey
- Amity Institute of Biotechnology, Amity University Jharkhand, Ranchi, India
| |
Collapse
|
11
|
Bodinier B, Filippi S, Nøst TH, Chiquet J, Chadeau-Hyam M. Automated calibration for stability selection in penalised regression and graphical models. J R Stat Soc Ser C Appl Stat 2023; 72:1375-1393. [PMID: 38143734 PMCID: PMC10746547 DOI: 10.1093/jrsssc/qlad058] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 06/01/2023] [Accepted: 06/12/2023] [Indexed: 12/26/2023]
Abstract
Stability selection represents an attractive approach to identify sparse sets of features jointly associated with an outcome in high-dimensional contexts. We introduce an automated calibration procedure via maximisation of an in-house stability score and accommodating a priori-known block structure (e.g. multi-OMIC) data. It applies to [Least Absolute Shrinkage Selection Operator (LASSO)] penalised regression and graphical models. Simulations show our approach outperforms non-stability-based and stability selection approaches using the original calibration. Application to multi-block graphical LASSO on real (epigenetic and transcriptomic) data from the Norwegian Women and Cancer study reveals a central/credible and novel cross-OMIC role of LRRN3 in the biological response to smoking. Proposed approaches were implemented in the R package sharp.
Collapse
Affiliation(s)
- Barbara Bodinier
- Department of Epidemiology and Biostatistics, MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
| | - Sarah Filippi
- Department of Mathematics, Imperial College London, London, UK
| | - Therese Haugdahl Nøst
- Department of Community Medicine, Faculty of Health Sciences, UiT, The Arctic University of Norway, NO-9037 Tromsø, Norway
| | - Julien Chiquet
- Université Paris-Saclay, AgroParisTech INRAE, UMR MIA, SolsTIS team, Paris, France
| | - Marc Chadeau-Hyam
- Department of Epidemiology and Biostatistics, MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
| |
Collapse
|
12
|
Molversmyr H, Øyås O, Rotnes F, Vik JO. Extracting functionally accurate context-specific models of Atlantic salmon metabolism. NPJ Syst Biol Appl 2023; 9:19. [PMID: 37244928 DOI: 10.1038/s41540-023-00280-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 05/05/2023] [Indexed: 05/29/2023] Open
Abstract
Constraint-based models (CBMs) are used to study metabolic network structure and function in organisms ranging from microbes to multicellular eukaryotes. Published CBMs are usually generic rather than context-specific, meaning that they do not capture differences in reaction activities, which, in turn, determine metabolic capabilities, between cell types, tissues, environments, or other conditions. Only a subset of a CBM's metabolic reactions and capabilities are likely to be active in any given context, and several methods have therefore been developed to extract context-specific models from generic CBMs through integration of omics data. We tested the ability of six model extraction methods (MEMs) to create functionally accurate context-specific models of Atlantic salmon using a generic CBM (SALARECON) and liver transcriptomics data from contexts differing in water salinity (life stage) and dietary lipids. Three MEMs (iMAT, INIT, and GIMME) outperformed the others in terms of functional accuracy, which we defined as the extracted models' ability to perform context-specific metabolic tasks inferred directly from the data, and one MEM (GIMME) was faster than the others. Context-specific versions of SALARECON consistently outperformed the generic version, showing that context-specific modeling better captures salmon metabolism. Thus, we demonstrate that results from human studies also hold for a non-mammalian animal and major livestock species.
Collapse
Affiliation(s)
- Håvard Molversmyr
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
| | - Ove Øyås
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
| | - Filip Rotnes
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
| | - Jon Olav Vik
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway.
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway.
| |
Collapse
|
13
|
Song S, Jin J, Li M, Kong D, Cao M, Wang X, Li Y, Chen X, Zhang X, Pang X, Bo W, Hao Q. The Key Metabolic Network and Genes Regulating the Fresh Fruit Texture of Jujube ( Ziziphus jujuba Mill.) Revealed via Metabolomic and Transcriptomic Analysis. PLANTS (BASEL, SWITZERLAND) 2023; 12:plants12112087. [PMID: 37299066 DOI: 10.3390/plants12112087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 05/11/2023] [Accepted: 05/22/2023] [Indexed: 06/12/2023]
Abstract
The texture of fresh jujube fruit is related to its popularity and commercial value. The metabolic networks and essential genes that regulate the texture of jujube (Ziziphus jujuba) fruit are still unknown. In this study, two jujube cultivars with significantly different textures were selected by a texture analyzer. The four developmental stages of the exocarp and mesocarp of jujube fruit were studied separately using metabolomic and transcriptomic analyses. Differentially accumulated metabolites were enriched in several critical pathways related to cell wall substance synthesis and metabolism. Transcriptome analysis confirmed this by finding enriched differential expression genes in these pathways. Combined analysis showed that 'Galactose metabolism' was the most overlapping pathway in two omics. Genes such as β-Gal, MYB and DOF may affect fruit texture by regulating cell wall substances. Overall, this study provides an essential reference for the establishment of texture-related metabolic and gene networks of jujube fruit.
Collapse
Affiliation(s)
- Shuang Song
- The State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions (Preparation), Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Institute of Horticulture Crops, Xinjiang Academy of Agricultural Sciences, Urumqi 830091, China
| | - Juan Jin
- The State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions (Preparation), Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Institute of Horticulture Crops, Xinjiang Academy of Agricultural Sciences, Urumqi 830091, China
| | - Meiyu Li
- Henan Key Laboratory of Germplasm Innovation and Utilization of Eco-Economic Woody Plant, Pingdingshan University, Pingdingshan 467000, China
| | - Decang Kong
- National Foundation for Improved Cultivar of Chinese Jujube, Cangzhou 061000, China
| | - Ming Cao
- National Foundation for Improved Cultivar of Chinese Jujube, Cangzhou 061000, China
| | - Xue Wang
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Yingyue Li
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Xuexun Chen
- Bureau of Forestry of Aohan, Chifeng 028000, China
| | - Xiuli Zhang
- Bureau of Forestry of Aohan, Chifeng 028000, China
| | - Xiaoming Pang
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Wenhao Bo
- State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Biotechnology, Beijing Forestry University, Beijing 100083, China
| | - Qing Hao
- The State Key Laboratory of Genetic Improvement and Germplasm Innovation of Crop Resistance in Arid Desert Regions (Preparation), Key Laboratory of Genome Research and Genetic Improvement of Xinjiang Characteristic Fruits and Vegetables, Institute of Horticulture Crops, Xinjiang Academy of Agricultural Sciences, Urumqi 830091, China
| |
Collapse
|
14
|
Donati S, Mattanovich M, Hjort P, Jacobsen SAB, Blomquist SD, Mangaard D, Gurdo N, Pastor FP, Maury J, Hanke R, Herrgård MJ, Wulff T, Jakočiūnas T, Nielsen LK, McCloskey D. An automated workflow for multi-omics screening of microbial model organisms. NPJ Syst Biol Appl 2023; 9:14. [PMID: 37208327 DOI: 10.1038/s41540-023-00277-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 04/19/2023] [Indexed: 05/21/2023] Open
Abstract
Multi-omics datasets are becoming of key importance to drive discovery in fundamental research as much as generating knowledge for applied biotechnology. However, the construction of such large datasets is usually time-consuming and expensive. Automation might enable to overcome these issues by streamlining workflows from sample generation to data analysis. Here, we describe the construction of a complex workflow for the generation of high-throughput microbial multi-omics datasets. The workflow comprises a custom-built platform for automated cultivation and sampling of microbes, sample preparation protocols, analytical methods for sample analysis and automated scripts for raw data processing. We demonstrate possibilities and limitations of such workflow in generating data for three biotechnologically relevant model organisms, namely Escherichia coli, Saccharomyces cerevisiae, and Pseudomonas putida.
Collapse
Affiliation(s)
- Stefano Donati
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Matthias Mattanovich
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Blegdamsvej 3B, DK-2200, Copenhagen, Denmark
| | - Pernille Hjort
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | | | - Sarah Dina Blomquist
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Drude Mangaard
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Nicolas Gurdo
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Felix Pacheco Pastor
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Jérôme Maury
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Rene Hanke
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Markus J Herrgård
- BioInnovation Institute, Ole Maaløes Vej 3, 2200, København, Denmark
| | - Tune Wulff
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Tadas Jakočiūnas
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Lars Keld Nielsen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark.
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD, 4072, Australia.
| | - Douglas McCloskey
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Lyngby, Denmark.
| |
Collapse
|
15
|
Flores JE, Claborne DM, Weller ZD, Webb-Robertson BJM, Waters KM, Bramer LM. Missing data in multi-omics integration: Recent advances through artificial intelligence. Front Artif Intell 2023; 6:1098308. [PMID: 36844425 PMCID: PMC9949722 DOI: 10.3389/frai.2023.1098308] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 01/23/2023] [Indexed: 02/11/2023] Open
Abstract
Biological systems function through complex interactions between various 'omics (biomolecules), and a more complete understanding of these systems is only possible through an integrated, multi-omic perspective. This has presented the need for the development of integration approaches that are able to capture the complex, often non-linear, interactions that define these biological systems and are adapted to the challenges of combining the heterogenous data across 'omic views. A principal challenge to multi-omic integration is missing data because all biomolecules are not measured in all samples. Due to either cost, instrument sensitivity, or other experimental factors, data for a biological sample may be missing for one or more 'omic techologies. Recent methodological developments in artificial intelligence and statistical learning have greatly facilitated the analyses of multi-omics data, however many of these techniques assume access to completely observed data. A subset of these methods incorporate mechanisms for handling partially observed samples, and these methods are the focus of this review. We describe recently developed approaches, noting their primary use cases and highlighting each method's approach to handling missing data. We additionally provide an overview of the more traditional missing data workflows and their limitations; and we discuss potential avenues for further developments as well as how the missing data issue and its current solutions may generalize beyond the multi-omics context.
Collapse
Affiliation(s)
- Javier E. Flores
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| | - Daniel M. Claborne
- Pacific Northwest National Laboratory, Artificial Intelligence and Data Analytics Division, National Security Directorate, Richland, WA, United States
| | - Zachary D. Weller
- Pacific Northwest National Laboratory, Artificial Intelligence and Data Analytics Division, National Security Directorate, Richland, WA, United States
| | - Bobbie-Jo M. Webb-Robertson
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| | - Katrina M. Waters
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| | - Lisa M. Bramer
- Pacific Northwest National Laboratory, Biological Sciences Division, Earth and Biological Sciences Directorate, Richland, WA, United States
| |
Collapse
|
16
|
Casotti MC, Meira DD, Alves LNR, Bessa BGDO, Campanharo CV, Vicente CR, Aguiar CC, Duque DDA, Barbosa DG, dos Santos EDVW, Garcia FM, de Paula F, Santana GM, Pavan IP, Louro LS, Braga RFR, Trabach RSDR, Louro TS, de Carvalho EF, Louro ID. Translational Bioinformatics Applied to the Study of Complex Diseases. Genes (Basel) 2023; 14:419. [PMID: 36833346 PMCID: PMC9956936 DOI: 10.3390/genes14020419] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 01/29/2023] [Accepted: 01/31/2023] [Indexed: 02/10/2023] Open
Abstract
Translational Bioinformatics (TBI) is defined as the union of translational medicine and bioinformatics. It emerges as a major advance in science and technology by covering everything, from the most basic database discoveries, to the development of algorithms for molecular and cellular analysis, as well as their clinical applications. This technology makes it possible to access the knowledge of scientific evidence and apply it to clinical practice. This manuscript aims to highlight the role of TBI in the study of complex diseases, as well as its application to the understanding and treatment of cancer. An integrative literature review was carried out, obtaining articles through several websites, among them: PUBMED, Science Direct, NCBI-PMC, Scientific Electronic Library Online (SciELO), and Google Academic, published in English, Spanish, and Portuguese, indexed in the referred databases and answering the following guiding question: "How does TBI provide a scientific understanding of complex diseases?" An additional effort is aimed at the dissemination, inclusion, and perpetuation of TBI knowledge from the academic environment to society, helping the study, understanding, and elucidating of complex disease mechanics and their treatment.
Collapse
Affiliation(s)
- Matheus Correia Casotti
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Débora Dummer Meira
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Lyvia Neves Rebello Alves
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | | | - Camilly Victória Campanharo
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Creuza Rachel Vicente
- Departamento de Medicina Social, Universidade Federal do Espírito Santo, Vitória 29040-090, Espírito Santo, Brazil
| | - Carla Carvalho Aguiar
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Daniel de Almeida Duque
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Débora Gonçalves Barbosa
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | | | - Fernanda Mariano Garcia
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Flávia de Paula
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Gabriel Mendonça Santana
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Isabele Pagani Pavan
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Luana Santos Louro
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Raquel Furlani Rocon Braga
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Raquel Silva dos Reis Trabach
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| | - Thomas Santos Louro
- Escola Superior de Ciências da Santa Casa de Misericórdia de Vitória (EMESCAM), Vitória 29027-502, Espírito Santo, Brazil
| | - Elizeu Fagundes de Carvalho
- Instituto de Biologia Roberto Alcantara Gomes (IBRAG), Universidade do Estado do Rio de Janeiro (UERJ), Rio de Janeiro 20551-030, Rio de Janeiro, Brazil
| | - Iúri Drumond Louro
- Departamento de Ciências Biológicas, Universidade Federal do Espírito Santo, Vitória 29075-010, Espírito Santo, Brazil
| |
Collapse
|
17
|
Wan H, Zhou L, Wu B, Han W, Sui C, Wei J. Integrated metabolomics and transcriptomics analysis of roots of Bupleurum chinense and B. scorzonerifolium, two sources of medicinal Chaihu. Sci Rep 2022; 12:22335. [PMID: 36572795 PMCID: PMC9792521 DOI: 10.1038/s41598-022-27019-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Accepted: 12/23/2022] [Indexed: 12/28/2022] Open
Abstract
Radix Bupleuri (Chaihu in Chinese) is a traditional Chinese medicine commonly used to treat colds and fevers. The root metabolome and transcriptome of two cultivars of B. chinense (BCYC and BCZC) and one of B. scorzonerifolium (BSHC) were determined and analyzed. Compared with BSHC, 135 and 194 differential metabolites were identified in BCYC and BCZC, respectively, which were mainly fatty acyls, organooxygen metabolites. A total of 163 differential metabolites were obtained between BCYC and BCZC, including phenolic acids and lipids. Compared with BSHC, 6557 and 5621 differential expression genes (DEGs) were found in BCYC and BSHC, respectively, which were annotated into biosynthesis of unsaturated fatty acid and fatty acid metabolism. A total of 4,880 DEGs existed between the two cultivars of B. chinense. The abundance of flavonoids in B. scorzonerifolium was higher than that of B. chinense, with the latter having higher saikosaponin A and saikosaponin D than the former. Pinobanksin was the most major flavonoid which differ between the two cultivars of B. chinense. The expression of chalcone synthase gene was dramatically differential, which had a positive correlation with the biosynthesis of pinobanksin. The present study laid a foundation for further research on biosynthesis of flavonoids and terpenoids of Bupleurum L.
Collapse
Affiliation(s)
- Hefang Wan
- grid.506261.60000 0001 0706 7839Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences & Peking Union Medical College (Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine, Ministry of Education & National Engineering Laboratory for Breeding of Endangered Medicinal Materials), Beijing, 100193 China
| | - Lei Zhou
- grid.506261.60000 0001 0706 7839Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences & Peking Union Medical College (Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine, Ministry of Education & National Engineering Laboratory for Breeding of Endangered Medicinal Materials), Beijing, 100193 China
| | - Bin Wu
- grid.506261.60000 0001 0706 7839Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences & Peking Union Medical College (Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine, Ministry of Education & National Engineering Laboratory for Breeding of Endangered Medicinal Materials), Beijing, 100193 China
| | - Wenjing Han
- grid.506261.60000 0001 0706 7839Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences & Peking Union Medical College (Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine, Ministry of Education & National Engineering Laboratory for Breeding of Endangered Medicinal Materials), Beijing, 100193 China
| | - Chun Sui
- grid.506261.60000 0001 0706 7839Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences & Peking Union Medical College (Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine, Ministry of Education & National Engineering Laboratory for Breeding of Endangered Medicinal Materials), Beijing, 100193 China
| | - Jianhe Wei
- grid.506261.60000 0001 0706 7839Institute of Medicinal Plant Development (IMPLAD), Chinese Academy of Medical Sciences & Peking Union Medical College (Key Laboratory of Bioactive Substances and Resources Utilization of Chinese Herbal Medicine, Ministry of Education & National Engineering Laboratory for Breeding of Endangered Medicinal Materials), Beijing, 100193 China
| |
Collapse
|
18
|
Combined HTR1A/1B methylation and human functional connectome to recognize patients with MDD. Psychiatry Res 2022; 317:114842. [PMID: 36150307 DOI: 10.1016/j.psychres.2022.114842] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 08/22/2022] [Accepted: 09/09/2022] [Indexed: 01/04/2023]
Abstract
OBJECTIVES This study aimed to use a machine-learning method to identify HTR1A/1B methylation and resting-state functional connectivity (rsFC) related to the diagnosis of MDD, then try to build classification models for MDD diagnosis based on the identified features. METHODS Peripheral blood samples were collected from all recruited participants, and part of the participants underwent the resting-state fMRI scan. Features including HTR1A/1B methylation and rsFC were calculated. Then, the initial feature sets of epigenetics and neuroimaging were separately input into an all-relevant feature selection to generate significant discriminative power for MDD diagnosis. Random forest classifiers were constructed and evaluated based on identified features. In addition, the SHapley Additive exPlanations (SHAP) method was adapted to interpret the diagnostic model. RESULTS A combination of selected HTR1A/1B methylation and rsFC feature sets achieved better performance than using either one alone - a distinction between MDD and healthy control groups was achieved at 81.78% classification accuracy and 0.8948 AUC. CONCLUSION A high classification accuracy can be achieved by combining multidimensional information from epigenetics and cerebral radiomic features in MDD. Our approach can be helpful for accurate clinical diagnosis of MDD and further exploring the pathogenesis of MDD.
Collapse
|
19
|
Du YH, Wang MY, Yang LH, Tong LL, Guo DS, Ji XJ. Optimization and Scale-Up of Fermentation Processes Driven by Models. Bioengineering (Basel) 2022; 9:bioengineering9090473. [PMID: 36135019 PMCID: PMC9495923 DOI: 10.3390/bioengineering9090473] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 09/05/2022] [Accepted: 09/09/2022] [Indexed: 11/16/2022] Open
Abstract
In the era of sustainable development, the use of cell factories to produce various compounds by fermentation has attracted extensive attention; however, industrial fermentation requires not only efficient production strains, but also suitable extracellular conditions and medium components, as well as scaling-up. In this regard, the use of biological models has received much attention, and this review will provide guidance for the rapid selection of biological models. This paper first introduces two mechanistic modeling methods, kinetic modeling and constraint-based modeling (CBM), and generalizes their applications in practice. Next, we review data-driven modeling based on machine learning (ML), and highlight the application scope of different learning algorithms. The combined use of ML and CBM for constructing hybrid models is further discussed. At the end, we also discuss the recent strategies for predicting bioreactor scale-up and culture behavior through a combination of biological models and computational fluid dynamics (CFD) models.
Collapse
Affiliation(s)
- Yuan-Hang Du
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
| | - Min-Yu Wang
- State Key Laboratory of Materials-Oriented Chemical Engineering, College of Biotechnology and Pharmaceutical Engineering, Nanjing Tech University, Nanjing 211816, China
| | - Lin-Hui Yang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
| | - Ling-Ling Tong
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
| | - Dong-Sheng Guo
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
- Correspondence: (D.-S.G.); (X.-J.J.)
| | - Xiao-Jun Ji
- State Key Laboratory of Materials-Oriented Chemical Engineering, College of Biotechnology and Pharmaceutical Engineering, Nanjing Tech University, Nanjing 211816, China
- Correspondence: (D.-S.G.); (X.-J.J.)
| |
Collapse
|
20
|
easyMF: A Web Platform for Matrix Factorization-Based Gene Discovery from Large-scale Transcriptome Data. Interdiscip Sci 2022; 14:746-758. [PMID: 35585280 DOI: 10.1007/s12539-022-00522-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 04/06/2022] [Accepted: 04/07/2022] [Indexed: 01/22/2023]
Abstract
With the development of high-throughput experimental technologies, large-scale RNA sequencing (RNA-Seq) data have been and continue to be produced, but have led to challenges in extracting relevant biological knowledge hidden in the produced high-dimensional gene expression matrices. Here, we develop easyMF ( https://github.com/cma2015/easyMF ), a web platform that can facilitate functional gene discovery from large-scale transcriptome data using matrix factorization (MF) algorithms. Compared with existing MF-based software packages, easyMF exhibits several promising features, such as greater functionality, flexibility and ease of use. The easyMF platform is equipped using the Big-Data-supported Galaxy system with user-friendly graphic user interfaces, allowing users with little programming experience to streamline transcriptome analysis from raw reads to gene expression, carry out multiple-scenario MF analysis, and perform multiple-way MF-based gene discovery. easyMF is also powered with the advanced packing technology to enhance ease of use under different operating systems and computational environments. We illustrated the application of easyMF for seed gene discovery from temporal, spatial, and integrated RNA-Seq datasets of maize (Zea mays L.), resulting in the identification of 3,167 seed stage-specific, 1,849 seed compartment-specific, and 774 seed-specific genes, respectively. The present results also indicated that easyMF can prioritize seed-related genes with superior prediction performance over the state-of-art network-based gene prioritization system MaizeNet. As a modular, containerized and open-source platform, easyMF can be further customized to satisfy users' specific demands of functional gene discovery and deployed as a web service for broad applications.
Collapse
|
21
|
Gliozzo J, Mesiti M, Notaro M, Petrini A, Patak A, Puertas-Gallardo A, Paccanaro A, Valentini G, Casiraghi E. Heterogeneous data integration methods for patient similarity networks. Brief Bioinform 2022; 23:6604996. [PMID: 35679533 PMCID: PMC9294435 DOI: 10.1093/bib/bbac207] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 04/14/2022] [Accepted: 05/04/2022] [Indexed: 12/29/2022] Open
Abstract
Patient similarity networks (PSNs), where patients are represented as nodes and their similarities as weighted edges, are being increasingly used in clinical research. These networks provide an insightful summary of the relationships among patients and can be exploited by inductive or transductive learning algorithms for the prediction of patient outcome, phenotype and disease risk. PSNs can also be easily visualized, thus offering a natural way to inspect complex heterogeneous patient data and providing some level of explainability of the predictions obtained by machine learning algorithms. The advent of high-throughput technologies, enabling us to acquire high-dimensional views of the same patients (e.g. omics data, laboratory data, imaging data), calls for the development of data fusion techniques for PSNs in order to leverage this rich heterogeneous information. In this article, we review existing methods for integrating multiple biomedical data views to construct PSNs, together with the different patient similarity measures that have been proposed. We also review methods that have appeared in the machine learning literature but have not yet been applied to PSNs, thus providing a resource to navigate the vast machine learning literature existing on this topic. In particular, we focus on methods that could be used to integrate very heterogeneous datasets, including multi-omics data as well as data derived from clinical information and medical imaging.
Collapse
Affiliation(s)
- Jessica Gliozzo
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,European Commission, Joint Research Centre (JRC), Ispra (VA), Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Marco Mesiti
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Marco Notaro
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Alessandro Petrini
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Alex Patak
- European Commission, Joint Research Centre (JRC), Ispra (VA), Italy
| | | | - Alberto Paccanaro
- Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX UK.,School of Applied Mathematics (EMAp), Fundação Getúlio Vargas, Rio de Janeiro Brazil
| | - Giorgio Valentini
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy.,DSRC UNIMI, Data Science Research Center, Milano, 20135, Italy.,ELLIS, European Laboratory for Learning and Intelligent Systems, Berlin, Germany
| | - Elena Casiraghi
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| |
Collapse
|
22
|
Xia Y, Liu J, Chen C, Mo X, Tan Q, He Y, Wang Z, Yin J, Zhou G. The Multifunctions and Future Prospects of Endophytes and Their Metabolites in Plant Disease Management. Microorganisms 2022; 10:microorganisms10051072. [PMID: 35630514 PMCID: PMC9146654 DOI: 10.3390/microorganisms10051072] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/19/2022] [Accepted: 05/21/2022] [Indexed: 01/27/2023] Open
Abstract
Endophytes represent a ubiquitous and magical world in plants. Almost all plant species studied by different researchers have been found to harbor one or more endophytes, which protect host plants from pathogen invasion and from adverse environmental conditions. They produce various metabolites that can directly inhibit the growth of pathogens and even promote the growth and development of the host plants. In this review, we focus on the biological control of plant diseases, aiming to elucidate the contribution and key roles of endophytes and their metabolites in this field with the latest research information. Metabolites synthesized by endophytes are part of plant disease management, and the application of endophyte metabolites to induce plant resistance is very promising. Furthermore, multi-omics should be more fully utilized in plant–microbe research, especially in mining novel bioactive metabolites. We believe that the utilization of endophytes and their metabolites for plant disease management is a meaningful and promising research direction that can lead to new breakthroughs in the development of more effective and ecosystem-friendly insecticides and fungicides in modern agriculture.
Collapse
Affiliation(s)
- Yandong Xia
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
| | - Junang Liu
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
| | - Cang Chen
- College of Life Science, Hunan Normal University, Changsha 410081, China;
| | - Xiuli Mo
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
| | - Qian Tan
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
| | - Yuan He
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
| | - Zhikai Wang
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
| | - Jia Yin
- College of Life Science, Hunan Normal University, Changsha 410081, China;
- Correspondence: (J.Y.); (G.Z.)
| | - Guoying Zhou
- Key Laboratory of National Forestry and Grassland Administration on Control of Artificial Forest Diseases and Pests in South China, Hunan Provincial Key Laboratory for Control of Forest Diseases and Pests, Key Laboratory for Non-Wood Forest Cultivation and Conservation of Ministry of Education, College of Life Science and Technology, Central South University of Forestry and Technology, Changsha 410004, China; (Y.X.); (J.L.); (X.M.); (Q.T.); (Y.H.); (Z.W.)
- Correspondence: (J.Y.); (G.Z.)
| |
Collapse
|
23
|
Kaiser T, Jahansouz C, Staley C. Network-based approaches for the investigation of microbial community structure and function using metagenomics-based data. Future Microbiol 2022; 17:621-631. [PMID: 35360922 DOI: 10.2217/fmb-2021-0219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Network-based approaches offer a powerful framework to evaluate microbial community organization and function as it relates to a variety of environmental processes. Emerging studies are exploring network theory as a method for data integration that is likely to be critical for the integration of 'omics' data using systems biology approaches. Intricacies of network theory and methodological and computational complexities in network construction, however, impede the use of these tools for translational science. We provide a perspective on the methods of network construction, interpretation and emerging uses for these techniques in understanding host-microbiota interactions.
Collapse
Affiliation(s)
- Thomas Kaiser
- Department of Surgery, University of Minnesota, Minneapolis, MN 55455, USA.,Biotechnology Institute, University of Minnesota, Saint Paul, MN 55108, USA
| | - Cyrus Jahansouz
- Department of Surgery, University of Minnesota, Minneapolis, MN 55455, USA
| | - Christopher Staley
- Department of Surgery, University of Minnesota, Minneapolis, MN 55455, USA.,Biotechnology Institute, University of Minnesota, Saint Paul, MN 55108, USA
| |
Collapse
|
24
|
Post JN, Loerakker S, Merks R, Carlier A. Implementing computational modeling in tissue engineering: where disciplines meet. Tissue Eng Part A 2022; 28:542-554. [PMID: 35345902 DOI: 10.1089/ten.tea.2021.0215] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
In recent years, the mathematical and computational sciences have developed novel methodologies and insights that can aid in designing advanced bioreactors, microfluidic set-ups or organ-on-chip devices, in optimizing culture conditions, or predicting long-term behavior of engineered tissues in vivo. In this review, we introduce the concept of computational models and how they can be integrated in an interdisciplinary workflow for Tissue Engineering and Regenerative Medicine (TERM). We specifically aim this review of general concepts and examples at experimental scientists with little or no computational modeling experience. We also describe the contribution of computational models in understanding TERM processes and in advancing the TERM field by providing novel insights.
Collapse
Affiliation(s)
- Janine Nicole Post
- University of Twente, 3230, Tissue Regeneration, Enschede, Overijssel, Netherlands;
| | - Sandra Loerakker
- Eindhoven University of Technology, 3169, Department of Biomedical Engineering, Eindhoven, Noord-Brabant, Netherlands.,Eindhoven University of Technology, 3169, Institute for Complex Molecular Systems, Eindhoven, Noord-Brabant, Netherlands;
| | - Roeland Merks
- Leiden University, 4496, Institute for Biology Leiden and Mathematical Institute, Leiden, Zuid-Holland, Netherlands;
| | - Aurélie Carlier
- Maastricht University, 5211, MERLN Institute for Technology-Inspired Regenerative Medicine, Universiteitssingel 40, 6229 ER Maastricht, Maastricht, Netherlands, 6200 MD;
| |
Collapse
|
25
|
Giulia A, Anna S, Antonia B, Dario P, Maurizio C. Extending Association Rule Mining to Microbiome Pattern Analysis: Tools and Guidelines to Support Real Applications. FRONTIERS IN BIOINFORMATICS 2022; 1:794547. [PMID: 36303759 PMCID: PMC9580939 DOI: 10.3389/fbinf.2021.794547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 12/07/2021] [Indexed: 11/24/2022] Open
Abstract
Boosted by the exponential growth of microbiome-based studies, analyzing microbiome patterns is now a hot-topic, finding different fields of application. In particular, the use of machine learning techniques is increasing in microbiome studies, providing deep insights into microbial community composition. In this context, in order to investigate microbial patterns from 16S rRNA metabarcoding data, we explored the effectiveness of Association Rule Mining (ARM) technique, a supervised-machine learning procedure, to extract patterns (in this work, intended as groups of species or taxa) from microbiome data. ARM can generate huge amounts of data, making spurious information removal and visualizing results challenging. Our work sheds light on the strengths and weaknesses of pattern mining strategy into the study of microbial patterns, in particular from 16S rRNA microbiome datasets, applying ARM on real case studies and providing guidelines for future usage. Our results highlighted issues related to the type of input and the use of metadata in microbial pattern extraction, identifying the key steps that must be considered to apply ARM consciously on 16S rRNA microbiome data. To promote the use of ARM and the visualization of microbiome patterns, specifically, we developed microFIM (microbial Frequent Itemset Mining), a versatile Python tool that facilitates the use of ARM integrating common microbiome outputs, such as taxa tables. microFIM implements interest measures to remove spurious information and merges the results of ARM analysis with the common microbiome outputs, providing similar microbiome strategies that help scientists to integrate ARM in microbiome applications. With this work, we aimed at creating a bridge between microbial ecology researchers and ARM technique, making researchers aware about the strength and weaknesses of association rule mining approach.
Collapse
Affiliation(s)
- Agostinetto Giulia
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
- *Correspondence: Agostinetto Giulia,
| | | | - Bruno Antonia
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
| | - Pescini Dario
- Department of Statistics and Quantitative Methods, University of Milano-Bicocca, Milan, Italy
| | - Casiraghi Maurizio
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy
| |
Collapse
|
26
|
Viaud G, Mayilvahanan P, Cournede PH. Representation Learning for the Clustering of Multi-Omics Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:135-145. [PMID: 33600320 DOI: 10.1109/tcbb.2021.3060340] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The integration of several sources of data for the identification of subtypes of diseases has gained attention over the past few years. The heterogeneity and the high dimensions of the data sets calls for an adequate representation of the data. We summarize the field of representation learning for the multi-omics clustering problem and we investigate several techniques to learn relevant combined representations, using methods from group factor analysis (PCA, MFA and extensions) and from machine learning with autoencoders. We highlight the importance of appropriately designing and training the latter, notably with a novel combination of a disjointed deep autoencoder (DDAE) architecture and a layer-wise reconstruction loss. These different representations can then be clustered to identify biologically meaningful clusters of patients. We provide a unifying framework for model comparison between statistical and deep learning approaches with the introduction of a new weighted internal clustering index that evaluates how well the clustering information is retained from each source, favoring contributions from all data sets. We apply our methodology to two case studies for which previous works of integrative clustering exist, TCGA Breast Cancer and TARGET Neuroblastoma, and show how our method can yield good and well-balanced clusters across the different data sources.
Collapse
|
27
|
Correa R, Alonso-Pupo N, Hernández Rodríguez EW. Multi-omics data integration approaches for precision oncology. Mol Omics 2022; 18:469-479. [DOI: 10.1039/d1mo00411e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Next-generation sequencing (NGS) has been pivotal to enhance the molecular characterization of human malignancies, allowing multiple omics data types to be available for cancer researchers and practitioners. In this context,...
Collapse
|
28
|
MacDonald MA, Nöbel M, Roche Recinos D, Martínez VS, Schulz BL, Howard CB, Baker K, Shave E, Lee YY, Marcellin E, Mahler S, Nielsen LK, Munro T. Perfusion culture of Chinese Hamster Ovary cells for bioprocessing applications. Crit Rev Biotechnol 2021; 42:1099-1115. [PMID: 34844499 DOI: 10.1080/07388551.2021.1998821] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Much of the biopharmaceutical industry's success over the past 30 years has relied on products derived from Chinese Hamster Ovary (CHO) cell lines. During this time, improvements in mammalian cell cultures have come from cell line development and process optimization suited for large-scale fed-batch processes. Originally developed for high cell densities and sensitive products, perfusion processes have a long history. Driven by high volumetric titers and a small footprint, perfusion-based bioprocess research has regained an interest from academia and industry. The recent pandemic has further highlighted the need for such intensified biomanufacturing options. In this review, we outline the technical history of research in this field as it applies to biologics production in CHO cells. We demonstrate a number of emerging trends in the literature and corroborate these with underlying drivers in the commercial space. From these trends, we speculate that the future of perfusion bioprocesses is bright and that the fields of media optimization, continuous processing, and cell line engineering hold the greatest potential. Aligning in its continuous setup with the demands for Industry 4.0, perfusion biomanufacturing is likely to be a hot topic in the years to come.
Collapse
Affiliation(s)
- Michael A MacDonald
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,Thermo Fisher Scientific, Woolloongabba, Brisbane, Australia
| | - Matthias Nöbel
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,Thermo Fisher Scientific, Woolloongabba, Brisbane, Australia
| | - Dinora Roche Recinos
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,CSL Limited, Parkville, Melbourne, Australia
| | - Verónica S Martínez
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia
| | - Benjamin L Schulz
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, Australia
| | - Christopher B Howard
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia
| | - Kym Baker
- Thermo Fisher Scientific, Woolloongabba, Brisbane, Australia
| | - Evan Shave
- Thermo Fisher Scientific, Woolloongabba, Brisbane, Australia
| | | | - Esteban Marcellin
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,Metabolomics Australia, The University of Queensland, St. Lucia, Brisbane, Australia
| | - Stephen Mahler
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia
| | - Lars Keld Nielsen
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,Metabolomics Australia, The University of Queensland, St. Lucia, Brisbane, Australia.,The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Trent Munro
- ARC Training Centre for Biopharmaceutical Innovation, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, St. Lucia, Brisbane, Australia.,National Biologics Facility, The University of Queensland, St. Lucia, Brisbane, Australia
| |
Collapse
|
29
|
Chung CH, Lin DW, Eames A, Chandrasekaran S. Next-Generation Genome-Scale Metabolic Modeling through Integration of Regulatory Mechanisms. Metabolites 2021; 11:606. [PMID: 34564422 PMCID: PMC8470976 DOI: 10.3390/metabo11090606] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 09/01/2021] [Accepted: 09/03/2021] [Indexed: 12/18/2022] Open
Abstract
Genome-scale metabolic models (GEMs) are powerful tools for understanding metabolism from a systems-level perspective. However, GEMs in their most basic form fail to account for cellular regulation. A diverse set of mechanisms regulate cellular metabolism, enabling organisms to respond to a wide range of conditions. This limitation of GEMs has prompted the development of new methods to integrate regulatory mechanisms, thereby enhancing the predictive capabilities and broadening the scope of GEMs. Here, we cover integrative models encompassing six types of regulatory mechanisms: transcriptional regulatory networks (TRNs), post-translational modifications (PTMs), epigenetics, protein-protein interactions and protein stability (PPIs/PS), allostery, and signaling networks. We discuss 22 integrative GEM modeling methods and how these have been used to simulate metabolic regulation during normal and pathological conditions. While these advances have been remarkable, there remains a need for comprehensive and widespread integration of regulatory constraints into GEMs. We conclude by discussing challenges in constructing GEMs with regulation and highlight areas that need to be addressed for the successful modeling of metabolic regulation. Next-generation integrative GEMs that incorporate multiple regulatory mechanisms and their crosstalk will be invaluable for discovering cell-type and disease-specific metabolic control mechanisms.
Collapse
Affiliation(s)
- Carolina H. Chung
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA; (C.H.C.); (A.E.)
| | - Da-Wei Lin
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA;
| | - Alec Eames
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA; (C.H.C.); (A.E.)
| | - Sriram Chandrasekaran
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109, USA; (C.H.C.); (A.E.)
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA;
- Program in Chemical Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Center for Bioinformatics and Computational Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Rogel Cancer Center, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
30
|
Khaleghi MK, Savizi ISP, Lewis NE, Shojaosadati SA. Synergisms of machine learning and constraint-based modeling of metabolism for analysis and optimization of fermentation parameters. Biotechnol J 2021; 16:e2100212. [PMID: 34390201 DOI: 10.1002/biot.202100212] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 08/10/2021] [Accepted: 08/11/2021] [Indexed: 11/06/2022]
Abstract
Recent noteworthy advances in the development of high-performing microbial and mammalian strains have enabled the sustainable production of bio-economically valuable substances such as bio-compounds, biofuels, and biopharmaceuticals. However, to obtain an industrially viable mass-production scheme, much time and effort are required. The robust and rational design of fermentation processes requires analysis and optimization of different extracellular conditions and medium components, which have a massive effect on growth and productivity. In this regard, knowledge- and data-driven modeling methods have received much attention. Constraint-based modeling (CBM) is a knowledge-driven mathematical approach that has been widely used in fermentation analysis and optimization due to its capabilities of predicting the cellular phenotype from genotype through high-throughput means. On the other hand, machine learning (ML) is a data-driven statistical method that identifies the data patterns within sophisticated biological systems and processes, where there is inadequate knowledge to represent underlying mechanisms. Furthermore, ML models are becoming a viable complement to constraint-based models in a reciprocal manner when one is used as a pre-step of another. As a result, more predictable model is produced. This review highlights the applications of CBM and ML independently and the combination of these two approaches for analyzing and optimizing fermentation parameters. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Mohammad Karim Khaleghi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Iman Shahidi Pour Savizi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Nathan E Lewis
- Department of Bioengineering, University of California, San Diego, USA.,Department of Pediatrics, University of California, San Diego, USA
| | - Seyed Abbas Shojaosadati
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| |
Collapse
|
31
|
Jacobson TB, Callaghan MM, Amador-Noguez D. Hostile Takeover: How Viruses Reprogram Prokaryotic Metabolism. Annu Rev Microbiol 2021; 75:515-539. [PMID: 34348026 DOI: 10.1146/annurev-micro-060621-043448] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
To reproduce, prokaryotic viruses must hijack the cellular machinery of their hosts and redirect it toward the production of viral particles. While takeover of the host replication and protein synthesis apparatus has long been considered an essential feature of infection, recent studies indicate that extensive reprogramming of host primary metabolism is a widespread phenomenon among prokaryotic viruses that is required to fulfill the biosynthetic needs of virion production. In this review we provide an overview of the most significant recent findings regarding virus-induced reprogramming of prokaryotic metabolism and suggest how quantitative systems biology approaches may be used to provide a holistic understanding of metabolic remodeling during lytic viral infection. Expected final online publication date for the Annual Review of Microbiology, Volume 75 is October 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Tyler B Jacobson
- Department of Bacteriology, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA; , , .,Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, Wisconsin 53726, USA.,Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, USA
| | - Melanie M Callaghan
- Department of Bacteriology, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA; , , .,Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, USA
| | - Daniel Amador-Noguez
- Department of Bacteriology, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA; , , .,Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, Wisconsin 53726, USA.,Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, USA
| |
Collapse
|
32
|
Gawthrop PJ, Pan M, Crampin EJ. Modular dynamic biomolecular modelling with bond graphs: the unification of stoichiometry, thermodynamics, kinetics and data. J R Soc Interface 2021; 18:20210478. [PMID: 34428949 PMCID: PMC8385351 DOI: 10.1098/rsif.2021.0478] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 08/02/2021] [Indexed: 12/14/2022] Open
Abstract
Renewed interest in dynamic simulation models of biomolecular systems has arisen from advances in genome-wide measurement and applications of such models in biotechnology and synthetic biology. In particular, genome-scale models of cellular metabolism beyond the steady state are required in order to represent transient and dynamic regulatory properties of the system. Development of such whole-cell models requires new modelling approaches. Here, we propose the energy-based bond graph methodology, which integrates stoichiometric models with thermodynamic principles and kinetic modelling. We demonstrate how the bond graph approach intrinsically enforces thermodynamic constraints, provides a modular approach to modelling, and gives a basis for estimation of model parameters leading to dynamic models of biomolecular systems. The approach is illustrated using a well-established stoichiometric model of Escherichia coli and published experimental data.
Collapse
Affiliation(s)
- Peter J. Gawthrop
- Systems Biology Laboratory, School of Mathematics and Statistics, and Department of Biomedical Engineering, University of Melbourne, Victoria 3010, Australia
| | - Michael Pan
- Systems Biology Laboratory, School of Mathematics and Statistics, and Department of Biomedical Engineering, University of Melbourne, Victoria 3010, Australia
- ARC Centre of Excellence in Convergent Bio-Nano Science and Technology, School of Chemical and Biomedical Engineering, University of Melbourne, Victoria 3010, Australia
| | - Edmund J. Crampin
- Systems Biology Laboratory, School of Mathematics and Statistics, and Department of Biomedical Engineering, University of Melbourne, Victoria 3010, Australia
- ARC Centre of Excellence in Convergent Bio-Nano Science and Technology, School of Chemical and Biomedical Engineering, University of Melbourne, Victoria 3010, Australia
| |
Collapse
|
33
|
Schneidewind T, Brause A, Schölermann B, Sievers S, Pahl A, Sankar MG, Winzker M, Janning P, Kumar K, Ziegler S, Waldmann H. Combined morphological and proteome profiling reveals target-independent impairment of cholesterol homeostasis. Cell Chem Biol 2021; 28:1780-1794.e5. [PMID: 34214450 DOI: 10.1016/j.chembiol.2021.06.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 03/11/2021] [Accepted: 06/08/2021] [Indexed: 12/19/2022]
Abstract
Unbiased profiling approaches are powerful tools for small-molecule target or mode-of-action deconvolution as they generate a holistic view of the bioactivity space. This is particularly important for non-protein targets that are difficult to identify with commonly applied target identification methods. Thereby, unbiased profiling can enable identification of novel bioactivity even for annotated compounds. We report the identification of a large bioactivity cluster comprised of numerous well-characterized drugs with different primary targets using a combination of the morphological Cell Painting Assay and proteome profiling. Cluster members alter cholesterol homeostasis and localization due to their physicochemical properties that lead to protonation and accumulation in lysosomes, an increase in lysosomal pH, and a disturbed cholesterol homeostasis. The identified cluster enables identification of modulators of cholesterol homeostasis and links regulation of genes or proteins involved in cholesterol synthesis or trafficking to physicochemical properties rather than to nominal targets.
Collapse
Affiliation(s)
- Tabea Schneidewind
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany; Technical University Dortmund, Faculty of Chemistry and Chemical Biology, Otto-Hahn-Strasse 6, Dortmund 44227, Germany
| | - Alexandra Brause
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Beate Schölermann
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Sonja Sievers
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Axel Pahl
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Muthukumar G Sankar
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Michael Winzker
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Petra Janning
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Kamal Kumar
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Slava Ziegler
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany
| | - Herbert Waldmann
- Max-Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, Dortmund 44227, Germany; Technical University Dortmund, Faculty of Chemistry and Chemical Biology, Otto-Hahn-Strasse 6, Dortmund 44227, Germany.
| |
Collapse
|
34
|
Dornhaus A, Smith B, Hristova K, Buckley LB. How can we fully realize the potential of mathematical and biological models to reintegrate biology? Integr Comp Biol 2021; 61:2244-2254. [PMID: 34160617 DOI: 10.1093/icb/icab142] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Both mathematical models and biological model systems stand as tractable representations of complex biological systems or behaviors. They facilitate research and provide insights, and they can describe general rules. Models that represent biological processes or formalize general hypotheses are essential to any broad understanding. Mathematical or biological models necessarily omit details of the natural systems and thus may ultimately be "incorrect" representations. A key challenge is that tractability requires relatively simple models but simplification can result in models that are incorrect in their qualitative, broad implications if the abstracted details matter. Our paper discusses this tension, and how we can improve our inferences from models. We advocate for further efforts dedicated to model development, improvement, and acceptance by the scientific community, all of which may necessitate a more explicit discussion of the purpose and power of models. We argue that models should play a central role in reintegrating biology as a way to test our integrated understanding of how molecules, cells, organs, organisms, populations, and ecosystems function.
Collapse
Affiliation(s)
- Anna Dornhaus
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, AZ 85721
| | - Brian Smith
- School of Life Sciences, Arizona State University, Tempe, AZ 85287
| | - Kalina Hristova
- Department of Materials Science and Engineering, and Program in Molecular Biology, John Hopkins University, Baltimore, MD 21218
| | - Lauren B Buckley
- Department of Biology, University of Washington, Seattle, WA 98115
| |
Collapse
|
35
|
Martínez Arbas S, Busi SB, Queirós P, de Nies L, Herold M, May P, Wilmes P, Muller EEL, Narayanasamy S. Challenges, Strategies, and Perspectives for Reference-Independent Longitudinal Multi-Omic Microbiome Studies. Front Genet 2021; 12:666244. [PMID: 34194470 PMCID: PMC8236828 DOI: 10.3389/fgene.2021.666244] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/30/2021] [Indexed: 12/21/2022] Open
Abstract
In recent years, multi-omic studies have enabled resolving community structure and interrogating community function of microbial communities. Simultaneous generation of metagenomic, metatranscriptomic, metaproteomic, and (meta) metabolomic data is more feasible than ever before, thus enabling in-depth assessment of community structure, function, and phenotype, thus resulting in a multitude of multi-omic microbiome datasets and the development of innovative methods to integrate and interrogate those multi-omic datasets. Specifically, the application of reference-independent approaches provides opportunities in identifying novel organisms and functions. At present, most of these large-scale multi-omic datasets stem from spatial sampling (e.g., water/soil microbiomes at several depths, microbiomes in/on different parts of the human anatomy) or case-control studies (e.g., cohorts of human microbiomes). We believe that longitudinal multi-omic microbiome datasets are the logical next step in microbiome studies due to their characteristic advantages in providing a better understanding of community dynamics, including: observation of trends, inference of causality, and ultimately, prediction of community behavior. Furthermore, the acquisition of complementary host-derived omics, environmental measurements, and suitable metadata will further enhance the aforementioned advantages of longitudinal data, which will serve as the basis to resolve drivers of community structure and function to understand the biotic and abiotic factors governing communities and specific populations. Carefully setup future experiments hold great potential to further unveil ecological mechanisms to evolution, microbe-microbe interactions, or microbe-host interactions. In this article, we discuss the challenges, emerging strategies, and best-practices applicable to longitudinal microbiome studies ranging from sampling, biomolecular extraction, systematic multi-omic measurements, reference-independent data integration, modeling, and validation.
Collapse
Affiliation(s)
- Susana Martínez Arbas
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Susheel Bhanu Busi
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Pedro Queirós
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Laura de Nies
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Malte Herold
- Department of Environmental Research and Innovation, Luxembourg Institute of Science and Technology, Belvaux, Luxembourg
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Emilie E. L. Muller
- Université de Strasbourg, UMR 7156 CNRS, Génétique Moléculaire, Génomique, Microbiologie, Strasbourg, France
| | - Shaman Narayanasamy
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
36
|
Liu Z, Ma A, Mathé E, Merling M, Ma Q, Liu B. Network analyses in microbiome based on high-throughput multi-omics data. Brief Bioinform 2021; 22:1639-1655. [PMID: 32047891 PMCID: PMC7986608 DOI: 10.1093/bib/bbaa005] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 01/07/2020] [Accepted: 01/08/2020] [Indexed: 02/06/2023] Open
Abstract
Together with various hosts and environments, ubiquitous microbes interact closely with each other forming an intertwined system or community. Of interest, shifts of the relationships between microbes and their hosts or environments are associated with critical diseases and ecological changes. While advances in high-throughput Omics technologies offer a great opportunity for understanding the structures and functions of microbiome, it is still challenging to analyse and interpret the omics data. Specifically, the heterogeneity and diversity of microbial communities, compounded with the large size of the datasets, impose a tremendous challenge to mechanistically elucidate the complex communities. Fortunately, network analyses provide an efficient way to tackle this problem, and several network approaches have been proposed to improve this understanding recently. Here, we systemically illustrate these network theories that have been used in biological and biomedical research. Then, we review existing network modelling methods of microbial studies at multiple layers from metagenomics to metabolomics and further to multi-omics. Lastly, we discuss the limitations of present studies and provide a perspective for further directions in support of the understanding of microbial communities.
Collapse
Affiliation(s)
- Zhaoqian Liu
- Department of Biomedical Informatics, College of Medicine, the Ohio State University, Columbus, OH 43210, USA
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, the Ohio State University, Columbus, OH 43210, USA
| | - Ewy Mathé
- Department of Biomedical Informatics, College of Medicine, the Ohio State University, Columbus, OH 43210, USA
| | - Marlena Merling
- Department of Biomedical Informatics, College of Medicine, the Ohio State University, Columbus, OH 43210, USA
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, the Ohio State University, Columbus, OH 43210, USA
| | - Bingqiang Liu
- Department of Biomedical Informatics, College of Medicine, the Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
37
|
Integration of relative metabolomics and transcriptomics time-course data in a metabolic model pinpoints effects of ribosome biogenesis defects on Arabidopsis thaliana metabolism. Sci Rep 2021; 11:4787. [PMID: 33637852 PMCID: PMC7910480 DOI: 10.1038/s41598-021-84114-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 01/11/2021] [Indexed: 01/30/2023] Open
Abstract
Ribosome biogenesis is tightly associated to plant metabolism due to the usage of ribosomes in the synthesis of proteins necessary to drive metabolic pathways. Given the central role of ribosome biogenesis in cell physiology, it is important to characterize the impact of different components involved in this process on plant metabolism. Double mutants of the Arabidopsis thaliana cytosolic 60S maturation factors REIL1 and REIL2 do not resume growth after shift to moderate 10 \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$^{\circ }\hbox {C}$$\end{document}∘C chilling conditions. To gain mechanistic insights into the metabolic effects of this ribosome biogenesis defect on metabolism, we developed TC-iReMet2, a constraint-based modelling approach that integrates relative metabolomics and transcriptomics time-course data to predict differential fluxes on a genome-scale level. We employed TC-iReMet2 with metabolomics and transcriptomics data from the Arabidopsis Columbia 0 wild type and the reil1-1 reil2-1 double mutant before and after cold shift. We identified reactions and pathways that are highly altered in a mutant relative to the wild type. These pathways include the Calvin–Benson cycle, photorespiration, gluconeogenesis, and glycolysis. Our findings also indicated differential NAD(P)/NAD(P)H ratios after cold shift. TC-iReMet2 allows for mechanistic hypothesis generation and interpretation of system biology experiments related to metabolic fluxes on a genome-scale level.
Collapse
|
38
|
Polewko-Klim A, Mnich K, Rudnicki WR. Robust Data Integration Method for Classification of Biomedical Data. J Med Syst 2021; 45:45. [PMID: 33624190 PMCID: PMC7902598 DOI: 10.1007/s10916-021-01718-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 01/26/2021] [Indexed: 10/26/2022]
Abstract
We present a protocol for integrating two types of biological data - clinical and molecular - for more effective classification of patients with cancer. The proposed approach is a hybrid between early and late data integration strategy. In this hybrid protocol, the set of informative clinical features is extended by the classification results based on molecular data sets. The results are then treated as new synthetic variables. The hybrid protocol was applied to METABRIC breast cancer samples and TCGA urothelial bladder carcinoma samples. Various data types were used for clinical endpoint prediction: clinical data, gene expression, somatic copy number aberrations, RNA-Seq, methylation, and reverse phase protein array. The performance of the hybrid data integration was evaluated with a repeated cross validation procedure and compared with other methods of data integration: early integration and late integration via super learning. The hybrid method gave similar results to those obtained by the best of the tested variants of super learning. What is more, the hybrid method allowed for further sensitivity analysis and recursive feature elimination, which led to compact predictive models for cancer clinical endpoints. For breast cancer, the final model consists of eight clinical variables and two synthetic features obtained from molecular data. For urothelial bladder carcinoma, only two clinical features and one synthetic variable were necessary to build the best predictive model. We have shown that the inclusion of the synthetic variables based on the RNA expression levels and copy number alterations can lead to improved quality of prognostic tests. Thus, it should be considered for inclusion in wider medical practice.
Collapse
Affiliation(s)
- Aneta Polewko-Klim
- Institute of Computer Science, University of Bialystok, Bialystok, Poland
| | - Krzysztof Mnich
- Computational Center, University of Bialystok, Bialystok, Poland
| | - Witold R. Rudnicki
- Institute of Computer Science, University of Bialystok, Bialystok, Poland
- Computational Center, University of Bialystok, Bialystok, Poland
| |
Collapse
|
39
|
Bernstein DB, Sulheim S, Almaas E, Segrè D. Addressing uncertainty in genome-scale metabolic model reconstruction and analysis. Genome Biol 2021; 22:64. [PMID: 33602294 PMCID: PMC7890832 DOI: 10.1186/s13059-021-02289-z] [Citation(s) in RCA: 74] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 02/04/2021] [Indexed: 02/07/2023] Open
Abstract
The reconstruction and analysis of genome-scale metabolic models constitutes a powerful systems biology approach, with applications ranging from basic understanding of genotype-phenotype mapping to solving biomedical and environmental problems. However, the biological insight obtained from these models is limited by multiple heterogeneous sources of uncertainty, which are often difficult to quantify. Here we review the major sources of uncertainty and survey existing approaches developed for representing and addressing them. A unified formal characterization of these uncertainties through probabilistic approaches and ensemble modeling will facilitate convergence towards consistent reconstruction pipelines, improved data integration algorithms, and more accurate assessment of predictive capacity.
Collapse
Affiliation(s)
- David B Bernstein
- Department of Biomedical Engineering and Biological Design Center, Boston University, Boston, MA, USA
| | - Snorre Sulheim
- Bioinformatics Program, Boston University, Boston, MA, USA
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- Department of Biotechnology and Nanomedicine, SINTEF Industry, Trondheim, Norway
| | - Eivind Almaas
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- K.G. Jebsen Center for Genetic Epidemiology, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - Daniel Segrè
- Department of Biomedical Engineering and Biological Design Center, Boston University, Boston, MA, USA.
- Bioinformatics Program, Boston University, Boston, MA, USA.
- Department of Biology and Department of Physics, Boston University, Boston, MA, USA.
| |
Collapse
|
40
|
Ghosh PN, Brookes LM, Edwards HM, Fisher MC, Jervis P, Kappel D, Sewell TR, Shelton JM, Skelly E, Rhodes JL. Cross-Disciplinary Genomics Approaches to Studying Emerging Fungal Infections. Life (Basel) 2020; 10:E315. [PMID: 33260763 PMCID: PMC7761180 DOI: 10.3390/life10120315] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/15/2020] [Accepted: 11/19/2020] [Indexed: 11/16/2022] Open
Abstract
Emerging fungal pathogens pose a serious, global and growing threat to food supply systems, wild ecosystems, and human health. However, historic chronic underinvestment in their research has resulted in a limited understanding of their epidemiology relative to bacterial and viral pathogens. Therefore, the untargeted nature of genomics and, more widely, -omics approaches is particularly attractive in addressing the threats posed by and illuminating the biology of these pathogens. Typically, research into plant, human and wildlife mycoses have been largely separated, with limited dialogue between disciplines. However, many serious mycoses facing the world today have common traits irrespective of host species, such as plastic genomes; wide host ranges; large population sizes and an ability to persist outside the host. These commonalities mean that -omics approaches that have been productively applied in one sphere and may also provide important insights in others, where these approaches may have historically been underutilised. In this review, we consider the advances made with genomics approaches in the fields of plant pathology, human medicine and wildlife health and the progress made in linking genomes to other -omics datatypes and sets; we identify the current barriers to linking -omics approaches and how these are being underutilised in each field; and we consider how and which -omics methodologies it is most crucial to build capacity for in the near future.
Collapse
Affiliation(s)
- Pria N. Ghosh
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
- Unit for Environmental Sciences and Management, North-West University, Potchefstroom 2520, South Africa
| | - Lola M. Brookes
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
- Institute of Zoology, Zoological Society of London, London NW1 4RY, UK
- Royal Veterinary College, Hawkshead Lane, North Mymms, Herts AL9 7TA, UK
| | - Hannah M. Edwards
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
| | - Matthew C. Fisher
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
| | - Phillip Jervis
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
- Institute of Zoology, Zoological Society of London, London NW1 4RY, UK
- Department of Chemistry, University College London, London WC1H 0AJ, UK
| | - Dana Kappel
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
| | - Thomas R. Sewell
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
| | - Jennifer M.G. Shelton
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
- UK Centre for Ecology & Hydrology, Wallingford OX10 8BB, UK
| | - Emily Skelly
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
| | - Johanna L. Rhodes
- Department of Infectious Disease Epidemiology, MRC Centre for Global Infectious Disease Analysis, St Mary’s Campus, Imperial College London, London W2 1PG, UK; (L.M.B.); (H.M.E.); (M.C.F.); (P.J.); (D.K.); (T.R.S.); (J.M.G.S.); (E.S.); (J.L.R.)
| |
Collapse
|
41
|
Fermented food products in the era of globalization: tradition meets biotechnology innovations. Curr Opin Biotechnol 2020; 70:36-41. [PMID: 33232845 DOI: 10.1016/j.copbio.2020.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 09/18/2020] [Accepted: 10/19/2020] [Indexed: 02/06/2023]
Abstract
Omics tools offer the opportunity to characterize and trace traditional and industrial fermented foods. Bioinformatics, through machine learning, and other advanced statistical approaches, are able to disentangle fermentation processes and to predict the evolution and metabolic outcomes of a food microbial ecosystem. By assembling microbial artificial consortia, the biotechnological advances will also be able to enhance the nutritional value and organoleptics characteristics of fermented food, preserving, at the same time, the potential of autochthonous microbial consortia and metabolic pathways, which are difficult to reproduce. Preserving the traditional methods contributes to protecting the hidden value of local biodiversity, and exploits its potential in industrial processes with the final aim of guaranteeing food security and safety, even in developing countries.
Collapse
|
42
|
Cammarota G, Ianiro G, Ahern A, Carbone C, Temko A, Claesson MJ, Gasbarrini A, Tortora G. Gut microbiome, big data and machine learning to promote precision medicine for cancer. Nat Rev Gastroenterol Hepatol 2020; 17:635-648. [PMID: 32647386 DOI: 10.1038/s41575-020-0327-3] [Citation(s) in RCA: 162] [Impact Index Per Article: 32.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/02/2020] [Indexed: 12/13/2022]
Abstract
The gut microbiome has been implicated in cancer in several ways, as specific microbial signatures are known to promote cancer development and influence safety, tolerability and efficacy of therapies. The 'omics' technologies used for microbiome analysis continuously evolve and, although much of the research is still at an early stage, large-scale datasets of ever increasing size and complexity are being produced. However, there are varying levels of difficulty in realizing the full potential of these new tools, which limit our ability to critically analyse much of the available data. In this Perspective, we provide a brief overview on the role of gut microbiome in cancer and focus on the need, role and limitations of a machine learning-driven approach to analyse large amounts of complex health-care information in the era of big data. We also discuss the potential application of microbiome-based big data aimed at promoting precision medicine in cancer.
Collapse
Affiliation(s)
- Giovanni Cammarota
- Gastroenterology Department, Fondazione Policlinico Universitario Agostino Gemelli-IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy.
| | - Gianluca Ianiro
- Gastroenterology Department, Fondazione Policlinico Universitario Agostino Gemelli-IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Anna Ahern
- School of Microbiology and APC Microbiome Ireland, University College Cork, Cork, Ireland
| | - Carmine Carbone
- Oncology Department, Fondazione Policlinico Universitario Agostino Gemelli-IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Andriy Temko
- School of Engineering, University College Cork, Cork, Ireland.,Qualcomm ML R&D, Cork, Ireland
| | - Marcus J Claesson
- School of Microbiology and APC Microbiome Ireland, University College Cork, Cork, Ireland
| | - Antonio Gasbarrini
- Gastroenterology Department, Fondazione Policlinico Universitario Agostino Gemelli-IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Giampaolo Tortora
- Oncology Department, Fondazione Policlinico Universitario Agostino Gemelli-IRCCS, Università Cattolica del Sacro Cuore, Rome, Italy
| |
Collapse
|
43
|
Liu Y, Su A, Li J, Ledesma-Amaro R, Xu P, Du G, Liu L. Towards next-generation model microorganism chassis for biomanufacturing. Appl Microbiol Biotechnol 2020; 104:9095-9108. [DOI: 10.1007/s00253-020-10902-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 09/03/2020] [Accepted: 09/10/2020] [Indexed: 11/29/2022]
|
44
|
Bonnin EA, Rizzoli SO. Novel Secondary Ion Mass Spectrometry Methods for the Examination of Metabolic Effects at the Cellular and Subcellular Levels. Front Behav Neurosci 2020; 14:124. [PMID: 32792922 PMCID: PMC7384447 DOI: 10.3389/fnbeh.2020.00124] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 06/24/2020] [Indexed: 11/13/2022] Open
Abstract
The behavior of an animal has substantial effects on its metabolism. Such effects, including changes in the lipid composition of different organs, or changes in the turnover of the proteins, have typically been observed using liquid mass spectrometry methods, averaging the effect of animal behavior across tissue samples containing multiple cells. These methods have provided the scientific community with valuable information, but have limited resolution, making it difficult if not impossible to examine metabolic effects at the cellular and subcellular levels. Recent advances in the field of secondary ion mass spectrometry (SIMS) have made it possible to examine the metabolic effects of animal behavior with high resolution at the nanoscale, enabling the analysis of the metabolic effects of behavior on individual cells. In this review we summarize and present these emerging methods, beginning with an overview of the SIMS technique. We then discuss the specific application of nanoscale SIMS (NanoSIMS) to examine cell behavior. This often requires the use of isotope labeling to highlight specific sections of the cell for analysis, an approach that is presented at length in this review article. We also present SIMS applications concerning animal and cell behavior, from development and aging to changes in the cellular activity programs. We conclude that the emerging group of SIMS technologies represents an exciting set of tools for the study of animal behavior and of its effects on internal metabolism at the smallest possible scales.
Collapse
Affiliation(s)
- Elisa A. Bonnin
- Department of Neuro- and Sensory Physiology, Excellence Cluster Multiscale Bioimaging, University Medical Center Göttingen, Göttingen, Germany
- Center for Biostructural Imaging of Neurodegeneration (BIN), University Medical Center Göttingen, Göttingen, Germany
| | - Silvio O. Rizzoli
- Department of Neuro- and Sensory Physiology, Excellence Cluster Multiscale Bioimaging, University Medical Center Göttingen, Göttingen, Germany
- Center for Biostructural Imaging of Neurodegeneration (BIN), University Medical Center Göttingen, Göttingen, Germany
| |
Collapse
|
45
|
Liebal UW, Phan ANT, Sudhakar M, Raman K, Blank LM. Machine Learning Applications for Mass Spectrometry-Based Metabolomics. Metabolites 2020; 10:E243. [PMID: 32545768 PMCID: PMC7345470 DOI: 10.3390/metabo10060243] [Citation(s) in RCA: 180] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 06/09/2020] [Accepted: 06/11/2020] [Indexed: 12/20/2022] Open
Abstract
The metabolome of an organism depends on environmental factors and intracellular regulation and provides information about the physiological conditions. Metabolomics helps to understand disease progression in clinical settings or estimate metabolite overproduction for metabolic engineering. The most popular analytical metabolomics platform is mass spectrometry (MS). However, MS metabolome data analysis is complicated, since metabolites interact nonlinearly, and the data structures themselves are complex. Machine learning methods have become immensely popular for statistical analysis due to the inherent nonlinear data representation and the ability to process large and heterogeneous data rapidly. In this review, we address recent developments in using machine learning for processing MS spectra and show how machine learning generates new biological insights. In particular, supervised machine learning has great potential in metabolomics research because of the ability to supply quantitative predictions. We review here commonly used tools, such as random forest, support vector machines, artificial neural networks, and genetic algorithms. During processing steps, the supervised machine learning methods help peak picking, normalization, and missing data imputation. For knowledge-driven analysis, machine learning contributes to biomarker detection, classification and regression, biochemical pathway identification, and carbon flux determination. Of important relevance is the combination of different omics data to identify the contributions of the various regulatory levels. Our overview of the recent publications also highlights that data quality determines analysis quality, but also adds to the challenge of choosing the right model for the data. Machine learning methods applied to MS-based metabolomics ease data analysis and can support clinical decisions, guide metabolic engineering, and stimulate fundamental biological discoveries.
Collapse
Affiliation(s)
- Ulf W. Liebal
- Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany;
| | - An N. T. Phan
- Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany;
| | - Malvika Sudhakar
- Department of Biotechnology, Bhupat and Juoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai 600 036, India; (M.S.); (K.R.)
- Initiative for Biological Systems Engineering, IIT Madras, Chennai 600 036, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), IIT Madras, Chennai 600 036, India
| | - Karthik Raman
- Department of Biotechnology, Bhupat and Juoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai 600 036, India; (M.S.); (K.R.)
- Initiative for Biological Systems Engineering, IIT Madras, Chennai 600 036, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), IIT Madras, Chennai 600 036, India
| | - Lars M. Blank
- Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany;
| |
Collapse
|
46
|
Branching out: what omics can tell us about primate evolution. Curr Opin Genet Dev 2020; 62:65-71. [DOI: 10.1016/j.gde.2020.06.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 06/02/2020] [Accepted: 06/04/2020] [Indexed: 12/25/2022]
|
47
|
Li S, Tian Y, Jiang P, Lin Y, Liu X, Yang H. Recent advances in the application of metabolomics for food safety control and food quality analyses. Crit Rev Food Sci Nutr 2020; 61:1448-1469. [PMID: 32441547 DOI: 10.1080/10408398.2020.1761287] [Citation(s) in RCA: 170] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
As one of the omics fields, metabolomics has unique advantages in facilitating the understanding of physiological and pathological activities in biology, physiology, pathology, and food science. In this review, based on developments in analytical chemistry tools, cheminformatics, and bioinformatics methods, we highlight the current applications of metabolomics in food safety, food authenticity and quality, and food traceability. Additionally, the combined use of metabolomics with other omics techniques for "foodomics" is comprehensively described. Finally, the latest developments and advances, practical challenges and limitations, and requirements related to the application of metabolomics are critically discussed, providing new insight into the application of metabolomics in food analysis.
Collapse
Affiliation(s)
- Shubo Li
- College of Light Industry and Food Engineering, Guangxi University, Nanning, China
| | - Yufeng Tian
- College of Light Industry and Food Engineering, Guangxi University, Nanning, China
| | - Pingyingzi Jiang
- College of Light Industry and Food Engineering, Guangxi University, Nanning, China
| | - Ying Lin
- College of Light Industry and Food Engineering, Guangxi University, Nanning, China
| | - Xiaoling Liu
- College of Light Industry and Food Engineering, Guangxi University, Nanning, China
| | - Hongshun Yang
- Department of Food Science & Technology, National University of Singapore, Singapore, Singapore
| |
Collapse
|
48
|
Integrating molecular markers into metabolic models improves genomic selection for Arabidopsis growth. Nat Commun 2020; 11:2410. [PMID: 32415110 PMCID: PMC7229213 DOI: 10.1038/s41467-020-16279-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Accepted: 04/21/2020] [Indexed: 02/05/2023] Open
Abstract
The current trends of crop yield improvements are not expected to meet the projected rise in demand. Genomic selection uses molecular markers and machine learning to identify superior genotypes with improved traits, such as growth. Plant growth directly depends on rates of metabolic reactions which transform nutrients into the building blocks of biomass. Here, we predict growth of Arabidopsis thaliana accessions by employing genomic prediction of reaction rates estimated from accession-specific metabolic models. We demonstrate that, comparing to classical genomic selection on the available data sets for 67 accessions, our approach improves the prediction accuracy for growth within and across nitrogen environments by 32.6% and 51.4%, respectively, and from optimal nitrogen to low carbon environment by 50.4%. Therefore, integration of molecular markers into metabolic models offers an approach to predict traits directly related to metabolism, and its usefulness in breeding can be examined by gathering matching datasets in crops. An increase in genomic selection (GS) accuracy can accelerate genetic gain by shortening the breeding cycles. Here, the authors introduce a network-based GS method that uses metabolic models and improves the prediction accuracy of Arabidopsis growth within and across environments.
Collapse
|
49
|
Fenizia C, Saulle I, Clerici M, Biasin M. Genetic and epigenetic regulation of natural resistance to HIV-1 infection: new approaches to unveil the HESN secret. Expert Rev Clin Immunol 2020; 16:429-445. [PMID: 32085689 DOI: 10.1080/1744666x.2020.1732820] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Introduction: Since the identification of HIV, several studies reported the unusual case of small groups of subjects showing natural resistance to HIV infection. These subjects are referred to as HIV-1-exposed seronegative (HESN) individuals and include people located in different areas, with diverse ethnic backgrounds and routes of exposure. The mechanism/s responsible for protection from infection in HESN individuals are basically indefinite and most likely are multifactorial.Areas covered: Host factors, including genetic background as well as natural and acquired immunity, have all been associated with this phenomenon. Recently, epigenetic factors have been investigated as possible determinants of reduced susceptibility to HIV infection. With the advent of the OMICS era, the availability of techniques such as GWAS, RNAseq, and exome-sequencing in both bulk cell populations and single cells will likely lead to great strides in the understanding of the HESN mystery.Expert opinion: The employment of increasingly sophisticated techniques is allowing the gathering of enormous amounts of data. The integration of such information will provide important hints that could lead to the identification of viral and host correlates of protection against HIV infection, allowing the development of more effective preventative and therapeutic regimens.
Collapse
Affiliation(s)
- Claudio Fenizia
- Department of Physiopathology and Transplantation, University of Milan, Milan, Italy
| | - Irma Saulle
- Department of Biomedical and Clinical Sciences "L. Sacco", University of Milan, Milan, Italy
| | - Mario Clerici
- Department of Physiopathology and Transplantation, University of Milan, Milan, Italy.,Don C. Gnocchi Foundation ONLUS, IRCCS, Milan, Italy
| | - Mara Biasin
- Department of Biomedical and Clinical Sciences "L. Sacco", University of Milan, Milan, Italy
| |
Collapse
|
50
|
Abstract
Over the past decade, it has become exceedingly clear that the microbiome is a critical factor in human health and disease and thus should be investigated to develop innovative treatment strategies. The field of metagenomics has come a long way in leveraging the advances of next-generation sequencing technologies resulting in the capability to identify and quantify all microorganisms present in human specimens. However, the field of metagenomics is still in its infancy, specifically in regard to the limitations in computational analysis, statistical assessments, standardization, and validation due to vast variability in the cohorts themselves, experimental design, and bioinformatic workflows. This review summarizes the methods, technologies, computational tools, and model systems for characterizing and studying the microbiome. We also discuss important considerations investigators must make when interrogating the involvement of the microbiome in health and disease in order to establish robust results and mechanistic insights before moving into therapeutic design and intervention.
Collapse
|