1
|
Sattari A. Machine learning in biofluid mechanics: A review of recent developments. Comput Biol Med 2025; 193:110410. [PMID: 40413894 DOI: 10.1016/j.compbiomed.2025.110410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 05/11/2025] [Accepted: 05/17/2025] [Indexed: 05/27/2025]
Abstract
This review paper comprehensively examines recent advancements in machine learning (ML) applications within biofluid mechanics, with a targeted focus on enabling clinically actionable diagnostics and simulations. It demonstrates how ML, and in particular physics-informed ML methods, are used to enhance the analysis and understanding of intricate biofluid dynamics. The review systematically analyzes various ML techniques, detailing their strengths and limitations in modeling biofluid behaviors. By integrating physics-informed ML methods, such as Physics-Informed Neural Networks (PINNs), this work addresses critical challenges in translating complex biofluid dynamics into practical clinical tools. Differentiating itself from previous literature, this review not only summarizes current methods but also proposes potential solutions-including data augmentation, transfer learning, and hybrid modeling approaches (e.g., PINNs)-to overcome challenges related to limited datasets and the integration of complex physics. The review emphasizes ML's ability to enhance diagnostic accuracy, enable personalized treatment strategies, and accelerate computational simulations for applications like cardiovascular disease detection and respiratory disorder diagnosis, with findings showing that ML-driven approaches can reduce diagnostic errors by up to 30 % in cardiovascular applications and improve early detection rates in metabolomics-based diagnostics. Findings indicate that while ML techniques have significantly improved predictive capabilities in biofluid dynamics, challenges such as data scarcity and multi-scale physical integration remain critical. By outlining strategies to bridge the gap between ML advancements and clinical implementation, this review provides a robust framework for future research aimed at integrating ML with biofluid mechanics to revolutionize healthcare delivery. The paper concludes by identifying future research directions aimed at further integrating ML with domain-specific physical insights to achieve more reliable and accurate biofluid models.
Collapse
Affiliation(s)
- Amirmohammad Sattari
- Department of Mechanical Engineering, K. N. Toosi University of Technology, Tehran, Iran.
| |
Collapse
|
2
|
Vigo F, Tozzi A, Lombardo FC, Eugster M, Kavvadias V, Brogle R, Rigert J, Heinzelmann-Schwarz V, Kavvadias T. Binary classification of gynecological cancers based on ATR-FTIR spectroscopy and machine learning using urine samples. Clin Exp Med 2025; 25:143. [PMID: 40343586 PMCID: PMC12064457 DOI: 10.1007/s10238-025-01684-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2025] [Accepted: 04/13/2025] [Indexed: 05/11/2025]
Abstract
Making an early diagnosis of cancer still in the early stages, when completely asymptomatic, is the challenge modern medicine has been setting for several decades. In gynecology, no effective screening has yet been found and approved for endometrial and ovarian cancer. Mammography is an effective screening method for Breast Cancer, as well as Pap Test for Cervical Cancer, but they are underused in third world countries because of their expensive and specific instrumentation. Previous studies showed how "machine learning analysis methods" of the spectral information obtained from dried urine samples could provide good accuracy in differentiation between healthy and ovarian or endometrial cancer. In this study, we also apply ATR-FTIR spectrometry's practical, fast, and relatively inexpensive principles to liquid urine analysis from 309 patients undergoing surgical treatment for benign or malignant diseases (endometrium, breast, cervix, vulvar and ovarian cancer). The data obtained from those liquid samples were then analyzed to train a machine learning model to classify healthy VS cancer patients. We obtained an accuracy of > 91%, and we also identified discriminant wavelengths (2093, 1774 cm-1). These frequencies are close to already reported ones in other studies, indicating a possible association with tumor presence and/or progression.
Collapse
Affiliation(s)
- Francesco Vigo
- Department of Biomedicine, University of Basel, Basel, Switzerland
- Department of Gynecology, Clinic for Gynecology and Gynecologic Oncology, University Hospital of Basel, Spitalstrasse 21, 4055, Basel, Switzerland
| | - Alessandra Tozzi
- Department of Biomedicine, University of Basel, Basel, Switzerland
- Department of Gynecology, Clinic for Gynecology and Gynecologic Oncology, University Hospital of Basel, Spitalstrasse 21, 4055, Basel, Switzerland
| | - Flavio C Lombardo
- Department of Biomedicine, University of Basel, Basel, Switzerland
- Department of Gynecology, Clinic for Gynecology and Gynecologic Oncology, University Hospital of Basel, Spitalstrasse 21, 4055, Basel, Switzerland
| | - Muriel Eugster
- Department of Biomedicine, University of Basel, Basel, Switzerland
| | | | - Rahel Brogle
- Medicine, University of Basel, Basel, Switzerland
| | - Julia Rigert
- Medicine, University of Basel, Basel, Switzerland
| | - Viola Heinzelmann-Schwarz
- Department of Biomedicine, University of Basel, Basel, Switzerland
- Department of Gynecology, Clinic for Gynecology and Gynecologic Oncology, University Hospital of Basel, Spitalstrasse 21, 4055, Basel, Switzerland
| | - Tilemachos Kavvadias
- Department of Gynecology, Clinic for Gynecology and Gynecologic Oncology, University Hospital of Basel, Spitalstrasse 21, 4055, Basel, Switzerland.
| |
Collapse
|
3
|
Petrova B, Guler AT. Recent Developments in Single-Cell Metabolomics by Mass Spectrometry─A Perspective. J Proteome Res 2025; 24:1493-1518. [PMID: 39437423 PMCID: PMC11976873 DOI: 10.1021/acs.jproteome.4c00646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 10/07/2024] [Accepted: 10/15/2024] [Indexed: 10/25/2024]
Abstract
Recent advancements in single-cell (sc) resolution analyses, particularly in sc transcriptomics and sc proteomics, have revolutionized our ability to probe and understand cellular heterogeneity. The study of metabolism through small molecules, metabolomics, provides an additional level of information otherwise unattainable by transcriptomics or proteomics by shedding light on the metabolic pathways that translate gene expression into functional outcomes. Metabolic heterogeneity, critical in health and disease, impacts developmental outcomes, disease progression, and treatment responses. However, dedicated approaches probing the sc metabolome have not reached the maturity of other sc omics technologies. Over the past decade, innovations in sc metabolomics have addressed some of the practical limitations, including cell isolation, signal sensitivity, and throughput. To fully exploit their potential in biological research, however, remaining challenges must be thoroughly addressed. Additionally, integrating sc metabolomics with orthogonal sc techniques will be required to validate relevant results and gain systems-level understanding. This perspective offers a broad-stroke overview of recent mass spectrometry (MS)-based sc metabolomics advancements, focusing on ongoing challenges from a biologist's viewpoint, aimed at addressing pertinent and innovative biological questions. Additionally, we emphasize the use of orthogonal approaches and showcase biological systems that these sophisticated methodologies are apt to explore.
Collapse
Affiliation(s)
- Boryana Petrova
- Medical
University of Vienna, Vienna 1090, Austria
- Department
of Pathology, Boston Children’s Hospital, Boston, Massachusetts 02115, United States
| | - Arzu Tugce Guler
- Department
of Pathology, Boston Children’s Hospital, Boston, Massachusetts 02115, United States
- Institute
for Experiential AI, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
4
|
Gan Q, Jiang T, Li C, Gong X, Zhang J, Desai BK, Yan Y. De novo biosynthesis of 4,6-dihydroxycoumarin in Escherichia coli. GREEN CHEMISTRY : AN INTERNATIONAL JOURNAL AND GREEN CHEMISTRY RESOURCE : GC 2025; 27:3064-3076. [PMID: 40013057 PMCID: PMC11848710 DOI: 10.1039/d4gc05694a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Accepted: 02/12/2025] [Indexed: 02/28/2025]
Abstract
Coumarins and their derivatives possess crucial biochemical and pharmaceutical properties. However, the exploration of the coumarin biosynthesis pathways remains limited, restricting their microbial biosynthesis, especially for hydroxycoumarins. In this work, we designed and verified novel artificial pathways to produce a valuable compound 4,6-dihydroxycoumarin (4,6-DHC) in Escherichia coli. Based on the retrosynthesis analysis, multiple routes were designed and verified by extending the shikimate pathway, screening the potential enzymes, and characterizing the enzymes involved. Rare codon optimization and protein engineering strategies were applied to optimize the rate-limiting steps. De novo biosynthesis of 4,6-DHC was achieved using the cheap carbon source glycerol, and the titer can reach 18.3 ± 0.7 mg L-1. Ultimately, inducible regulation of critical pathway genes with a tetracycline-inducible controller yielded a significant boost in 4,6-DHC production, achieving a titer of 56.7 ± 2.1 mg L-1. This research successfully created a microbial platform for 4,6-dihydroxycoumarin production and demonstrated a generalizable strategy for synthesizing valuable compounds.
Collapse
Affiliation(s)
- Qi Gan
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| | - Tian Jiang
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| | - Chenyi Li
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| | - Xinyu Gong
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| | - Jianli Zhang
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| | - Bhaven K Desai
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| | - Yajun Yan
- School of Chemical, Materials and Biomedical Engineering, College of Engineering, University of Georgia Athens GA 30602 USA
| |
Collapse
|
5
|
Lateef Junaid MA. Artificial intelligence driven innovations in biochemistry: A review of emerging research frontiers. BIOMOLECULES & BIOMEDICINE 2025; 25:739-750. [PMID: 39819459 PMCID: PMC11959397 DOI: 10.17305/bb.2024.11537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2024] [Revised: 12/15/2024] [Accepted: 12/15/2024] [Indexed: 01/19/2025]
Abstract
Artificial intelligence (AI) has become a powerful tool in biochemistry, greatly enhancing research capabilities by enabling the analysis of complex datasets, predicting molecular interactions, and accelerating drug discovery. As AI continues to evolve, its applications in biochemistry are poised to expand, revolutionizing both theoretical and applied research. This review explores current and potential AI applications in biochemistry, with a focus on data analysis, molecular modeling, enzyme engineering, and metabolic pathway studies. Key AI techniques-such as machine learning algorithms, natural language processing, and AI-based molecular modeling-are discussed. The review also highlights emerging research areas benefiting from AI, including personalized medicine and synthetic biology. The methodology involves an extensive analysis of existing literature, particularly peer-reviewed studies on AI applications in biochemistry. AI-driven tools like AlphaFold, which have significantly advanced protein structure prediction, are evaluated alongside AI's role in expediting drug discovery. The review also addresses challenges such as data quality, model interpretability, and ethical considerations. Results indicate that AI has expanded the scope of biochemical research by facilitating large-scale data analysis, enhancing molecular simulations, and opening new avenues of inquiry. However, challenges remain, particularly in data handling and ethical concerns. In conclusion, AI is transforming biochemistry by driving innovation and expanding research possibilities. Future advancements in AI algorithms, interdisciplinary collaboration, and integration with automated techniques will be crucial to fully unlocking AI's potential in advancing biochemical research.
Collapse
Affiliation(s)
- Mohammed Abdul Lateef Junaid
- Department of Basic Medical Sciences, College of Medicine, Majmaah University, Al Majmaah, Kingdom of Saudi Arabia
| |
Collapse
|
6
|
Kavoni H, Savizi ISP, Lewis NE, Shojaosadati SA. Recent advances in culture medium design for enhanced production of monoclonal antibodies in CHO cells: A comparative study of machine learning and systems biology approaches. Biotechnol Adv 2025; 78:108480. [PMID: 39571767 DOI: 10.1016/j.biotechadv.2024.108480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 09/24/2024] [Accepted: 11/14/2024] [Indexed: 11/26/2024]
Abstract
The production of monoclonal antibodies (mAbs) using Chinese Hamster Ovary (CHO) cells has revolutionized the treatment of numerous diseases, solidifying their position as a cornerstone of the biopharmaceutical industry. However, achieving maximum mAb production while upholding strict product quality standards remains a significant hurdle. Optimizing cell culture media emerges as a critical factor in this endeavor, requiring a nuanced understanding of the complex interplay of nutrients, growth factors, and other components that profoundly influence cellular growth, productivity, and product quality. Significant strides have been made in media optimization, including techniques such as media blending, one factor at a time, and statistical design of experiments approaches. The present review provides a comprehensive analysis of the recent advancements in culture media design strategies, focusing on the comparative application of systems biology (SB) and machine learning (ML) approaches. The applications of SB and ML in optimizing CHO cell culture medium and successful examples of their use are summarized. Finally, we highlight the immense potential of integrating SB and ML, emphasizing the development of hybrid models that leverage the strengths of both approaches for robust, efficient, and scalable optimization of mAb production in CHO cells. This review provides a roadmap for researchers and industry professionals to navigate the complex landscape of mAb production optimization, paving the way for developing next-generation CHO cell culture media that drive significant improvements in yield and productivity.
Collapse
Affiliation(s)
- Hossein Kavoni
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Iman Shahidi Pour Savizi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Nathan E Lewis
- Department of Bioengineering, University of California, San Diego, CA, USA; Department of Pediatrics, University of California, San Diego, CA, USA
| | - Seyed Abbas Shojaosadati
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran.
| |
Collapse
|
7
|
Josyula JVN, JeanPierre AR, Jorvekar SB, Adla D, Mariappan V, Pulimamidi SS, Green SR, Pillai AB, Borkar RM, Mutheneni SR. Metabolomic profiling of dengue infection: unraveling molecular signatures by LC-MS/MS and machine learning models. Metabolomics 2024; 20:104. [PMID: 39305446 DOI: 10.1007/s11306-024-02169-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 09/02/2024] [Indexed: 10/22/2024]
Abstract
BACKGROUND & OBJECTIVE The progression of dengue fever to severe dengue (SD) is a major public health concern that impairs the capacity of the medical system to predict and treat dengue patients. Hence, the present study used a metabolomic approach integrated with machine models to identify differentially expressed metabolites in patients with SD compared to nonsevere patients and healthy controls. METHODS Comprehensively, the plasma was collected at different clinical phases during dengue without warning signs (DWOW, N = 10), dengue with warning signs (DWW, N = 10), and SD (N = 10) at different stages [i.e., day of admission (DOA), day of defervescence (DOD), and day of convalescent (DOC)] in comparison to healthy control (HC). The samples were subjected to LC‒ESI‒MS/MS to identify metabolites. Statistical and machine learning analyses were performed using R and Python language. Further, biomarker, pathway and correlation analysis was performed to identify potential predictors of dengue. RESULTS & CONCLUSION A total of 423 metabolites were identified in all the study groups. Paired and unpaired t-tests revealed 14 highly differentially expressed metabolites between and across the dengue groups, with four metabolites (shikimic acid, ureidosuccinic acid, propionyl carnitine, and alpha-tocopherol) showing significant differences compared to HC. Furthermore, biomarker (ROC) analysis revealed 11 potential molecules with a significant AUC value of 1 that could serve as potential biomarkers for identifying different dengue clinical stages that are beneficial for predicting dengue disease outcomes. The logistic regression model revealed that S-adenosylhomocysteine, hypotaurine, and shikimic acid metabolites could be beneficial indicators for predicting severe dengue, with an accuracy and AUC of 0.75. The data showed that dengue infection is related to lipid metabolism, oxidative stress, inflammation, metabolomic adaptation, and virus manipulation. Moreover, the biomarkers had a significant correlation with biochemical parameters like platelet count, and hematocrit. These results shed some light on host-derived small-molecule biomarkers that are associated with dengue severity and novel insights into metabolomics mechanisms interlinked with disease severity.
Collapse
Affiliation(s)
- Jhansi Venkata Nagamani Josyula
- Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Tarnaka, Hyderabad, Telangana, 500007, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
| | - Aashika Raagavi JeanPierre
- Mahatma Gandhi Medical Advanced Research Institute (MGMARI), Sri Balaji Vidyapeeth (Deemed to be University), Puducherry, 607 402, India
| | - Sachin B Jorvekar
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, Assam, 781101, India
| | - Deepthi Adla
- Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Tarnaka, Hyderabad, Telangana, 500007, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
| | - Vignesh Mariappan
- Mahatma Gandhi Medical Advanced Research Institute (MGMARI), Sri Balaji Vidyapeeth (Deemed to be University), Puducherry, 607 402, India
| | - Sai Sharanya Pulimamidi
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, Assam, 781101, India
| | - Siva Ranganathan Green
- Mahatma Gandhi Medical College and Research Institute (MGMCRI), Sri Balaji Vidyapeeth (Deemed to be University), Puducherry, 607 402, India
| | - Agieshkumar Balakrishna Pillai
- Mahatma Gandhi Medical Advanced Research Institute (MGMARI), Sri Balaji Vidyapeeth (Deemed to be University), Puducherry, 607 402, India.
| | - Roshan M Borkar
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, Assam, 781101, India
| | - Srinivasa Rao Mutheneni
- Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Tarnaka, Hyderabad, Telangana, 500007, India.
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
| |
Collapse
|
8
|
Chi J, Shu J, Li M, Mudappathi R, Jin Y, Lewis F, Boon A, Qin X, Liu L, Gu H. Artificial Intelligence in Metabolomics: A Current Review. Trends Analyt Chem 2024; 178:117852. [PMID: 39071116 PMCID: PMC11271759 DOI: 10.1016/j.trac.2024.117852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Metabolomics and artificial intelligence (AI) form a synergistic partnership. Metabolomics generates large datasets comprising hundreds to thousands of metabolites with complex relationships. AI, aiming to mimic human intelligence through computational modeling, possesses extraordinary capabilities for big data analysis. In this review, we provide a recent overview of the methodologies and applications of AI in metabolomics studies in the context of systems biology and human health. We first introduce the AI concept, history, and key algorithms for machine learning and deep learning, summarizing their strengths and weaknesses. We then discuss studies that have successfully used AI across different aspects of metabolomic analysis, including analytical detection, data preprocessing, biomarker discovery, predictive modeling, and multi-omics data integration. Lastly, we discuss the existing challenges and future perspectives in this rapidly evolving field. Despite limitations and challenges, the combination of metabolomics and AI holds great promises for revolutionary advancements in enhancing human health.
Collapse
Affiliation(s)
- Jinhua Chi
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Jingmin Shu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Ming Li
- Phoenix VA Health Care System, Phoenix, AZ 85012, USA
- University of Arizona College of Medicine, Phoenix, AZ 85004, USA
| | - Rekha Mudappathi
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Yan Jin
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Freeman Lewis
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Alexandria Boon
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| | - Xiaoyan Qin
- College of Liberal Arts and Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Li Liu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, AZ 85281, USA
| | - Haiwei Gu
- College of Health Solutions, Arizona State University, Phoenix, AZ 85004, USA
- Center for Translational Science, Florida International University, Port St. Lucie, FL 34987, USA
| |
Collapse
|
9
|
Ahmad Z, Shareen, Ganie IB, Firdaus F, Ramakrishnan M, Shahzad A, Ding Y. Enhancing Withanolide Production in the Withania Species: Advances in In Vitro Culture and Synthetic Biology Approaches. PLANTS (BASEL, SWITZERLAND) 2024; 13:2171. [PMID: 39124289 PMCID: PMC11313931 DOI: 10.3390/plants13152171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 07/30/2024] [Accepted: 08/02/2024] [Indexed: 08/12/2024]
Abstract
Withanolides are naturally occurring steroidal lactones found in certain species of the Withania genus, especially Withania somnifera (commonly known as Ashwagandha). These compounds have gained considerable attention due to their wide range of therapeutic properties and potential applications in modern medicine. To meet the rapidly growing demand for withanolides, innovative approaches such as in vitro culture techniques and synthetic biology offer promising solutions. In recent years, synthetic biology has enabled the production of engineered withanolides using heterologous systems, such as yeast and bacteria. Additionally, in vitro methods like cell suspension culture and hairy root culture have been employed to enhance withanolide production. Nevertheless, one of the primary obstacles to increasing the production of withanolides using these techniques has been the intricacy of the biosynthetic pathways for withanolides. The present article examines new developments in withanolide production through in vitro culture. A comprehensive summary of viable traditional methods for producing withanolide is also provided. The development of withanolide production in heterologous systems is examined and emphasized. The use of machine learning as a potent tool to model and improve the bioprocesses involved in the generation of withanolide is then discussed. In addition, the control and modification of the withanolide biosynthesis pathway by metabolic engineering mediated by CRISPR are discussed.
Collapse
Affiliation(s)
- Zishan Ahmad
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Centre for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, School of Life Sciences, Nanjing Forestry University, Nanjing 210037, China; (Z.A.); (M.R.)
| | - Shareen
- Department of Environmental Engineering, College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China;
| | - Irfan Bashir Ganie
- Department of Botany, Aligarh Muslim University, Aligarh 202002, India; (I.B.G.); (A.S.)
| | - Fatima Firdaus
- Chemistry Department, Lucknow University, Lucknow 226007, India;
| | - Muthusamy Ramakrishnan
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Centre for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, School of Life Sciences, Nanjing Forestry University, Nanjing 210037, China; (Z.A.); (M.R.)
| | - Anwar Shahzad
- Department of Botany, Aligarh Muslim University, Aligarh 202002, India; (I.B.G.); (A.S.)
| | - Yulong Ding
- State Key Laboratory of Tree Genetics and Breeding, Co-Innovation Centre for Sustainable Forestry in Southern China, Bamboo Research Institute, Key Laboratory of National Forestry and Grassland Administration on Subtropical Forest Biodiversity Conservation, School of Life Sciences, Nanjing Forestry University, Nanjing 210037, China; (Z.A.); (M.R.)
| |
Collapse
|
10
|
Helmy M, Elhalis H, Rashid MM, Selvarajoo K. Can digital twin efforts shape microorganism-based alternative food? Curr Opin Biotechnol 2024; 87:103115. [PMID: 38547588 DOI: 10.1016/j.copbio.2024.103115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/05/2024] [Accepted: 03/06/2024] [Indexed: 06/09/2024]
Abstract
With the continuous increment in global population growth, compounded by post-pandemic food security challenges due to labor shortages, effects of climate change, political conflicts, limited land for agriculture, and carbon emissions control, addressing food production in a sustainable manner for future generations is critical. Microorganisms are potential alternative food sources that can help close the gap in food production. For the development of more efficient and yield-enhancing products, it is necessary to have a better understanding on the underlying regulatory molecular pathways of microbial growth. Nevertheless, as microbes are regulated at multiomics scales, current research focusing on single omics (genomics, proteomics, or metabolomics) independently is inadequate for optimizing growth and product output. Here, we discuss digital twin (DT) approaches that integrate systems biology and artificial intelligence in analyzing multiomics datasets to yield a microbial replica model for in silico testing before production. DT models can thus provide a holistic understanding of microbial growth, metabolite biosynthesis mechanisms, as well as identifying crucial production bottlenecks. Our argument, therefore, is to support the development of novel DT models that can potentially revolutionize microorganism-based alternative food production efficiency.
Collapse
Affiliation(s)
- Mohamed Helmy
- Vaccine and Infectious Disease Organization (VIDO), University of Saskatchewan, SK, Canada; Department of Computer Science, Lakehead University, ON, Canada; Department of Computer Science, College of Science and Engineering, Idaho State University, ID, USA; Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), Singapore 138671, Singapore
| | - Hosam Elhalis
- Research School of Biology, Australian National University, Canberra, Australia
| | - Md Mamunur Rashid
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), Singapore 138671, Singapore
| | - Kumar Selvarajoo
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), Singapore 138671, Singapore; Synthetic Biology Translational Research Program and SynCTI, Yong Loo Lin School of Medicine, National University of Singapore (NUS), Singapore 117456, Singapore; School of Biological Sciences, Nanyang Technological University (NTU), Singapore 637551, Singapore.
| |
Collapse
|
11
|
Siddharth T, Lewis NE. Predicting pathways for old and new metabolites through clustering. J Theor Biol 2024; 578:111684. [PMID: 38048983 PMCID: PMC11139542 DOI: 10.1016/j.jtbi.2023.111684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 12/06/2023]
Abstract
The diverse metabolic pathways are fundamental to all living organisms, as they harvest energy, synthesize biomass components, produce molecules to interact with the microenvironment, and neutralize toxins. While the discovery of new metabolites and pathways continues, the prediction of pathways for new metabolites can be challenging. It can take vast amounts of time to elucidate pathways for new metabolites; thus, according to HMDB (Human Metabolome Database), only 60% of metabolites get assigned to pathways. Here, we present an approach to identify pathways based on metabolite structure. We extracted 201 features from SMILES annotations and identified new metabolites from PubMed abstracts and HMDB. After applying clustering algorithms to both groups of features, we quantified correlations between metabolites, and found the clusters accurately linked 92% of known metabolites to their respective pathways. Thus, this approach could be valuable for predicting metabolic pathways for new metabolites.
Collapse
Affiliation(s)
- Thiru Siddharth
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Bhopal, MP 462003, India
| | - Nathan E Lewis
- Department of Pediatrics and Bioengineering, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
12
|
de Vries R, Farzan N, Fabius T, De Jongh FHC, Jak PMC, Haarman EG, Snoey E, In 't Veen JCCM, Dagelet YWF, Maitland-Van Der Zee AH, Lucas A, Van Den Heuvel MM, Wolf-Lansdorf M, Muller M, Baas P, Sterk PJ. Prospective Detection of Early Lung Cancer in Patients With COPD in Regular Care by Electronic Nose Analysis of Exhaled Breath. Chest 2023; 164:1315-1324. [PMID: 37209772 PMCID: PMC10635840 DOI: 10.1016/j.chest.2023.04.050] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 04/18/2023] [Accepted: 04/25/2023] [Indexed: 05/22/2023] Open
Abstract
BACKGROUND Patients with COPD are at high risk of lung cancer developing, but no validated predictive biomarkers have been reported to identify these patients. Molecular profiling of exhaled breath by electronic nose (eNose) technology may qualify for early detection of lung cancer in patients with COPD. RESEARCH QUESTION Can eNose technology be used for prospective detection of early lung cancer in patients with COPD? STUDY DESIGN AND METHODS BreathCloud is a real-world multicenter prospective follow-up study using diagnostic and monitoring visits in day-to-day clinical care of patients with a standardized diagnosis of asthma, COPD, or lung cancer. Breath profiles were collected at inclusion in duplicate by a metal-oxide semiconductor eNose positioned at the rear end of a pneumotachograph (SpiroNose; Breathomix). All patients with COPD were managed according to standard clinical care, and the incidence of clinically diagnosed lung cancer was prospectively monitored for 2 years. Data analysis involved advanced signal processing, ambient air correction, and statistics based on principal component (PC) analysis, linear discriminant analysis, and receiver operating characteristic analysis. RESULTS Exhaled breath data from 682 patients with COPD and 211 patients with lung cancer were available. Thirty-seven patients with COPD (5.4%) demonstrated clinically manifest lung cancer within 2 years after inclusion. Principal components 1, 2, and 3 were significantly different between patients with COPD and those with lung cancer in both training and validation sets with areas under the receiver operating characteristic curve of 0.89 (95% CI, 0.83-0.95) and 0.86 (95% CI, 0.81-0.89). The same three PCs showed significant differences (P < .01) at baseline between patients with COPD who did and did not subsequently demonstrate lung cancer within 2 years, with a cross-validation value of 87% and an area under the receiver operating characteristic curve of 0.90 (95% CI, 0.84-0.95). INTERPRETATION Exhaled breath analysis by eNose identified patients with COPD in whom lung cancer became clinically manifest within 2 years after inclusion. These results show that eNose assessment may detect early stages of lung cancer in patients with COPD.
Collapse
Affiliation(s)
- Rianne de Vries
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands; Breathomix B.V, Leiden, The Netherlands.
| | | | - Timon Fabius
- Medisch Spectrum Twente, Enschede, The Netherlands
| | | | - Patrick M C Jak
- Emma Children's Hospital, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | - Eric G Haarman
- Emma Children's Hospital, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | - Erik Snoey
- Franciscus Gasthuis & Vlietland, Rotterdam, The Netherlands
| | | | | | - Anke-Hilse Maitland-Van Der Zee
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | | | | | | | - Mirte Muller
- The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Paul Baas
- The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Peter J Sterk
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
13
|
Ateia M, Sigmund G, Bentel MJ, Washington JW, Lai A, Merrill NH, Wang Z. Integrated data-driven cross-disciplinary framework to prevent chemical water pollution. ONE EARTH (CAMBRIDGE, MASS.) 2023; 6:10.1016/j.oneear.2023.07.001. [PMID: 38264630 PMCID: PMC10802893 DOI: 10.1016/j.oneear.2023.07.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Abstract
Access to a clean and healthy environment is a human right and a prerequisite for maintaining a sustainable ecosystem. Experts across domains along the chemical life cycle have traditionally operated in isolation, leading to limited connectivity between upstream chemical innovation to downstream development of water-treatment technologies. This fragmented and historically reactive approach to managing emerging contaminants has resulted in significant externalized societal costs. Herein, we propose an integrated data-driven framework to foster proactive action across domains to effectively address chemical water pollution. By implementing this integrated framework, it will not only enhance the capabilities of experts in their respective fields but also create opportunities for novel approaches that yield co-benefits across multiple domains. To successfully operationalize the integrated framework, several concerted efforts are warranted, including adopting open and FAIR (findable, accessible, interoperable, and reusable) data practices, developing common knowledge bases/platforms, and staying vigilant against new substance "properties" of concern.
Collapse
Affiliation(s)
- Mohamed Ateia
- United States Environmental Protection Agency, Center for Environmental Solutions & Emergency Response, Cincinnati, OH 45220, USA
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, TX, USA
| | - Gabriel Sigmund
- Environmental Geosciences, Centre for Microbiology and Environmental Systems Science, University of Vienna, Josef-Holaubeck-Platz 2, 1090 Vienna, Austria
- Environmental Technology, Wageningen University & Research, P.O. Box 17, 6700 AA Wageningen, the Netherlands
| | - Michael J. Bentel
- Department of Environmental Engineering and Earth Sciences, Clemson University, Clemson, SC 29634, USA
| | - John W. Washington
- United States Environmental Protection Agency, Center for Environmental Measurement and Modeling, Athens, GA 30605, USA
| | - Adelene Lai
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367 Belvaux, Luxembourg
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University, 07743 Jena, Germany
| | - Nathaniel H. Merrill
- United States Environmental Protection Agency, Center for Environmental Measurement and Modeling, Narragansett, RI, USA
| | - Zhanyun Wang
- Empa Swiss – Federal Laboratories for Materials Science and Technology, Technology and Society Laboratory, 9014 St. Gallen, Switzerland
| |
Collapse
|
14
|
Shah HA, Liu J, Yang Z, Yang F, Zhang Q, Feng J. DeepRT: Predicting compounds presence in pathway modules and classifying into module classes using deep neural networks based on molecular properties. J Bioinform Comput Biol 2023; 21:2350017. [PMID: 37632195 DOI: 10.1142/s0219720023500178] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/27/2023]
Abstract
Metabolic pathways play a crucial role in understanding the biochemistry of organisms. In metabolic pathways, modules refer to clusters of interconnected reactions or sub-networks representing specific functional units or biological processes within the overall pathway. In pathway modules, compounds are major elements and refer to the various molecules that participate in the biochemical reactions within the pathway modules. These molecules can include substrates, intermediates and final products. Determining the presence relation of compounds and pathway modules is essential for synthesizing new molecules and predicting hidden reactions. To date, several computational methods have been proposed to address this problem. However, all methods only predict the metabolic pathways and their types, not the pathway modules. To address this issue, we proposed a novel deep learning model, DeepRT that integrates message passing neural networks (MPNNs) and transformer encoder. This combination allows DeepRT to effectively extract global and local structure information from the molecular graph. The model is designed to perform two tasks: first, determining the present relation of the compound with the pathway module, and second, predicting the relation of query compound and module classes. The proposed DeepRT model evaluated on a dataset comprising compounds and pathway modules, and it outperforms existing approaches.
Collapse
Affiliation(s)
- Hayat Ali Shah
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Juan Liu
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Zhihui Yang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Feng Yang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Qiang Zhang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Jing Feng
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| |
Collapse
|
15
|
Sen P, Orešič M. Integrating Omics Data in Genome-Scale Metabolic Modeling: A Methodological Perspective for Precision Medicine. Metabolites 2023; 13:855. [PMID: 37512562 PMCID: PMC10383060 DOI: 10.3390/metabo13070855] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/11/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023] Open
Abstract
Recent advancements in omics technologies have generated a wealth of biological data. Integrating these data within mathematical models is essential to fully leverage their potential. Genome-scale metabolic models (GEMs) provide a robust framework for studying complex biological systems. GEMs have significantly contributed to our understanding of human metabolism, including the intrinsic relationship between the gut microbiome and the host metabolism. In this review, we highlight the contributions of GEMs and discuss the critical challenges that must be overcome to ensure their reproducibility and enhance their prediction accuracy, particularly in the context of precision medicine. We also explore the role of machine learning in addressing these challenges within GEMs. The integration of omics data with GEMs has the potential to lead to new insights, and to advance our understanding of molecular mechanisms in human health and disease.
Collapse
Affiliation(s)
- Partho Sen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, 702 81 Örebro, Sweden
| | - Matej Orešič
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, 702 81 Örebro, Sweden
| |
Collapse
|
16
|
Hassan S, Ganai BA. Deciphering the recent trends in pesticide bioremediation using genome editing and multi-omics approaches: a review. World J Microbiol Biotechnol 2023; 39:151. [PMID: 37029313 DOI: 10.1007/s11274-023-03603-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 04/03/2023] [Indexed: 04/09/2023]
Abstract
Pesticide pollution in recent times has emerged as a grave environmental problem contaminating both aquatic and terrestrial ecosystems owing to their widespread use. Bioremediation using gene editing and system biology could be developed as an eco-friendly and proficient tool to remediate pesticide-contaminated sites due to its advantages and greater public acceptance over the physical and chemical methods. However, it is indispensable to understand the different aspects associated with microbial metabolism and their physiology for efficient pesticide remediation. Therefore, this review paper analyses the different gene editing tools and multi-omics methods in microbes to produce relevant evidence regarding genes, proteins and metabolites associated with pesticide remediation and the approaches to contend against pesticide-induced stress. We systematically discussed and analyzed the recent reports (2015-2022) on multi-omics methods for pesticide degradation to elucidate the mechanisms and the recent advances associated with the behaviour of microbes under diverse environmental conditions. This study envisages that CRISPR-Cas, ZFN and TALEN as gene editing tools utilizing Pseudomonas, Escherichia coli and Achromobacter sp. can be employed for remediation of chlorpyrifos, parathion-methyl, carbaryl, triphenyltin and triazophos by creating gRNA for expressing specific genes for the bioremediation. Similarly, systems biology accompanying multi-omics tactics revealed that microbial strains from Paenibacillus, Pseudomonas putida, Burkholderia cenocepacia, Rhodococcus sp. and Pencillium oxalicum are capable of degrading deltamethrin, p-nitrophenol, chlorimuron-ethyl and nicosulfuron. This review lends notable insights into the research gaps and provides potential solutions for pesticide remediation by using different microbe-assisted technologies. The inferences drawn from the current study will help researchers, ecologists, and decision-makers gain comprehensive knowledge of value and application of systems biology and gene editing in bioremediation assessments.
Collapse
Affiliation(s)
- Shahnawaz Hassan
- Department of Environmental Science, University of Kashmir, Srinagar, 190006, India.
| | - Bashir Ahmad Ganai
- Centre of Research for Development, University of Kashmir, Srinagar, 190006, India.
| |
Collapse
|
17
|
Miller WM, Ziegler KM, Yilmaz A, Saiyed N, Ustun I, Akyol S, Idler J, Sims MD, Maddens ME, Graham SF. Association of Metabolomic Biomarkers with Sleeve Gastrectomy Weight Loss Outcomes. Metabolites 2023; 13:metabo13040506. [PMID: 37110164 PMCID: PMC10145663 DOI: 10.3390/metabo13040506] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 03/29/2023] [Accepted: 03/29/2023] [Indexed: 04/03/2023] Open
Abstract
This prospective observational study aimed to evaluate the association of metabolomic alterations with weight loss outcomes following sleeve gastrectomy (SG). We evaluated the metabolomic profile of serum and feces prior to SG and three months post-SG, along with weight loss outcomes in 45 adults with obesity. The percent total weight loss for the highest versus the lowest weight loss tertiles (T3 vs. T1) was 17.0 ± 1.3% and 11.1 ± 0.8%, p < 0.001. Serum metabolite alterations specific to T3 at three months included a decrease in methionine sulfoxide concentration as well as alterations to tryptophan and methionine metabolism (p < 0.03). Fecal metabolite changes specific to T3 included a decrease in taurine concentration and perturbations to arachidonic acid metabolism, and taurine and hypotaurine metabolism (p < 0.002). Preoperative metabolites were found to be highly predictive of weight loss outcomes in machine learning algorithms, with an average area under the curve of 94.6% for serum and 93.4% for feces. This comprehensive metabolomics analysis of weight loss outcome differences post-SG highlights specific metabolic alterations as well as machine learning algorithms predictive of weight loss. These findings could contribute to the development of novel therapeutic targets to enhance weight loss outcomes after SG.
Collapse
Affiliation(s)
- Wendy M. Miller
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Oakland University William Beaumont School of Medicine, 586 Pioneer Dr, Rochester, MI 48309, USA
| | - Kathryn M. Ziegler
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Oakland University William Beaumont School of Medicine, 586 Pioneer Dr, Rochester, MI 48309, USA
| | - Ali Yilmaz
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| | - Nazia Saiyed
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| | - Ilyas Ustun
- DePaul University Jarvis College of Computing and Digital Media, 243 S Wabash Ave, Chicago, IL 60604, USA
| | - Sumeyya Akyol
- NX Prenatal Inc. Laboratory, 4800 Fournace Place, Suite BW28, Bellaire, TX 77401, USA
| | - Jay Idler
- Allegheny Health Network, West Penn Hospital, 4815 Liberty Ave, Suite GR50, Pittsburgh, PA 15224, USA
- Drexel University College of Medicine, 2900 W Queen Ln, Philadelphia, PA 19129, USA
| | - Matthew D. Sims
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| | - Michael E. Maddens
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Oakland University William Beaumont School of Medicine, 586 Pioneer Dr, Rochester, MI 48309, USA
| | - Stewart F. Graham
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| |
Collapse
|
18
|
Zhang X, Su Y, Lane AN, Stromberg AJ, Fan TWM, Wang C. Bayesian kinetic modeling for tracer-based metabolomic data. BMC Bioinformatics 2023; 24:108. [PMID: 36949395 PMCID: PMC10035190 DOI: 10.1186/s12859-023-05211-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 02/24/2023] [Indexed: 03/24/2023] Open
Abstract
BACKGROUND Stable Isotope Resolved Metabolomics (SIRM) is a new biological approach that uses stable isotope tracers such as uniformly [Formula: see text]-enriched glucose ([Formula: see text]-Glc) to trace metabolic pathways or networks at the atomic level in complex biological systems. Non-steady-state kinetic modeling based on SIRM data uses sets of simultaneous ordinary differential equations (ODEs) to quantitatively characterize the dynamic behavior of metabolic networks. It has been increasingly used to understand the regulation of normal metabolism and dysregulation in the development of diseases. However, fitting a kinetic model is challenging because there are usually multiple sets of parameter values that fit the data equally well, especially for large-scale kinetic models. In addition, there is a lack of statistically rigorous methods to compare kinetic model parameters between different experimental groups. RESULTS We propose a new Bayesian statistical framework to enhance parameter estimation and hypothesis testing for non-steady-state kinetic modeling of SIRM data. For estimating kinetic model parameters, we leverage the prior distribution not only to allow incorporation of experts' knowledge but also to provide robust parameter estimation. We also introduce a shrinkage approach for borrowing information across the ensemble of metabolites to stably estimate the variance of an individual isotopomer. In addition, we use a component-wise adaptive Metropolis algorithm with delayed rejection to perform efficient Monte Carlo sampling of the posterior distribution over high-dimensional parameter space. For comparing kinetic model parameters between experimental groups, we propose a new reparameterization method that converts the complex hypothesis testing problem into a more tractable parameter estimation problem. We also propose an inference procedure based on credible interval and credible value. Our method is freely available for academic use at https://github.com/xuzhang0131/MCMCFlux . CONCLUSIONS Our new Bayesian framework provides robust estimation of kinetic model parameters and enables rigorous comparison of model parameters between experimental groups. Simulation studies and application to a lung cancer study demonstrate that our framework performs well for non-steady-state kinetic modeling of SIRM data.
Collapse
Affiliation(s)
- Xu Zhang
- Dr. Bing Zhang Department of Statistics, University of Kentucky, Lexington, 40536, USA.
| | - Ya Su
- Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, 23220, USA
| | - Andrew N Lane
- Markey Cancer Center, University of Kentucky, Lexington, 40536, USA
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, 40536, USA
| | - Arnold J Stromberg
- Dr. Bing Zhang Department of Statistics, University of Kentucky, Lexington, 40536, USA
| | - Teresa W M Fan
- Markey Cancer Center, University of Kentucky, Lexington, 40536, USA
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, 40536, USA
| | - Chi Wang
- Dr. Bing Zhang Department of Statistics, University of Kentucky, Lexington, 40536, USA.
- Markey Cancer Center, University of Kentucky, Lexington, 40536, USA.
- Division of Cancer Biostatistics, Department of Internal Medicine, University of Kentucky, Lexington, 40536, USA.
| |
Collapse
|
19
|
Liu W, Zhang L, Bao L, Shen G, Feng J. Accurate Classification and Prediction of Acute Myocardial Infarction through an ARMD Procedure. J Proteome Res 2023; 22:758-767. [PMID: 36710647 DOI: 10.1021/acs.jproteome.2c00488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The risk stratification of acute myocardial infarction (AMI) patients is of prime importance for clinical management and prognosis assessment. Thus, we propose an ensemble machine learning analysis procedure named ADASYN-RFECV-MDA-DNN (ARMD) to address sample-unbalanced problems and enable stratification and prediction of AMI outcomes. The ARMD analysis procedure was applied to the NMR data of sera from 534 AMI-related subjects in four categories with an extremely imbalanced sample proportion. Firstly, the adaptive synthetic sampling (ADASYN) algorithm was used to address the issue of the original sample imbalance. Secondly, the recursive feature elimination with cross-validation (RFECV) processing and random forest mean decrease accuracy (RF-MDA) algorithm was performed to identify the differential metabolites corresponding to each AMI outcome. Finally, the deep neural network (DNN) was employed to classify and predict AMI events, and its performance was evaluated by comparing the four traditional machine learning methods. Compared with the other four machine learning models, DNN presented consistent superiority in almost all of the model parameters including precision, f1-score, sensitivity, specificity, area under the receiver operating characteristic curve (AUC), and classification accuracy, highlighting the potential of deep learning in classification and stratification of clinical diseases. The ARMD analysis procedure was a practical analysis tool for supervised classification and regression modeling of clinical diseases.
Collapse
Affiliation(s)
- Wuping Liu
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Lirong Zhang
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Lijun Bao
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Guiping Shen
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Jianghua Feng
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| |
Collapse
|
20
|
Jala A, Dutta R, Josyula JVN, Mutheneni SR, Borkar RM. Environmental phenol exposure associates with urine metabolome alteration in young Northeast Indian females. CHEMOSPHERE 2023; 317:137830. [PMID: 36640981 DOI: 10.1016/j.chemosphere.2023.137830] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 12/12/2022] [Accepted: 01/10/2023] [Indexed: 06/17/2023]
Abstract
Urinary biomonitoring delivers the most accurate environmental phenols exposure assessment. However, environmental phenol exposure-related biomarkers are required to improve risk assessment to understand the internal processes perturbed, which may link exposure to specific health outcomes. This study aimed to investigate the association between environmental phenols exposure and the metabolome of young adult females from India. Urinary metabolomics was performed using liquid chromatography-mass spectrometry. Environmental phenols-related metabolic biomarkers were investigated by comparing the low and high exposure of environmental phenols. Seven potential biomarkers, namely histidine, cysteine-s-sulfate, 12-KETE, malonic acid, p-hydroxybenzoic acid, PE (36:2), and PS (36:0), were identified, revealing that environmental phenol exposure altered the metabolic pathways such as histidine metabolism, beta-Alanine metabolism, glycerophospholipid metabolism, and other pathways. This study also conceived an innovative strategy for the early prediction of diseases by combining urinary metabolomics with machine learning (ML) algorithms. The differential metabolites predictive accuracy by ML models was >80%. This is the first mass spectrometry-based metabolomics study on young adult females from India with environmental phenols exposure. The study is valuable in demonstrating multiple urine metabolic changes linked to environmental phenol exposure and a better understanding of the mechanisms behind environmental phenol-induced effects in young female adults.
Collapse
Affiliation(s)
- Aishwarya Jala
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, 781101, Assam, India
| | - Ratul Dutta
- Down Town Hospital, Guwahati, Assam, 781106, India
| | | | - Srinivasa Rao Mutheneni
- Applied Biology Division, CSIR-Indian Institute of Chemical Technology, Tarnaka, Hyderabad, 500007, Telangana, India
| | - Roshan M Borkar
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, 781101, Assam, India.
| |
Collapse
|
21
|
Wijbenga N, Hoek RAS, Mathot BJ, Seghers L, Moor CC, Aerts JGJV, Bos D, Manintveld OC, Hellemons ME. Diagnostic performance of electronic nose technology in chronic lung allograft dysfunction. J Heart Lung Transplant 2023; 42:236-245. [PMID: 36283951 DOI: 10.1016/j.healun.2022.09.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 08/22/2022] [Accepted: 09/12/2022] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND There is a need for reliable biomarkers for the diagnosis of chronic lung allograft dysfunction (CLAD). In this light, we investigated the diagnostic value of exhaled breath analysis using an electronic nose (eNose) for CLAD, CLAD phenotype, and CLAD stage in lung transplant recipients (LTR). METHODS We performed eNose measurements in LTR with and without CLAD, visiting the outpatient clinic. Through supervised machine learning, the diagnostic value of eNose for CLAD was assessed in a random training and validation set. Next, we investigated the diagnostic value of the eNose measurements combined with known risk factors for CLAD. Model performance was evaluated using ROC-analysis. RESULTS We included 152 LTR (median age 60 years, 49% females), of whom 38 with CLAD. eNose-based classification of patients with and without CLAD provided an AUC of 0.86 in the training set, and 0.82 in the validation set. After adding established risk factors for CLAD (age, gender, type of transplantation, time after transplantation and prior occurrence of acute cellular rejection) to a model with the eNose data, the discriminative ability of the model improved to an AUC of 0.94 (p = 0.02) in the training set and 0.94 (p = 0.04) in the validation set. Discrimination between BOS and RAS was good (AUC 0.95). Discriminative ability for other phenotypes (AUCs ranging 0.50-0.92) or CLAD stages (AUC 0.56) was limited. CONCLUSION Exhaled breath analysis using eNose is a promising novel biomarker for enabling diagnosis and phenotyping CLAD. eNose technology could be a valuable addition to the diagnostic armamentarium for suspected graft failure in LTR.
Collapse
Affiliation(s)
- Nynke Wijbenga
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Rogier A S Hoek
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Bas J Mathot
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Leonard Seghers
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Catharina C Moor
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Joachim G J V Aerts
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Daniel Bos
- Department of Radiology & Nuclear Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Department of Epidemiology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Olivier C Manintveld
- Department of Cardiology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Merel E Hellemons
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands.
| |
Collapse
|
22
|
Aida H, Uchida K, Nagai M, Hashizume T, Masuo S, Takaya N, Ying BW. Machine learning-assisted medium optimization revealed the discriminated strategies for improved production of the foreign and native metabolites. Comput Struct Biotechnol J 2023; 21:2654-2663. [PMID: 37138901 PMCID: PMC10149329 DOI: 10.1016/j.csbj.2023.04.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 04/18/2023] [Accepted: 04/19/2023] [Indexed: 05/05/2023] Open
Abstract
The composition of medium components is crucial for achieving the best performance of synthetic construction in genetically engineered cells. Which and how medium components determine the performance, e.g., productivity, remain poorly investigated. To address the questions, a comparative survey with two genetically engineered Escherichia coli strains was performed. As a case study, the strains carried the synthetic pathways for producing the aromatic compounds of 4-aminophenylalanine (4APhe) or tyrosine (Tyr), common in the upstream but differentiated in the downstream metabolism. Bacterial growth and compound production were examined in hundreds of medium combinations that comprised 48 pure chemicals. The resultant data sets linking the medium composition to bacterial growth and production were subjected to machine learning for improved production. Intriguingly, the primary medium components determining the production of 4PheA and Tyr were differentiated, which were the initial resource (glucose) of the synthetic pathway and the inducer (IPTG) of the synthetic construction, respectively. Fine-tuning of the primary component significantly increased the yields of 4APhe and Tyr, indicating that a single component could be crucial for the performance of synthetic construction. Transcriptome analysis observed the local and global changes in gene expression for improved production of 4APhe and Tyr, respectively, revealing divergent metabolic strategies for producing the foreign and native metabolites. The study demonstrated that ML-assisted medium optimization could provide a novel point of view on how to make the synthetic construction meet the designed working principle and achieve the expected biological function.
Collapse
Affiliation(s)
- Honoka Aida
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Keisuke Uchida
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Motoki Nagai
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Takamasa Hashizume
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Shunsuke Masuo
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Microbiology Research Center for Sustainability, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Naoki Takaya
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Microbiology Research Center for Sustainability, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Corresponding author at: School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan.
| | - Bei-Wen Ying
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Corresponding author.
| |
Collapse
|
23
|
Cuperlovic-Culf M, Nguyen-Tran T, Bennett SAL. Machine Learning and Hybrid Methods for Metabolic Pathway Modeling. Methods Mol Biol 2023; 2553:417-439. [PMID: 36227553 DOI: 10.1007/978-1-0716-2617-7_18] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Computational cell metabolism models seek to provide metabolic explanations of cell behavior under different conditions or following genetic alterations, help in the optimization of in vitro cell growth environments, or predict cellular behavior in vivo and in vitro. In the extremes, mechanistic models can include highly detailed descriptions of a small number of metabolic reactions or an approximate representation of an entire metabolic network. To date, all mechanistic models have required details of individual metabolic reactions, either kinetic parameters or metabolic flux, as well as information about extracellular and intracellular metabolite concentrations. Despite the extensive efforts and the increasing availability of high-quality data, required in vivo data are not available for the majority of known metabolic reactions; thus, mechanistic models are based primarily on ex vivo kinetic measurements and limited flux information. Machine learning approaches provide an alternative for derivation of functional dependencies from existing data. The increasing availability of metabolomic and lipidomic data, with growing feature coverage as well as sample set size, is expected to provide new data options needed for derivation of machine learning models of cell metabolic processes. Moreover, machine learning analysis of longitudinal data can lead to predictive models of cell behaviors over time. Conversely, machine learning models trained on steady-state data can provide descriptive models for the comparison of metabolic states in different environments or disease conditions. Additionally, inclusion of metabolic network knowledge in these analyses can further help in the development of models with limited data.This chapter will explore the application of machine learning to the modeling of cell metabolism. We first provide a theoretical explanation of several machine learning and hybrid mechanistic machine learning methods currently being explored to model metabolism. Next, we introduce several avenues for improving these models with machine learning. Finally, we provide protocols for specific examples of the utilization of machine learning in the development of predictive cell metabolism models using metabolomic data. We describe data preprocessing, approaches for training of machine learning models for both descriptive and predictive models, and the utilization of these models in synthetic and systems biology. Detailed protocols provide a list of software tools and libraries used for these applications, step-by-step modeling protocols, troubleshooting, as well as an overview of existing limitations to these approaches.
Collapse
Affiliation(s)
- Miroslava Cuperlovic-Culf
- Digital Technologies Research Centre, National Research Council of Canada, Ottawa, ON, Canada.
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada.
| | - Thao Nguyen-Tran
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada
- Neural Regeneration Laboratory, Ottawa Institute of Systems Biology, Brain and Mind Research Institute, University of Ottawa, Ottawa, ON, Canada
- Department of Chemistry and Biomolecular Sciences, Centre for Catalysis Research and Innovation, University of Ottawa, Ottawa, ON, Canada
| | - Steffany A L Bennett
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada
- Neural Regeneration Laboratory, Ottawa Institute of Systems Biology, Brain and Mind Research Institute, University of Ottawa, Ottawa, ON, Canada
- Department of Chemistry and Biomolecular Sciences, Centre for Catalysis Research and Innovation, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
24
|
Peralbo-Molina Á, Solà-Santos P, Perera-Lluna A, Chicano-Gálvez E. Data Processing and Analysis in Mass Spectrometry-Based Metabolomics. Methods Mol Biol 2023; 2571:207-239. [PMID: 36152164 DOI: 10.1007/978-1-0716-2699-3_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Metabolomics is the latest of the omics sciences. It attempts to measure and characterize metabolites-small chemical compounds <1500 Da-on cells, tissue, or biofluids, which are usually products of biological reactions. As metabolic reactions are closer to the phenotype, metabolomics has emerged as an attractive science for various areas of research, including personalized medicine. However, due to the complexity of data obtained and the absence of curated databases for metabolite identification, data processing is the major bottleneck in this area since most technicians lack the required bioinformatics expertise to process datasets in a reliable and fast manner. The aim of this chapter is to describe the available tools for data processing that makes an inexperienced researcher capable of obtaining reliable results without having to undergo through huge parametrization steps.
Collapse
Affiliation(s)
- Ángela Peralbo-Molina
- IMIBIC Mass Spectrometry and Molecular Imaging Unit, Maimonides, Biomedical Research Institute of Cordoba (IMIBIC), Reina Sofia University Hospital, University of Cordoba (UCO), Córdoba, Spain.
| | - Pol Solà-Santos
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain
- Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
- Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Alexandre Perera-Lluna
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain
- Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
- Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Eduardo Chicano-Gálvez
- IMIBIC Mass Spectrometry and Molecular Imaging Unit, Maimonides, Biomedical Research Institute of Cordoba (IMIBIC), Reina Sofia University Hospital, University of Cordoba (UCO), Córdoba, Spain
| |
Collapse
|
25
|
Metabolomics and modelling approaches for systems metabolic engineering. Metab Eng Commun 2022; 15:e00209. [PMID: 36281261 PMCID: PMC9587336 DOI: 10.1016/j.mec.2022.e00209] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 11/21/2022] Open
Abstract
Metabolic engineering involves the manipulation of microbes to produce desirable compounds through genetic engineering or synthetic biology approaches. Metabolomics involves the quantitation of intracellular and extracellular metabolites, where mass spectrometry and nuclear magnetic resonance based analytical instrumentation are often used. Here, the experimental designs, sample preparations, metabolite quenching and extraction are essential to the quantitative metabolomics workflow. The resultant metabolomics data can then be used with computational modelling approaches, such as kinetic and constraint-based modelling, to better understand underlying mechanisms and bottlenecks in the synthesis of desired compounds, thereby accelerating research through systems metabolic engineering. Constraint-based models, such as genome scale models, have been used successfully to enhance the yield of desired compounds from engineered microbes, however, unlike kinetic or dynamic models, constraint-based models do not incorporate regulatory effects. Nevertheless, the lack of time-series metabolomic data generation has hindered the usefulness of dynamic models till today. In this review, we show that improvements in automation, dynamic real-time analysis and high throughput workflows can drive the generation of more quality data for dynamic models through time-series metabolomics data generation. Spatial metabolomics also has the potential to be used as a complementary approach to conventional metabolomics, as it provides information on the localization of metabolites. However, more effort must be undertaken to identify metabolites from spatial metabolomics data derived through imaging mass spectrometry, where machine learning approaches could prove useful. On the other hand, single-cell metabolomics has also seen rapid growth, where understanding cell-cell heterogeneity can provide more insights into efficient metabolic engineering of microbes. Moving forward, with potential improvements in automation, dynamic real-time analysis, high throughput workflows, and spatial metabolomics, more data can be produced and studied using machine learning algorithms, in conjunction with dynamic models, to generate qualitative and quantitative predictions to advance metabolic engineering efforts.
Collapse
|
26
|
Antony S, Antony S, Rebello S, George S, Biju DT, R R, Madhavan A, Binod P, Pandey A, Sindhu R, Awasthi MK. Bioremediation of Endocrine Disrupting Chemicals- Advancements and Challenges. ENVIRONMENTAL RESEARCH 2022; 213:113509. [PMID: 35660566 DOI: 10.1016/j.envres.2022.113509] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Revised: 05/08/2022] [Accepted: 05/18/2022] [Indexed: 06/15/2023]
Abstract
Endocrine Disrupting Chemicals (EDCs), major group of recalcitrant compounds, poses a serious threat to the health and future of millions of human beings, and other flora and fauna for years to come. A close analysis of various xenobiotics undermines the fact that EDC is structurally diverse chemical compounds generated as a part of anthropogenic advancements as well as part of their degradation. Regardless of such structural diversity, EDC is common in their ultimate drastic effect of impeding the proper functioning of the endocrinal system, basic physiologic systems, resulting in deregulated growth, malformations, and cancerous outcomes in animals as well as humans. The current review outlines an overview of various EDCs, their toxic effects on the ecosystem and its inhabitants. Conventional remediation methods such as physico-chemical methods and enzymatic approaches have been put into action as some form of mitigation measures. However, the last decade has seen the hunt for newer technologies and methodologies at an accelerated pace. Genetically engineered microbial degradation, gene editing strategies, metabolic and protein engineering, and in-silico predictive approaches - modern day's additions to our armamentarium in combating the EDCs are addressed. These additions have greater acceptance socially with lesser dissonance owing to reduced toxic by-products, lower health trepidations, better degradation, and ultimately the prevention of bioaccumulation. The positive impact of such new approaches on controlling the menace of EDCs has been outlaid. This review will shed light on sources of EDCs, their impact, significance, and the different remediation and bioremediation approaches, with a special emphasis on the recent trends and perspectives in using sustainable approaches for bioremediation of EDCs. Strict regulations to prevent the release of estrogenic chemicals to the ecosystem, adoption of combinatorial methods to remove EDC and prevalent use of bioremediation techniques should be followed in all future endeavors to combat EDC pollution. Moreover, the proper development, growth and functioning of future living forms relies on their non-exposure to EDCs, thus remediation of such chemicals present even in nano-concentrations should be addressed gravely.
Collapse
Affiliation(s)
- Sherly Antony
- Department of Microbiology, Pushpagiri Institute of Medical Sciences and Research Centre, Thiruvalla, 689 101, Kerala, India
| | - Sham Antony
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla, 689 101, Kerala, India
| | - Sharrel Rebello
- School of Food Science & Technology, Mahatma Gandhi University, Kottayam, India
| | - Sandhra George
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla, 689 101, Kerala, India
| | - Devika T Biju
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla, 689 101, Kerala, India
| | - Reshmy R
- Department of Science and Humanities, Providence College of Engineering, Chengannur, 689 122, Kerala, India
| | - Aravind Madhavan
- Rajiv Gandhi Centre for Biotechnology, Jagathy, Trivandrum, 695 014, India
| | - Parameswaran Binod
- Microbial Processes and Technology Division, CSIR-National Institute for Interdisciplinary Science and Technology (CSIR-NIIST), Thiruvananthapuram, 695 019, Kerala, India
| | - Ashok Pandey
- Center for Innovation and Translational Research, CSIR-Indian Institute of Toxicology Research, Lucknow, 226 001, India; Centre for Energy and Environmental Sustainability, Lucknow, 226 029, Uttar Pradesh, India
| | - Raveendran Sindhu
- Department of Food Technology, T K M Institute of Technology, Kollam, 691 505, Kerala, India.
| | - Mukesh Kumar Awasthi
- College of Natural Resources and Environment, Northwest A&F University, Yangling, Shaanxi Province, 712100, China.
| |
Collapse
|
27
|
Lao Z, Matsui Y, Ijichi S, Ying BW. Global coordination of the mutation and growth rates across the genetic and nutritional variety in Escherichia coli. Front Microbiol 2022; 13:990969. [PMID: 36204613 PMCID: PMC9530902 DOI: 10.3389/fmicb.2022.990969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 08/30/2022] [Indexed: 11/24/2022] Open
Abstract
Fitness and mutability are the primary traits of living organisms for adaptation and evolution. However, their quantitative linkage remained largely deficient. Whether there is any general relationship between the two features and how genetic and environmental variables influence them remained unclear and were addressed here. The mutation and growth rates of an assortment of Escherichia coli strain collections, including the wild-type strains and the genetically disturbed strains of either reduced genomes or deletion of the genes involved in the DNA replication fidelity, were evaluated in various media. The contribution of media to the mutation and growth rates was differentiated depending on the types of genetic disturbance. Nevertheless, the negative correlation between the mutation and growth rates was observed across the genotypes and was common in all media. It indicated the comprehensive association of the correlated mutation and growth rates with the genetic and medium variation. Multiple linear regression and support vector machine successfully predicted the mutation and growth rates and the categories of genotypes and media, respectively. Taken together, the study provided a quantitative dataset linking the mutation and growth rates, genotype, and medium and presented a simple and successful example of predicting bacterial growth and mutability by data-driven approaches.
Collapse
|
28
|
Chen H, Gao Y. Behavior Data Analysis of English Learners Based on Discrete Dynamic System Modeling Method. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:5409571. [PMID: 36093503 PMCID: PMC9462973 DOI: 10.1155/2022/5409571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/11/2022] [Accepted: 08/17/2022] [Indexed: 11/30/2022]
Abstract
With the rapid development of computer science, there are more and more kinds of discrete dynamic systems. Computer integrated system CIMS, network communication database administrator system, and human behavior analysis system are all discrete dynamic systems. At present, many researchers have studied by adding human behavior data to discrete dynamic systems. This paper aims to study the behavior data of English learners by using the discrete dynamic modeling technology of complex systems and the discrete dynamic system modeling method of Petri nets. By adding the behavior data of learners to the discrete dynamic system of fuzzy Petri nets, the system is diagnosed and optimized. The experimental results show that the complex discrete dynamic system in this paper has achieved good experimental results according to the performance indicators selected in theory. Based on the combination of the above technologies and systems, the fuzzy Petri net discrete dynamic system studied in this paper improves the processing speed of English learners' behavior data.
Collapse
Affiliation(s)
- Hongli Chen
- School of Foreign Languages, Handan University, Handan, Hebei 056006, China
| | - Yuzheng Gao
- School of Foreign Languages, Handan University, Handan, Hebei 056006, China
| |
Collapse
|
29
|
Lo-Thong-Viramoutou O, Charton P, Cadet XF, Grondin-Perez B, Saavedra E, Damour C, Cadet F. Non-linearity of Metabolic Pathways Critically Influences the Choice of Machine Learning Model. Front Artif Intell 2022; 5:744755. [PMID: 35757298 PMCID: PMC9226554 DOI: 10.3389/frai.2022.744755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 04/29/2022] [Indexed: 11/13/2022] Open
Abstract
The use of machine learning (ML) in life sciences has gained wide interest over the past years, as it speeds up the development of high performing models. Important modeling tools in biology have proven their worth for pathway design, such as mechanistic models and metabolic networks, as they allow better understanding of mechanisms involved in the functioning of organisms. However, little has been done on the use of ML to model metabolic pathways, and the degree of non-linearity associated with them is not clear. Here, we report the construction of different metabolic pathways with several linear and non-linear ML models. Different types of data are used; they lead to the prediction of important biological data, such as pathway flux and final product concentration. A comparison reveals that the data features impact model performance and highlight the effectiveness of non-linear models (e.g., QRF: RMSE = 0.021 nmol·min-1 and R2 = 1 vs. Bayesian GLM: RMSE = 1.379 nmol·min-1 R2 = 0.823). It turns out that the greater the degree of non-linearity of the pathway, the better suited a non-linear model will be. Therefore, a decision-making support for pathway modeling is established. These findings generally support the hypothesis that non-linear aspects predominate within the metabolic pathways. This must be taken into account when devising possible applications of these pathways for the identification of biomarkers of diseases (e.g., infections, cancer, neurodegenerative diseases) or the optimization of industrial production processes.
Collapse
Affiliation(s)
- Ophélie Lo-Thong-Viramoutou
- University of Paris, BIGR—Biologie Intégrée du Globule Rouge, Inserm, UMR_S1134, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Laboratory DSIMB, UMR_S1134, BIGR, Inserm, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | - Philippe Charton
- University of Paris, BIGR—Biologie Intégrée du Globule Rouge, Inserm, UMR_S1134, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Laboratory DSIMB, UMR_S1134, BIGR, Inserm, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | | | - Brigitte Grondin-Perez
- EnergyLab, EA 4079, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | - Emma Saavedra
- Departamento de Bioquímica, Instituto Nacional de Cardiología Ignacio Chávez, Mexico City, Mexico
| | - Cédric Damour
- EnergyLab, EA 4079, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | - Frédéric Cadet
- University of Paris, BIGR—Biologie Intégrée du Globule Rouge, Inserm, UMR_S1134, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Laboratory DSIMB, UMR_S1134, BIGR, Inserm, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| |
Collapse
|
30
|
Vanhove K, Derveaux E, Mesotten L, Thomeer M, Criel M, Mariën H, Adriaensens P. Unraveling the Rewired Metabolism in Lung Cancer Using Quantitative NMR Metabolomics. Int J Mol Sci 2022; 23:ijms23105602. [PMID: 35628415 PMCID: PMC9146819 DOI: 10.3390/ijms23105602] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 05/14/2022] [Accepted: 05/15/2022] [Indexed: 11/16/2022] Open
Abstract
Lung cancer cells are well documented to rewire their metabolism and energy production networks to enable proliferation and survival in a nutrient-poor and hypoxic environment. Although metabolite profiling of blood plasma and tissue is still emerging in omics approaches, several techniques have shown potential in cancer diagnosis. In this paper, the authors describe the alterations in the metabolic phenotype of lung cancer patients. In addition, we focus on the metabolic cooperation between tumor cells and healthy tissue. Furthermore, the authors discuss how metabolomics could improve the management of lung cancer patients.
Collapse
Affiliation(s)
- Karolien Vanhove
- Applied and Analytical Chemistry, Institute for Materials Research, Hasselt University, Agoralaan 1-Building D, B-3590 Diepenbeek, Belgium;
- Department of Respiratory Medicine, AZ Vesalius, Hazelereik 51, B-3700 Tongeren, Belgium
- Correspondence:
| | - Elien Derveaux
- Faculty of Medicine and Life Sciences, Hasselt University, Martelarenlaan 42, B-3500 Hasselt, Belgium; (E.D.); (H.M.)
| | - Liesbet Mesotten
- Department of Nuclear Medicine, Ziekenhuis Oost-Limburg, Schiepse Bos 6, B-3600 Genk, Belgium;
| | - Michiel Thomeer
- Department of Respiratory Medicine, Ziekenhuis Oost-Limburg, Schiepse Bos 6, B-3600 Genk, Belgium; (M.T.); (M.C.)
| | - Maarten Criel
- Department of Respiratory Medicine, Ziekenhuis Oost-Limburg, Schiepse Bos 6, B-3600 Genk, Belgium; (M.T.); (M.C.)
| | - Hanne Mariën
- Faculty of Medicine and Life Sciences, Hasselt University, Martelarenlaan 42, B-3500 Hasselt, Belgium; (E.D.); (H.M.)
| | - Peter Adriaensens
- Applied and Analytical Chemistry, Institute for Materials Research, Hasselt University, Agoralaan 1-Building D, B-3590 Diepenbeek, Belgium;
| |
Collapse
|
31
|
Parvandeh S, Donehower LA, Katsonis P, Hsu TK, Asmussen J, Lee K, Lichtarge O. EPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants. Nucleic Acids Res 2022; 50:e70. [PMID: 35412634 PMCID: PMC9262594 DOI: 10.1093/nar/gkac215] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 03/17/2022] [Accepted: 03/21/2022] [Indexed: 02/01/2023] Open
Abstract
Discovering rare cancer driver genes is difficult because their mutational frequency is too low for statistical detection by computational methods. EPIMUTESTR is an integrative nearest-neighbor machine learning algorithm that identifies such marginal genes by modeling the fitness of their mutations with the phylogenetic Evolutionary Action (EA) score. Over cohorts of sequenced patients from The Cancer Genome Atlas representing 33 tumor types, EPIMUTESTR detected 214 previously inferred cancer driver genes and 137 new candidates never identified computationally before of which seven genes are supported in the COSMIC Cancer Gene Census. EPIMUTESTR achieved better robustness and specificity than existing methods in a number of benchmark methods and datasets.
Collapse
Affiliation(s)
- Saeid Parvandeh
- To whom correspondence should be addressed. Tel: +1 713 798 7677;
| | - Lawrence A Donehower
- Department of Molecular Virology and Microbiology, Houston, TX 77030, USA,Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Panagiotis Katsonis
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Teng-Kuei Hsu
- Department of Biochemistry & Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | - Jennifer K Asmussen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Kwanghyuk Lee
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Olivier Lichtarge
- Correspondence may also be addressed to Olivier Lichtarge. Tel: +1 713 798 5646;
| |
Collapse
|
32
|
Sampaio M, Rocha M, Dias O. Exploring synergies between plant metabolic modelling and machine learning. Comput Struct Biotechnol J 2022; 20:1885-1900. [PMID: 35521559 PMCID: PMC9052043 DOI: 10.1016/j.csbj.2022.04.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 11/03/2022] Open
|
33
|
Lavrentev FV, Rumyantsev IS, Ivanov AS, Shilovskikh VV, Orlova OY, Nikolaev KG, Andreeva DV, Skorb EV. Soft Hydrogel Actuator for Fast Machine-Learning-Assisted Bacteria Detection. ACS APPLIED MATERIALS & INTERFACES 2022; 14:7321-7328. [PMID: 35080838 DOI: 10.1021/acsami.1c22470] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
We demonstrate that our bio-electrochemical platform facilitates the reduction of detection time from the 3-day period of the existing tests to 15 min. Machine learning and robotized bioanalytical platforms require the principles such as hydrogel-based actuators for fast and easy analysis of bioactive analytes. Bacteria are fragile and environmentally sensitive microorganisms that require a special environment to support their lifecycles during analytical tests. Here, we develop a bio-electrochemical platform based on the soft hydrogel/eutectic gallium-indium alloy interface for the detection of Streptococcus thermophilus and Bacillus coagulans bacteria in various mediums. The soft hydrogel-based device is capable to support bacteria' viability during detection time. Current-voltage data are used for multilayer perceptron algorithm training. The multilayer perceptron model is capable of detecting bacterial concentrations in the 104 to 108 cfu/mL range of the culture medium or in the dairy products with high accuracy (94%). Such a fast and easy biodetection is extremely important for food and agriculture industries and biomedical and environmental science.
Collapse
Affiliation(s)
- Filipp V Lavrentev
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Igor S Rumyantsev
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Artemii S Ivanov
- Institute for Functional Intelligent Materials, National University of Singapore, Singapore 117544, Singapore
- Department of Materials Science and Engineering, National University of Singapore, Singapore 117575, Singapore
| | - Vladimir V Shilovskikh
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Olga Yu Orlova
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Konstantin G Nikolaev
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Daria V Andreeva
- Institute for Functional Intelligent Materials, National University of Singapore, Singapore 117544, Singapore
- Department of Materials Science and Engineering, National University of Singapore, Singapore 117575, Singapore
| | - Ekaterina V Skorb
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| |
Collapse
|
34
|
Verissimo T, Faivre A, Sgardello S, Naesens M, de Seigneux S, Criton G, Legouis D. Estimated Renal Metabolomics at Reperfusion Predicts One-Year Kidney Graft Function. Metabolites 2022; 12:57. [PMID: 35050179 PMCID: PMC8778290 DOI: 10.3390/metabo12010057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 12/26/2021] [Accepted: 01/04/2022] [Indexed: 02/04/2023] Open
Abstract
Renal transplantation is the gold-standard procedure for end-stage renal disease patients, improving quality of life and life expectancy. Despite continuous advancement in the management of post-transplant complications, progress is still needed to increase the graft lifespan. Early identification of patients at risk of rapid graft failure is critical to optimize their management and slow the progression of the disease. In 42 kidney grafts undergoing protocol biopsies at reperfusion, we estimated the renal metabolome from RNAseq data. The estimated metabolites' abundance was further used to predict the renal function within the first year of transplantation through a random forest machine learning algorithm. Using repeated K-fold cross-validation we first built and then tuned our model on a training dataset. The optimal model accurately predicted the one-year eGFR, with an out-of-bag root mean square root error (RMSE) that was 11.8 ± 7.2 mL/min/1.73 m2. The performance was similar in the test dataset, with a RMSE of 12.2 ± 3.2 mL/min/1.73 m2. This model outperformed classic statistical models. Reperfusion renal metabolome may be used to predict renal function one year after allograft kidney recipients.
Collapse
Affiliation(s)
- Thomas Verissimo
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
| | - Anna Faivre
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
| | - Sebastian Sgardello
- Department of Surgery, University Hospital of Geneva, 1205 Geneva, Switzerland;
| | - Maarten Naesens
- Service of Nephrology, University Hospitals of Leuven, 3000 Leuven, Belgium;
| | - Sophie de Seigneux
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
- Service of Nephrology, Department of Internal Medicine Specialties, University Hospital of Geneva, 1205 Geneva, Switzerland
| | - Gilles Criton
- Geneva School of Economics and Management, University of Geneva, 1205 Geneva, Switzerland;
| | - David Legouis
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
- Division of Intensive Care, Department of Acute Medicine, University hospital of Geneva, 1205 Geneva, Switzerland
| |
Collapse
|
35
|
Vijayakumar S, Magazzù G, Moon P, Occhipinti A, Angione C. A Practical Guide to Integrating Multimodal Machine Learning and Metabolic Modeling. Methods Mol Biol 2022; 2399:87-122. [PMID: 35604554 DOI: 10.1007/978-1-0716-1831-8_5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Complex, distributed, and dynamic sets of clinical biomedical data are collectively referred to as multimodal clinical data. In order to accommodate the volume and heterogeneity of such diverse data types and aid in their interpretation when they are combined with a multi-scale predictive model, machine learning is a useful tool that can be wielded to deconstruct biological complexity and extract relevant outputs. Additionally, genome-scale metabolic models (GSMMs) are one of the main frameworks striving to bridge the gap between genotype and phenotype by incorporating prior biological knowledge into mechanistic models. Consequently, the utilization of GSMMs as a foundation for the integration of multi-omic data originating from different domains is a valuable pursuit towards refining predictions. In this chapter, we show how cancer multi-omic data can be analyzed via multimodal machine learning and metabolic modeling. Firstly, we focus on the merits of adopting an integrative systems biology led approach to biomedical data mining. Following this, we propose how constraint-based metabolic models can provide a stable yet adaptable foundation for the integration of multimodal data with machine learning. Finally, we provide a step-by-step tutorial for the combination of machine learning and GSMMs, which includes: (i) tissue-specific constraint-based modeling; (ii) survival analysis using time-to-event prediction for cancer; and (iii) classification and regression approaches for multimodal machine learning. The code associated with the tutorial can be found at https://github.com/Angione-Lab/Tutorials_Combining_ML_and_GSMM .
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Giuseppe Magazzù
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Pradip Moon
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Annalisa Occhipinti
- Computational Systems Biology and Data Analytics Research Group, Middlebrough, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK
| | - Claudio Angione
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK.
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK.
- Healthcare Innovation Centre, Teesside University, Middlesbrough, UK.
| |
Collapse
|
36
|
Vijayakumar S, Angione C. Protocol for hybrid flux balance, statistical, and machine learning analysis of multi-omic data from the cyanobacterium Synechococcus sp. PCC 7002. STAR Protoc 2021; 2:100837. [PMID: 34632416 PMCID: PMC8488602 DOI: 10.1016/j.xpro.2021.100837] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Combining a computational framework for flux balance analysis with machine learning improves the accuracy of predicting metabolic activity across conditions, while enabling mechanistic interpretation. This protocol presents a guide to condition-specific metabolic modeling that integrates regularized flux balance analysis with machine learning approaches to extract key features from transcriptomic and fluxomic data. We demonstrate the protocol as applied to Synechococcus sp. PCC 7002; we also outline how it can be adapted to any species or community with available multi-omic data. For complete details on the use and execution of this protocol, please refer to Vijayakumar et al. (2020).
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
| | - Claudio Angione
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough TS1 3BX, UK
- Healthcare Innovation Centre, Teesside University, Middlesbrough TS1 3BX, UK
| |
Collapse
|
37
|
Urine-Based Metabolomics and Machine Learning Reveals Metabolites Associated with Renal Cell Carcinoma Stage. Cancers (Basel) 2021; 13:cancers13246253. [PMID: 34944874 PMCID: PMC8699523 DOI: 10.3390/cancers13246253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 12/08/2021] [Accepted: 12/09/2021] [Indexed: 11/17/2022] Open
Abstract
Urine metabolomics profiling has potential for non-invasive RCC staging, in addition to providing metabolic insights into disease progression. In this study, we utilized liquid chromatography-mass spectrometry (LC-MS), nuclear magnetic resonance (NMR), and machine learning (ML) for the discovery of urine metabolites associated with RCC progression. Two machine learning questions were posed in the study: Binary classification into early RCC (stage I and II) and advanced RCC stages (stage III and IV), and RCC tumor size estimation through regression analysis. A total of 82 RCC patients with known tumor size and metabolomic measurements were used for the regression task, and 70 RCC patients with complete tumor-nodes-metastasis (TNM) staging information were used for the classification tasks under ten-fold cross-validation conditions. A voting ensemble regression model consisting of elastic net, ridge, and support vector regressor predicted RCC tumor size with a R2 value of 0.58. A voting classifier model consisting of random forest, support vector machines, logistic regression, and adaptive boosting yielded an AUC of 0.96 and an accuracy of 87%. Some identified metabolites associated with renal cell carcinoma progression included 4-guanidinobutanoic acid, 7-aminomethyl-7-carbaguanine, 3-hydroxyanthranilic acid, lysyl-glycine, glycine, citrate, and pyruvate. Overall, we identified a urine metabolic phenotype associated with renal cell carcinoma stage, exploring the promise of a urine-based metabolomic assay for staging this disease.
Collapse
|
38
|
Li R, Li L, Xu Y, Yang J. Machine learning meets omics: applications and perspectives. Brief Bioinform 2021; 23:6425809. [PMID: 34791021 DOI: 10.1093/bib/bbab460] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 09/29/2021] [Accepted: 10/07/2021] [Indexed: 02/07/2023] Open
Abstract
The innovation of biotechnologies has allowed the accumulation of omics data at an alarming rate, thus introducing the era of 'big data'. Extracting inherent valuable knowledge from various omics data remains a daunting problem in bioinformatics. Better solutions often need some kind of more innovative methods for efficient handlings and effective results. Recent advancements in integrated analysis and computational modeling of multi-omics data helped address such needs in an increasingly harmonious manner. The development and application of machine learning have largely advanced our insights into biology and biomedicine and greatly promoted the development of therapeutic strategies, especially for precision medicine. Here, we propose a comprehensive survey and discussion on what happened, is happening and will happen when machine learning meets omics. Specifically, we describe how artificial intelligence can be applied to omics studies and review recent advancements at the interface between machine learning and the ever-widest range of omics including genomics, transcriptomics, proteomics, metabolomics, radiomics, as well as those at the single-cell resolution. We also discuss and provide a synthesis of ideas, new insights, current challenges and perspectives of machine learning in omics.
Collapse
Affiliation(s)
- Rufeng Li
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China
| | - Lixin Li
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China
| | - Yungang Xu
- School of Electronics and Information, Northwestern Polytechnical University, Xi'an, 710129, China
| | - Juan Yang
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China.,Key Laboratory of Environment and Genes Related to Diseases (Xi'an Jiaotong University), Ministry of Education of China, Xi'an 710061, P. R. China
| |
Collapse
|
39
|
Mey F, Clauwaert J, Van Huffel K, Waegeman W, De Mey M. Improving the performance of machine learning models for biotechnology: The quest for deus ex machina. Biotechnol Adv 2021; 53:107858. [PMID: 34695560 DOI: 10.1016/j.biotechadv.2021.107858] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 11/24/2022]
Abstract
Machine learning is becoming an integral part of the Design-Build-Test-Learn cycle in biotechnology. Machine learning models learn from collected datasets such as omics data and predict a defined outcome, which has led to both production improvements and predictive tools in the field. Robust prediction of the behavior of microbial cell factories and production processes not only greatly increases our understanding of the function of such systems, but also provides significant savings of development time. However, many pitfalls when modeling biological data - bad fit, noisy data, model instability, low data quantity and imbalances in the data - cause models to suffer in their performance. Here we provide an accessible, in-depth analysis on the problems created by these pitfalls, as well as means of their detection and mediation, with a focus on supervised learning. Assessing the state of the art, we show that, currently, in-depth analyses of model performance are often absent and must be improved. This review provides a toolbox for the analysis of model robustness and performance, and simultaneously proposes a standard for the community to facilitate future work. It is further accompanied by an interactive online tutorial on the discussed issues.
Collapse
Affiliation(s)
- Friederike Mey
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium
| | - Jim Clauwaert
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Kirsten Van Huffel
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium
| | - Willem Waegeman
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Marjan De Mey
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium.
| |
Collapse
|
40
|
Seif Y, Palsson BØ. Path to improving the life cycle and quality of genome-scale models of metabolism. Cell Syst 2021; 12:842-859. [PMID: 34555324 PMCID: PMC8480436 DOI: 10.1016/j.cels.2021.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 02/17/2021] [Accepted: 06/23/2021] [Indexed: 11/28/2022]
Abstract
Genome-scale models of metabolism (GEMs) are key computational tools for the systems-level study of metabolic networks. Here, we describe the "GEM life cycle," which we subdivide into four stages: inception, maturation, specialization, and amalgamation. We show how different types of GEM reconstruction workflows fit in each stage and proceed to highlight two fundamental bottlenecks for GEM quality improvement: GEM maturation and content removal. We identify common characteristics contributing to increasing quality of maturing GEMs drawing from past independent GEM maturation efforts. We then shed some much-needed light on the latent and unrecognized but pervasive issue of content removal, demonstrating the substantial effects of model pruning on its solution space. Finally, we propose a novel framework for content removal and associated confidence-level assignment which will help guide future GEM development efforts, reduce duplication of effort across groups, potentially aid automated reconstruction platforms, and boost the reproducibility of model development.
Collapse
Affiliation(s)
- Yara Seif
- Department of Bioengineering, University of California, San Diego, La Jolla, San Diego, CA 92093, USA
| | - Bernhard Ørn Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, San Diego, CA 92093, USA.
| |
Collapse
|
41
|
Chen Z, Huang X, Gao Y, Zeng S, Mao W. Plasma-metabolite-based machine learning is a promising diagnostic approach for esophageal squamous cell carcinoma investigation. J Pharm Anal 2021; 11:505-514. [PMID: 34513127 PMCID: PMC8424362 DOI: 10.1016/j.jpha.2020.11.009] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 11/23/2020] [Accepted: 11/24/2020] [Indexed: 12/12/2022] Open
Abstract
The aim of this study was to develop a diagnostic strategy for esophageal squamous cell carcinoma (ESCC) that combines plasma metabolomics with machine learning algorithms. Plasma-based untargeted metabolomics analysis was performed with samples derived from 88 ESCC patients and 52 healthy controls. The dataset was split into a training set and a test set. After identification of differential metabolites in training set, single-metabolite-based receiver operating characteristic (ROC) curves and multiple-metabolite-based machine learning models were used to distinguish between ESCC patients and healthy controls. Kaplan-Meier survival analysis and Cox proportional hazards regression analysis were performed to investigate the prognostic significance of the plasma metabolites. Finally, twelve differential plasma metabolites (six up-regulated and six down-regulated) were annotated. The predictive performance of the six most prevalent diagnostic metabolites through the diagnostic models in the test set were as follows: arachidonic acid (accuracy: 0.887), sebacic acid (accuracy: 0.867), indoxyl sulfate (accuracy: 0.850), phosphatidylcholine (PC) (14:0/0:0) (accuracy: 0.825), deoxycholic acid (accuracy: 0.773), and trimethylamine N-oxide (accuracy: 0.653). The prediction accuracies of the machine learning models in the test set were partial least-square (accuracy: 0.947), random forest (accuracy: 0.947), gradient boosting machine (accuracy: 0.960), and support vector machine (accuracy: 0.980). Additionally, survival analysis demonstrated that acetoacetic acid was an unfavorable prognostic factor (hazard ratio (HR): 1.752), while PC (14:0/0:0) (HR: 0.577) was a favorable prognostic factor for ESCC. This study devised an innovative strategy for ESCC diagnosis by combining plasma metabolomics with machine learning algorithms and revealed its potential to become a novel screening test for ESCC. Six most prevalent diagnostic plasma metabolites were identified in ESCC. Plasma-metabolite-based machine learning models (PLS, RF, GBM, and SVM) for ESCC diagnosis. Acetoacetic acid was an unfavorable prognostic factor, while PC (14:0/0:0) was a favorable prognostic factor for ESCC.
Collapse
Affiliation(s)
- Zhongjian Chen
- Laboratory of Pharmaceutical Analysis and Drug Metabolism, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.,The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| | - Xiancong Huang
- The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| | - Yun Gao
- The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| | - Su Zeng
- Laboratory of Pharmaceutical Analysis and Drug Metabolism, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Weimin Mao
- The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| |
Collapse
|
42
|
McGranaghan P, Kirwan JA, Garcia-Rivera MA, Pieske B, Edelmann F, Blaschke F, Appunni S, Saxena A, Rubens M, Veledar E, Trippel TD. Lipid Metabolite Biomarkers in Cardiovascular Disease: Discovery and Biomechanism Translation from Human Studies. Metabolites 2021; 11:621. [PMID: 34564437 PMCID: PMC8470800 DOI: 10.3390/metabo11090621] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 08/30/2021] [Accepted: 09/06/2021] [Indexed: 12/12/2022] Open
Abstract
Lipids represent a valuable target for metabolomic studies since altered lipid metabolism is known to drive the pathological changes in cardiovascular disease (CVD). Metabolomic technologies give us the ability to measure thousands of metabolites providing us with a metabolic fingerprint of individual patients. Metabolomic studies in humans have supported previous findings into the pathomechanisms of CVD, namely atherosclerosis, apoptosis, inflammation, oxidative stress, and insulin resistance. The most widely studied classes of lipid metabolite biomarkers in CVD are phospholipids, sphingolipids/ceramides, glycolipids, cholesterol esters, fatty acids, and acylcarnitines. Technological advancements have enabled novel strategies to discover individual biomarkers or panels that may aid in the diagnosis and prognosis of CVD, with sphingolipids/ceramides as the most promising class of biomarkers thus far. In this review, application of metabolomic profiling for biomarker discovery to aid in the diagnosis and prognosis of CVD as well as metabolic abnormalities in CVD will be discussed with particular emphasis on lipid metabolites.
Collapse
Affiliation(s)
- Peter McGranaghan
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
| | - Jennifer A. Kirwan
- Metabolomics Platform, Berlin Institute of Health at Charité Universitätsmedizin Berlin, 13353 Berlin, Germany; (J.A.K.); (M.A.G.-R.)
- Max Delbrück Center for Molecular Research, 13125 Berlin, Germany
- School of Veterinary Medicine and Science, University of Nottingham, Leicestershire LE12 5RD, UK
| | - Mariel A. Garcia-Rivera
- Metabolomics Platform, Berlin Institute of Health at Charité Universitätsmedizin Berlin, 13353 Berlin, Germany; (J.A.K.); (M.A.G.-R.)
- Max Delbrück Center for Molecular Research, 13125 Berlin, Germany
| | - Burkert Pieske
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
- Berlin Institute of Health, 13353 Berlin, Germany
- German Heart Center Berlin, Department of Cardiology, 13353 Berlin, Germany
| | - Frank Edelmann
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
- German Heart Center Berlin, Department of Cardiology, 13353 Berlin, Germany
| | - Florian Blaschke
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
| | - Sandeep Appunni
- Department of Biochemistry, Government Medical College, Kozhikode, Kerala 673008, India;
| | - Anshul Saxena
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
| | - Muni Rubens
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
| | - Emir Veledar
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
- Department of Biostatistics, Florida International University, Miami, FL 33199, USA
- Division of Cardiology, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Tobias Daniel Trippel
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
| |
Collapse
|
43
|
Khaleghi MK, Savizi ISP, Lewis NE, Shojaosadati SA. Synergisms of machine learning and constraint-based modeling of metabolism for analysis and optimization of fermentation parameters. Biotechnol J 2021; 16:e2100212. [PMID: 34390201 DOI: 10.1002/biot.202100212] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 08/10/2021] [Accepted: 08/11/2021] [Indexed: 11/06/2022]
Abstract
Recent noteworthy advances in the development of high-performing microbial and mammalian strains have enabled the sustainable production of bio-economically valuable substances such as bio-compounds, biofuels, and biopharmaceuticals. However, to obtain an industrially viable mass-production scheme, much time and effort are required. The robust and rational design of fermentation processes requires analysis and optimization of different extracellular conditions and medium components, which have a massive effect on growth and productivity. In this regard, knowledge- and data-driven modeling methods have received much attention. Constraint-based modeling (CBM) is a knowledge-driven mathematical approach that has been widely used in fermentation analysis and optimization due to its capabilities of predicting the cellular phenotype from genotype through high-throughput means. On the other hand, machine learning (ML) is a data-driven statistical method that identifies the data patterns within sophisticated biological systems and processes, where there is inadequate knowledge to represent underlying mechanisms. Furthermore, ML models are becoming a viable complement to constraint-based models in a reciprocal manner when one is used as a pre-step of another. As a result, more predictable model is produced. This review highlights the applications of CBM and ML independently and the combination of these two approaches for analyzing and optimizing fermentation parameters. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Mohammad Karim Khaleghi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Iman Shahidi Pour Savizi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Nathan E Lewis
- Department of Bioengineering, University of California, San Diego, USA.,Department of Pediatrics, University of California, San Diego, USA
| | - Seyed Abbas Shojaosadati
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| |
Collapse
|
44
|
Tinte MM, Chele KH, van der Hooft JJJ, Tugizimana F. Metabolomics-Guided Elucidation of Plant Abiotic Stress Responses in the 4IR Era: An Overview. Metabolites 2021; 11:445. [PMID: 34357339 PMCID: PMC8305945 DOI: 10.3390/metabo11070445] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 06/30/2021] [Accepted: 07/03/2021] [Indexed: 12/27/2022] Open
Abstract
Plants are constantly challenged by changing environmental conditions that include abiotic stresses. These are limiting their development and productivity and are subsequently threatening our food security, especially when considering the pressure of the increasing global population. Thus, there is an urgent need for the next generation of crops with high productivity and resilience to climate change. The dawn of a new era characterized by the emergence of fourth industrial revolution (4IR) technologies has redefined the ideological boundaries of research and applications in plant sciences. Recent technological advances and machine learning (ML)-based computational tools and omics data analysis approaches are allowing scientists to derive comprehensive metabolic descriptions and models for the target plant species under specific conditions. Such accurate metabolic descriptions are imperatively essential for devising a roadmap for the next generation of crops that are resilient to environmental deterioration. By synthesizing the recent literature and collating data on metabolomics studies on plant responses to abiotic stresses, in the context of the 4IR era, we point out the opportunities and challenges offered by omics science, analytical intelligence, computational tools and big data analytics. Specifically, we highlight technological advancements in (plant) metabolomics workflows and the use of machine learning and computational tools to decipher the dynamics in the chemical space that define plant responses to abiotic stress conditions.
Collapse
Affiliation(s)
- Morena M. Tinte
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa; (M.M.T.); (K.H.C.)
| | - Kekeletso H. Chele
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa; (M.M.T.); (K.H.C.)
| | | | - Fidele Tugizimana
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa; (M.M.T.); (K.H.C.)
- International Research and Development Division, Omnia Group, Ltd., Johannesburg 2021, South Africa
| |
Collapse
|
45
|
Shah HA, Liu J, Yang Z, Feng J. Review of Machine Learning Methods for the Prediction and Reconstruction of Metabolic Pathways. Front Mol Biosci 2021; 8:634141. [PMID: 34222327 PMCID: PMC8247443 DOI: 10.3389/fmolb.2021.634141] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 06/01/2021] [Indexed: 11/13/2022] Open
Abstract
Prediction and reconstruction of metabolic pathways play significant roles in many fields such as genetic engineering, metabolic engineering, drug discovery, and are becoming the most active research topics in synthetic biology. With the increase of related data and with the development of machine learning techniques, there have many machine leaning based methods been proposed for prediction or reconstruction of metabolic pathways. Machine learning techniques are showing state-of-the-art performance to handle the rapidly increasing volume of data in synthetic biology. To support researchers in this field, we briefly review the research progress of metabolic pathway reconstruction and prediction based on machine learning. Some challenging issues in the reconstruction of metabolic pathways are also discussed in this paper.
Collapse
Affiliation(s)
- Hayat Ali Shah
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Juan Liu
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Zhihui Yang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Jing Feng
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| |
Collapse
|
46
|
Licen S, Franzon M, Rodani T, Barbieri P. SOMEnv: An R package for mining environmental monitoring datasets by Self-Organizing Map and k-means algorithms with a graphical user interface. Microchem J 2021. [DOI: 10.1016/j.microc.2021.106181] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
47
|
Sha C, Cuperlovic-Culf M, Hu T. SMILE: systems metabolomics using interpretable learning and evolution. BMC Bioinformatics 2021; 22:284. [PMID: 34049495 PMCID: PMC8161935 DOI: 10.1186/s12859-021-04209-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 05/18/2021] [Indexed: 11/23/2022] Open
Abstract
Background Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods have seen increasing adoptions in metabolomics thanks to their powerful prediction abilities. However, the “black-box” nature of many machine learning models remains a major challenge for wide acceptance and utility as it makes the interpretation of decision process difficult. This challenge is particularly predominant in biomedical research where understanding of the underlying decision making mechanism is essential for insuring safety and gaining new knowledge. Results In this article, we proposed a novel computational framework, Systems Metabolomics using Interpretable Learning and Evolution (SMILE), for supervised metabolomics data analysis. Our methodology uses an evolutionary algorithm to learn interpretable predictive models and to identify the most influential metabolites and their interactions in association with disease. Moreover, we have developed a web application with a graphical user interface that can be used for easy analysis, interpretation and visualization of the results. Performance of the method and utilization of the web interface is shown using metabolomics data for Alzheimer’s disease. Conclusions SMILE was able to identify several influential metabolites on AD and to provide interpretable predictive models that can be further used for a better understanding of the metabolic background of AD. SMILE addresses the emerging issue of interpretability and explainability in machine learning, and contributes to more transparent and powerful applications of machine learning in bioinformatics. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04209-1.
Collapse
Affiliation(s)
- Chengyuan Sha
- School of Computing, Queen's University, Kingston, ON, Canada
| | | | - Ting Hu
- School of Computing, Queen's University, Kingston, ON, Canada.
| |
Collapse
|
48
|
Peng GCY, Alber M, Tepole AB, Cannon WR, De S, Dura-Bernal S, Garikipati K, Karniadakis G, Lytton WW, Perdikaris P, Petzold L, Kuhl E. Multiscale modeling meets machine learning: What can we learn? ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING : STATE OF THE ART REVIEWS 2021; 28:1017-1037. [PMID: 34093005 PMCID: PMC8172124 DOI: 10.1007/s11831-020-09405-5] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 02/09/2020] [Indexed: 05/10/2023]
Abstract
Machine learning is increasingly recognized as a promising technology in the biological, biomedical, and behavioral sciences. There can be no argument that this technique is incredibly successful in image recognition with immediate applications in diagnostics including electrophysiology, radiology, or pathology, where we have access to massive amounts of annotated data. However, machine learning often performs poorly in prognosis, especially when dealing with sparse data. This is a field where classical physics-based simulation seems to remain irreplaceable. In this review, we identify areas in the biomedical sciences where machine learning and multiscale modeling can mutually benefit from one another: Machine learning can integrate physics-based knowledge in the form of governing equations, boundary conditions, or constraints to manage ill-posted problems and robustly handle sparse and noisy data; multiscale modeling can integrate machine learning to create surrogate models, identify system dynamics and parameters, analyze sensitivities, and quantify uncertainty to bridge the scales and understand the emergence of function. With a view towards applications in the life sciences, we discuss the state of the art of combining machine learning and multiscale modeling, identify applications and opportunities, raise open questions, and address potential challenges and limitations. We anticipate that it will stimulate discussion within the community of computational mechanics and reach out to other disciplines including mathematics, statistics, computer science, artificial intelligence, biomedicine, systems biology, and precision medicine to join forces towards creating robust and efficient models for biological systems.
Collapse
Affiliation(s)
| | - Mark Alber
- University of California, Riverside, USA
| | | | - William R Cannon
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Suvranu De
- Rensselaer Polytechnic Institute, Troy, New York, USA
| | | | | | | | | | | | - Linda Petzold
- University of California, Santa Barbara, California, USA
| | - Ellen Kuhl
- Stanford University, Stanford, California, USA
| |
Collapse
|
49
|
Cakmak A, Celik MH. Personalized Metabolic Analysis of Diseases. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1014-1025. [PMID: 32750887 DOI: 10.1109/tcbb.2020.3008196] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The metabolic wiring of patient cells is altered drastically in many diseases, including cancer. Understanding the nature of such changes may pave the way for new therapeutic opportunities as well as the development of personalized treatment strategies for patients. In this paper, we propose an algorithm called Metabolitics, which allows systems-level analysis of changes in the biochemical network of cells in disease states. It enables the study of a disease at both reaction- and pathway-level granularities for a detailed and summarized view of disease etiology. Metabolitics employs flux variability analysis with a dynamically built objective function based on biofluid metabolomics measurements in a personalized manner. Moreover, Metabolitics builds supervised classification models to discriminate between patients and healthy subjects based on the computed metabolic network changes. The use of Metabolitics is demonstrated for three distinct diseases, namely, breast cancer, Crohn's disease, and colorectal cancer. Our results show that the constructed supervised learning models successfully differentiate patients from healthy individuals by an average f1-score of 88 percent. Besides, in addition to the confirmation of previously reported breast cancer-associated pathways, we discovered that Biotin Metabolism along with Arginine and Proline Metabolism is subject to a significant increase in flux capacity, which have not been reported before.
Collapse
|
50
|
Muzio G, O’Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 PMCID: PMC7986589 DOI: 10.1093/bib/bbaa257] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 12/17/2022] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O’Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|