1
|
Isewon I, Binaansim S, Adegoke F, Emmanuel J, Oyelade J. Machine learning methods for predicting essential metabolic genes from Plasmodium falciparum genome-scale metabolic network. PLoS One 2024; 19:e0315530. [PMID: 39715240 DOI: 10.1371/journal.pone.0315530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Accepted: 11/26/2024] [Indexed: 12/25/2024] Open
Abstract
Essential genes are those whose presence is vital for a cell's survival and growth. Detecting these genes in disease-causing organisms is critical for various biological studies, including understanding microbe metabolism, engineering genetically modified microorganisms, and identifying targets for treatment. When essential genes are expressed, they give rise to essential proteins. Identifying these genes, especially in complex organisms like Plasmodium falciparum, which causes malaria, is challenging due to the cost and time associated with experimental methods. Thus, computational approaches have emerged. Early research in this area prioritised the study of less intricate organisms, inadvertently neglecting the complexities of metabolite transport in metabolic networks. To overcome this, a Network-based Machine Learning framework was proposed. It assessed various network properties in Plasmodium falciparum, using a Genome-Scale Metabolic Model (iAM_Pf480) from the BiGG database and essentiality data from the Ogee database. The proposed approach substantially improved gene essentiality predictions as it considered the weighted and directed nature of metabolic networks and utilised network-based features, achieving a high accuracy rate of 0.85 and an AuROC of 0.7. Furthermore, this study enhanced the understanding of metabolic networks and their role in determining gene essentiality in Plasmodium falciparum. Notably, our model identified 9 genes previously considered non-essential in the Ogee database but now predicted to be essential, with some of them potentially serving as drug targets for malaria treatment, thereby opening exciting research avenues.
Collapse
Affiliation(s)
- Itunuoluwa Isewon
- Department of Computer and Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant Bioinformatics Research (CUBRe), Covenant University, Ota, Nigeria
- Covenant Applied Informatics and Communication, Africa Centre of Excellence (CApIC-ACE), Covenant University, Ota, Ogun State, Nigeria
| | - Stephen Binaansim
- Department of Computer and Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant Applied Informatics and Communication, Africa Centre of Excellence (CApIC-ACE), Covenant University, Ota, Ogun State, Nigeria
| | - Faith Adegoke
- Department of Computer and Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant Applied Informatics and Communication, Africa Centre of Excellence (CApIC-ACE), Covenant University, Ota, Ogun State, Nigeria
| | - Jerry Emmanuel
- Department of Computer and Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant Applied Informatics and Communication, Africa Centre of Excellence (CApIC-ACE), Covenant University, Ota, Ogun State, Nigeria
| | - Jelili Oyelade
- Department of Computer and Information Sciences, Covenant University, Ota, Ogun State, Nigeria
- Covenant Bioinformatics Research (CUBRe), Covenant University, Ota, Nigeria
- Covenant Applied Informatics and Communication, Africa Centre of Excellence (CApIC-ACE), Covenant University, Ota, Ogun State, Nigeria
| |
Collapse
|
2
|
Wang Y, Gao M, Zhu S, Li Z, Zhang T, Jiang Y, Zhu L, Zhan X. Glycerol-driven adaptive evolution for the production of low-molecular-weight Welan gum: Characterization and activity evaluation. Carbohydr Polym 2024; 339:122292. [PMID: 38823937 DOI: 10.1016/j.carbpol.2024.122292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Revised: 05/11/2024] [Accepted: 05/16/2024] [Indexed: 06/03/2024]
Abstract
Through adaptive laboratory evolution (ALE) of Sphingomonas sp. ATCC 31555, fermentation for production of low-molecular-weight welan gum (LMW-WG) was performed using glycerol as sole carbon source. During ALE, GPC-MALS analysis revealed a gradual decrease in WG molecular weight with the increase of adaptation cycles, accompanied by changes in solution conformation. LMW-WG was purified and structurally analyzed using GPC-MALS, monosaccharide composition analysis, infrared spectroscopy, NMR analysis, atomic force microscopy, and scanning electron microscopy. Subsequently, LMW-WG obtains hydration, transparency, antioxidant activity, and rheological properties. Finally, an in vitro simulation colon reactor was used to evaluate potential prebiotic properties of LMW-WG as dietary fiber. Compared with WG produced using sucrose as substrate, LMW-WG exhibited a fourfold reduction in molecular weight while maintaining moderate viscosity. Structurally, L-Rha nearly completely replaced L-Man. Furthermore, LMW-WG demonstrated excellent hydration, antioxidant activity, and high transparency. It also exhibited resistance to saliva and gastrointestinal digestion, showcasing a favorable colonization effect on Bifidobacterium, making it a promising symbiotic agent.
Collapse
Affiliation(s)
- Yuying Wang
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
| | - Minjie Gao
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
| | - Shengyong Zhu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
| | - Zhitao Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
| | - Tiantian Zhang
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
| | - Yun Jiang
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China
| | - Li Zhu
- A & F Biotech. Ltd., Burnaby, BC V5A3P6, Canada
| | - Xiaobei Zhan
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214122, China.
| |
Collapse
|
3
|
Moon JH, Nam S, Jeung K, Noh MH, Jung GY. Biosensor-Assisted Engineering for Diverse Microbial Cellular Physiologies. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:18321-18334. [PMID: 39107094 DOI: 10.1021/acs.jafc.4c04619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2024]
Abstract
Recent advancements in biosensor technology have revolutionized the field of microbial engineering, enabling efficient and precise optimization of strains for the production of valuable chemicals. This review comprehensively explores the innovative integration of biosensors to enhance microbial cell factories, with a particular emphasis on the crucial role of high-throughput biosensor-assisted screening. Biosensor-assisted approaches have enabled the identification of novel transporters, the elucidation of underlying transport mechanisms, and the fine-tuning of metabolic pathways for enhanced production. Furthermore, this review illustrates the utilization of biosensors for manipulating cellular behaviors, including interactions with environmental factors, and the reduction of nongenetic cell-to-cell variations. This review highlights the indispensable role of biosensors in advancing the field of microbial engineering through the modulation and exploitation of diverse cellular physiological processes.
Collapse
Affiliation(s)
- Jo Hyun Moon
- Department of Chemical Engineering, Pohang University of Science and Technology, 77 Cheongam-Ro, Nam-Gu, Pohang, Gyeongbuk 37673, Korea
| | - Sunghyun Nam
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, 77 Cheongam-Ro, Nam-Gu, Pohang, Gyeongbuk 37673, Korea
| | - Kumyoung Jeung
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, 77 Cheongam-Ro, Nam-Gu, Pohang, Gyeongbuk 37673, Korea
| | - Myung Hyun Noh
- Research Center for Bio-based Chemistry, Korea Research Institute of Chemical Technology (KRICT), 406-30, Jongga-ro, Jung-gu, Ulsan 44429, Korea
| | - Gyoo Yeol Jung
- Department of Chemical Engineering, Pohang University of Science and Technology, 77 Cheongam-Ro, Nam-Gu, Pohang, Gyeongbuk 37673, Korea
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, 77 Cheongam-Ro, Nam-Gu, Pohang, Gyeongbuk 37673, Korea
| |
Collapse
|
4
|
Phaneuf PV, Kim SH, Rychel K, Rode C, Beulig F, Palsson BO, Yang L. Meta-analysis Driven Strain Design for Mitigating Oxidative Stresses Important in Biomanufacturing. ACS Synth Biol 2024; 13:2045-2059. [PMID: 38934464 PMCID: PMC11264330 DOI: 10.1021/acssynbio.3c00572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 06/11/2024] [Accepted: 06/11/2024] [Indexed: 06/28/2024]
Abstract
As the availability of data sets increases, meta-analysis leveraging aggregated and interoperable data types is proving valuable. This study leveraged a meta-analysis workflow to identify mutations that could improve robustness to reactive oxygen species (ROS) stresses using an industrially important melatonin production strain as an example. ROS stresses often occur during cultivation and negatively affect strain performance. Cellular response to ROS is also linked to the SOS response and resistance to pH fluctuations, which is important to strain robustness in large-scale biomanufacturing. This work integrated more than 7000 E. coli adaptive laboratory evolution (ALE) mutations across 59 experiments to statistically associate mutated genes to 2 ROS tolerance ALE conditions from 72 unique conditions. Mutant oxyR, fur, iscR, and ygfZ were significantly associated and hypothesized to contribute fitness in ROS stress. Across these genes, 259 total mutations were inspected in conjunction with transcriptomics from 46 iModulon experiments. Ten mutations were chosen for reintroduction based on mutation clustering and coinciding transcriptional changes as evidence of fitness impact. Strains with mutations reintroduced into oxyR, fur, iscR, and ygfZ exhibited increased tolerance to H2O2 and acid stress and reduced SOS response, all of which are related to ROS. Additionally, new evidence was generated toward understanding the function of ygfZ, an uncharacterized gene. This meta-analysis approach utilized aggregated and interoperable multiomics data sets to identify mutations conferring industrially relevant phenotypes with the least drawbacks, describing an approach for data-driven strain engineering to optimize microbial cell factories.
Collapse
Affiliation(s)
- PV Phaneuf
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220. Kongens Lyngby 2800, Denmark
| | - SH Kim
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220. Kongens Lyngby 2800, Denmark
| | - K Rychel
- Department
of Bioengineering, University of California,
San Diego, La Jolla ,California92093-0412 ,United States
| | - C Rode
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220. Kongens Lyngby 2800, Denmark
| | - F Beulig
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220. Kongens Lyngby 2800, Denmark
| | - BO Palsson
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220. Kongens Lyngby 2800, Denmark
- Department
of Bioengineering, University of California,
San Diego, La Jolla ,California92093-0412 ,United States
- Bioinformatics
and Systems Biology Program, University
of California, San Diego, La Jolla ,California92093-0021, United States
- Department
of Pediatrics, University of California,
San Diego, La Jolla ,California 92093-0412, United States
| | - L Yang
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220. Kongens Lyngby 2800, Denmark
| |
Collapse
|
5
|
Kim K, Choe D, Kang M, Cho SH, Cho S, Jeong KJ, Palsson B, Cho BK. Serial adaptive laboratory evolution enhances mixed carbon metabolic capacity of Escherichia coli. Metab Eng 2024; 83:160-171. [PMID: 38636729 DOI: 10.1016/j.ymben.2024.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/31/2024] [Accepted: 04/14/2024] [Indexed: 04/20/2024]
Abstract
Microbes have inherent capacities for utilizing various carbon sources, however they often exhibit sub-par fitness due to low metabolic efficiency. To test whether a bacterial strain can optimally utilize multiple carbon sources, Escherichia coli was serially evolved in L-lactate and glycerol. This yielded two end-point strains that evolved first in L-lactate then in glycerol, and vice versa. The end-point strains displayed a universal growth advantage on single and a mixture of adaptive carbon sources, enabled by a concerted action of carbon source-specialists and generalist mutants. The combination of just four variants of glpK, ppsA, ydcI, and rph-pyrE, accounted for more than 80% of end-point strain fitness. In addition, machine learning analysis revealed a coordinated activity of transcriptional regulators imparting condition-specific regulation of gene expression. The effectiveness of the serial adaptive laboratory evolution (ALE) scheme in bioproduction applications was assessed under single and mixed-carbon culture conditions, in which serial ALE strain exhibited superior productivity of acetoin compared to ancestral strains. Together, systems-level analysis elucidated the molecular basis of serial evolution, which hold potential utility in bioproduction applications.
Collapse
Affiliation(s)
- Kangsan Kim
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Donghui Choe
- Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
| | - Minjeong Kang
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Sang-Hyeok Cho
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Suhyung Cho
- KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Ki Jun Jeong
- KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; Graduate School of Engineering Biology, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea
| | - Bernhard Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA; Department of Pediatrics, University of California San Diego, La Jolla, CA, 92093, USA
| | - Byung-Kwan Cho
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; KAIST Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea; Graduate School of Engineering Biology, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea.
| |
Collapse
|
6
|
Catoiu EA, Mih N, Lu M, Palsson B. Establishing comprehensive quaternary structural proteomes from genome sequence. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.24.590993. [PMID: 38712217 PMCID: PMC11071507 DOI: 10.1101/2024.04.24.590993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
A critical body of knowledge has developed through advances in protein microscopy, protein-fold modeling, structural biology software, availability of sequenced bacterial genomes, large-scale mutation databases, and genome-scale models. Based on these recent advances, we develop a computational framework that; i) identifies the oligomeric structural proteome encoded by an organism's genome from available structural resources; ii) maps multi-strain alleleomic variation, resulting in the structural proteome for a species; and iii) calculates the 3D orientation of proteins across subcellular compartments with residue-level precision. Using the platform, we; iv) compute the quaternary E. coli K-12 MG1655 structural proteome; v) use a dataset of 12,000 mutations to build Random Forest classifiers that can predict the severity of mutations; and, in combination with a genome-scale model that computes proteome allocation, vi) obtain the spatial allocation of the E. coli proteome. Thus, in conjunction with relevant datasets and increasingly accurate computational models, we can now annotate quaternary structural proteomes, at genome-scale, to obtain a molecular-level understanding of whole-cell functions. Significance Advancements in experimental and computational methods have revealed the shapes of multi-subunit proteins. The absence of a unified platform that maps actionable datatypes onto these increasingly accurate structures creates a barrier to structural analyses, especially at the genome-scale. Here, we describe QSPACE, a computational annotation platform that evaluates existing resources to identify the best-available structure for each protein in a user's query, maps the 3D location of actionable datatypes ( e.g. , active sites, published mutations) onto the selected structures, and uses third-party APIs to determine the subcellular compartment of all amino acids of a protein. As proof-of-concept, we deployed QSPACE to generate the quaternary structural proteome of E. coli MG1655 and demonstrate two use-cases involving large-scale mutant analysis and genome-scale modelling.
Collapse
|