1
|
Oulia F, Charton P, Lo-Thong-Viramoutou O, Acevedo-Rocha CG, Liu W, Huynh D, Damour C, Wang J, Cadet F. Metabolic Fluxes Using Deep Learning Based on Enzyme Variations: Application to Glycolysis in Entamoeba histolytica. Int J Mol Sci 2024; 25:13390. [PMID: 39769154 PMCID: PMC11676880 DOI: 10.3390/ijms252413390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 12/09/2024] [Accepted: 12/10/2024] [Indexed: 01/11/2025] Open
Abstract
Metabolic pathway modeling, essential for understanding organism metabolism, is pivotal in predicting genetic mutation effects, drug design, and biofuel development. Enhancing these modeling techniques is crucial for achieving greater prediction accuracy and reliability. However, the limited experimental data or the complexity of the pathway makes it challenging for researchers to predict phenotypes. Deep learning (DL) is known to perform better than other Machine Learning (ML) approaches if the right conditions are met (i.e., a large database and good choice of parameters). Here, we use a knowledge-based model to massively generate synthetic data and extend a small initial dataset of experimental values. The main objective is to assess if DL can perform at least as well as other ML approaches in flux prediction, using 68,950 instances. Two processing methods are used to generate DL models: cross-validation and repeated holdout evaluation. DL models predict the metabolic fluxes with high precision and slightly outperform the best-known ML approach (the Cubist model) with a lower RMSE (≤0.01) in both cases. They also outperform the PLS model (RMSE ≥ 30). This study is the first to use DL to predict the overall flux of a metabolic pathway only from variations of enzyme concentrations.
Collapse
Affiliation(s)
- Freddy Oulia
- BIGR, UMR_S1134 Inserm, University of Paris City, 75006 Paris, France; (F.O.); (P.C.); (O.L.-T.-V.)
- Laboratory of Excellence GR-Ex, 75006 Paris, France
- DSIMB, UMR_S1134 BIGR, Inserm, Faculty of Sciences and Technology, University of Reunion, 97744 Saint-Denis, France
| | - Philippe Charton
- BIGR, UMR_S1134 Inserm, University of Paris City, 75006 Paris, France; (F.O.); (P.C.); (O.L.-T.-V.)
- Laboratory of Excellence GR-Ex, 75006 Paris, France
- DSIMB, UMR_S1134 BIGR, Inserm, Faculty of Sciences and Technology, University of Reunion, 97744 Saint-Denis, France
| | - Ophélie Lo-Thong-Viramoutou
- BIGR, UMR_S1134 Inserm, University of Paris City, 75006 Paris, France; (F.O.); (P.C.); (O.L.-T.-V.)
- Laboratory of Excellence GR-Ex, 75006 Paris, France
- DSIMB, UMR_S1134 BIGR, Inserm, Faculty of Sciences and Technology, University of Reunion, 97744 Saint-Denis, France
| | - Carlos G. Acevedo-Rocha
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark;
| | - Wei Liu
- Department of Computer Science and Software Engineering, School of Physics, Mathematics and Computing, The University of Western Australia, Perth 6009, Australia; (W.L.); (D.H.)
| | - Du Huynh
- Department of Computer Science and Software Engineering, School of Physics, Mathematics and Computing, The University of Western Australia, Perth 6009, Australia; (W.L.); (D.H.)
| | - Cédric Damour
- EnergyLab, EA 4079, Faculty of Sciences and Technology, University of Reunion, 97490 Saint-Denis, France;
| | - Jingbo Wang
- Department of Physics, School of Physics, Mathematics and Computing, The University of Western Australia, Perth 6009, Australia;
| | - Frederic Cadet
- BIGR, UMR_S1134 Inserm, University of Paris City, 75006 Paris, France; (F.O.); (P.C.); (O.L.-T.-V.)
- Laboratory of Excellence GR-Ex, 75006 Paris, France
- DSIMB, UMR_S1134 BIGR, Inserm, Faculty of Sciences and Technology, University of Reunion, 97744 Saint-Denis, France
- Artificial Intelligence Department, PEACCEL, 75013 Paris, France
| |
Collapse
|
2
|
Matsuyama C, Seike T, Okahashi N, Niide T, Hara KY, Hirono-Hara Y, Ishii J, Shimizu H, Toya Y, Matsuda F. Metabolome analysis of metabolic burden in Escherichia coli caused by overexpression of green fluorescent protein and delta-rhodopsin. J Biosci Bioeng 2024; 137:187-194. [PMID: 38281859 DOI: 10.1016/j.jbiosc.2023.12.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 11/18/2023] [Accepted: 12/04/2023] [Indexed: 01/30/2024]
Abstract
Overexpression of proteins by introducing a DNA vector is among the most important tools for the metabolic engineering of microorganisms such as Escherichia coli. Protein overexpression imposes a burden on metabolism because metabolic pathways must supply building blocks for protein and DNA synthesis. Different E. coli strains have distinct metabolic capacities. In this study, two proteins were overexpressed in four E. coli strains (MG1655(DE3), W3110(DE3), BL21star(DE3), and Rosetta(DE3)), and their effects on metabolic burden were investigated. Metabolomic analysis showed that E. coli strains overexpressing green fluorescent protein had decreased levels of several metabolites, with a positive correlation between the number of reduced metabolites and green fluorescent protein expression levels. Moreover, nucleic acid-related metabolites decreased, indicating a metabolic burden in the E. coli strains, and the growth rate and protein expression levels were improved by supplementation with the five nucleosides. In contrast, two strains overexpressing delta rhodopsin, a microbial membrane rhodopsin from Haloterrigena turkmenica, led to a metabolic burden and decrease in the amino acids Ala, Val, Leu, Ile, Thr, Phe, Asp, and Trp, which are the most frequent amino acids in the delta rhodopsin protein sequence. The metabolic burden caused by protein overexpression was influenced by the metabolic capacity of the host strains and the sequences of the overexpressed proteins. Detailed characterization of the effects of protein expression on the metabolic state of engineered cells using metabolomics will provide insights into improving the production of target compounds.
Collapse
Affiliation(s)
- Chinatsu Matsuyama
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Taisuke Seike
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Nobuyuki Okahashi
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan; Osaka University Shimadzu Omics Innovation Research Laboratories, Osaka University, Osaka 565-0871, Japan
| | - Teppei Niide
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Kiyotaka Y Hara
- Department of Environmental and Life Sciences, School of Food and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | | | - Jun Ishii
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe, Hyogo 657-8501, Japan
| | - Hiroshi Shimizu
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Yoshihiro Toya
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan
| | - Fumio Matsuda
- Graduate School of Information Science and Technology, Osaka University, Osaka 565-0871, Japan; Osaka University Shimadzu Omics Innovation Research Laboratories, Osaka University, Osaka 565-0871, Japan.
| |
Collapse
|
3
|
Koduru L, Lakshmanan M, Lee DY. In silico model-guided identification of transcriptional regulator targets for efficient strain design. Microb Cell Fact 2018; 17:167. [PMID: 30359263 PMCID: PMC6201637 DOI: 10.1186/s12934-018-1015-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 10/20/2018] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Cellular metabolism is tightly regulated by hard-wired multiple layers of biological processes to achieve robust and homeostatic states given the limited resources. As a result, even the most intuitive enzyme-centric metabolic engineering endeavours through the up-/down-regulation of multiple genes in biochemical pathways often deliver insignificant improvements in the product yield. In this regard, targeted engineering of transcriptional regulators (TRs) that control several metabolic functions in modular patterns is an interesting strategy. However, only a handful of in silico model-added techniques are available for identifying the TR manipulation candidates, thus limiting its strain design application. RESULTS We developed hierarchical-Beneficial Regulatory Targeting (h-BeReTa) which employs a genome-scale metabolic model and transcriptional regulatory network (TRN) to identify the relevant TR targets suitable for strain improvement. We then applied this method to industrially relevant metabolites and cell factory hosts, Escherichia coli and Corynebacterium glutamicum. h-BeReTa suggested several promising TR targets, many of which have been validated through literature evidences. h-BeReTa considers the hierarchy of TRs in the TRN and also accounts for alternative metabolic pathways which may divert flux away from the product while identifying suitable metabolic fluxes, thereby performing superior in terms of global TR target identification. CONCLUSIONS In silico model-guided strain design framework, h-BeReTa, was presented for identifying transcriptional regulator targets. Its efficacy and applicability to microbial cell factories were successfully demonstrated via case studies involving two cell factory hosts, as such suggesting several intuitive targets for overproducing various value-added compounds.
Collapse
Affiliation(s)
- Lokanand Koduru
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 4 Engineering Drive 4, Singapore, 117576, Singapore
| | - Meiyappan Lakshmanan
- Bioprocessing Technology Institute, Agency for Science, Technology and Research (A*STAR), 20 Biopolis Way, #06-01, Centros, Singapore, 138668, Singapore
| | - Dong-Yup Lee
- Bioprocessing Technology Institute, Agency for Science, Technology and Research (A*STAR), 20 Biopolis Way, #06-01, Centros, Singapore, 138668, Singapore.
- School of Chemical Engineering, Sungkyunkwan University, 2066, Seobu-ro, Jangan-gu, Suwon, Gyeonggi-do, 16419, Republic of Korea.
| |
Collapse
|
4
|
Martínez JA, Bolívar F, Escalante A. Shikimic Acid Production in Escherichia coli: From Classical Metabolic Engineering Strategies to Omics Applied to Improve Its Production. Front Bioeng Biotechnol 2015; 3:145. [PMID: 26442259 PMCID: PMC4585142 DOI: 10.3389/fbioe.2015.00145] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 09/07/2015] [Indexed: 12/02/2022] Open
Abstract
Shikimic acid (SA) is an intermediate of the SA pathway that is present in bacteria and plants. SA has gained great interest because it is a precursor in the synthesis of the drug oseltamivir phosphate (OSF), an efficient inhibitor of the neuraminidase enzyme of diverse seasonal influenza viruses, the avian influenza virus H5N1, and the human influenza virus H1N1. For the purposes of OSF production, SA is extracted from the pods of Chinese star anise plants (Illicium spp.), yielding up to 17% of SA (dry basis content). The high demand for OSF necessary to manage a major influenza outbreak is not adequately met by industrial production using SA from plants sources. As the SA pathway is present in the model bacteria Escherichia coli, several "intuitive" metabolically engineered strains have been applied for its successful overproduction by biotechnological processes, resulting in strains producing up to 71 g/L of SA, with high conversion yields of up to 0.42 (mol SA/mol Glc), in both batch and fed-batch cultures using complex fermentation broths, including glucose as a carbon source and yeast extract. Global transcriptomic analyses have been performed in SA-producing strains, resulting in the identification of possible key target genes for the design of a rational strain improvement strategy. Because possible target genes are involved in the transport, catabolism, and interconversion of different carbon sources and metabolic intermediates outside the central carbon metabolism and SA pathways, as genes involved in diverse cellular stress responses, the development of rational cellular strain improvement strategies based on omics data constitutes a challenging task to improve SA production in currently overproducing engineered strains. In this review, we discuss the main metabolic engineering strategies that have been applied for the development of efficient SA-producing strains, as the perspective of omics analysis has focused on further strain improvement for the production of this valuable aromatic intermediate.
Collapse
Affiliation(s)
- Juan Andrés Martínez
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Francisco Bolívar
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| | - Adelfo Escalante
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Mexico
| |
Collapse
|
5
|
The transport and mediation mechanisms of the common sugars in Escherichia coli. Biotechnol Adv 2014; 32:905-19. [DOI: 10.1016/j.biotechadv.2014.04.009] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 03/23/2014] [Accepted: 04/18/2014] [Indexed: 11/17/2022]
|
6
|
Vaumourin E, Vourc'h G, Telfer S, Lambin X, Salih D, Seitzer U, Morand S, Charbonnel N, Vayssier-Taussat M, Gasqui P. To be or not to be associated: power study of four statistical modeling approaches to identify parasite associations in cross-sectional studies. Front Cell Infect Microbiol 2014; 4:62. [PMID: 24860791 PMCID: PMC4030204 DOI: 10.3389/fcimb.2014.00062] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2014] [Accepted: 04/23/2014] [Indexed: 01/08/2023] Open
Abstract
A growing number of studies are reporting simultaneous infections by parasites in many different hosts. The detection of whether these parasites are significantly associated is important in medicine and epidemiology. Numerous approaches to detect associations are available, but only a few provide statistical tests. Furthermore, they generally test for an overall detection of association and do not identify which parasite is associated with which other one. Here, we developed a new approach, the association screening approach, to detect the overall and the detail of multi-parasite associations. We studied the power of this new approach and of three other known ones (i.e., the generalized chi-square, the network and the multinomial GLM approaches) to identify parasite associations either due to parasite interactions or to confounding factors. We applied these four approaches to detect associations within two populations of multi-infected hosts: (1) rodents infected with Bartonella sp., Babesia microti and Anaplasma phagocytophilum and (2) bovine population infected with Theileria sp. and Babesia sp. We found that the best power is obtained with the screening model and the generalized chi-square test. The differentiation between associations, which are due to confounding factors and parasite interactions was not possible. The screening approach significantly identified associations between Bartonella doshiae and B. microti, and between T. parva, T. mutans, and T. velifera. Thus, the screening approach was relevant to test the overall presence of parasite associations and identify the parasite combinations that are significantly over- or under-represented. Unraveling whether the associations are due to real biological interactions or confounding factors should be further investigated. Nevertheless, in the age of genomics and the advent of new technologies, it is a considerable asset to speed up researches focusing on the mechanisms driving interactions between parasites.
Collapse
Affiliation(s)
- Elise Vaumourin
- INRA, UR346 Epidémiologie Animale Saint Genès Champanelle, France ; INRA-Anses-ENVA, USC BIPAR Maisons-Alfort, France
| | - Gwenaël Vourc'h
- INRA, UR346 Epidémiologie Animale Saint Genès Champanelle, France
| | - Sandra Telfer
- School of Biological Sciences, University of Aberdeen Aberdeen, UK
| | - Xavier Lambin
- School of Biological Sciences, University of Aberdeen Aberdeen, UK
| | - Diaeldin Salih
- Department of Ticks and Tick-borne Diseases, Veterinary Research Institute Khartoum, Sudan
| | - Ulrike Seitzer
- Division of Veterinary-Infection Biology and Immunology, Research Center Borstel Borstel, Germany
| | - Serge Morand
- Institut des Sciences de l'Evolution (CNRS /IRD / UM2), University of Montpellier 2 Montpellier, France ; Animal et Gestion Intégrée des Risques, CIRAD Montpellier, France
| | - Nathalie Charbonnel
- INRA, UMR CBGP (INRA / IRD / CIRAD / Montpellier SupAgro) Montpellier, France
| | | | - Patrick Gasqui
- INRA, UR346 Epidémiologie Animale Saint Genès Champanelle, France
| |
Collapse
|
7
|
Matsuoka Y, Shimizu K. ¹³C-metabolic flux analysis for Escherichia coli. Methods Mol Biol 2014; 1191:261-289. [PMID: 25178796 DOI: 10.1007/978-1-4939-1170-7_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
(13)C-Metabolic flux analysis ((13)C-MFA) is used here to study the effects of the knockout of such genes as pgi, zwf, gnd, ppc, pck, pyk, and lpdA on the metabolic changes in Escherichia coli cultivated under aerobic condition. The metabolic regulation mechanisms were clarified by integrating such information as fermentation data, gene expression, enzyme activities, and metabolite concentrations as well the result of (13)C-MFA.
Collapse
Affiliation(s)
- Yu Matsuoka
- Kyushu Institute of Technology, 680-4 Kawazu, Iizuka-shi, Fukuoka, 820-8502, Japan
| | | |
Collapse
|