1
|
Ferreira S, Balola A, Sveshnikova A, Hatzimanikatis V, Vilaça P, Maia P, Carreira R, Stoney R, Carbonell P, Souza CS, Correia J, Lousa D, Soares CM, Rocha I. Computer-aided design and implementation of efficient biosynthetic pathways to produce high added-value products derived from tyrosine in Escherichia coli. Front Bioeng Biotechnol 2024; 12:1360740. [PMID: 38978715 PMCID: PMC11228882 DOI: 10.3389/fbioe.2024.1360740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Accepted: 06/03/2024] [Indexed: 07/10/2024] Open
Abstract
Developing efficient bioprocesses requires selecting the best biosynthetic pathways, which can be challenging and time-consuming due to the vast amount of data available in databases and literature. The extension of the shikimate pathway for the biosynthesis of commercially attractive molecules often involves promiscuous enzymes or lacks well-established routes. To address these challenges, we developed a computational workflow integrating enumeration/retrosynthesis algorithms, a toolbox for pathway analysis, enzyme selection tools, and a gene discovery pipeline, supported by manual curation and literature review. Our focus has been on implementing biosynthetic pathways for tyrosine-derived compounds, specifically L-3,4-dihydroxyphenylalanine (L-DOPA) and dopamine, with significant applications in health and nutrition. We selected one pathway to produce L-DOPA and two different pathways for dopamine-one already described in the literature and a novel pathway. Our goal was either to identify the most suitable gene candidates for expression in Escherichia coli for the known pathways or to discover innovative pathways. Although not all implemented pathways resulted in the accumulation of target compounds, in our shake-flask experiments we achieved a maximum L-DOPA titer of 0.71 g/L and dopamine titers of 0.29 and 0.21 g/L for known and novel pathways, respectively. In the case of L-DOPA, we utilized, for the first time, a mutant version of tyrosinase from Ralstonia solanacearum. Production of dopamine via the known biosynthesis route was accomplished by coupling the L-DOPA pathway with the expression of DOPA decarboxylase from Pseudomonas putida, resulting in a unique biosynthetic pathway never reported in literature before. In the context of the novel pathway, dopamine was produced using tyramine as the intermediate compound. To achieve this, tyrosine was initially converted into tyramine by expressing TDC from Levilactobacillus brevis, which, in turn, was converted into dopamine through the action of the enzyme encoded by ppoMP from Mucuna pruriens. This marks the first time that an alternative biosynthetic pathway for dopamine has been validated in microbes. These findings underscore the effectiveness of our computational workflow in facilitating pathway enumeration and selection, offering the potential to uncover novel biosynthetic routes, thus paving the way for other target compounds of biotechnological interest.
Collapse
Affiliation(s)
- Sofia Ferreira
- Systems and Synthetic Biology Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| | - Alexandra Balola
- Systems and Synthetic Biology Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| | - Anastasia Sveshnikova
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne, Switzerland
| | - Vassily Hatzimanikatis
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne, Switzerland
| | - Paulo Vilaça
- SilicoLife-Computational Biology Solutions for the Life Sciences, Braga, Portugal
| | - Paulo Maia
- SilicoLife-Computational Biology Solutions for the Life Sciences, Braga, Portugal
| | - Rafael Carreira
- SilicoLife-Computational Biology Solutions for the Life Sciences, Braga, Portugal
| | - Ruth Stoney
- Manchester Institute of Biotechnology, School of Chemistry, Faculty of Science and Engineering, University of Manchester, Manchester, United Kingdom
| | - Pablo Carbonell
- Institute of Industrial Control Systems and Computing (AI2), Universitat Politècnica de València (UPV), Valencia, Spain
- Institute for Integrative Systems Biology I2SysBio, Universitat de València-CSIC: Consejo Superior de Investigaciones Científicas, Paterna, Spain
| | - Caio Silva Souza
- Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| | - João Correia
- Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| | - Diana Lousa
- Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| | - Cláudio M Soares
- Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| | - Isabel Rocha
- Systems and Synthetic Biology Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
| |
Collapse
|
2
|
Goshisht MK. Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges. ACS OMEGA 2024; 9:9921-9945. [PMID: 38463314 PMCID: PMC10918679 DOI: 10.1021/acsomega.3c05913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/19/2024] [Accepted: 01/30/2024] [Indexed: 03/12/2024]
Abstract
Machine learning (ML), particularly deep learning (DL), has made rapid and substantial progress in synthetic biology in recent years. Biotechnological applications of biosystems, including pathways, enzymes, and whole cells, are being probed frequently with time. The intricacy and interconnectedness of biosystems make it challenging to design them with the desired properties. ML and DL have a synergy with synthetic biology. Synthetic biology can be employed to produce large data sets for training models (for instance, by utilizing DNA synthesis), and ML/DL models can be employed to inform design (for example, by generating new parts or advising unrivaled experiments to perform). This potential has recently been brought to light by research at the intersection of engineering biology and ML/DL through achievements like the design of novel biological components, best experimental design, automated analysis of microscopy data, protein structure prediction, and biomolecular implementations of ANNs (Artificial Neural Networks). I have divided this review into three sections. In the first section, I describe predictive potential and basics of ML along with myriad applications in synthetic biology, especially in engineering cells, activity of proteins, and metabolic pathways. In the second section, I describe fundamental DL architectures and their applications in synthetic biology. Finally, I describe different challenges causing hurdles in the progress of ML/DL and synthetic biology along with their solutions.
Collapse
Affiliation(s)
- Manoj Kumar Goshisht
- Department of Chemistry, Natural and
Applied Sciences, University of Wisconsin—Green
Bay, Green
Bay, Wisconsin 54311-7001, United States
| |
Collapse
|
3
|
Datta R. Enzymatic degradation of cellulose in soil: A review. Heliyon 2024; 10:e24022. [PMID: 38234915 PMCID: PMC10792583 DOI: 10.1016/j.heliyon.2024.e24022] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 12/13/2023] [Accepted: 01/02/2024] [Indexed: 01/19/2024] Open
Abstract
Cellulose degradation is a critical process in soil ecosystems, playing a vital role in nutrient cycling and organic matter decomposition. Enzymatic degradation of cellulosic biomass is the most sustainable and green method of producing liquid biofuel. It has gained intensive research interest with future perspective as the majority of terrestrial lignocellulose biomass has a great potential to be used as a source of bioenergy. However, the recalcitrant nature of lignocellulose limits its use as a source of energy. Noteworthy enough, enzymatic conversion of cellulose biomass could be a leading future technology. Fungal enzymes play a central role in cellulose degradation. Our understanding of fungal cellulases has substantially redirected in the past few years with the discovery of a new class of enzymes and Cellulosome. Efforts have been made from time to time to develop an economically viable method of cellulose degradation. This review provides insights into the current state of knowledge regarding cellulose degradation in soil and identifies areas where further research is needed.
Collapse
Affiliation(s)
- Rahul Datta
- Department of Geology and Pedology, Faculty of Forestry and Wood Technology. Mendel University In Brno, Czech Republic
| |
Collapse
|
4
|
Abstract
This chapter outlines the myriad applications of machine learning (ML) in synthetic biology, specifically in engineering cell and protein activity, and metabolic pathways. Though by no means comprehensive, the chapter highlights several prominent computational tools applied in the field and their potential use cases. The examples detailed reinforce how ML algorithms can enhance synthetic biology research by providing data-driven insights into the behavior of living systems, even without detailed knowledge of their underlying mechanisms. By doing so, ML promises to increase the efficiency of research projects by modeling hypotheses in silico that can then be tested through experiments. While challenges related to training dataset generation and computational costs remain, ongoing improvements in ML tools are paving the way for smarter and more streamlined synthetic biology workflows that can be readily employed to address grand challenges across manufacturing, medicine, engineering, agriculture, and beyond.
Collapse
Affiliation(s)
- Brendan Fu-Long Sieow
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- NUS Graduate School for Integrative Sciences and Engineering Programme, National University of Singapore, Singapore, Singapore
| | - Ryan De Sotto
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Zhi Ren Darren Seet
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - In Young Hwang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Matthew Wook Chang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore.
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
5
|
Nakanishi A, Yamamoto N, Sakihama Y, Okino T, Matoba N. Development of Targeted Protein-Displaying Technology with a Novel Carbon Material. BIOTECH (BASEL (SWITZERLAND)) 2022; 12:biotech12010002. [PMID: 36648828 PMCID: PMC9844296 DOI: 10.3390/biotech12010002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/15/2022] [Accepted: 12/22/2022] [Indexed: 12/28/2022]
Abstract
This study reports a new carbon material and its specific display of targeted protein. The properties of the carbon materials fabricated with carbon black MOGUL® were analyzed. The carbon materials were spherical structures with 55.421 µm as a median value. The specific surface area, pore volume, average pore diameter, and total of the acidic functional group were 130 m2·g-1, 0.55 cm3·g-1, 17.2 nm, and 0.29 mEq·g-1, respectively. The adsorption-desorption isoform of the carbon materials showed type IV of the hysteresis loop as defined by IUPAC, indicating non-uniform mesoporous structures (2-50 nm). The distribution of the log differential pore volume also indicated non-uniform porous structures because (i) the difference between the average pore size and the most frequent pore size was significant and (ii) the σ value was larger than the average value regarding the pore sizes. However, 10-90% of the integrated values of the log differential pore volume were 57.4% of the total integrated values, and the distribution was similar to the Gauss distribution model. Although the value of the total of the acidic functional group was 2.5-5.4 times lower than the values of the HPLC columns, the carbon materials require good scaffold quality rather than good HPLC quality. Therefore, the amounts could be enough for the scaffold of biotin hydrazide. To demonstrate the property of displaying the targeted proteins, carbon materials displaying biotin hydrazide by covalent bonding were prepared and avidin-labeled horse radish peroxidase (HRP) was bound to the biotin region. The carbon materials were porous structures, so the unspecific adsorption of HRP was estimated. Then, the maintenance ratios of HRP activities were analyzed in the repeated-use-with-wash processes after each evaluation, resulting in the activities of HRP on the carbon materials being treated with biotin hydrazide being significantly maintained compared to that of the ones without biotin hydrazide. The study revealed the properties of the carbon materials and indicated the display of HRP, suggesting that the carbon materials could be a new material for displaying targeted proteins.
Collapse
Affiliation(s)
- Akihito Nakanishi
- Graduate School of Bionics, Tokyo University of Technology, Tokyo 192-0982, Japan
- School of Bioscience and Biotechnology, Tokyo University of Technology, Tokyo 192-0982, Japan
- Correspondence:
| | - Naotaka Yamamoto
- Graduate School of Bionics, Tokyo University of Technology, Tokyo 192-0982, Japan
| | - Yuri Sakihama
- College of Science and Engineering, Aoyama Gakuin University, Sagamihara 252-5258, Japan
| | - Tomoya Okino
- Kansai Coke & Chemicals Co., Ltd., Amagasaki 661-0976, Japan
| | - Naoki Matoba
- Kansai Coke & Chemicals Co., Ltd., Amagasaki 661-0976, Japan
| |
Collapse
|
6
|
Foerster H, Battey JND, Sierro N, Ivanov NV, Mueller LA. Metabolic networks of the Nicotiana genus in the spotlight: content, progress and outlook. Brief Bioinform 2021; 22:bbaa136. [PMID: 32662816 PMCID: PMC8138835 DOI: 10.1093/bib/bbaa136] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 05/19/2020] [Accepted: 06/04/2020] [Indexed: 01/09/2023] Open
Abstract
Manually curated metabolic databases residing at the Sol Genomics Network comprise two taxon-specific databases for the Solanaceae family, i.e. SolanaCyc and the genus Nicotiana, i.e. NicotianaCyc as well as six species-specific databases for Nicotiana tabacum TN90, N. tabacum K326, Nicotiana benthamiana, N. sylvestris, N. tomentosiformis and N. attenuata. New pathways were created through the extraction, examination and verification of related data from the literature and the aid of external database guided by an expert-led curation process. Here we describe the curation progress that has been achieved in these databases since the first release version 1.0 in 2016, the curation flow and the curation process using the example metabolic pathway for cholesterol in plants. The current content of our databases comprises 266 pathways and 36 superpathways in SolanaCyc and 143 pathways plus 21 superpathways in NicotianaCyc, manually curated and validated specifically for the Solanaceae family and Nicotiana genus, respectively. The curated data have been propagated to the respective Nicotiana-specific databases, which resulted in the enrichment and more accurate presentation of their metabolic networks. The quality and coverage in those databases have been compared with related external databases and discussed in terms of literature support and metabolic content.
Collapse
|
7
|
Lawson CE, Martí JM, Radivojevic T, Jonnalagadda SVR, Gentz R, Hillson NJ, Peisert S, Kim J, Simmons BA, Petzold CJ, Singer SW, Mukhopadhyay A, Tanjore D, Dunn JG, Garcia Martin H. Machine learning for metabolic engineering: A review. Metab Eng 2020; 63:34-60. [PMID: 33221420 DOI: 10.1016/j.ymben.2020.10.005] [Citation(s) in RCA: 123] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 10/22/2020] [Accepted: 10/31/2020] [Indexed: 12/14/2022]
Abstract
Machine learning provides researchers a unique opportunity to make metabolic engineering more predictable. In this review, we offer an introduction to this discipline in terms that are relatable to metabolic engineers, as well as providing in-depth illustrative examples leveraging omics data and improving production. We also include practical advice for the practitioner in terms of data management, algorithm libraries, computational resources, and important non-technical issues. A variety of applications ranging from pathway construction and optimization, to genetic editing optimization, cell factory testing, and production scale-up are discussed. Moreover, the promising relationship between machine learning and mechanistic models is thoroughly reviewed. Finally, the future perspectives and most promising directions for this combination of disciplines are examined.
Collapse
Affiliation(s)
- Christopher E Lawson
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA
| | - Jose Manuel Martí
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Tijana Radivojevic
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Sai Vamshi R Jonnalagadda
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Reinhard Gentz
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Nathan J Hillson
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Sean Peisert
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; University of California Davis, Davis, CA, 95616, USA
| | - Joonhoon Kim
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Pacific Northwest National Laboratory, Richland, 99354, WA, USA
| | - Blake A Simmons
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Christopher J Petzold
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Steven W Singer
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA
| | - Aindrila Mukhopadhyay
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, USA
| | - Deepti Tanjore
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Advanced Biofuels and Bioproducts Process Development Unit, Emeryville, CA, 94608, USA
| | | | - Hector Garcia Martin
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA; Basque Center for Applied Mathematics, 48009, Bilbao, Spain; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, USA.
| |
Collapse
|
8
|
Medford AJ, Kunz MR, Ewing SM, Borders T, Fushimi R. Extracting Knowledge from Data through Catalysis Informatics. ACS Catal 2018. [DOI: 10.1021/acscatal.8b01708] [Citation(s) in RCA: 128] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Andrew J. Medford
- School of Chemical and Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia 30318 United States
| | - M. Ross Kunz
- Biological and Chemical Processing Department, Energy and Environmental Science and Technology, Idaho National Laboratory, P.O. Box 1625, Idaho Falls, Idaho 83415, United States
| | - Sarah M. Ewing
- Biological and Chemical Processing Department, Energy and Environmental Science and Technology, Idaho National Laboratory, P.O. Box 1625, Idaho Falls, Idaho 83415, United States
| | - Tammie Borders
- Biological and Chemical Processing Department, Energy and Environmental Science and Technology, Idaho National Laboratory, P.O. Box 1625, Idaho Falls, Idaho 83415, United States
| | - Rebecca Fushimi
- Biological and Chemical Processing Department, Energy and Environmental Science and Technology, Idaho National Laboratory, P.O. Box 1625, Idaho Falls, Idaho 83415, United States
- Center for Advanced Energy Studies, 995 University Boulevard, Idaho Falls, Idaho 83401, United States
| |
Collapse
|
9
|
McDonagh JL, Palmer DS, Mourik TV, Mitchell JBO. Are the Sublimation Thermodynamics of Organic Molecules Predictable? J Chem Inf Model 2016; 56:2162-2179. [PMID: 27749062 DOI: 10.1021/acs.jcim.6b00033] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
We compare a range of computational methods for the prediction of sublimation thermodynamics (enthalpy, entropy, and free energy of sublimation). These include a model from theoretical chemistry that utilizes crystal lattice energy minimization (with the DMACRYS program) and quantitative structure property relationship (QSPR) models generated by both machine learning (random forest and support vector machines) and regression (partial least squares) methods. Using these methods we investigate the predictability of the enthalpy, entropy and free energy of sublimation, with consideration of whether such a method may be able to improve solubility prediction schemes. Previous work has suggested that the major source of error in solubility prediction schemes involving a thermodynamic cycle via the solid state is in the modeling of the free energy change away from the solid state. Yet contrary to this conclusion other work has found that the inclusion of terms such as the enthalpy of sublimation in QSPR methods does not improve the predictions of solubility. We suggest the use of theoretical chemistry terms, detailed explicitly in the Methods section, as descriptors for the prediction of the enthalpy and free energy of sublimation. A data set of 158 molecules with experimental sublimation thermodynamics values and some CSD refcodes has been collected from the literature and is provided with their original source references.
Collapse
Affiliation(s)
- James L McDonagh
- Manchester Institute of Biotechnology, The University of Manchester , 131 Princess Street, Manchester, M1 7DN, U.K.,School of Chemistry, University of St Andrews , North Haugh, St Andrews, Fife, Scotland, United Kingdom , KY16 9ST
| | - David S Palmer
- Department of Pure and Applied Chemistry, University of Strathclyde , Thomas Graham Building, 295 Cathedral Street, Glasgow, Scotland, United Kingdom , G1 1XL
| | - Tanja van Mourik
- School of Chemistry, University of St Andrews , North Haugh, St Andrews, Fife, Scotland, United Kingdom , KY16 9ST
| | - John B O Mitchell
- School of Chemistry, University of St Andrews , North Haugh, St Andrews, Fife, Scotland, United Kingdom , KY16 9ST
| |
Collapse
|
10
|
McDonagh JL, Vincent MA, Popelier PL. Partitioning dynamic electron correlation energy: Viewing Møller-Plesset correlation energies through Interacting Quantum Atom (IQA) energy partitioning. Chem Phys Lett 2016. [DOI: 10.1016/j.cplett.2016.09.019] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
11
|
Supuran CT, Capasso C. New light on bacterial carbonic anhydrases phylogeny based on the analysis of signal peptide sequences. J Enzyme Inhib Med Chem 2016; 31:1254-60. [PMID: 27353388 DOI: 10.1080/14756366.2016.1201479] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
Among protein families, carbonic anhydrases (CAs, EC 4.2.1.1) are metalloenzymes characterized by a common reaction mechanism in all life domains: the carbon dioxide hydration to bicarbonate and protons (CO2+H2O ⇔ HCO3(-)+H(+)). Six genetically distinct CA families are known to date, the α-, β-, γ-, δ-, ζ- and η-CAs. The last CA class was recently discovered analyzing the amino acid sequences of CAs from Plasmodia. Bacteria encode for enzymes belonging to the α-, β-, and γ-CA classes and recently, phylogenetic analysis revealed an interesting relationship regarding the evolution of bacterial CA classes. This result evidenced that the three bacterial CA classes, in spite of the high level of the structural similarity, are evolutionarily distinct, but we noted that the primary structure of some β-CAs identified in the genome of Gram-negative bacteria present a pre-sequence of 18 or more amino acid residues at the N-terminal part. These observations and subsequent phylogenetic data presented here prompted us to propose that the β-CAs found in Gram-negative bacteria with a periplasmic space and characterized by the presence of a signal peptide might have a periplasmic localization and a role similar to that described previously for the α-CAs.
Collapse
Affiliation(s)
- Claudiu T Supuran
- a Dipartimento Neurofarba, Sezione di Scienze Farmaceutiche e Nutraceutiche and.,b Dipartimento di Chimica, Laboratorio di Chimica Bioinorganica, Polo Scientifico, Università degli Studi di Firenze , Sesto Fiorentino , Florence , Italy , and
| | | |
Collapse
|
12
|
Mellor J, Grigoras I, Carbonell P, Faulon JL. Semisupervised Gaussian Process for Automated Enzyme Search. ACS Synth Biol 2016; 5:518-28. [PMID: 27007080 DOI: 10.1021/acssynbio.5b00294] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Synthetic biology is today harnessing the design of novel and greener biosynthesis routes for the production of added-value chemicals and natural products. The design of novel pathways often requires a detailed selection of enzyme sequences to import into the chassis at each of the reaction steps. To address such design requirements in an automated way, we present here a tool for exploring the space of enzymatic reactions. Given a reaction and an enzyme the tool provides a probability estimate that the enzyme catalyzes the reaction. Our tool first considers the similarity of a reaction to known biochemical reactions with respect to signatures around their reaction centers. Signatures are defined based on chemical transformation rules by using extended connectivity fingerprint descriptors. A semisupervised Gaussian process model associated with the similar known reactions then provides the probability estimate. The Gaussian process model uses information about both the reaction and the enzyme in providing the estimate. These estimates were validated experimentally by the application of the Gaussian process model to a newly identified metabolite in Escherichia coli in order to search for the enzymes catalyzing its associated reactions. Furthermore, we show with several pathway design examples how such ability to assign probability estimates to enzymatic reactions provides the potential to assist in bioengineering applications, providing experimental validation to our proposed approach. To the best of our knowledge, the proposed approach is the first application of Gaussian processes dealing with biological sequences and chemicals, the use of a semisupervised Gaussian process framework is also novel in the context of machine learning applied to bioinformatics. However, the ability of an enzyme to catalyze a reaction depends on the affinity between the substrates of the reaction and the enzyme. This affinity is generally quantified by the Michaelis constant KM. Therefore, we also demonstrate using Gaussian process regression to predict KM given a substrate-enzyme pair.
Collapse
Affiliation(s)
- Joseph Mellor
- School
of Chemistry, University of Manchester, Manchester M13 9PL, U.K
- Manchester
Institute of Biotechnology, University of Manchester, Manchester M13 9PL, U.K
| | - Ioana Grigoras
- iSSB,
Institute of Systems and Synthetic Biology, CNRS, University of Évry-Val-d’Essonne, 91000 Évry, France
| | - Pablo Carbonell
- SYNBIOCHEM
Centre, Manchester Institute of Biotechnology, University of Manchester, Manchester M13 9PL, U.K
| | - Jean-Loup Faulon
- School
of Chemistry, University of Manchester, Manchester M13 9PL, U.K
- iSSB,
Institute of Systems and Synthetic Biology, CNRS, University of Évry-Val-d’Essonne, 91000 Évry, France
- SYNBIOCHEM
Centre, Manchester Institute of Biotechnology, University of Manchester, Manchester M13 9PL, U.K
- MICALIS Institute, INRA, 78352 Jouy en Jossas, France
| |
Collapse
|
13
|
García-Guevara F, Avelar M, Ayala M, Segovia L. Computational Tools Applied to Enzyme Design − a review. ACTA ACUST UNITED AC 2016. [DOI: 10.1515/boca-2015-0009] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
AbstractThe protein design toolbox has been greatly improved by the addition of enzyme computational simulations. Not only do they warrant a more ambitious and thorough exploration of sequence space, but a much higher number of variants and protein-ligand systems can be analyzed in silico compared to experimental engineering methods. Modern computational tools are being used to redesign and also for de novo generation of enzymes. These approaches are contingent on a deep understanding of the reaction mechanism and the enzyme’s three-dimensional structure coordinates, but the wealth of information produced by these analyses leads to greatly improved or even totally new types of catalysis.
Collapse
|
14
|
Brunk E, Rothlisberger U. Mixed Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulations of Biological Systems in Ground and Electronically Excited States. Chem Rev 2015; 115:6217-63. [PMID: 25880693 DOI: 10.1021/cr500628b] [Citation(s) in RCA: 327] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Elizabeth Brunk
- †Laboratory of Computational Chemistry and Biochemistry, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,‡Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, California 94618, United States
| | - Ursula Rothlisberger
- †Laboratory of Computational Chemistry and Biochemistry, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,§National Competence Center of Research (NCCR) MARVEL-Materials' Revolution: Computational Design and Discovery of Novel Materials, 1015 Lausanne, Switzerland
| |
Collapse
|
15
|
Alderson RG, Barker D, Mitchell JBO. One origin for metallo-β-lactamase activity, or two? An investigation assessing a diverse set of reconstructed ancestral sequences based on a sample of phylogenetic trees. J Mol Evol 2014; 79:117-29. [PMID: 25185655 PMCID: PMC4185109 DOI: 10.1007/s00239-014-9639-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Accepted: 08/11/2014] [Indexed: 01/04/2023]
Abstract
Bacteria use metallo-β-lactamase enzymes to hydrolyse lactam rings found in many antibiotics, rendering them ineffective. Metallo-β-lactamase activity is thought to be polyphyletic, having arisen on more than one occasion within a single functionally diverse homologous superfamily. Since discovery of multiple origins of enzymatic activity conferring antibiotic resistance has broad implications for the continued clinical use of antibiotics, we test the hypothesis of polyphyly further; if lactamase function has arisen twice independently, the most recent common ancestor (MRCA) is not expected to possess lactam-hydrolysing activity. Two major problems present themselves. Firstly, even with a perfectly known phylogeny, ancestral sequence reconstruction is error prone. Secondly, the phylogeny is not known, and in fact reconstructing a single, unambiguous phylogeny for the superfamily has proven impossible. To obtain a more statistical view of the strength of evidence for or against MRCA lactamase function, we reconstructed a sample of 98 MRCAs of the metallo-β-lactamases, each based on a different tree in a bootstrap sample of reconstructed phylogenies. InterPro sequence signatures and homology modelling were then used to assess our sample of MRCAs for lactamase functionality. Only 5 % of these models conform to our criteria for metallo-β-lactamase functionality, suggesting that the ancestor was unlikely to have been a metallo-β-lactamase. On the other hand, given that ancestral proteins may have had metallo-β-lactamase functionality with variation in sequence and structural properties compared with extant enzymes, our criteria are conservative, estimating a lower bound of evidence for metallo-β-lactamase functionality but not an upper bound.
Collapse
Affiliation(s)
- Rosanna G. Alderson
- Biomedical Sciences Research Complex and EaStCHEM School of Chemistry, Purdie Building, University of St Andrews, North Haugh, St Andrews, KY16 9ST Scotland, UK
| | - Daniel Barker
- Sir Harold Mitchell Building, School of Biology, University of St Andrews, St Andrews, KY16 9TH Scotland, UK
| | - John B. O. Mitchell
- Biomedical Sciences Research Complex and EaStCHEM School of Chemistry, Purdie Building, University of St Andrews, North Haugh, St Andrews, KY16 9ST Scotland, UK
| |
Collapse
|
16
|
Duardo-Sánchez A, Munteanu CR, Riera-Fernández P, López-Díaz A, Pazos A, González-Díaz H. Modeling Complex Metabolic Reactions, Ecological Systems, and Financial and Legal Networks with MIANN Models Based on Markov-Wiener Node Descriptors. J Chem Inf Model 2013; 54:16-29. [DOI: 10.1021/ci400280n] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Affiliation(s)
- Aliuska Duardo-Sánchez
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
- Department of Special Public Law, Financial
and Tributary Law Area, Faculty of Law, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, A Coruña, Spain
| | - Cristian R. Munteanu
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
| | - Pablo Riera-Fernández
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
| | - Antonio López-Díaz
- Department of Special Public Law, Financial
and Tributary Law Area, Faculty of Law, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, A Coruña, Spain
| | - Alejandro Pazos
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
| | - Humberto González-Díaz
- Department of Organic Chemistry II, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Bizkaia, Spain
- IKERBASQUE, Basque
Foundation for Science, 48011, Bilbao, Biscay, Spain
| |
Collapse
|