1
|
Blue-Lahom TC, Jones SK, Davis KM. Bioinformatic and biochemical analysis uncovers novel activity in the 2-ER subfamily of OYEs. RSC Chem Biol 2025:d4cb00289j. [PMID: 39867842 PMCID: PMC11759058 DOI: 10.1039/d4cb00289j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2024] [Accepted: 01/18/2025] [Indexed: 01/28/2025] Open
Abstract
Members of the old yellow enzyme (OYE) family utilize a flavin mononucleotide cofactor to catalyze the asymmetric reduction of activated alkenes. The 2-enoate reductase (2-ER) subfamily are of particular industrial relevance as they can reduce α/β alkenes near electron-withdrawing groups. While the broader OYE family is being extensively explored for biocatalytic applications, oxygen sensitivity and poor expression yields associated with the presence of an Fe/S cluster in 2-ERs have hampered their characterization. Herein, we explore the use of pseudo-anaerobic preparation as a route to more widespread study of these enzymes and apply bioinformatics approaches to identify a subset of 2-ERs containing unusual mutations in both a key catalytic residue and the Fe/S cluster-binding motif. Biochemical analysis of a representative member from Burkholderia insecticola (OYEBi) reveals novel N-methyl-proline demethylation activity, which we hypothesize may play a role in osmotic stress regulation based on genomic neighborhood analysis.
Collapse
Affiliation(s)
| | - Stacey K Jones
- Department of Chemistry, Emory University Atlanta GA 30322 USA
| | | |
Collapse
|
2
|
Allert MJ, Kumar S, Wang Y, Beese LS, Hellinga HW. Accurate Identification of Periplasmic Urea-binding Proteins by Structure- and Genome Context-assisted Functional Analysis. J Mol Biol 2024; 436:168780. [PMID: 39241982 DOI: 10.1016/j.jmb.2024.168780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 08/29/2024] [Accepted: 08/31/2024] [Indexed: 09/09/2024]
Abstract
ABC transporters are ancient and ubiquitous nutrient transport systems in bacteria and play a central role in defining lifestyles. Periplasmic solute-binding proteins (SBPs) are components that deliver ligands to their translocation machinery. SBPs have diversified to bind a wide range of ligands with high specificity and affinity. However, accurate assignment of cognate ligands remains a challenging problem in SBPs. Urea metabolism plays an important role in the nitrogen cycle; anthropogenic sources account for more than half of global nitrogen fertilizer. We report identification of urea-binding proteins within a large SBP sequence family that encodes diverse functions. By combining genetic linkage between SBPs, ABC transporter components, enzymes or transcription factors, we accurately identified cognate ligands, as we verified experimentally by biophysical characterization of ligand binding and crystallographic determination of the urea complex of a thermostable urea-binding homolog. Using three-dimensional structure information, these functional assignments were extrapolated to other members in the sequence family lacking genetic linkage information, which revealed that only a fraction bind urea. Using the same combined approaches, we also inferred that other family members bind various short-chain amides, aliphatic amino acids (leucine, isoleucine, valine), γ-aminobutyrate, and as yet unknown ligands. Comparative structural analysis revealed structural adaptations that encode diversification in these SBPs. Systematic assignment of ligands to SBP sequence families is key to understanding bacterial lifestyles, and also provides a rich source of biosensors for clinical and environmental analysis, such as the thermostable urea-binding protein identified here.
Collapse
Affiliation(s)
- Malin J Allert
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA.
| | - Shivesh Kumar
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA; Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, MO 63110, USA.
| | - You Wang
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA.
| | - Lorena S Beese
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA.
| | - Homme W Hellinga
- Department of Biochemistry, Duke University Medical Center, Durham, NC 27710, USA.
| |
Collapse
|
3
|
Price MN, Arkin AP. Interactive tools for functional annotation of bacterial genomes. Database (Oxford) 2024; 2024:baae089. [PMID: 39241109 PMCID: PMC11378808 DOI: 10.1093/database/baae089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 07/29/2024] [Accepted: 08/09/2024] [Indexed: 09/08/2024]
Abstract
Automated annotations of protein functions are error-prone because of our lack of knowledge of protein functions. For example, it is often impossible to predict the correct substrate for an enzyme or a transporter. Furthermore, much of the knowledge that we do have about the functions of proteins is missing from the underlying databases. We discuss how to use interactive tools to quickly find different kinds of information relevant to a protein's function. Many of these tools are available via PaperBLAST (http://papers.genomics.lbl.gov). Combining these tools often allows us to infer a protein's function. Ideally, accurate annotations would allow us to predict a bacterium's capabilities from its genome sequence, but in practice, this remains challenging. We describe interactive tools that infer potential capabilities from a genome sequence or that search a genome to find proteins that might perform a specific function of interest. Database URL: http://papers.genomics.lbl.gov.
Collapse
Affiliation(s)
- Morgan N Price
- Environmental Genomics & Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA 94720, United States
| | - Adam P Arkin
- Environmental Genomics & Systems Biology, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA 94720, United States
| |
Collapse
|
4
|
Wakabayashi T, Oide M, Kato T, Nakasako M. Coenzyme-binding pathway on glutamate dehydrogenase suggested from multiple-binding sites visualized by cryo-electron microscopy. FEBS J 2023; 290:5514-5535. [PMID: 37682540 DOI: 10.1111/febs.16951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 08/10/2023] [Accepted: 09/05/2023] [Indexed: 09/09/2023]
Abstract
The structure of hexameric glutamate dehydrogenase (GDH) in the presence of the coenzyme nicotinamide adenine dinucleotide phosphate (NADP) was visualized using cryogenic transmission electron microscopy to investigate the ligand-binding pathways to the active site of the enzyme. Each subunit of GDH comprises one hexamer-forming core domain and one nucleotide-binding domain (NAD domain), which spontaneously opens and closes the active-site cleft situated between the two domains. In the presence of NADP, the potential map of GDH hexamer, assuming D3 symmetry, was determined at a resolution of 2.4 Å, but the NAD domain was blurred due to the conformational variety. After focused classification with respect to the NAD domain, the potential maps interpreted as NADP molecules appeared at five different sites in the active-site cleft. The subunits associated with NADP molecules were close to one of the four metastable conformations in the unliganded state. Three of the five binding sites suggested a pathway of NADP molecules to approach the active-site cleft for initiating the enzymatic reaction. The other two binding modes may rarely appear in the presence of glutamate, as demonstrated by the reaction kinetics. Based on the visualized structures and the results from the enzymatic kinetics, we discussed the binding modes of NADP to GDH in the absence and presence of glutamate.
Collapse
Grants
- JPMJPR22E2 Japan Science and Technology Agency
- 18J11653 Japan Society for the Promotion of Science
- jp13480214 Japan Society for the Promotion of Science
- jp19204042 Japan Society for the Promotion of Science
- jp21H01050 Japan Society for the Promotion of Science
- jp22244054 Japan Society for the Promotion of Science
- jp26800227 Japan Society for the Promotion of Science
- jp15076210 Ministry of Education, Culture, Sports, Science and Technology
- jp15H01647 Ministry of Education, Culture, Sports, Science and Technology
- jp17H05891 Ministry of Education, Culture, Sports, Science and Technology
- jp20050030 Ministry of Education, Culture, Sports, Science and Technology
- jp22018027 Ministry of Education, Culture, Sports, Science and Technology
- jp23120525 Ministry of Education, Culture, Sports, Science and Technology
- jp25120725 Ministry of Education, Culture, Sports, Science and Technology
- 0436 Japan Agency for Medical Research and Development
Collapse
Affiliation(s)
- Taiki Wakabayashi
- Department of Physics, Faculty of Science and Technology, Keio University, Yokohama, Japan
- RIKEN SPring-8 Center, Sayo-gun, Hyogo, Japan
- RIKEN Cluster for Pioneering Research, Wako, Japan
| | - Mao Oide
- Department of Physics, Faculty of Science and Technology, Keio University, Yokohama, Japan
- RIKEN SPring-8 Center, Sayo-gun, Hyogo, Japan
- RIKEN Cluster for Pioneering Research, Wako, Japan
- PRESTO, Japan Science and Technology Agency, Tokyo, Japan
| | - Takayuki Kato
- Protein Research Institute, Osaka University, Suita, Japan
| | - Masayoshi Nakasako
- Department of Physics, Faculty of Science and Technology, Keio University, Yokohama, Japan
- RIKEN SPring-8 Center, Sayo-gun, Hyogo, Japan
| |
Collapse
|
5
|
Pinotsis N, Krüger A, Tomas N, Chatziefthymiou SD, Litz C, Mortensen SA, Daffé M, Marrakchi H, Antranikian G, Wilmanns M. Discovery of a non-canonical prototype long-chain monoacylglycerol lipase through a structure-based endogenous reaction intermediate complex. Nat Commun 2023; 14:7649. [PMID: 38012138 PMCID: PMC10682391 DOI: 10.1038/s41467-023-43354-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 11/07/2023] [Indexed: 11/29/2023] Open
Abstract
The identification and characterization of enzyme function is largely lacking behind the rapidly increasing availability of large numbers of sequences and associated high-resolution structures. This is often hampered by lack of knowledge on in vivo relevant substrates. Here, we present a case study of a high-resolution structure of an unusual orphan lipase in complex with an endogenous C18 monoacylglycerol ester reaction intermediate from the expression host, which is insoluble under aqueous conditions and thus not accessible for studies in solution. The data allowed its functional characterization as a prototypic long-chain monoacylglycerol lipase, which uses a minimal lid domain to position the substrate through a hydrophobic tunnel directly to the enzyme's active site. Knowledge about the molecular details of the substrate binding site allowed us to modulate the enzymatic activity by adjusting protein/substrate interactions, demonstrating the potential of our findings for future biotechnology applications.
Collapse
Affiliation(s)
- Nikos Pinotsis
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607, Hamburg, Germany
- Department of Chemistry, National and Kapodistrian University of Athens, Zografou, Greece
| | - Anna Krüger
- Hamburg University of Technology, Kasernenstrasse 12, 21073, Hamburg, Germany
| | - Nicolas Tomas
- Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, Université Toulouse III-Paul Sabatier, Toulouse, France
| | | | - Claudia Litz
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607, Hamburg, Germany
| | - Simon Arnold Mortensen
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607, Hamburg, Germany
| | - Mamadou Daffé
- Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, Université Toulouse III-Paul Sabatier, Toulouse, France
| | - Hedia Marrakchi
- Institut de Pharmacologie et de Biologie Structurale, Université de Toulouse, CNRS, Université Toulouse III-Paul Sabatier, Toulouse, France
| | - Garabed Antranikian
- Hamburg University of Technology, Kasernenstrasse 12, 21073, Hamburg, Germany
| | - Matthias Wilmanns
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607, Hamburg, Germany.
- University Hamburg Clinical Center Hamburg-Eppendorf, Martinistrasse 52, 20251, Hamburg, Germany.
| |
Collapse
|
6
|
Mathy CJP, Kortemme T. Emerging maps of allosteric regulation in cellular networks. Curr Opin Struct Biol 2023; 80:102602. [PMID: 37150039 PMCID: PMC10960510 DOI: 10.1016/j.sbi.2023.102602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 03/24/2023] [Accepted: 04/04/2023] [Indexed: 05/09/2023]
Abstract
Allosteric regulation is classically defined as action at a distance, where a perturbation outside of a protein active site affects function. While this definition has motivated many studies of allosteric mechanisms at the level of protein structure, translating these insights to the allosteric regulation of entire cellular processes - and their crosstalk - has received less attention, despite the broad importance of allostery for cellular regulation foreseen by Jacob and Monod. Here, we revisit an evolutionary model for the widespread emergence of allosteric regulation in colocalized proteins, describe supporting evidence, and discuss emerging advances in mapping allostery in cellular networks that link precise and often allosteric perturbations at the molecular level to functional changes at the pathway and systems levels.
Collapse
Affiliation(s)
- Christopher J P Mathy
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, CA, 94158, USA; The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, 94158, USA.
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, CA, 94158, USA; The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, 94158, USA; Chan Zuckerberg Biohub, San Francisco, CA, 94158, USA.
| |
Collapse
|
7
|
Yu Y, Wang Z, Wang L, Tian S, Hou T, Sun H. Predicting the mutation effects of protein–ligand interactions via end-point binding free energy calculations: strategies and analyses. J Cheminform 2022; 14:56. [PMID: 35987841 PMCID: PMC9392442 DOI: 10.1186/s13321-022-00639-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 08/08/2022] [Indexed: 12/04/2022] Open
Abstract
Protein mutations occur frequently in biological systems, which may impact, for example, the binding of drugs to their targets through impairing the critical H-bonds, changing the hydrophobic interactions, etc. Thus, accurately predicting the effects of mutations on biological systems is of great interests to various fields. Unfortunately, it is still unavailable to conduct large-scale wet-lab mutation experiments because of the unaffordable experimental time and financial costs. Alternatively, in silico computation can serve as a pioneer to guide the experiments. In fact, numerous pioneering works have been conducted from computationally cheaper machine-learning (ML) methods to the more expensive alchemical methods with the purpose to accurately predict the mutation effects. However, these methods usually either cannot result in a physically understandable model (ML-based methods) or work with huge computational resources (alchemical methods). Thus, compromised methods with good physical characteristics and high computational efficiency are expected. Therefore, here, we conducted a comprehensive investigation on the mutation issues of biological systems with the famous end-point binding free energy calculation methods represented by MM/GBSA and MM/PBSA. Different computational strategies considering different length of MD simulations, different value of dielectric constants and whether to incorporate entropy effects to the predicted total binding affinities were investigated to provide a more accurate way for predicting the energetic change upon protein mutations. Overall, our result shows that a relatively long MD simulation (e.g. 100 ns) benefits the prediction accuracy for both MM/GBSA and MM/PBSA (with the best Pearson correlation coefficient between the predicted ∆∆G and the experimental data of ~ 0.44 for a challenging dataset). Further analyses shows that systems involving large perturbations (e.g. multiple mutations and large number of atoms change in the mutation site) are much easier to be accurately predicted since the algorithm works more sensitively to the large change of the systems. Besides, system-specific investigation reveals that conformational adjustment is needed to refine the micro-environment of the manually mutated systems and thus lead one to understand why longer MD simulation is necessary to improve the predicting result. The proposed strategy is expected to be applied in large-scale mutation effects investigation with interpretation.
Collapse
|
8
|
Escudeiro P, Henry CS, Dias RP. Functional characterization of prokaryotic dark matter: the road so far and what lies ahead. CURRENT RESEARCH IN MICROBIAL SCIENCES 2022; 3:100159. [PMID: 36561390 PMCID: PMC9764257 DOI: 10.1016/j.crmicr.2022.100159] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 07/18/2022] [Accepted: 08/05/2022] [Indexed: 12/25/2022] Open
Abstract
Eight-hundred thousand to one trillion prokaryotic species may inhabit our planet. Yet, fewer than two-hundred thousand prokaryotic species have been described. This uncharted fraction of microbial diversity, and its undisclosed coding potential, is known as the "microbial dark matter" (MDM). Next-generation sequencing has allowed to collect a massive amount of genome sequence data, leading to unprecedented advances in the field of genomics. Still, harnessing new functional information from the genomes of uncultured prokaryotes is often limited by standard classification methods. These methods often rely on sequence similarity searches against reference genomes from cultured species. This hinders the discovery of unique genetic elements that are missing from the cultivated realm. It also contributes to the accumulation of prokaryotic gene products of unknown function among public sequence data repositories, highlighting the need for new approaches for sequencing data analysis and classification. Increasing evidence indicates that these proteins of unknown function might be a treasure trove of biotechnological potential. Here, we outline the challenges, opportunities, and the potential hidden within the functional dark matter (FDM) of prokaryotes. We also discuss the pitfalls surrounding molecular and computational approaches currently used to probe these uncharted waters, and discuss future opportunities for research and applications.
Collapse
Affiliation(s)
- Pedro Escudeiro
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Lisboa 1749-016, Portugal
| | - Christopher S. Henry
- Argonne National Laboratory, Lemont, Illinois, USA
- University of Chicago, Chicago, Illinois, USA
| | - Ricardo P.M. Dias
- BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Lisboa 1749-016, Portugal
- iXLab - Innovation for National Biological Resilience, Faculdade de Ciências, Universidade de Lisboa, Lisboa 1749-016, Portugal
| |
Collapse
|
9
|
Xu Z, Mahadevan R. Efficient Enumeration of Branched Novel Biochemical Pathways Using a Probabilistic Technique. Ind Eng Chem Res 2022. [DOI: 10.1021/acs.iecr.1c02211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Zhiqing Xu
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario M5S 3G9, Canada
| |
Collapse
|
10
|
Wang P, Schumacher AM, Shiu SH. Computational prediction of plant metabolic pathways. CURRENT OPINION IN PLANT BIOLOGY 2022; 66:102171. [PMID: 35078130 DOI: 10.1016/j.pbi.2021.102171] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 12/07/2021] [Accepted: 12/18/2021] [Indexed: 06/14/2023]
Abstract
Uncovering genes encoding enzymes responsible for the biosynthesis of diverse plant metabolites is essential for metabolic engineering and production of plant metabolite-derived medicine. With the availability of multi-omics data for an ever-increasing number of plant species and the development of computational approaches, the metabolic pathways of many important plant compounds can be predicted, complementing a more traditional genetic and/or biochemical approach. Here, we summarize recent progress in predicting plant metabolic pathways using genome, transcriptome, proteome, interactome, and/or metabolome data, and the utility of integrating these data with machine learning to further improve metabolic pathway predictions.
Collapse
Affiliation(s)
- Peipei Wang
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA.
| | - Ally M Schumacher
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Shin-Han Shiu
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA; Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, 48824, USA.
| |
Collapse
|
11
|
Raveh B, Sun L, White KL, Sanyal T, Tempkin J, Zheng D, Bharath K, Singla J, Wang C, Zhao J, Li A, Graham NA, Kesselman C, Stevens RC, Sali A. Bayesian metamodeling of complex biological systems across varying representations. Proc Natl Acad Sci U S A 2021; 118:e2104559118. [PMID: 34453000 PMCID: PMC8536362 DOI: 10.1073/pnas.2104559118] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Comprehensive modeling of a whole cell requires an integration of vast amounts of information on various aspects of the cell and its parts. To divide and conquer this task, we introduce Bayesian metamodeling, a general approach to modeling complex systems by integrating a collection of heterogeneous input models. Each input model can in principle be based on any type of data and can describe a different aspect of the modeled system using any mathematical representation, scale, and level of granularity. These input models are 1) converted to a standardized statistical representation relying on probabilistic graphical models, 2) coupled by modeling their mutual relations with the physical world, and 3) finally harmonized with respect to each other. To illustrate Bayesian metamodeling, we provide a proof-of-principle metamodel of glucose-stimulated insulin secretion by human pancreatic β-cells. The input models include a coarse-grained spatiotemporal simulation of insulin vesicle trafficking, docking, and exocytosis; a molecular network model of glucose-stimulated insulin secretion signaling; a network model of insulin metabolism; a structural model of glucagon-like peptide-1 receptor activation; a linear model of a pancreatic cell population; and ordinary differential equations for systemic postprandial insulin response. Metamodeling benefits from decentralized computing, while often producing a more accurate, precise, and complete model that contextualizes input models as well as resolves conflicting information. We anticipate Bayesian metamodeling will facilitate collaborative science by providing a framework for sharing expertise, resources, data, and models, as exemplified by the Pancreatic β-Cell Consortium.
Collapse
Affiliation(s)
- Barak Raveh
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem 9190416, Israel
| | - Liping Sun
- iHuman Institute, ShanghaiTech University, Shanghai 201210, China
| | - Kate L White
- Department of Biological Sciences, Bridge Institute, University of Southern California, Los Angeles, CA 90089
| | - Tanmoy Sanyal
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| | - Jeremy Tempkin
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| | - Dongqing Zheng
- Mork Family Department of Chemical Engineering and Materials Science, Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089
| | - Kala Bharath
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
| | - Jitin Singla
- Department of Biological Sciences, Bridge Institute, University of Southern California, Los Angeles, CA 90089
- Epstein Department of Industrial and Systems Engineering, The Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089
- Information Science Institute, The Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089
| | - Chenxi Wang
- iHuman Institute, ShanghaiTech University, Shanghai 201210, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Jihui Zhao
- iHuman Institute, ShanghaiTech University, Shanghai 201210, China
| | - Angdi Li
- iHuman Institute, ShanghaiTech University, Shanghai 201210, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Nicholas A Graham
- Mork Family Department of Chemical Engineering and Materials Science, Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089
| | - Carl Kesselman
- Epstein Department of Industrial and Systems Engineering, The Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089
- Information Science Institute, The Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089
| | - Raymond C Stevens
- iHuman Institute, ShanghaiTech University, Shanghai 201210, China
- Department of Biological Sciences, Bridge Institute, University of Southern California, Los Angeles, CA 90089
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158;
- Quantitative Biosciences Institute, University of California, San Francisco, CA 94158
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158
| |
Collapse
|
12
|
Discovery and mining of enzymes from the human gut microbiome. Trends Biotechnol 2021; 40:240-254. [PMID: 34304905 DOI: 10.1016/j.tibtech.2021.06.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 06/24/2021] [Accepted: 06/25/2021] [Indexed: 12/19/2022]
Abstract
Advances in technological and bioinformatics approaches have led to the generation of a plethora of human gut metagenomic datasets. Metabolomics has also provided substantial data regarding the small metabolites produced and modified by the microbiota. Comparatively, the microbial enzymes mediating the transformation of metabolites have not been intensively investigated. Here, we discuss the recent efforts and technologies used for discovering and mining enzymes from the human gut microbiota. The wealth of knowledge on metabolites, reactions, genome sequences, and structures of proteins, may drive the development of strategies for enzyme mining. Ongoing efforts to annotate gut microbiota enzymes will explain catalytic mechanisms that may guide the clinical applications of the gut microbiome for diagnostic and therapeutic purposes.
Collapse
|
13
|
Fenner K, Elsner M, Lueders T, McLachlan MS, Wackett LP, Zimmermann M, Drewes JE. Methodological Advances to Study Contaminant Biotransformation: New Prospects for Understanding and Reducing Environmental Persistence? ACS ES&T WATER 2021; 1:1541-1554. [PMID: 34278380 PMCID: PMC8276273 DOI: 10.1021/acsestwater.1c00025] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Revised: 06/11/2021] [Accepted: 06/11/2021] [Indexed: 05/14/2023]
Abstract
Complex microbial communities in environmental systems play a key role in the detoxification of chemical contaminants by transforming them into less active metabolites or by complete mineralization. Biotransformation, i.e., transformation by microbes, is well understood for a number of priority pollutants, but a similar level of understanding is lacking for many emerging contaminants encountered at low concentrations and in complex mixtures across natural and engineered systems. Any advanced approaches aiming to reduce environmental exposure to such contaminants (e.g., novel engineered biological water treatment systems, design of readily degradable chemicals, or improved regulatory assessment strategies to determine contaminant persistence a priori) will depend on understanding the causal links among contaminant removal, the key driving agents of biotransformation at low concentrations (i.e., relevant microbes and their metabolic activities), and how their presence and activity depend on environmental conditions. In this Perspective, we present the current understanding and recent methodological advances that can help to identify such links, even in complex environmental microbiomes and for contaminants present at low concentrations in complex chemical mixtures. We discuss the ensuing insights into contaminant biotransformation across varying environments and conditions and ask how much closer we have come to designing improved approaches to reducing environmental exposure to contaminants.
Collapse
Affiliation(s)
- Kathrin Fenner
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, 8600 Dübendorf, Switzerland
- Institute of Biogeochemistry and Pollutant Dynamics, ETH Zürich, 8092 Zürich, Switzerland
- Department of Chemistry, University of Zürich, 8057 Zürich, Switzerland
| | - Martin Elsner
- Chair of Analytical Chemistry and Water Chemistry, Technical University of Munich, 85748 Garching, Germany
| | - Tillmann Lueders
- Chair of Ecological Microbiology, Bayreuth Center of Ecology and Environmental Research (BayCEER), University of Bayreuth, 95448 Bayreuth, Germany
| | - Michael S McLachlan
- Department of Environmental Science (ACES), Stockholm University, 106 91 Stockholm, Sweden
| | - Lawrence P Wackett
- Biotechnology Institute, University of Minnesota, Saint Paul, Minnesota 55108, United States
| | - Michael Zimmermann
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Jörg E Drewes
- Chair of Urban Water Systems Engineering, Technical University of Munich, 85748 Garching, Germany
| |
Collapse
|
14
|
Sali A. From integrative structural biology to cell biology. J Biol Chem 2021; 296:100743. [PMID: 33957123 PMCID: PMC8203844 DOI: 10.1016/j.jbc.2021.100743] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/09/2021] [Accepted: 04/30/2021] [Indexed: 12/16/2022] Open
Abstract
Integrative modeling is an increasingly important tool in structural biology, providing structures by combining data from varied experimental methods and prior information. As a result, molecular architectures of large, heterogeneous, and dynamic systems, such as the ∼52-MDa Nuclear Pore Complex, can be mapped with useful accuracy, precision, and completeness. Key challenges in improving integrative modeling include expanding model representations, increasing the variety of input data and prior information, quantifying a match between input information and a model in a Bayesian fashion, inventing more efficient structural sampling, as well as developing better model validation, analysis, and visualization. In addition, two community-level challenges in integrative modeling are being addressed under the auspices of the Worldwide Protein Data Bank (wwPDB). First, the impact of integrative structures is maximized by PDB-Development, a prototype wwPDB repository for archiving, validating, visualizing, and disseminating integrative structures. Second, the scope of structural biology is expanded by linking the wwPDB resource for integrative structures with archives of data that have not been generally used for structure determination but are increasingly important for computing integrative structures, such as data from various types of mass spectrometry, spectroscopy, optical microscopy, proteomics, and genetics. To address the largest of modeling problems, a type of integrative modeling called metamodeling is being developed; metamodeling combines different types of input models as opposed to different types of data to compute an output model. Collectively, these developments will facilitate the structural biology mindset in cell biology and underpin spatiotemporal mapping of the entire cell.
Collapse
Affiliation(s)
- Andrej Sali
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, the Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California, USA.
| |
Collapse
|
15
|
Stack TMM, Gerlt JA. Discovery of novel pathways for carbohydrate metabolism. Curr Opin Chem Biol 2020; 61:63-70. [PMID: 33197748 DOI: 10.1016/j.cbpa.2020.09.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 09/14/2020] [Accepted: 09/16/2020] [Indexed: 01/09/2023]
Abstract
Closing the gap between the increasing availability of complete genome sequences and the discovery of novel enzymes in novel metabolic pathways is a significant challenge. Here, we review recent examples of assignment of in vitro enzymatic activities and in vivo metabolic functions to uncharacterized proteins, with a focus on enzymes and metabolic pathways involved in the catabolism and biosynthesis of monosaccharides and polysaccharides. The most effective approaches are based on analyses of sequence-function space in protein families that provide clues for the predictions of the functions of the uncharacterized enzymes. As summarized in this Opinion, this approach allows the discovery of the catabolism of new molecules, new pathways for common molecules, and new enzymatic chemistries.
Collapse
Affiliation(s)
- Tyler M M Stack
- Carl. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL 61801, United States
| | - John A Gerlt
- Carl. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL 61801, United States; Departments of Biochemistry and Chemistry, University of Illinois, Urbana, IL 61801, United States.
| |
Collapse
|
16
|
Mast FD, Navare AT, van der Sloot AM, Coulombe-Huntington J, Rout MP, Baliga NS, Kaushansky A, Chait BT, Aderem A, Rice CM, Sali A, Tyers M, Aitchison JD. Crippling life support for SARS-CoV-2 and other viruses through synthetic lethality. J Cell Biol 2020; 219:e202006159. [PMID: 32785687 PMCID: PMC7659715 DOI: 10.1083/jcb.202006159] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 07/28/2020] [Accepted: 07/28/2020] [Indexed: 02/07/2023] Open
Abstract
With the rapid global spread of SARS-CoV-2, we have become acutely aware of the inadequacies of our ability to respond to viral epidemics. Although disrupting the viral life cycle is critical for limiting viral spread and disease, it has proven challenging to develop targeted and selective therapeutics. Synthetic lethality offers a promising but largely unexploited strategy against infectious viral disease; as viruses infect cells, they abnormally alter the cell state, unwittingly exposing new vulnerabilities in the infected cell. Therefore, we propose that effective therapies can be developed to selectively target the virally reconfigured host cell networks that accompany altered cellular states to cripple the host cell that has been converted into a virus factory, thus disrupting the viral life cycle.
Collapse
Affiliation(s)
- Fred D. Mast
- Center for Global Infectious Disease Research, Seattle Children’s Research Institute, Seattle, WA
| | - Arti T. Navare
- Center for Global Infectious Disease Research, Seattle Children’s Research Institute, Seattle, WA
| | - Almer M. van der Sloot
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, Canada
| | | | - Michael P. Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY
| | | | - Alexis Kaushansky
- Center for Global Infectious Disease Research, Seattle Children’s Research Institute, Seattle, WA
- Department of Pediatrics, University of Washington, Seattle, WA
| | - Brian T. Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY
| | - Alan Aderem
- Center for Global Infectious Disease Research, Seattle Children’s Research Institute, Seattle, WA
- Department of Pediatrics, University of Washington, Seattle, WA
| | - Charles M. Rice
- Laboratory of Virology and Infectious Disease, The Rockefeller University, New York, NY
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California, San Francisco, San Francisco, CA
| | - Mike Tyers
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, Canada
| | - John D. Aitchison
- Center for Global Infectious Disease Research, Seattle Children’s Research Institute, Seattle, WA
- Department of Pediatrics, University of Washington, Seattle, WA
- Department of Biochemistry, University of Washington, Seattle, WA
| |
Collapse
|
17
|
Kumari A, Bano N, Chaudhary DR, Jha B. Draft genome sequence of plastic degrading Bacillus sp. AIIW2 isolated from the Arabian ocean. J Basic Microbiol 2020; 61:37-44. [PMID: 33006156 DOI: 10.1002/jobm.202000416] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 08/23/2020] [Accepted: 09/15/2020] [Indexed: 11/09/2022]
Abstract
The endemic spread of plastic in the environment requires urgent need of a sustainable approach. Marine microbes found to have vast bioactivity and play a central role in biogeochemical cycling in the ocean; however, very few of them had been explored for biochemical cycling or plastic degradation. In the present study, we report the draft genome sequence of marine Bacillus sp. AIIW2 which was found to utilize plastic as a carbon source. The Bacillus sonorensis SRCM101395 was used as a reference genome for mapping the reads. The genome size of strain AIIW2 was approximately 4.4 Mb and composed of 4737 coding sequences with 45.7% G + C contents. The whole genome comparison of strain AIIW2 with three closest Bacillus strains showed strain specificity, the 16S ribosomal RNA sequence shows 99.93% similarity with Bacillus paralicheniformis KJ-16T (KY694465). This genome data would provide the genetic basis in developing plastic bioremediation approaches and discover the enzymes pertinent in the biodegradation processes.
Collapse
Affiliation(s)
- Alka Kumari
- Division of Plant Omics, CSIR-Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India
| | - Nasreen Bano
- Department of Molecular Biology and Biotechnology, CSIR-National Botanical Research Institute, Lucknow, Uttar Pradesh, India.,CSIR-Academy of Scientific and Innovative Research, Ghaziabad, Uttar Pradesh, India
| | - Doongar R Chaudhary
- Division of Plant Omics, CSIR-Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India.,CSIR-Academy of Scientific and Innovative Research, Ghaziabad, Uttar Pradesh, India
| | - Bhavanath Jha
- Division of Plant Omics, CSIR-Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India
| |
Collapse
|
18
|
Garcia DC, Cheng X, Land ML, Standaert RF, Morrell-Falvey JL, Doktycz MJ. Computationally Guided Discovery and Experimental Validation of Indole-3-acetic Acid Synthesis Pathways. ACS Chem Biol 2019; 14:2867-2875. [PMID: 31693336 DOI: 10.1021/acschembio.9b00725] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Elucidating the interaction networks associated with secondary metabolite production in microorganisms is an ongoing challenge made all the more daunting by the rate at which DNA sequencing technology reveals new genes and potential pathways. Developing the culturing methods, expression conditions, and genetic systems needed for validating pathways in newly discovered microorganisms is often not possible. Therefore, new tools and techniques are needed for defining complex metabolic pathways. Here, we describe an in vitro computationally assisted pathway description approach that employs bioinformatic searches of genome databases, protein structural modeling, and protein-ligand-docking simulations to predict the gene products most likely to be involved in a particular secondary metabolite production pathway. This information is then used to direct in vitro reconstructions of the pathway and subsequent confirmation of pathway activity using crude enzyme preparations. As a test system, we elucidated the pathway for biosynthesis of indole-3-acetic acid (IAA) in the plant-associated microbe Pantoea sp. YR343. This organism is capable of metabolizing tryptophan into the plant phytohormone IAA. BLAST analyses identified a likely three-step pathway involving an amino transferase, an indole pyruvate decarboxylase, and a dehydrogenase. However, multiple candidate enzymes were identified at each step, resulting in a large number of potential pathway reconstructions (32 different enzyme combinations). Our approach shows the effectiveness of crude extracts to rapidly elucidate enzymes leading to functional pathways. Results are compared to affinity purified enzymes for select combinations and found to yield similar relative activities. Further, in vitro testing of the pathway reconstructions revealed the "underground" nature of IAA metabolism in Pantoea sp. YR343 and the various mechanisms used to produce IAA. Importantly, our experiments illustrate the scalable integration of computational tools and cell-free enzymatic reactions to identify and validate metabolic pathways in a broadly applicable manner.
Collapse
Affiliation(s)
- David C. Garcia
- Biological and Nanoscale Systems Group, Biosciences Division Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
- Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Tennessee 37996-4519, United States
| | - Xiaolin Cheng
- College of Pharmacy, The Ohio State University, Columbus, Ohio 43210, United States
| | - Miriam L. Land
- Computational Biology and Bioinformatics Group, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Robert F. Standaert
- Biological and Nanoscale Systems Group, Biosciences Division Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
- Department of Chemistry, East Tennessee State University, Johnson City, Tennessee 37604, United States
| | - Jennifer L. Morrell-Falvey
- Biological and Nanoscale Systems Group, Biosciences Division Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Mitchel J. Doktycz
- Biological and Nanoscale Systems Group, Biosciences Division Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
- Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Tennessee 37996-4519, United States
| |
Collapse
|
19
|
Systems biology based metabolic engineering for non-natural chemicals. Biotechnol Adv 2019; 37:107379. [DOI: 10.1016/j.biotechadv.2019.04.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Revised: 02/23/2019] [Accepted: 04/01/2019] [Indexed: 12/17/2022]
|
20
|
Shmakov SA, Faure G, Makarova KS, Wolf YI, Severinov KV, Koonin EV. Systematic prediction of functionally linked genes in bacterial and archaeal genomes. Nat Protoc 2019; 14:3013-3031. [PMID: 31520072 DOI: 10.1038/s41596-019-0211-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Accepted: 06/13/2019] [Indexed: 11/09/2022]
Abstract
Functionally linked genes in bacterial and archaeal genomes are often organized into operons. However, the composition and architecture of operons are highly variable and frequently differ even among closely related genomes. Therefore, to efficiently extract reliable functional predictions for uncharacterized genes from comparative analyses of the rapidly growing genomic databases, dedicated computational approaches are required. We developed a protocol to systematically and automatically identify genes that are likely to be functionally associated with a 'bait' gene or locus by using relevance metrics. Given a set of bait loci and a genomic database defined by the user, this protocol compares the genomic neighborhoods of the baits to identify genes that are likely to be functionally linked to the baits by calculating the abundance of a given gene within and outside the bait neighborhoods and the distance to the bait. We exemplify the performance of the protocol with three test cases, namely, genes linked to CRISPR-Cas systems using the 'CRISPRicity' metric, genes associated with archaeal proviruses and genes linked to Argonaute genes in halobacteria. The protocol can be run by users with basic computational skills. The computational cost depends on the sizes of the genomic dataset and the list of reference loci and can vary from one CPU-hour to hundreds of hours on a supercomputer.
Collapse
Affiliation(s)
- Sergey A Shmakov
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA.,Skolkovo Institute of Science and Technology, Skolkovo, Russia
| | - Guilhem Faure
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| | - Konstantin V Severinov
- Skolkovo Institute of Science and Technology, Skolkovo, Russia.,Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.,Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA.
| |
Collapse
|
21
|
Vallat B, Webb B, Westbrook J, Sali A, Berman HM. Archiving and disseminating integrative structure models. JOURNAL OF BIOMOLECULAR NMR 2019; 73:385-398. [PMID: 31278630 PMCID: PMC6692293 DOI: 10.1007/s10858-019-00264-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 06/25/2019] [Indexed: 05/04/2023]
Abstract
Limitations in the applicability, accuracy, and precision of individual structure characterization methods can sometimes be overcome via an integrative modeling approach that relies on information from all available sources, including all available experimental data and prior models. The open-source Integrative Modeling Platform (IMP) is one piece of software that implements all computational aspects of integrative modeling. To maximize the impact of integrative structures, the coordinates should be made publicly available, as is already the case for structures based on X-ray crystallography, NMR spectroscopy, and electron microscopy. Moreover, the associated experimental data and modeling protocols should also be archived, such that the original results can easily be reproduced. Finally, it is essential that the integrative structures are validated as part of their publication and deposition. A number of research groups have already developed software to implement integrative modeling and have generated a number of structures, prompting the formation of an Integrative/Hybrid Methods Task Force. Following the recommendations of this task force, the existing PDBx/mmCIF data representation used for atomic PDB structures has been extended to address the requirements for archiving integrative structural models. This IHM-dictionary adds a flexible model representation, including coarse graining, models in multiple states and/or related by time or other order, and multiple input experimental information sources. A prototype archiving system called PDB-Dev ( https://pdb-dev.wwpdb.org ) has also been created to archive integrative structural models, together with a Python library to facilitate handling of integrative models in PDBx/mmCIF format.
Collapse
Affiliation(s)
- Brinda Vallat
- Institute for Quantitative Biomedicine, Piscataway, USA
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA, 94143, USA
| | - John Westbrook
- Institute for Quantitative Biomedicine, Piscataway, USA
- RCSB Protein Data Bank, Piscataway, USA
| | - Andrej Sali
- RCSB Protein Data Bank, Piscataway, USA.
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, San Francisco, CA, 94143, USA.
- Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, CA, 94143, USA.
- Lead Contacts, San Francisco, USA.
| | - Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08854, USA.
- Lead Contacts, Piscataway, USA.
| |
Collapse
|
22
|
Abstract
Bacterial natural products display astounding structural diversity, which, in turn, endows them with a remarkable range of biological activities that are of significant value to modern society. Such structural features are generated by biosynthetic enzymes that construct core scaffolds or perform peripheral modifications, and can thus define natural product families, introduce pharmacophores and permit metabolic diversification. Modern genomics approaches have greatly enhanced our ability to access and characterize natural product pathways via sequence-similarity-based bioinformatics discovery strategies. However, many biosynthetic enzymes catalyse exceptional, unprecedented transformations that continue to defy functional prediction and remain hidden from us in bacterial (meta)genomic sequence data. In this Review, we highlight exciting examples of unusual enzymology that have been uncovered recently in the context of natural product biosynthesis. These suggest that much of the natural product diversity, including entire substance classes, awaits discovery. New approaches to lift the veil on the cryptic chemistries of the natural product universe are also discussed.
Collapse
|
23
|
Discovering a new catabolic pathway of D-ribonate in Mycobacterium smegmatis. Biochem Biophys Res Commun 2018; 505:1107-1111. [DOI: 10.1016/j.bbrc.2018.10.033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Accepted: 10/05/2018] [Indexed: 11/19/2022]
|
24
|
Seitzer P, Jeanniard A, Ma F, Van Etten JL, Facciotti MT, Dunigan DD. Gene Gangs of the Chloroviruses: Conserved Clusters of Collinear Monocistronic Genes. Viruses 2018; 10:E576. [PMID: 30347809 PMCID: PMC6213493 DOI: 10.3390/v10100576] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 10/12/2018] [Accepted: 10/17/2018] [Indexed: 01/16/2023] Open
Abstract
Chloroviruses (family Phycodnaviridae) are dsDNA viruses found throughout the world's inland waters. The open reading frames in the genomes of 41 sequenced chloroviruses (330 ± 40 kbp each) representing three virus types were analyzed for evidence of evolutionarily conserved local genomic "contexts", the organization of biological information into units of a scale larger than a gene. Despite a general loss of synteny between virus types, we informatically detected a highly conserved genomic context defined by groups of three or more genes that we have termed "gene gangs". Unlike previously described local genomic contexts, the definition of gene gangs requires only that member genes be consistently co-localized and are not constrained by strand, regulatory sites, or intervening sequences (and therefore represent a new type of conserved structural genomic element). An analysis of functional annotations and transcriptomic data suggests that some of the gene gangs may organize genes involved in specific biochemical processes, but that this organization does not involve their coordinated expression.
Collapse
Affiliation(s)
- Phillip Seitzer
- Department of Biomedical Engineering, One Shields Ave, University of California, Davis, CA 95616, USA;
- Genome Center, One Shields Ave, University of California, Davis, CA 95616, USA
- Proteome Software, Portland, OR 97219, USA
| | - Adrien Jeanniard
- Nebraska Center for Virology, Morrison Research Center, University of Nebraska, Lincoln, NE 68583-0900, USA; (F.M.); (J.L.V.E.)
| | - Fangrui Ma
- Nebraska Center for Virology, Morrison Research Center, University of Nebraska, Lincoln, NE 68583-0900, USA; (F.M.); (J.L.V.E.)
| | - James L. Van Etten
- Nebraska Center for Virology, Morrison Research Center, University of Nebraska, Lincoln, NE 68583-0900, USA; (F.M.); (J.L.V.E.)
- Department of Plant Pathology, Plant Science Hall, University of Nebraska, Lincoln, NE 68583-0722, USA
| | - Marc T. Facciotti
- Department of Biomedical Engineering, One Shields Ave, University of California, Davis, CA 95616, USA;
- Genome Center, One Shields Ave, University of California, Davis, CA 95616, USA
| | - David D. Dunigan
- Nebraska Center for Virology, Morrison Research Center, University of Nebraska, Lincoln, NE 68583-0900, USA; (F.M.); (J.L.V.E.)
- Department of Plant Pathology, Plant Science Hall, University of Nebraska, Lincoln, NE 68583-0722, USA
| |
Collapse
|
25
|
de Crécy-Lagard V, Haas D, Hanson AD. Newly-discovered enzymes that function in metabolite damage-control. Curr Opin Chem Biol 2018; 47:101-108. [PMID: 30268903 DOI: 10.1016/j.cbpa.2018.09.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 08/19/2018] [Accepted: 09/11/2018] [Indexed: 01/26/2023]
Abstract
Enzymes of unknown function are estimated to make up around 25% of the sequenced proteome. In the past decade, over 20 conserved families have been shown to function in the metabolism of 'damaged' or abnormal metabolites that are wasteful and often toxic. These newly discovered damage-control enzymes either repair or inactivate the offending metabolites, or pre-empt their formation in the first place. Comparative genomics has been of prime importance in predicting the functions of damage-control enzymes and in guiding the biochemical and genetic tests required to validate these functions.
Collapse
Affiliation(s)
- Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, USA; Genetics Institute, University of Florida, Gainesville, FL, USA.
| | - Drago Haas
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, USA
| | - Andrew D Hanson
- Horticultural Sciences Department, University of Florida, Gainesville, FL, USA
| |
Collapse
|
26
|
Copp JN, Akiva E, Babbitt PC, Tokuriki N. Revealing Unexplored Sequence-Function Space Using Sequence Similarity Networks. Biochemistry 2018; 57:4651-4662. [PMID: 30052428 DOI: 10.1021/acs.biochem.8b00473] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The rapidly expanding number of protein sequences found in public databases can improve our understanding of how protein functions evolve. However, our current knowledge of protein function likely represents a small fraction of the diverse repertoire that exists in nature. Integrative computational methods can facilitate the discovery of new protein functions and enzymatic reactions through the observation and investigation of the complex sequence-structure-function relationships within protein superfamilies. Here, we highlight the use of sequence similarity networks (SSNs) to identify previously unexplored sequence and function space. We exemplify this approach using the nitroreductase (NTR) superfamily. We demonstrate that SSN investigations can provide a rapid and effective means to classify groups of proteins, therefore exposing experimentally unexplored sequences that may exhibit novel functionality. Integration of such approaches with systematic experimental characterization will expand our understanding of the functional diversity of enzymes and their associated physiological roles.
Collapse
Affiliation(s)
- Janine N Copp
- Michael Smith Laboratories , University of British Columbia , 2185 East Mall , Vancouver , British Columbia V6T 1Z4 , Canada
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences , University of California , San Francisco , California 94158 , United States.,Quantitative Biosciences Institute , University of California , San Francisco , California 94143 , United States
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences , University of California , San Francisco , California 94158 , United States.,Quantitative Biosciences Institute , University of California , San Francisco , California 94143 , United States
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories , University of British Columbia , 2185 East Mall , Vancouver , British Columbia V6T 1Z4 , Canada
| |
Collapse
|
27
|
Holliday GL, Akiva E, Meng EC, Brown SD, Calhoun S, Pieper U, Sali A, Booker SJ, Babbitt PC. Atlas of the Radical SAM Superfamily: Divergent Evolution of Function Using a "Plug and Play" Domain. Methods Enzymol 2018; 606:1-71. [PMID: 30097089 DOI: 10.1016/bs.mie.2018.06.004] [Citation(s) in RCA: 96] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The radical SAM superfamily contains over 100,000 homologous enzymes that catalyze a remarkably broad range of reactions required for life, including metabolism, nucleic acid modification, and biogenesis of cofactors. While the highly conserved SAM-binding motif responsible for formation of the key 5'-deoxyadenosyl radical intermediate is a key structural feature that simplifies identification of superfamily members, our understanding of their structure-function relationships is complicated by the modular nature of their structures, which exhibit varied and complex domain architectures. To gain new insight about these relationships, we classified the entire set of sequences into similarity-based subgroups that could be visualized using sequence similarity networks. This superfamily-wide analysis reveals important features that had not previously been appreciated from studies focused on one or a few members. Functional information mapped to the networks indicates which members have been experimentally or structurally characterized, their known reaction types, and their phylogenetic distribution. Despite the biological importance of radical SAM chemistry, the vast majority of superfamily members have never been experimentally characterized in any way, suggesting that many new reactions remain to be discovered. In addition to 20 subgroups with at least one known function, we identified additional subgroups made up entirely of sequences of unknown function. Importantly, our results indicate that even general reaction types fail to track well with our sequence similarity-based subgroupings, raising major challenges for function prediction for currently identified and new members that continue to be discovered. Interactive similarity networks and other data from this analysis are available from the Structure-Function Linkage Database.
Collapse
Affiliation(s)
- Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States.
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Elaine C Meng
- Resource for Biocomputing, Visualization, and Informatics, Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA, United States
| | - Shoshana D Brown
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Sara Calhoun
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Graduate Program in Biophysics, University of California, San Francisco, CA, United States
| | - Ursula Pieper
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States
| | - Squire J Booker
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, United States; Department of Chemistry, The Pennsylvania State University, University Park, PA, United States; The Howard Hughes Medical Institute, The Pennsylvania State University, University Park, PA, United States
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States.
| |
Collapse
|
28
|
Carter MS, Zhang X, Huang H, Bouvier JT, Francisco BS, Vetting MW, Al-Obaidi N, Bonanno JB, Ghosh A, Zallot RG, Andersen HM, Almo SC, Gerlt JA. Functional assignment of multiple catabolic pathways for D-apiose. Nat Chem Biol 2018; 14:696-705. [PMID: 29867142 DOI: 10.1038/s41589-018-0067-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 03/29/2018] [Indexed: 11/09/2022]
Abstract
Colocation of the genes encoding ABC, TRAP, and TCT transport systems and catabolic pathways for the transported ligand provides a strategy for discovering novel microbial enzymes and pathways. We screened solute-binding proteins (SBPs) for ABC transport systems and identified three that bind D-apiose, a branched pentose in the cell walls of higher plants. Guided by sequence similarity networks (SSNs) and genome neighborhood networks (GNNs), the identities of the SBPs enabled the discovery of four catabolic pathways for D-apiose with eleven previously unknown reactions. The new enzymes include D-apionate oxidoisomerase, which catalyzes hydroxymethyl group migration, as well as 3-oxo-isoapionate-4-phosphate decarboxylase and 3-oxo-isoapionate-4-phosphate transcarboxylase/hydrolase, which are RuBisCO-like proteins (RLPs). The web tools for generating SSNs and GNNs are publicly accessible ( http://efi.igb.illinois.edu/efi-est/ ), so similar 'genomic enzymology' strategies for discovering novel pathways can be used by the community.
Collapse
Affiliation(s)
- Michael S Carter
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Xinshuai Zhang
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Hua Huang
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Jason T Bouvier
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Brian San Francisco
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Matthew W Vetting
- Department of Biochemistry, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Nawar Al-Obaidi
- Department of Biochemistry, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Jeffrey B Bonanno
- Department of Biochemistry, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Agnidipta Ghosh
- Department of Biochemistry, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Rémi G Zallot
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Harvey M Andersen
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Steven C Almo
- Department of Biochemistry, Albert Einstein College of Medicine, Bronx, NY, USA
| | - John A Gerlt
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA. .,Department of Biochemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA. .,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| |
Collapse
|
29
|
Wohlgemuth R. Horizons of Systems Biocatalysis and Renaissance of Metabolite Synthesis. Biotechnol J 2018; 13:e1700620. [DOI: 10.1002/biot.201700620] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Revised: 04/26/2018] [Indexed: 12/12/2022]
Affiliation(s)
- Roland Wohlgemuth
- European Federation of Biotechnology; Section on Applied Biocatalysis (ESAB); Theodor-Heuss-Allee 25,Frankfurt am Main 60486 Germany
- Sigma-Aldrich; Member of Merck Group; Industriestrasse 25,Buchs 9470 Switzerland
| |
Collapse
|