1
|
Yu K, Chen L, Tang Y, Ma A, Zhu W, Wang H, Tang X, Li Y, Li J. Enhanced thermostability of nattokinase by rational design of disulfide bond. Microb Cell Fact 2025; 24:51. [PMID: 40033318 PMCID: PMC11877946 DOI: 10.1186/s12934-025-02681-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Accepted: 02/20/2025] [Indexed: 03/05/2025] Open
Abstract
Nattokinase, the thrombolytically active substance in the health food natto, nevertheless, its lower thermostability restricts its use in food and pharmaceutical applications. In this study, two heat-resistant variants of nattokinase, designated 50-109 (M1) and 15-271 (M2), were successfully obtained by introducing a disulfide bonding strategy. Their half-lives at 55℃ were found to be 2.50-fold and 5.17-fold higher, respectively, than that of the wild type. Furthermore, the specific enzyme activities of the variants, M1 and M2, were also increased by 2.37 and 1.66-fold, respectively. Meanwhile, the combination of two mutants increased the thermostability of nattokinase by 8.0-fold. Bioinformatics analyses indicated that the enhanced thermostability of the M1 and M2 variants was due to the increased rigidity and structural contraction of the overall structure. Finally, the fermentation process of mutant M1 was optimized to increase the expression of nattokinase. Study provides substantial molecular and theoretical support for the industrial production and application of nattokinase.
Collapse
Affiliation(s)
- Kongfang Yu
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China
| | - Liangqi Chen
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, 830017, China
| | - Yaolei Tang
- The Third People's Hospital of Xinjiang Uygur Autonomous Region, Urumqi, 830000, China
| | - Aixia Ma
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, 830017, China
| | - Wenhui Zhu
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China
| | - Hong Wang
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China
| | - Xiyu Tang
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, 830017, China
| | - Yuan Li
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China.
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, 830017, China.
| | - Jinyao Li
- Institute of Materia Medica, College of Pharmacy, Xinjiang University, Urumqi, 830017, China.
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life Science and Technology, Xinjiang University, Urumqi, 830017, China.
| |
Collapse
|
2
|
Hornung BVH, Terrapon N. An objective criterion to evaluate sequence-similarity networks helps in dividing the protein family sequence space. PLoS Comput Biol 2023; 19:e1010881. [PMID: 37585436 PMCID: PMC10461819 DOI: 10.1371/journal.pcbi.1010881] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 08/28/2023] [Accepted: 01/18/2023] [Indexed: 08/18/2023] Open
Abstract
The deluge of genomic data raises various challenges for computational protein annotation. The definition of superfamilies, based on conserved folds, or of families, showing more recent homology signatures, allow a first categorization of the sequence space. However, for precise functional annotation or the identification of the unexplored parts within a family, a division into subfamilies is essential. As curators of an expert database, the Carbohydrate Active Enzymes database (CAZy), we began, more than 15 years ago, to manually define subfamilies based on phylogeny reconstruction. However, facing the increasing amount of sequence and functional data, we required more scalable and reproducible methods. The recently popularized sequence similarity networks (SSNs), allows to cope with very large families and computation of many subfamily schemes. Still, the choice of the optimal SSN subfamily scheme only relies on expert knowledge so far, without any data-driven guidance from within the network. In this study, we therefore decided to investigate several network properties to determine a criterion which can be used by curators to evaluate the quality of subfamily assignments. The performance of the closeness centrality criterion, a network property to indicate the connectedness within the network, shows high similarity to the decisions of expert curators from eight distinct protein families. Closeness centrality also suggests that in some cases multiple levels of subfamilies could be possible, depending on the granularity of the research question, while it indicates when no subfamily emerged in some family evolution. We finally used closeness centrality to create subfamilies in four families of the CAZy database, providing a finer functional annotation and highlighting subfamilies without biochemically characterized members for potential future discoveries.
Collapse
Affiliation(s)
| | - Nicolas Terrapon
- Aix Marseille Université, CNRS, UMR 7257 AFMB, Marseille, France
- INRAE, USC 1408 AFMB, Marseille, France
| |
Collapse
|
3
|
Maatouk M, Merhej V, Pontarotti P, Ibrahim A, Rolain JM, Bittar F. Metallo-Beta-Lactamase-like Encoding Genes in Candidate Phyla Radiation: Widespread and Highly Divergent Proteins with Potential Multifunctionality. Microorganisms 2023; 11:1933. [PMID: 37630493 PMCID: PMC10459063 DOI: 10.3390/microorganisms11081933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 07/22/2023] [Accepted: 07/27/2023] [Indexed: 08/27/2023] Open
Abstract
The Candidate Phyla Radiation (CPR) was found to harbor a vast repertoire of genes encoding for enzymes with potential antibiotic resistance activity. Among these, as many as 3349 genes were predicted in silico to contain a metallo-beta-lactamase-like (MBL-like) fold. These proteins were subject to an in silico functional characterization by comparing their protein profiles (presence/absence of conserved protein domains) to other MBLs, including 24 already expressed in vitro, along with those of the beta-lactamase database (BLDB) (n = 761). The sequence similarity network (SSN) was then used to predict the functional clusters of CPR MBL-like sequences. Our findings showed that CPR MBL-like sequences were longer and more diverse than bacterial MBL sequences, with a high content of functional domains. Most CPR MBL-like sequences did not show any SSN connectivity with expressed MBLs, indicating the presence of many potential, yet unidentified, functions in CPR. In conclusion, CPR was shown to have many protein functions and a large sequence variability of MBL-like folds, exceeding all known MBLs. Further experimental and evolutionary studies of this superfamily of hydrolyzing enzymes are necessary to illustrate their functional annotation, origin, and expansion for adaptation or specialization within a given niche or compared to a specific substrate.
Collapse
Affiliation(s)
- Mohamad Maatouk
- Microbes, Evolution, Phylogénie et Infection (MEPHI), Institut de Recherche pour le Développement (IRD), Assistance Publique-Hôpitaux de Marseille (AP-HM), Aix-Marseille University, 13005 Marseille, France; (M.M.); (P.P.); (A.I.); (J.-M.R.)
- Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, 13005 Marseille, France
| | - Vicky Merhej
- Microbes, Evolution, Phylogénie et Infection (MEPHI), Institut de Recherche pour le Développement (IRD), Assistance Publique-Hôpitaux de Marseille (AP-HM), Aix-Marseille University, 13005 Marseille, France; (M.M.); (P.P.); (A.I.); (J.-M.R.)
- Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, 13005 Marseille, France
| | - Pierre Pontarotti
- Microbes, Evolution, Phylogénie et Infection (MEPHI), Institut de Recherche pour le Développement (IRD), Assistance Publique-Hôpitaux de Marseille (AP-HM), Aix-Marseille University, 13005 Marseille, France; (M.M.); (P.P.); (A.I.); (J.-M.R.)
- Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, 13005 Marseille, France
- Centre National de la Recherche Scientifique (CNRS-SNC5039), 13009 Marseille, France
| | - Ahmad Ibrahim
- Microbes, Evolution, Phylogénie et Infection (MEPHI), Institut de Recherche pour le Développement (IRD), Assistance Publique-Hôpitaux de Marseille (AP-HM), Aix-Marseille University, 13005 Marseille, France; (M.M.); (P.P.); (A.I.); (J.-M.R.)
- Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, 13005 Marseille, France
| | - Jean-Marc Rolain
- Microbes, Evolution, Phylogénie et Infection (MEPHI), Institut de Recherche pour le Développement (IRD), Assistance Publique-Hôpitaux de Marseille (AP-HM), Aix-Marseille University, 13005 Marseille, France; (M.M.); (P.P.); (A.I.); (J.-M.R.)
- Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, 13005 Marseille, France
| | - Fadi Bittar
- Microbes, Evolution, Phylogénie et Infection (MEPHI), Institut de Recherche pour le Développement (IRD), Assistance Publique-Hôpitaux de Marseille (AP-HM), Aix-Marseille University, 13005 Marseille, France; (M.M.); (P.P.); (A.I.); (J.-M.R.)
- Institut Hospitalo-Universitaire (IHU) Méditerranée Infection, 13005 Marseille, France
| |
Collapse
|
4
|
Reed A, Ware T, Li H, Fernando Bazan J, Cravatt BF. TMEM164 is an acyltransferase that forms ferroptotic C20:4 ether phospholipids. Nat Chem Biol 2023; 19:378-388. [PMID: 36782012 PMCID: PMC10362496 DOI: 10.1038/s41589-022-01253-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 12/21/2022] [Indexed: 02/15/2023]
Abstract
Ferroptosis is an iron-dependent form of cell death driven by oxidation of polyunsaturated fatty acid (PUFA) phospholipids. Large-scale genetic screens have uncovered a specialized role for PUFA ether phospholipids (ePLs) in promoting ferroptosis. Understanding of the enzymes involved in PUFA-ePL production, however, remains incomplete. Here we show, using a combination of pathway mining of genetic dependency maps, AlphaFold-guided structure predictions and targeted lipidomics, that the uncharacterized transmembrane protein TMEM164-the genetic ablation of which has been shown to protect cells from ferroptosis-is a cysteine active center enzyme that selectively transfers C20:4 acyl chains from phosphatidylcholine to lyso-ePLs to produce PUFA ePLs. Genetic deletion of TMEM164 across a set of ferroptosis-sensitive cancer cell lines caused selective reductions in C20:4 ePLs with minimal effects on C20:4 diacyl PLs, and this lipid profile produced a variable range of protection from ferroptosis, supportive of an important but contextualized role for C20:4 ePLs in this form of cell death.
Collapse
Affiliation(s)
- Alex Reed
- Department of Chemistry, The Scripps Research Institute, San Diego, CA, USA
| | - Timothy Ware
- Department of Chemistry, The Scripps Research Institute, San Diego, CA, USA
| | - Haoxin Li
- Department of Chemistry, The Scripps Research Institute, San Diego, CA, USA
| | - J Fernando Bazan
- ħ Bioconsulting, LLC, Stillwater, MN, USA.
- Unit for Structural Biology, VIB-UGent Center for Inflammation Research, Ghent, Belgium.
| | - Benjamin F Cravatt
- Department of Chemistry, The Scripps Research Institute, San Diego, CA, USA.
| |
Collapse
|
5
|
Lin HY, Chen X, Dong J, Yang JF, Xiao H, Ye Y, Li LH, Zhan CG, Yang WC, Yang GF. Rational Redesign of Enzyme via the Combination of Quantum Mechanics/Molecular Mechanics, Molecular Dynamics, and Structural Biology Study. J Am Chem Soc 2021; 143:15674-15687. [PMID: 34542283 DOI: 10.1021/jacs.1c06227] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Increasing demands for efficient and versatile chemical reactions have prompted innovations in enzyme engineering. A major challenge in engineering α-ketoglutarate-dependent oxygenases is to develop a rational strategy which can be widely used for directly evolving the desired mutant to generate new products. Herein, we report a strategy for rational redesign of a model enzyme, 4-hydroxyphenylpyruvate dioxygenase (HPPD), based on quantum mechanics/molecular mechanics (QM/MM) calculation and molecular dynamic simulations. This strategy enriched our understanding of the HPPD catalytic reaction pathway and led to the discovery of a series of HPPD mutants producing hydroxyphenylacetate (HPA) as the alternative product other than the native product homogentisate. The predicted HPPD-Fe(IV)═O-HPA intermediate was further confirmed by the crystal structure of Arabidopsis thaliana HPPD/S267W complexed with HPA. These findings not only provide a good understanding of the structure-function relationship of HPPD but also demonstrate a generally applicable platform for the development of biocatalysts.
Collapse
Affiliation(s)
- Hong-Yan Lin
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| | - Xi Chen
- College of Chemistry and Material Science, South-Central University for Nationalities, Wuhan 430074, P.R. China
| | - Jin Dong
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| | - Jing-Fang Yang
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| | - Han Xiao
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| | - Ying Ye
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| | - Lin-Hui Li
- College of Chemistry and Material Science, South-Central University for Nationalities, Wuhan 430074, P.R. China
| | - Chang-Guo Zhan
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky, 789 South Limestone Street, Lexington, Kentucky 40536, United States
| | - Wen-Chao Yang
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| | - Guang-Fu Yang
- Key Laboratory of Pesticide & Chemical Biology of Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, College of Chemistry, Central China Normal University, Wuhan 430079, P.R. China
| |
Collapse
|
6
|
Hecht N, Monteil CL, Perrière G, Vishkautzan M, Gur E. Exploring Protein Space: From Hydrolase to Ligase by Substitution. Mol Biol Evol 2021; 38:761-776. [PMID: 32870983 PMCID: PMC7947786 DOI: 10.1093/molbev/msaa215] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The understanding of how proteins evolve to perform novel functions has long been sought by biologists. In this regard, two homologous bacterial enzymes, PafA and Dop, pose an insightful case study, as both rely on similar mechanistic properties, yet catalyze different reactions. PafA conjugates a small protein tag to target proteins, whereas Dop removes the tag by hydrolysis. Given that both enzymes present a similar fold and high sequence similarity, we sought to identify the differences in the amino acid sequence and folding responsible for each distinct activity. We tackled this question using analysis of sequence–function relationships, and identified a set of uniquely conserved residues in each enzyme. Reciprocal mutagenesis of the hydrolase, Dop, completely abolished the native activity, at the same time yielding a catalytically active ligase. Based on the available Dop and PafA crystal structures, this change of activity required a conformational change of a critical loop at the vicinity of the active site. We identified the conserved positions essential for stabilization of the alternative loop conformation, and tracked alternative mutational pathways that lead to a change in activity. Remarkably, all these pathways were combined in the evolution of PafA and Dop, despite their redundant effect on activity. Overall, we identified the residues and structural elements in PafA and Dop responsible for their activity differences. This analysis delineated, in molecular terms, the changes required for the emergence of a new catalytic function from a preexisting one.
Collapse
Affiliation(s)
- Nir Hecht
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Caroline L Monteil
- Laboratoire de Biométrie et Biologie Evolutive, Université Claude Bernard - Lyon 1, Villeurbanne, France
| | - Guy Perrière
- Laboratoire de Biométrie et Biologie Evolutive, Université Claude Bernard - Lyon 1, Villeurbanne, France
| | - Marina Vishkautzan
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel.,The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - Eyal Gur
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel.,The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| |
Collapse
|
7
|
Das S, Scholes HM, Sen N, Orengo C. CATH functional families predict functional sites in proteins. Bioinformatics 2021; 37:1099-1106. [PMID: 33135053 PMCID: PMC8150129 DOI: 10.1093/bioinformatics/btaa937] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 09/30/2020] [Accepted: 10/27/2020] [Indexed: 01/12/2023] Open
Abstract
MOTIVATION Identification of functional sites in proteins is essential for functional characterization, variant interpretation and drug design. Several methods are available for predicting either a generic functional site, or specific types of functional site. Here, we present FunSite, a machine learning predictor that identifies catalytic, ligand-binding and protein-protein interaction functional sites using features derived from protein sequence and structure, and evolutionary data from CATH functional families (FunFams). RESULTS FunSite's prediction performance was rigorously benchmarked using cross-validation and a holdout dataset. FunSite outperformed other publicly available functional site prediction methods. We show that conserved residues in FunFams are enriched in functional sites. We found FunSite's performance depends greatly on the quality of functional site annotations and the information content of FunFams in the training data. Finally, we analyze which structural and evolutionary features are most predictive for functional sites. AVAILABILITYAND IMPLEMENTATION https://github.com/UCL/cath-funsite-predictor. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sayoni Das
- PrecisionLife Ltd., Long Hanborough, OX29 8LJ Oxford, UK
| | - Harry M Scholes
- Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK
| | - Neeladri Sen
- Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK
| | - Christine Orengo
- Institute of Structural and Molecular Biology, University College London, WC1E 6BT, London, UK
| |
Collapse
|
8
|
Rayder TM, Bensalah AT, Li B, Byers JA, Tsung CK. Engineering Second Sphere Interactions in a Host–Guest Multicomponent Catalyst System for the Hydrogenation of Carbon Dioxide to Methanol. J Am Chem Soc 2021; 143:1630-1640. [DOI: 10.1021/jacs.0c08957] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Thomas M. Rayder
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Adam T. Bensalah
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Banruo Li
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Jeffery A. Byers
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| | - Chia-Kuang Tsung
- Department of Chemistry, Boston College, Chestnut Hill, Massachusetts 02467, United States
| |
Collapse
|
9
|
Northover DE, Shank SD, Liberles DA. Characterizing lineage-specific evolution and the processes driving genomic diversification in chordates. BMC Evol Biol 2020; 20:24. [PMID: 32046633 PMCID: PMC7011509 DOI: 10.1186/s12862-020-1585-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 01/16/2020] [Indexed: 11/21/2022] Open
Abstract
Background Understanding the origins of genome content has long been a goal of molecular evolution and comparative genomics. By examining genome evolution through the guise of lineage-specific evolution, it is possible to make inferences about the evolutionary events that have given rise to species-specific diversification. Here we characterize the evolutionary trends found in chordate species using The Adaptive Evolution Database (TAED). TAED is a database of phylogenetically indexed gene families designed to detect episodes of directional or diversifying selection across chordates. Gene families within the database have been assessed for lineage-specific estimates of dN/dS and have been reconciled to the chordate species to identify retained duplicates. Gene families have also been mapped to the functional pathways and amino acid changes which occurred on high dN/dS lineages have been mapped to protein structures. Results An analysis of this exhaustive database has enabled a characterization of the processes of lineage-specific diversification in chordates. A pathway level enrichment analysis of TAED determined that pathways most commonly found to have elevated rates of evolution included those involved in metabolism, immunity, and cell signaling. An analysis of protein fold presence on proteins, after normalizing for frequency in the database, found common folds such as Rossmann folds, Jelly Roll folds, and TIM barrels were overrepresented on proteins most likely to undergo directional selection. A set of gene families which experience increased numbers of duplications within short evolutionary times are associated with pathways involved in metabolism, olfactory reception, and signaling. An analysis of protein secondary structure indicated more relaxed constraint in β-sheets and stronger constraint on alpha Helices, amidst a general preference for substitutions at exposed sites. Lastly a detailed analysis of the ornithine decarboxylase gene family, a key enzyme in the pathway for polyamine synthesis, revealed lineage-specific evolution along the lineage leading to Cetacea through rapid sequence evolution in a duplicate gene with amino acid substitutions causing active site rearrangement. Conclusion Episodes of lineage-specific evolution are frequent throughout chordate species. Both duplication and directional selection have played large roles in the evolution of the phylum. TAED is a powerful tool for facilitating this understanding of lineage-specific evolution.
Collapse
Affiliation(s)
- David E Northover
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Stephen D Shank
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA. .,Department of Molecular Biology, University of Wyoming, Laramie, WY, 82071, USA.
| |
Collapse
|
10
|
Holliday GL, Brown SD, Mischel D, Polacco BJ, Babbitt PC. A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function. Database (Oxford) 2020; 2020:baaa034. [PMID: 32449511 PMCID: PMC7246345 DOI: 10.1093/database/baaa034] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/18/2020] [Accepted: 04/27/2020] [Indexed: 12/12/2022]
Abstract
Determining the molecular function of enzymes discovered by genome sequencing represents a primary foundation for understanding many aspects of biology. Historically, classification of enzyme reactions has used the enzyme nomenclature system developed to describe the overall reactions performed by biochemically characterized enzymes, irrespective of their associated sequences. In contrast, functional classification and assignment for the millions of protein sequences of unknown function now available is largely done in two computational steps, first by similarity-based assignment of newly obtained sequences to homologous groups, followed by transferring to them the known functions of similar biochemically characterized homologs. Due to the fundamental differences in their etiologies and practice, `how' these chemistry- and evolution-centric functional classification systems relate to each other has been difficult to explore on a large scale. To investigate this issue in a new way, we integrated two published ontologies that had previously described each of these classification systems independently. The resulting infrastructure was then used to compare the functional assignments obtained from each classification system for the well-studied and functionally diverse enolase superfamily. Mapping these function assignments to protein structure and reaction similarity networks shows a profound and complex disconnect between the homology- and chemistry-based classification systems. This conclusion mirrors previous observations suggesting that except for closely related sequences, facile annotation transfer from small numbers of characterized enzymes to the huge number uncharacterized homologs to which they are related is problematic. Our extension of these comparisons to large enzyme superfamilies in a computationally intelligent manner provides a foundation for new directions in protein function prediction for the huge proportion of sequences of unknown function represented in major databases. Interactive sequence, reaction, substrate and product similarity networks computed for this work for the enolase and two other superfamilies are freely available for download from the Structure Function Linkage Database Archive (http://sfld.rbvi.ucsf.edu).
Collapse
Affiliation(s)
- Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
- Present Address: Medicines Discovery Catapult, Mereside, Alderley Park, Alderley Edge SK10 4TG, UK
| | - Shoshana D Brown
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
| | - David Mischel
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
| | - Benjamin J Polacco
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
- Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, 1700 4th Street, CA 94143, USA
| |
Collapse
|
11
|
Chevrette MG, Gutiérrez-García K, Selem-Mojica N, Aguilar-Martínez C, Yañez-Olvera A, Ramos-Aboites HE, Hoskisson PA, Barona-Gómez F. Evolutionary dynamics of natural product biosynthesis in bacteria. Nat Prod Rep 2019; 37:566-599. [PMID: 31822877 DOI: 10.1039/c9np00048h] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Covering: 2008 up to 2019The forces of biochemical adaptive evolution operate at the level of genes, manifesting in complex phenotypes and the global biodiversity of proteins and metabolites. While evolutionary histories have been deciphered for some other complex traits, the origins of natural product biosynthesis largely remain a mystery. This fundamental knowledge gap is surprising given the many decades of research probing the genetic, chemical, and biophysical mechanisms of bacterial natural product biosynthesis. Recently, evolutionary thinking has begun to permeate this otherwise mechanistically dominated field. Natural products are now sometimes referred to as 'specialized' rather than 'secondary' metabolites, reinforcing the importance of their biological and ecological functions. Here, we review known evolutionary mechanisms underlying the overwhelming chemical diversity of bacterial secondary metabolism, focusing on enzyme promiscuity and the evolution of enzymatic domains that enable metabolic traits. We discuss the mechanisms that drive the assembly of natural product biosynthetic gene clusters and propose formal definitions for 'specialized' and 'secondary' metabolism. We further explore how biosynthetic gene clusters evolve to synthesize related molecular species, and in turn how the biological and ecological roles that emerge from metabolic diversity are acted on by selection. Finally, we reconcile chemical, functional, and genetic data into an evolutionary model, the dynamic chemical matrix evolutionary hypothesis, in which the relationships between chemical distance, biomolecular activity, and relative fitness shape adaptive landscapes.
Collapse
Affiliation(s)
- Marc G Chevrette
- Wisconsin Institute for Discovery, Department of Plant Pathology, University of Wisconsin-Madison, Madison, WI, USA.
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Miller SA, Bandarian V. Analysis of Electrochemical Properties of S-Adenosyl-l-methionine and Implications for Its Role in Radical SAM Enzymes. J Am Chem Soc 2019; 141:11019-11026. [PMID: 31283208 PMCID: PMC7059804 DOI: 10.1021/jacs.9b00933] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
![]()
S-Adenosyl-l-methionine (SAM) is the
central cofactor in the radical SAM enzyme superfamily, responsible
for a vast number of transformations in primary and secondary metabolism.
In nearly all of these reactions, the reductive cleavage of SAM is
proposed to produce a reactive species, 5′-deoxyadenosyl radical,
which initiates catalysis. While the mechanistic details in many cases
are well-understood, the reductive cleavage of SAM remains elusive.
In this manuscript, we have measured the solution peak potential of
SAM to be ∼−1.4 V (v SHE) and show that under controlled
potential conditions, it undergoes irreversible fragmentation to the
5′-deoxyadenosyl radical. While the radical intermediate is
not directly observed, its presence as an initial intermediate is
inferred by the formation of 8,5′-cycloadenosine and by H atom
incorporation into 5′-deoxyadenosine from solvent exchangeable
site. Similarly, 2-aminobutyrate is also observed under electrolysis
conditions. The implications of these results in the context of the
reductive cleavage of SAM by radical SAM enzymes are discussed.
Collapse
Affiliation(s)
- Sven A Miller
- Department of Chemistry , University of Utah , 315 South 1400 East , Salt Lake City , Utah 84112 , United States
| | - Vahe Bandarian
- Department of Chemistry , University of Utah , 315 South 1400 East , Salt Lake City , Utah 84112 , United States
| |
Collapse
|
13
|
Sharir-Ivry A, Xia Y. Nature of Long-Range Evolutionary Constraint in Enzymes: Insights from Comparison to Pseudoenzymes with Similar Structures. Mol Biol Evol 2019; 35:2597-2606. [PMID: 30202983 DOI: 10.1093/molbev/msy177] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Enzymes are known to fine-tune their sequences to optimize catalytic function, yet quantitative evolutionary design principles of enzymes remain elusive on the proteomic scale. Recently, it was found that the catalytic site in enzymes induces long-range evolutionary constraint, where even sites distant to the catalytic site are more conserved than expected. Given that protein-fold usage is generally different between enzymes and nonenzymes, it remains an open question to what extent this long-range evolutionary constraint in enzymes is dictated, either directly or indirectly, by the special three-dimensional structure of the enzyme. To investigate this question, we have compared evolutionary properties of enzymes with those of counterpart pseudoenzymes that share the same protein fold but are catalytically inactive. We found that the long-range evolutionary constraint observed in enzymes is significantly reduced in pseudoenzyme counterparts, despite very high structural similarity (∼1.5 Å RMSD on average). Furthermore, this significant reduction in long-range evolutionary constraint is observed even in pseudoenzyme counterparts which retain the ligand-binding ability of enzymes. Finally, the distance between the site that induces the highest gradient of sequence conservation and the pseudocatalytic site in pseudoenzymes is significantly larger than the corresponding distance in enzymes. Taken together, our results suggest that the long-range evolutionary constraint in enzymes is induced mainly by the presence of the catalytic site rather than by the special three-dimensional structure of the enzyme, and that such long-range evolutionary constraint in enzymes depends mainly on the catalytic function of the active site rather than on the ligand-binding ability of the enzyme.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, QC, Canada
| |
Collapse
|
14
|
Sützl L, Foley G, Gillam EMJ, Bodén M, Haltrich D. The GMC superfamily of oxidoreductases revisited: analysis and evolution of fungal GMC oxidoreductases. BIOTECHNOLOGY FOR BIOFUELS 2019; 12:118. [PMID: 31168323 PMCID: PMC6509819 DOI: 10.1186/s13068-019-1457-0] [Citation(s) in RCA: 82] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 05/02/2019] [Indexed: 05/03/2023]
Abstract
BACKGROUND The glucose-methanol-choline (GMC) superfamily is a large and functionally diverse family of oxidoreductases that share a common structural fold. Fungal members of this superfamily that are characterised and relevant for lignocellulose degradation include aryl-alcohol oxidoreductase, alcohol oxidase, cellobiose dehydrogenase, glucose oxidase, glucose dehydrogenase, pyranose dehydrogenase, and pyranose oxidase, which together form family AA3 of the auxiliary activities in the CAZy database of carbohydrate-active enzymes. Overall, little is known about the extant sequence space of these GMC oxidoreductases and their phylogenetic relations. Although some individual forms are well characterised, it is still unclear how they compare in respect of the complete enzyme class and, therefore, also how generalizable are their characteristics. RESULTS To improve the understanding of the GMC superfamily as a whole, we used sequence similarity networks to cluster large numbers of fungal GMC sequences and annotate them according to functionality. Subsequently, different members of the GMC superfamily were analysed in detail with regard to their sequences and phylogeny. This allowed us to define the currently characterised sequence space and show that complete clades of some enzymes have not been studied in any detail to date. Finally, we interpret our results from an evolutionary perspective, where we could show, for example, that pyranose dehydrogenase evolved from aryl-alcohol oxidoreductase after a change in substrate specificity and that the cytochrome domain of cellobiose dehydrogenase was regularly lost during evolution. CONCLUSIONS This study offers new insights into the sequence variation and phylogenetic relationships of fungal GMC/AA3 sequences. Certain clades of these GMC enzymes identified in our phylogenetic analyses are completely uncharacterised to date, and might include enzyme activities of varying specificities and/or activities that are hitherto unstudied.
Collapse
Affiliation(s)
- Leander Sützl
- Food Biotechnology Laboratory, Department of Food Science and Technology, BOKU-University of Natural Resources and Life Sciences Vienna, Vienna, Austria
- Doctoral Programme BioToP-Biomolecular Technology of Proteins, BOKU-University of Natural Resources and Life Sciences Vienna, Vienna, Austria
| | - Gabriel Foley
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Elizabeth M J Gillam
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Mikael Bodén
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Dietmar Haltrich
- Food Biotechnology Laboratory, Department of Food Science and Technology, BOKU-University of Natural Resources and Life Sciences Vienna, Vienna, Austria
- Doctoral Programme BioToP-Biomolecular Technology of Proteins, BOKU-University of Natural Resources and Life Sciences Vienna, Vienna, Austria
| |
Collapse
|
15
|
Exploring the sequence, function, and evolutionary space of protein superfamilies using sequence similarity networks and phylogenetic reconstructions. Methods Enzymol 2019; 620:315-347. [PMID: 31072492 DOI: 10.1016/bs.mie.2019.03.015] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Integrative computational methods can facilitate the discovery of new protein functions and enzymatic reactions by enabling the observation and investigation of complex sequence-structure-function and evolutionary relationships within protein superfamilies. Here, we highlight the use of sequence similarity networks (SSNs) and phylogenetic reconstructions to map the functional divergence and evolutionary history of protein superfamilies. We exemplify this approach using the nitroreductase (NTR) flavoenzyme superfamily, demonstrating that SSN investigations can provide a rapid and effective means to classify groups of proteins, expose sequence similarity relationships across the global scale of a protein superfamily, and efficiently support detailed phylogenetic analyses. Integration of such approaches with systematic experimental characterization will expand our understanding of the functional diversity of enzymes, their evolution, and their associated physiological roles.
Collapse
|
16
|
Benkoulouche M, Fauré R, Remaud-Siméon M, Moulis C, André I. Harnessing glycoenzyme engineering for synthesis of bioactive oligosaccharides. Interface Focus 2019; 9:20180069. [PMID: 30842872 DOI: 10.1098/rsfs.2018.0069] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/20/2018] [Indexed: 12/13/2022] Open
Abstract
Combined with chemical synthesis, the use of glycoenzyme biocatalysts has shown great synthetic potential over recent decades owing to their remarkable versatility in terms of substrates and regio- and stereoselectivity that allow structurally controlled synthesis of carbohydrates and glycoconjugates. Nonetheless, the lack of appropriate enzymatic tools with requisite properties in the natural diversity has hampered extensive exploration of enzyme-based synthetic routes to access relevant bioactive oligosaccharides, such as cell-surface glycans or prebiotics. With the remarkable progress in enzyme engineering, it has become possible to improve catalytic efficiency and physico-chemical properties of enzymes but also considerably extend the repertoire of accessible catalytic reactions and tailor novel substrate specificities. In this review, we intend to give a brief overview of the advantageous use of engineered glycoenzymes, sometimes in combination with chemical steps, for the synthesis of natural bioactive oligosaccharides or their precursors. The focus will be on examples resulting from the three main classes of glycoenzymes specialized in carbohydrate synthesis: glycosyltransferases, glycoside hydrolases and glycoside phosphorylases.
Collapse
Affiliation(s)
- Mounir Benkoulouche
- Laboratoire d'Ingénierie des Systèmes Biologiques et Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, 135, avenue de Rangueil, 31077 Toulouse cedex 04, France
| | - Régis Fauré
- Laboratoire d'Ingénierie des Systèmes Biologiques et Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, 135, avenue de Rangueil, 31077 Toulouse cedex 04, France
| | - Magali Remaud-Siméon
- Laboratoire d'Ingénierie des Systèmes Biologiques et Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, 135, avenue de Rangueil, 31077 Toulouse cedex 04, France
| | - Claire Moulis
- Laboratoire d'Ingénierie des Systèmes Biologiques et Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, 135, avenue de Rangueil, 31077 Toulouse cedex 04, France
| | - Isabelle André
- Laboratoire d'Ingénierie des Systèmes Biologiques et Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, 135, avenue de Rangueil, 31077 Toulouse cedex 04, France
| |
Collapse
|
17
|
Yunes JM, Babbitt PC. Effusion: prediction of protein function from sequence similarity networks. Bioinformatics 2019; 35:442-451. [PMID: 30084920 PMCID: PMC6361244 DOI: 10.1093/bioinformatics/bty672] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Revised: 07/24/2018] [Accepted: 07/30/2018] [Indexed: 12/26/2022] Open
Abstract
Motivation Critical evaluation of methods for protein function prediction shows that data integration improves the performance of methods that predict protein function, but a basic BLAST-based method is still a top contender. We sought to engineer a method that modernizes the classical approach while avoiding pitfalls common to state-of-the-art methods. Results We present a method for predicting protein function, Effusion, which uses a sequence similarity network to add context for homology transfer, a probabilistic model to account for the uncertainty in labels and function propagation, and the structure of the Gene Ontology (GO) to best utilize sparse input labels and make consistent output predictions. Effusion's model makes it practical to integrate rare experimental data and abundant primary sequence and sequence similarity. We demonstrate Effusion's performance using a critical evaluation method and provide an in-depth analysis. We also dissect the design decisions we used to address challenges for predicting protein function. Finally, we propose directions in which the framework of the method can be modified for additional predictive power. Availability and implementation The source code for an implementation of Effusion is freely available at https://github.com/babbittlab/effusion. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jeffrey M Yunes
- UC Berkeley - UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, USA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA
- Quantitative Biosciences Institute, University of California, San Francisco, CA, USA
| |
Collapse
|
18
|
Sillitoe I, Dawson N, Lewis TE, Das S, Lees JG, Ashford P, Tolulope A, Scholes HM, Senatorov I, Bujan A, Ceballos Rodriguez-Conde F, Dowling B, Thornton J, Orengo CA. CATH: expanding the horizons of structure-based functional annotations for genome sequences. Nucleic Acids Res 2019; 47:D280-D284. [PMID: 30398663 PMCID: PMC6323983 DOI: 10.1093/nar/gky1097] [Citation(s) in RCA: 90] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Revised: 10/16/2018] [Accepted: 11/02/2018] [Indexed: 01/20/2023] Open
Abstract
This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains.
Collapse
Affiliation(s)
- Ian Sillitoe
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Natalie Dawson
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Tony E Lewis
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Sayoni Das
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Jonathan G Lees
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Paul Ashford
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Adeyelu Tolulope
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Harry M Scholes
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Ilya Senatorov
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Andra Bujan
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | | | - Benjamin Dowling
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| | - Janet Thornton
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Christine A Orengo
- Structural and Molecular Biology, University College London WC1E 6BT, UK
| |
Collapse
|
19
|
Kulkarni Y, Kamerlin SCL. Computational physical organic chemistry using the empirical valence bond approach. ADVANCES IN PHYSICAL ORGANIC CHEMISTRY 2019. [DOI: 10.1016/bs.apoc.2019.07.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
20
|
Jan Deniau A, Subileau M, Dubreucq E. Characterization and Reshaping of a Large and Hydrophobic Nucleophile Pocket in Lipases/Acyltransferases. Chembiochem 2018; 19:1839-1844. [DOI: 10.1002/cbic.201800279] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Indexed: 11/06/2022]
Affiliation(s)
- Anne‐Hélène Jan Deniau
- UMR IATEMontpellier SupAgro 2 place Pierre Viala Bâtiment 32 34060 Montpellier Cedex 2 France
| | - Maeva Subileau
- UMR IATEMontpellier SupAgro 2 place Pierre Viala Bâtiment 32 34060 Montpellier Cedex 2 France
| | - Eric Dubreucq
- UMR IATEMontpellier SupAgro 2 place Pierre Viala Bâtiment 32 34060 Montpellier Cedex 2 France
| |
Collapse
|
21
|
Copp JN, Akiva E, Babbitt PC, Tokuriki N. Revealing Unexplored Sequence-Function Space Using Sequence Similarity Networks. Biochemistry 2018; 57:4651-4662. [PMID: 30052428 DOI: 10.1021/acs.biochem.8b00473] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The rapidly expanding number of protein sequences found in public databases can improve our understanding of how protein functions evolve. However, our current knowledge of protein function likely represents a small fraction of the diverse repertoire that exists in nature. Integrative computational methods can facilitate the discovery of new protein functions and enzymatic reactions through the observation and investigation of the complex sequence-structure-function relationships within protein superfamilies. Here, we highlight the use of sequence similarity networks (SSNs) to identify previously unexplored sequence and function space. We exemplify this approach using the nitroreductase (NTR) superfamily. We demonstrate that SSN investigations can provide a rapid and effective means to classify groups of proteins, therefore exposing experimentally unexplored sequences that may exhibit novel functionality. Integration of such approaches with systematic experimental characterization will expand our understanding of the functional diversity of enzymes and their associated physiological roles.
Collapse
Affiliation(s)
- Janine N Copp
- Michael Smith Laboratories , University of British Columbia , 2185 East Mall , Vancouver , British Columbia V6T 1Z4 , Canada
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences , University of California , San Francisco , California 94158 , United States.,Quantitative Biosciences Institute , University of California , San Francisco , California 94143 , United States
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences , University of California , San Francisco , California 94158 , United States.,Quantitative Biosciences Institute , University of California , San Francisco , California 94143 , United States
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories , University of British Columbia , 2185 East Mall , Vancouver , British Columbia V6T 1Z4 , Canada
| |
Collapse
|
22
|
Holliday GL, Akiva E, Meng EC, Brown SD, Calhoun S, Pieper U, Sali A, Booker SJ, Babbitt PC. Atlas of the Radical SAM Superfamily: Divergent Evolution of Function Using a "Plug and Play" Domain. Methods Enzymol 2018; 606:1-71. [PMID: 30097089 DOI: 10.1016/bs.mie.2018.06.004] [Citation(s) in RCA: 96] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The radical SAM superfamily contains over 100,000 homologous enzymes that catalyze a remarkably broad range of reactions required for life, including metabolism, nucleic acid modification, and biogenesis of cofactors. While the highly conserved SAM-binding motif responsible for formation of the key 5'-deoxyadenosyl radical intermediate is a key structural feature that simplifies identification of superfamily members, our understanding of their structure-function relationships is complicated by the modular nature of their structures, which exhibit varied and complex domain architectures. To gain new insight about these relationships, we classified the entire set of sequences into similarity-based subgroups that could be visualized using sequence similarity networks. This superfamily-wide analysis reveals important features that had not previously been appreciated from studies focused on one or a few members. Functional information mapped to the networks indicates which members have been experimentally or structurally characterized, their known reaction types, and their phylogenetic distribution. Despite the biological importance of radical SAM chemistry, the vast majority of superfamily members have never been experimentally characterized in any way, suggesting that many new reactions remain to be discovered. In addition to 20 subgroups with at least one known function, we identified additional subgroups made up entirely of sequences of unknown function. Importantly, our results indicate that even general reaction types fail to track well with our sequence similarity-based subgroupings, raising major challenges for function prediction for currently identified and new members that continue to be discovered. Interactive similarity networks and other data from this analysis are available from the Structure-Function Linkage Database.
Collapse
Affiliation(s)
- Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States.
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Elaine C Meng
- Resource for Biocomputing, Visualization, and Informatics, Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA, United States
| | - Shoshana D Brown
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Sara Calhoun
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Graduate Program in Biophysics, University of California, San Francisco, CA, United States
| | - Ursula Pieper
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States
| | - Squire J Booker
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, United States; Department of Chemistry, The Pennsylvania State University, University Park, PA, United States; The Howard Hughes Medical Institute, The Pennsylvania State University, University Park, PA, United States
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States.
| |
Collapse
|
23
|
Theoretical Studies on Catalysis Mechanisms of Serum Paraoxonase 1 and Phosphotriesterase Diisopropyl Fluorophosphatase Suggest the Alteration of Substrate Preference from Paraoxonase to DFP. Molecules 2018; 23:molecules23071660. [PMID: 29986514 PMCID: PMC6100192 DOI: 10.3390/molecules23071660] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2018] [Revised: 07/04/2018] [Accepted: 07/05/2018] [Indexed: 12/18/2022] Open
Abstract
The calcium-dependent β-propeller proteins mammalian serum paraoxonase 1 (PON1) and phosphotriesterase diisopropyl fluorophosphatase (DFPase) catalyze the hydrolysis of organophosphorus compounds and enhance hydrolysis of various nerve agents. In the present work, the phosphotriesterase activity development between PON1 and DFPase was investigated by using the hybrid density functional theory method B3LYP. Based on the active-site difference between PON1 and DFPase, both the wild type and the mutant (a water molecule replacing Asn270 in PON1) models were designed. The results indicated that the substitution of a water molecule for Asn270 in PON1 had little effect on the enzyme activity in kinetics, while being more efficient in thermodynamics, which is essential for DFP hydrolysis. Structure comparisons of evolutionarily related enzymes show that the mutation of Asn270 leads to the catalytic Ca2+ ion indirectly connecting the buried structural Ca2+ ion via hydrogen bonds in DFPase. It can reduce the plasticity of enzymatic structure, and possibly change the substrate preference from paraoxon to DFP, which implies an evolutionary transition from mono- to dinuclear catalytic centers. Our studies shed light on the investigation of enzyme catalysis mechanism from an evolutionary perspective.
Collapse
|
24
|
Zhang H, Yang L, Ding W, Ma Y. Theoretical Studies on the Catalytic Cycle of Histidine Acid Phosphatases Revealing an Acid Proof Mechanism. J Phys Chem B 2018; 122:7530-7538. [DOI: 10.1021/acs.jpcb.8b04808] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Hao Zhang
- College of Life Science and Engineering, Northwest Minzu University, Lanzhou 730030, P. R. China
| | - Ling Yang
- MIIT Key Laboratory of Critical Materials Technology for New Energy Conversion and Storage, School of Chemistry and Chemical Engineering, Harbin Institute of Technology, Harbin 150001, P. R. China
| | - Wanjian Ding
- College of Chemistry, Beijing Normal University, Beijing 100086, P. R. China
| | - Yingying Ma
- Institute of Mining Technology, Inner Mongolia University of Technology. Hohhot 010051, P. R. China
- Inner Mongolia Key Laboratory of Theoretical and Computational Chemistry Simulation. Hohhot 010051, P. R. China
| |
Collapse
|
25
|
Danchin A, Ouzounis C, Tokuyasu T, Zucker JD. No wisdom in the crowd: genome annotation in the era of big data - current status and future prospects. Microb Biotechnol 2018; 11:588-605. [PMID: 29806194 PMCID: PMC6011933 DOI: 10.1111/1751-7915.13284] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery-driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from 'the sequence tells the structure tells the function' fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader.
Collapse
Affiliation(s)
- Antoine Danchin
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
- School of Biomedical Sciences, Li KaShing Faculty of Medicine, Hong Kong University, 21 Sassoon Road, Pokfulam, Hong Kong
| | - Christos Ouzounis
- Biological Computation and Process Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessalonica, 57001, Greece
| | - Taku Tokuyasu
- Shenzhen Institutes of Advanced Technology, Institute of Synthetic Biology, Shenzhen University Town, 1068 Xueyuan Avenue, Shenzhen, China
| | - Jean-Daniel Zucker
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
| |
Collapse
|
26
|
Valasatava Y, Rosato A, Furnham N, Thornton JM, Andreini C. To what extent do structural changes in catalytic metal sites affect enzyme function? J Inorg Biochem 2018; 179:40-53. [PMID: 29161638 PMCID: PMC5760197 DOI: 10.1016/j.jinorgbio.2017.11.002] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Revised: 11/02/2017] [Accepted: 11/04/2017] [Indexed: 01/09/2023]
Abstract
About half of known enzymatic reactions involve metals. Enzymes belonging to the same superfamily often evolve to catalyze different reactions on the same structural scaffold. The work presented here investigates how functional differentiation, within superfamilies that contain metalloenzymes, relates to structural changes at the catalytic metal site. In general, when the catalytic metal site is unchanged across the enzymes of a superfamily, the functional differentiation within the superfamily tends to be low and the mechanism conserved. Conversely, all types of structural changes in the metal binding site are observed for superfamilies with high functional differentiation. Overall, the catalytic role of the metal ions appears to be one of the most conserved features of the enzyme mechanism within metalloenzyme superfamilies. In particular, when the catalytic role of the metal ion does not involve a redox reaction (i.e. there is no exchange of electrons with the substrate), this role is almost always maintained even when the site undergoes significant structural changes. In these enzymes, functional diversification is most often associated with modifications in the surrounding protein matrix, which has changed so much that the enzyme chemistry is significantly altered. On the other hand, in more than 50% of the examples where the metal has a redox role in catalysis, changes at the metal site modify its catalytic role. Further, we find that there are no examples in our dataset where metal sites with a redox role are lost during evolution. SYNOPSIS In this paper we investigate how functional diversity within superfamilies of metalloenzymes relates to structural changes at the catalytic metal site. Evolution tends to strictly conserve the metal site. When changes occur, they do not modify the catalytic role of non-redox metals whereas they affect the role of redox-active metals.
Collapse
Affiliation(s)
- Yana Valasatava
- Magnetic Resonance Center, University of Florence, 50019 Sesto Fiorentino, Italy; Department of Chemistry, University of Florence, 50019 Sesto Fiorentino, Italy
| | - Antonio Rosato
- Magnetic Resonance Center, University of Florence, 50019 Sesto Fiorentino, Italy; Department of Chemistry, University of Florence, 50019 Sesto Fiorentino, Italy
| | - Nicholas Furnham
- Department of Pathogen Molecular Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| | - Janet M Thornton
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Claudia Andreini
- Magnetic Resonance Center, University of Florence, 50019 Sesto Fiorentino, Italy; Department of Chemistry, University of Florence, 50019 Sesto Fiorentino, Italy.
| |
Collapse
|
27
|
Ellens KW, Christian N, Singh C, Satagopam VP, May P, Linster CL. Confronting the catalytic dark matter encoded by sequenced genomes. Nucleic Acids Res 2017; 45:11495-11514. [PMID: 29059321 PMCID: PMC5714238 DOI: 10.1093/nar/gkx937] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Accepted: 10/03/2017] [Indexed: 01/02/2023] Open
Abstract
The post-genomic era has provided researchers with a deluge of protein sequences. However, a significant fraction of the proteins encoded by sequenced genomes remains without an identified function. Here, we aim at determining how many enzymes of uncertain or unknown function are still present in the Saccharomyces cerevisiae and human proteomes. Using information available in the Swiss-Prot, BRENDA and KEGG databases in combination with a Hidden Markov Model-based method, we estimate that >600 yeast and 2000 human proteins (>30% of their proteins of unknown function) are enzymes whose precise function(s) remain(s) to be determined. This illustrates the impressive scale of the ‘unknown enzyme problem’. We extensively review classical biochemical as well as more recent systematic experimental and computational approaches that can be used to support enzyme function discovery research. Finally, we discuss the possible roles of the elusive catalysts in light of recent developments in the fields of enzymology and metabolism as well as the significance of the unknown enzyme problem in the context of metabolic modeling, metabolic engineering and rare disease research.
Collapse
Affiliation(s)
- Kenneth W Ellens
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Nils Christian
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Charandeep Singh
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Venkata P Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Carole L Linster
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| |
Collapse
|
28
|
Holliday GL, Brown SD, Akiva E, Mischel D, Hicks MA, Morris JH, Huang CC, Meng EC, Pegg SCH, Ferrin TE, Babbitt PC. Biocuration in the structure-function linkage database: the anatomy of a superfamily. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:3074783. [PMID: 28365730 PMCID: PMC5467563 DOI: 10.1093/database/bax006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Accepted: 01/23/2017] [Indexed: 12/11/2022]
Abstract
With ever-increasing amounts of sequence data available in both the primary literature and sequence repositories, there is a bottleneck in annotating molecular function to a sequence. This article describes the biocuration process and methods used in the structure-function linkage database (SFLD) to help address some of the challenges. We discuss how the hierarchy within the SFLD allows us to infer detailed functional properties for functionally diverse enzyme superfamilies in which all members are homologous, conserve an aspect of their chemical function and have associated conserved structural features that enable the chemistry. Also presented is the Enzyme Structure-Function Ontology (ESFO), which has been designed to capture the relationships between enzyme sequence, structure and function that underlie the SFLD and is used to guide the biocuration processes within the SFLD. Database URL:http://sfld.rbvi.ucsf.edu/
Collapse
Affiliation(s)
- Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - Shoshana D Brown
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - David Mischel
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA
| | - Michael A Hicks
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA.,Human Longevity, Inc, San Diego, CA 92121, USA
| | - John H Morris
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA
| | - Conrad C Huang
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA
| | - Elaine C Meng
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA
| | | | - Thomas E Ferrin
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA.,California Institute for Quantitative Biosciences, University of California, San Francisco, CA 94158, USA
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143, USA.,Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA 94143, USA.,California Institute for Quantitative Biosciences, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
29
|
Akiva E, Copp JN, Tokuriki N, Babbitt PC. Evolutionary and molecular foundations of multiple contemporary functions of the nitroreductase superfamily. Proc Natl Acad Sci U S A 2017; 114:E9549-E9558. [PMID: 29078300 PMCID: PMC5692541 DOI: 10.1073/pnas.1706849114] [Citation(s) in RCA: 104] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Insight regarding how diverse enzymatic functions and reactions have evolved from ancestral scaffolds is fundamental to understanding chemical and evolutionary biology, and for the exploitation of enzymes for biotechnology. We undertook an extensive computational analysis using a unique and comprehensive combination of tools that include large-scale phylogenetic reconstruction to determine the sequence, structural, and functional relationships of the functionally diverse flavin mononucleotide-dependent nitroreductase (NTR) superfamily (>24,000 sequences from all domains of life, 54 structures, and >10 enzymatic functions). Our results suggest an evolutionary model in which contemporary subgroups of the superfamily have diverged in a radial manner from a minimal flavin-binding scaffold. We identified the structural design principle for this divergence: Insertions at key positions in the minimal scaffold that, combined with the fixation of key residues, have led to functional specialization. These results will aid future efforts to delineate the emergence of functional diversity in enzyme superfamilies, provide clues for functional inference for superfamily members of unknown function, and facilitate rational redesign of the NTR scaffold.
Collapse
Affiliation(s)
- Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Janine N Copp
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada V6T 1Z4
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada V6T 1Z4;
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158;
- California Institute for Quantitative Biosciences, University of California, San Francisco, CA 94158
| |
Collapse
|
30
|
|
31
|
Tyzack JD, Furnham N, Sillitoe I, Orengo CM, Thornton JM. Understanding enzyme function evolution from a computational perspective. Curr Opin Struct Biol 2017; 47:131-139. [PMID: 28892668 DOI: 10.1016/j.sbi.2017.08.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 07/08/2017] [Accepted: 08/13/2017] [Indexed: 10/18/2022]
Abstract
In this review, we will explore recent computational approaches to understand enzyme evolution from the perspective of protein structure, dynamics and promiscuity. We will present quantitative methods to measure the size of evolutionary steps within a structural domain, allowing the correlation between change in substrate and domain structure to be assessed, and giving insights into the evolvability of different domains in terms of the number, types and sizes of evolutionary steps observed. These approaches will help to understand the evolution of new catalytic and non-catalytic functionality in response to environmental demands, showing potential to guide de novoenzyme design and directed evolution experiments.
Collapse
Affiliation(s)
| | - Nicholas Furnham
- London School of Hygiene and Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
| | - Ian Sillitoe
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, United Kingdom
| | - Christine M Orengo
- Institute of Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, United Kingdom
| | | |
Collapse
|
32
|
Jan AH, Dubreucq E, Drone J, Subileau M. A glimpse into the specialization history of the lipases/acyltransferases family of CpLIP2. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2017. [DOI: 10.1016/j.bbapap.2017.06.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
33
|
Chowdhary J, Löffler FE, Smith JC. Community detection in sequence similarity networks based on attribute clustering. PLoS One 2017; 12:e0178650. [PMID: 28738060 PMCID: PMC5524321 DOI: 10.1371/journal.pone.0178650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 05/16/2017] [Indexed: 11/18/2022] Open
Abstract
Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs, for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the "ground truth" community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from https://cmb.ornl.gov/resources/developments.
Collapse
Affiliation(s)
- Janamejaya Chowdhary
- Center for Molecular Biophysics, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America
- University of Tennessee-Oak Ridge National Laboratory, Joint Institute for Biological Sciences and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America
| | - Frank E. Löffler
- Department of Microbiology, Department of Civil and Environmental Engineering, University of Tennessee, Knoxville, Tennessee, United States of America
- Center for Environmental Biotechnology, University of Tennessee, Knoxville, Tennessee, United States of America
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee, United States of America
| | - Jeremy C. Smith
- Center for Molecular Biophysics, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America
- University of Tennessee-Oak Ridge National Laboratory, Joint Institute for Biological Sciences and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee, United States of America
| |
Collapse
|
34
|
Evolution of biosynthetic diversity. Biochem J 2017; 474:2277-2299. [DOI: 10.1042/bcj20160823] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 04/20/2017] [Accepted: 04/24/2017] [Indexed: 12/16/2022]
Abstract
Since the emergence of the last common ancestor from which all extant life evolved, the metabolite repertoire of cells has increased and diversified. Not only has the metabolite cosmos expanded, but the ways in which the same metabolites are made have diversified. Enzymes catalyzing the same reaction have evolved independently from different protein folds; the same protein fold can produce enzymes recognizing different substrates, and enzymes performing different chemistries. Genes encoding useful enzymes can be transferred between organisms and even between the major domains of life. Organisms that live in metabolite-rich environments sometimes lose the pathways that produce those same metabolites. Fusion of different protein domains results in enzymes with novel properties. This review will consider the major evolutionary mechanisms that generate biosynthetic diversity: gene duplication (and gene loss), horizontal and endosymbiotic gene transfer, and gene fusion. It will also discuss mechanisms that lead to convergence as well as divergence. To illustrate these mechanisms, one of the original metabolisms present in the last universal common ancestor will be employed: polyamine metabolism, which is essential for the growth and cell proliferation of archaea and eukaryotes, and many bacteria.
Collapse
|
35
|
Abstract
The Gene Ontology (GO) (Ashburner et al., Nat Genet 25(1):25-29, 2000) is a powerful tool in the informatics arsenal of methods for evaluating annotations in a protein dataset. From identifying the nearest well annotated homologue of a protein of interest to predicting where misannotation has occurred to knowing how confident you can be in the annotations assigned to those proteins is critical. In this chapter we explore what makes an enzyme unique and how we can use GO to infer aspects of protein function based on sequence similarity. These can range from identification of misannotation or other errors in a predicted function to accurate function prediction for an enzyme of entirely unknown function. Although GO annotation applies to any gene products, we focus here a describing our approach for hierarchical classification of enzymes in the Structure-Function Linkage Database (SFLD) (Akiva et al., Nucleic Acids Res 42(Database issue):D521-530, 2014) as a guide for informed utilisation of annotation transfer based on GO terms.
Collapse
Affiliation(s)
- Gemma L Holliday
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1700 4th Street, San Francisco, CA, 94158, USA.
| | - Rebecca Davidson
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1700 4th Street, San Francisco, CA, 94158, USA
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1700 4th Street, San Francisco, CA, 94158, USA
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, 1700 4th Street, San Francisco, CA, 94158, USA
| |
Collapse
|
36
|
Baier F, Copp JN, Tokuriki N. Evolution of Enzyme Superfamilies: Comprehensive Exploration of Sequence–Function Relationships. Biochemistry 2016; 55:6375-6388. [DOI: 10.1021/acs.biochem.6b00723] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- F. Baier
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - J. N. Copp
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - N. Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| |
Collapse
|
37
|
Zallot R, Harrison KJ, Kolaczkowski B, de Crécy-Lagard V. Functional Annotations of Paralogs: A Blessing and a Curse. Life (Basel) 2016; 6:life6030039. [PMID: 27618105 PMCID: PMC5041015 DOI: 10.3390/life6030039] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Revised: 08/29/2016] [Accepted: 09/02/2016] [Indexed: 12/15/2022] Open
Abstract
Gene duplication followed by mutation is a classic mechanism of neofunctionalization, producing gene families with functional diversity. In some cases, a single point mutation is sufficient to change the substrate specificity and/or the chemistry performed by an enzyme, making it difficult to accurately separate enzymes with identical functions from homologs with different functions. Because sequence similarity is often used as a basis for assigning functional annotations to genes, non-isofunctional gene families pose a great challenge for genome annotation pipelines. Here we describe how integrating evolutionary and functional information such as genome context, phylogeny, metabolic reconstruction and signature motifs may be required to correctly annotate multifunctional families. These integrative analyses can also lead to the discovery of novel gene functions, as hints from specific subgroups can guide the functional characterization of other members of the family. We demonstrate how careful manual curation processes using comparative genomics can disambiguate subgroups within large multifunctional families and discover their functions. We present the COG0720 protein family as a case study. We also discuss strategies to automate this process to improve the accuracy of genome functional annotation pipelines.
Collapse
Affiliation(s)
- Rémi Zallot
- Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, USA.
| | - Katherine J Harrison
- Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, USA.
| | - Bryan Kolaczkowski
- Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, USA.
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida, Gainesville, FL 32611, USA.
| |
Collapse
|
38
|
Di Roberto RB, Chang B, Trusina A, Peisajovich SG. Evolution of a G protein-coupled receptor response by mutations in regulatory network interactions. Nat Commun 2016; 7:12344. [PMID: 27487915 PMCID: PMC4976203 DOI: 10.1038/ncomms12344] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Accepted: 06/24/2016] [Indexed: 12/17/2022] Open
Abstract
All cellular functions depend on the concerted action of multiple proteins organized in complex networks. To understand how selection acts on protein networks, we used the yeast mating receptor Ste2, a pheromone-activated G protein-coupled receptor, as a model system. In Saccharomyces cerevisiae, Ste2 is a hub in a network of interactions controlling both signal transduction and signal suppression. Through laboratory evolution, we obtained 21 mutant receptors sensitive to the pheromone of a related yeast species and investigated the molecular mechanisms behind this newfound sensitivity. While some mutants show enhanced binding affinity to the foreign pheromone, others only display weakened interactions with the network's negative regulators. Importantly, the latter changes have a limited impact on overall pathway regulation, despite their considerable effect on sensitivity. Our results demonstrate that a new receptor–ligand pair can evolve through network-altering mutations independently of receptor–ligand binding, and suggest a potential role for such mutations in disease. Co-evolution of a new receptor-ligand pair will affect the downstream signal transduction network. Here, the authors use experimental evolution of yeast mating receptor Ste2 to show the effect of enhanced binding affinity and weakened interactions with the network's negative regulators on protein evolution.
Collapse
Affiliation(s)
- Raphaël B Di Roberto
- Department of Cell and Systems Biology, University of Toronto, 25 Harbord Street, Toronto, Ontario, Canada M5S 3G5
| | - Belinda Chang
- Department of Cell and Systems Biology, University of Toronto, 25 Harbord Street, Toronto, Ontario, Canada M5S 3G5
| | - Ala Trusina
- Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, Copenhagen Ø 2100, Denmark
| | - Sergio G Peisajovich
- Department of Cell and Systems Biology, University of Toronto, 25 Harbord Street, Toronto, Ontario, Canada M5S 3G5
| |
Collapse
|
39
|
Lees JG, Dawson NL, Sillitoe I, Orengo CA. Functional innovation from changes in protein domains and their combinations. Curr Opin Struct Biol 2016; 38:44-52. [DOI: 10.1016/j.sbi.2016.05.016] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Revised: 05/17/2016] [Accepted: 05/24/2016] [Indexed: 10/21/2022]
|
40
|
Lobb B, Doxey AC. Novel function discovery through sequence and structural data mining. Curr Opin Struct Biol 2016; 38:53-61. [DOI: 10.1016/j.sbi.2016.05.017] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Revised: 05/17/2016] [Accepted: 05/24/2016] [Indexed: 01/30/2023]
|
41
|
Das S, Dawson NL, Orengo CA. Diversity in protein domain superfamilies. Curr Opin Genet Dev 2015; 35:40-9. [PMID: 26451979 PMCID: PMC4686048 DOI: 10.1016/j.gde.2015.09.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2015] [Revised: 09/07/2015] [Accepted: 09/08/2015] [Indexed: 01/25/2023]
Abstract
Whilst ∼93% of domain superfamilies appear to be relatively structurally and functionally conserved based on the available data from the CATH-Gene3D domain classification resource, the remainder are much more diverse. In this review, we consider how domains in some of the most ubiquitous and promiscuous superfamilies have evolved, in particular the plasticity in their functional sites and surfaces which expands the repertoire of molecules they interact with and actions performed on them. To what extent can we identify a core function for these superfamilies which would allow us to develop a ‘domain grammar of function’ whereby a protein's biological role can be proposed from its constituent domains? Clearly the first step is to understand the extent to which these components vary and how changes in their molecular make-up modifies function.
Collapse
Affiliation(s)
- Sayoni Das
- Institute of Structural and Molecular Biology, UCL, 627 Darwin Building, Gower Street, WC1E 6BT, UK
| | - Natalie L Dawson
- Institute of Structural and Molecular Biology, UCL, 627 Darwin Building, Gower Street, WC1E 6BT, UK
| | - Christine A Orengo
- Institute of Structural and Molecular Biology, UCL, 627 Darwin Building, Gower Street, WC1E 6BT, UK.
| |
Collapse
|
42
|
Ahmed FH, Carr PD, Lee BM, Afriat-Jurnou L, Mohamed AE, Hong NS, Flanagan J, Taylor MC, Greening C, Jackson CJ. Sequence-Structure-Function Classification of a Catalytically Diverse Oxidoreductase Superfamily in Mycobacteria. J Mol Biol 2015; 427:3554-3571. [PMID: 26434506 DOI: 10.1016/j.jmb.2015.09.021] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Revised: 09/23/2015] [Accepted: 09/24/2015] [Indexed: 12/11/2022]
Abstract
The deazaflavin cofactor F420 enhances the persistence of mycobacteria during hypoxia, oxidative stress, and antibiotic treatment. However, the identities and functions of the mycobacterial enzymes that utilize F420 under these conditions have yet to be resolved. In this work, we used sequence similarity networks to analyze the distribution of the largest F420-dependent protein family in mycobacteria. We show that these enzymes are part of a larger split β-barrel enzyme superfamily (flavin/deazaflavin oxidoreductases, FDORs) that include previously characterized pyridoxamine/pyridoxine-5'-phosphate oxidases and heme oxygenases. We show that these proteins variously utilize F420, flavin mononucleotide, flavin adenine dinucleotide, and heme cofactors. Functional annotation using phylogenetic, structural, and spectroscopic methods revealed their involvement in heme degradation, biliverdin reduction, fatty acid modification, and quinone reduction. Four novel crystal structures show that plasticity in substrate binding pockets and modifications to cofactor binding motifs enabled FDORs to carry out a variety of functions. This systematic classification and analysis provides a framework for further functional analysis of the roles of FDORs in mycobacterial pathogenesis and persistence.
Collapse
Affiliation(s)
- F Hafna Ahmed
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia
| | - Paul D Carr
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia
| | - Brendon M Lee
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia
| | - Livnat Afriat-Jurnou
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia
| | - A Elaaf Mohamed
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia
| | - Nan-Sook Hong
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia
| | - Jack Flanagan
- University of Auckland Faculty of Medical and Health Sciences, 85 Park Road, Grafton, Auckland 2013, New Zealand
| | - Matthew C Taylor
- Commonwealth Scientific and Industrial Research Organisation Land and Water Flagship, Clunies Ross Street, Acton, ACT 2060, Australia
| | - Chris Greening
- Commonwealth Scientific and Industrial Research Organisation Land and Water Flagship, Clunies Ross Street, Acton, ACT 2060, Australia
| | - Colin J Jackson
- Australian National University Research School of Chemistry, Sullivans Creek Road, Acton, ACT 2601, Australia.
| |
Collapse
|
43
|
Abstract
Catalytically inactive enzymes (also known as pseudoproteases, protease homologues or paralogues, non-peptidase homologues, non-enzymes and pseudoenzymes) have traditionally been hypothesized to act as regulators of their active homologues. However, those that have been characterized demonstrate that inactive enzymes have an extensive and expanding role in biological processes, including regulation, inhibition and immune modulation. With the emergence of each new genome, more inactive enzymes are being identified, and their abundance and potential as therapeutic targets has been realized. In the light of the growing interest in this emerging field the present review focuses on the classification, structure, function and mechanism of inactive enzymes. Examples of how inactivity is defined, how this is reflected in the structure, functions of inactive enzymes in biological processes and their mode of action are discussed.
Collapse
|
44
|
Baier F, Chen J, Solomonson M, Strynadka NC, Tokuriki N. Distinct Metal Isoforms Underlie Promiscuous Activity Profiles of Metalloenzymes. ACS Chem Biol 2015; 10:1684-93. [PMID: 25856271 DOI: 10.1021/acschembio.5b00068] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Within a superfamily, functionally diverged metalloenzymes often favor different metals as cofactors for catalysis. One hypothesis is that incorporation of alternative metals expands the catalytic repertoire of metalloenzymes and provides evolutionary springboards toward new catalytic functions. However, there is little experimental evidence that incorporation of alternative metals changes the activity profile of metalloenzymes. Here, we systematically investigate how metals alter the activity profiles of five functionally diverged enzymes of the metallo-β-lactamase (MBL) superfamily. Each enzyme was reconstituted in vitro with six different metals, Cd(2+), Co(2+), Fe(2+), Mn(2+), Ni(2+), and Zn(2+), and assayed against eight catalytically distinct hydrolytic reactions (representing native functions of MBL enzymes). We reveal that each enzyme metal isoform has a significantly different activity level for native and promiscuous reactions. Moreover, metal preferences for native versus promiscuous activities are not correlated and, in some cases, are mutually exclusive; only particular metal isoforms disclose cryptic promiscuous activities but often at the expense of the native activity. For example, the L1 B3 β-lactamase displays a 1000-fold catalytic preference for Zn(2+) over Ni(2+) for its native activity but exhibits promiscuous thioester, phosphodiester, phosphotriester, and lactonase activity only with Ni(2+). Furthermore, we find that the five MBL enzymes exist as an ensemble of various metal isoforms in vivo, and this heterogeneity results in an expanded activity profile compared to a single metal isoform. Our study suggests that promiscuous activities of metalloenzymes can stem from an ensemble of metal isoforms in the cell, which could facilitate the functional divergence of metalloenzymes.
Collapse
Affiliation(s)
- Florian Baier
- Michael
Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - John Chen
- Michael
Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Matthew Solomonson
- Center
for Blood Research, University of British Columbia, Vancouver, British Columbia, Canada
- Department
of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Natalie C.J. Strynadka
- Center
for Blood Research, University of British Columbia, Vancouver, British Columbia, Canada
- Department
of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, British Columbia, Canada
| | - Nobuhiko Tokuriki
- Michael
Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
45
|
Martínez Cuesta S, Rahman SA, Furnham N, Thornton JM. The Classification and Evolution of Enzyme Function. Biophys J 2015; 109:1082-6. [PMID: 25986631 DOI: 10.1016/j.bpj.2015.04.020] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 04/16/2015] [Accepted: 04/17/2015] [Indexed: 11/30/2022] Open
Abstract
Enzymes are the proteins responsible for the catalysis of life. Enzymes sharing a common ancestor as defined by sequence and structure similarity are grouped into families and superfamilies. The molecular function of enzymes is defined as their ability to catalyze biochemical reactions; it is manually classified by the Enzyme Commission and robust approaches to quantitatively compare catalytic reactions are just beginning to appear. Here, we present an overview of studies at the interface of the evolution and function of enzymes.
Collapse
Affiliation(s)
- Sergio Martínez Cuesta
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Syed Asad Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Nicholas Furnham
- Department of Pathogen Molecular Biology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.
| |
Collapse
|
46
|
Lozano P, Bernal JM, Nieto S, Gomez C, Garcia-Verdugo E, Luis SV. Active biopolymers in green non-conventional media: a sustainable tool for developing clean chemical processes. Chem Commun (Camb) 2015; 51:17361-74. [DOI: 10.1039/c5cc07600e] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
By understanding structure–function relationships of active biopolymers (e.g. enzymes and nucleic acids) in green non-conventional media, sustainable chemical processes may be developed.
Collapse
Affiliation(s)
- Pedro Lozano
- Departamento de Bioquímica y Biología Molecular “B” e Inmunología
- Facultad de Química
- Campus de Excelencia Internacional Mare Nostrum
- Universidad de Murcia
- Murcia
| | - Juana M. Bernal
- Departamento de Bioquímica y Biología Molecular “B” e Inmunología
- Facultad de Química
- Campus de Excelencia Internacional Mare Nostrum
- Universidad de Murcia
- Murcia
| | - Susana Nieto
- Departamento de Bioquímica y Biología Molecular “B” e Inmunología
- Facultad de Química
- Campus de Excelencia Internacional Mare Nostrum
- Universidad de Murcia
- Murcia
| | - Celia Gomez
- Departamento de Bioquímica y Biología Molecular “B” e Inmunología
- Facultad de Química
- Campus de Excelencia Internacional Mare Nostrum
- Universidad de Murcia
- Murcia
| | | | - Santiago V. Luis
- Departamento de Química Inorgánica y Orgánica
- Universidad Jaume I
- Castellón
- Spain
| |
Collapse
|
47
|
Abstract
In this thematic minireview series, the JBC presents five provocative articles on Enzyme Evolution. The reviews discuss stimulating concepts that include the emergence of primordial catalysts at temperatures that were considerably warmer than present day ones and the impact of the cooling environment on the evolution of catalytic fitness and the preservation of catalysis-promoting conformational dynamics. They also discuss the use of Urzymes or invariant modules in enzyme superfamilies as paradigms for understanding the evolution of catalytic efficiency and specificity, the use of bioinformatics approaches to understand the roles of substrate ambiguity and catalytic promiscuity as drivers of evolution, and the challenges associated with assigning catalytic function as the number of superfamily members grows rapidly.
Collapse
Affiliation(s)
- Ruma Banerjee
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, Michigan 48109-0600.
| |
Collapse
|