1
|
Pérez-Niño JA, Guerra Y, Díaz-Salazar AJ, Costas M, Rodríguez-Romero A, Fernández-Velasco DA. Stable monomers in the ancestral sequence reconstruction of the last opisthokont common ancestor of dimeric triosephosphate isomerase. Protein Sci 2024; 33:e5134. [PMID: 39145435 PMCID: PMC11325190 DOI: 10.1002/pro.5134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 07/01/2024] [Accepted: 07/21/2024] [Indexed: 08/16/2024]
Abstract
Function and structure are strongly coupled in obligated oligomers such as Triosephosphate isomerase (TIM). In animals and fungi, TIM monomers are inactive and unstable. Previously, we used ancestral sequence reconstruction to study TIM evolution and found that before these lineages diverged, the last opisthokonta common ancestor of TIM (LOCATIM) was an obligated oligomer that resembles those of extant TIMs. Notably, calorimetric evidence indicated that ancestral TIM monomers are more structured than extant ones. To further increase confidence about the function, structure, and stability of the LOCATIM, in this work, we applied two different inference methodologies and the worst plausible case scenario for both of them, to infer four sequences of this ancestor and test the robustness of their physicochemical properties. The extensive biophysical characterization of the four reconstructed sequences of LOCATIM showed very similar hydrodynamic and spectroscopic properties, as well as ligand-binding energetics and catalytic parameters. Their 3D structures were also conserved. Although differences were observed in melting temperature, all LOCATIMs showed reversible urea-induced unfolding transitions, and for those that reached equilibrium, high conformational stability was estimated (ΔGTot = 40.6-46.2 kcal/mol). The stability of the inactive monomeric intermediates was also high (ΔGunf = 12.6-18.4 kcal/mol), resembling some protozoan TIMs rather than the unstable monomer observed in extant opisthokonts. A comparative analysis of the 3D structure of ancestral and extant TIMs shows a correlation between the higher stability of the ancestral monomers with the presence of several hydrogen bonds located in the "bottom" part of the barrel.
Collapse
Affiliation(s)
- Jorge Alejandro Pérez-Niño
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | - Yasel Guerra
- Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito, Ecuador
- Grupo de Bio-Quimioinformática, Universidad de Las Américas, Quito, Ecuador
| | - A Jessica Díaz-Salazar
- Laboratorio de Biofisicoquímica, Departamento de Fisicoquímica, Facultad de Química, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | - Miguel Costas
- Laboratorio de Biofisicoquímica, Departamento de Fisicoquímica, Facultad de Química, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | | | - D Alejandro Fernández-Velasco
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| |
Collapse
|
2
|
Thomson RES, Carrera-Pacheco SE, Gillam EMJ. Engineering functional thermostable proteins using ancestral sequence reconstruction. J Biol Chem 2022; 298:102435. [PMID: 36041629 PMCID: PMC9525910 DOI: 10.1016/j.jbc.2022.102435] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 08/23/2022] [Accepted: 08/24/2022] [Indexed: 11/20/2022] Open
Abstract
Natural proteins are often only slightly more stable in the native state than the denatured state, and an increase in environmental temperature can easily shift the balance toward unfolding. Therefore, the engineering of proteins to improve protein stability is an area of intensive research. Thermostable proteins are required to withstand industrial process conditions, for increased shelf-life of protein therapeutics, for developing robust 'biobricks' for synthetic biology applications, and for research purposes (e.g., structure determination). In addition, thermostability buffers the often destabilizing effects of mutations introduced to improve other properties. Rational design approaches to engineering thermostability require structural information, but even with advanced computational methods, it is challenging to predict or parameterize all the relevant structural factors with sufficient precision to anticipate the results of a given mutation. Directed evolution is an alternative when structures are unavailable but requires extensive screening of mutant libraries. Recently, however, bioinspired approaches based on phylogenetic analyses have shown great promise. Leveraging the rapid expansion in sequence data and bioinformatic tools, ancestral sequence reconstruction can generate highly stable folds for novel applications in industrial chemistry, medicine, and synthetic biology. This review provides an overview of the factors important for successful inference of thermostable proteins by ancestral sequence reconstruction and what it can reveal about the determinants of stability in proteins.
Collapse
Affiliation(s)
- Raine E S Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Saskya E Carrera-Pacheco
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Elizabeth M J Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia.
| |
Collapse
|
3
|
Hobbs HT, Shah NH, Shoemaker SR, Amacher JF, Marqusee S, Kuriyan J. Saturation mutagenesis of a predicted ancestral Syk-family kinase. Protein Sci 2022; 31:e4411. [PMID: 36173161 PMCID: PMC9601881 DOI: 10.1002/pro.4411] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 06/27/2022] [Accepted: 07/25/2022] [Indexed: 11/08/2022]
Abstract
Many tyrosine kinases cannot be expressed readily in Escherichia coli, limiting facile production of these proteins for biochemical experiments. We used ancestral sequence reconstruction to generate a spleen tyrosine kinase (Syk) variant that can be expressed in bacteria and purified in soluble form, unlike the human members of this family (Syk and zeta-chain-associated protein kinase of 70 kDa [ZAP-70]). The catalytic activity, substrate specificity, and regulation by phosphorylation of this Syk variant are similar to the corresponding properties of human Syk and ZAP-70. Taking advantage of the ability to express this novel Syk-family kinase in bacteria, we developed a two-hybrid assay that couples the growth of E. coli in the presence of an antibiotic to successful phosphorylation of a bait peptide by the kinase. Using this assay, we screened a site-saturation mutagenesis library of the kinase domain of this reconstructed Syk-family kinase. Sites of loss-of-function mutations identified in the screen correlate well with residues established previously as critical to function and/or structure in protein kinases. We also identified activating mutations in the regulatory hydrophobic spine and activation loop, which are within key motifs involved in kinase regulation. Strikingly, one mutation in an ancestral Syk-family variant increases the soluble expression of the protein by 75-fold. Thus, through ancestral sequence reconstruction followed by deep mutational scanning, we have generated Syk-family kinase variants that can be expressed in bacteria with very high yield.
Collapse
Affiliation(s)
- Helen T. Hobbs
- Department of ChemistryUniversity of CaliforniaBerkeleyCaliforniaUSA
- Department of Biomedical EngineeringUniversity of CaliforniaIrvineCaliforniaUSA
| | - Neel H. Shah
- Department of ChemistryColumbia UniversityNew YorkNew YorkUSA
| | - Sophie R. Shoemaker
- Department of Molecular and Cell BiologyUniversity of CaliforniaBerkeleyCaliforniaUSA
| | - Jeanine F. Amacher
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Susan Marqusee
- Department of ChemistryUniversity of CaliforniaBerkeleyCaliforniaUSA
- Department of Molecular and Cell BiologyUniversity of CaliforniaBerkeleyCaliforniaUSA
- California Institute for Quantitative BiosciencesUniversity of CaliforniaBerkeleyCaliforniaUSA
| | - John Kuriyan
- Department of ChemistryUniversity of CaliforniaBerkeleyCaliforniaUSA
- Department of Molecular and Cell BiologyUniversity of CaliforniaBerkeleyCaliforniaUSA
- California Institute for Quantitative BiosciencesUniversity of CaliforniaBerkeleyCaliforniaUSA
- Howard Hughes Medical InstituteUniversity of CaliforniaBerkeleyCaliforniaUSA
| |
Collapse
|
4
|
Foley G, Mora A, Ross CM, Bottoms S, Sützl L, Lamprecht ML, Zaugg J, Essebier A, Balderson B, Newell R, Thomson RES, Kobe B, Barnard RT, Guddat L, Schenk G, Carsten J, Gumulya Y, Rost B, Haltrich D, Sieber V, Gillam EMJ, Bodén M. Engineering indel and substitution variants of diverse and ancient enzymes using Graphical Representation of Ancestral Sequence Predictions (GRASP). PLoS Comput Biol 2022; 18:e1010633. [PMID: 36279274 PMCID: PMC9632902 DOI: 10.1371/journal.pcbi.1010633] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 11/03/2022] [Accepted: 10/04/2022] [Indexed: 11/06/2022] Open
Abstract
Ancestral sequence reconstruction is a technique that is gaining widespread use in molecular evolution studies and protein engineering. Accurate reconstruction requires the ability to handle appropriately large numbers of sequences, as well as insertion and deletion (indel) events, but available approaches exhibit limitations. To address these limitations, we developed Graphical Representation of Ancestral Sequence Predictions (GRASP), which efficiently implements maximum likelihood methods to enable the inference of ancestors of families with more than 10,000 members. GRASP implements partial order graphs (POGs) to represent and infer insertion and deletion events across ancestors, enabling the identification of building blocks for protein engineering. To validate the capacity to engineer novel proteins from realistic data, we predicted ancestor sequences across three distinct enzyme families: glucose-methanol-choline (GMC) oxidoreductases, cytochromes P450, and dihydroxy/sugar acid dehydratases (DHAD). All tested ancestors demonstrated enzymatic activity. Our study demonstrates the ability of GRASP (1) to support large data sets over 10,000 sequences and (2) to employ insertions and deletions to identify building blocks for engineering biologically active ancestors, by exploring variation over evolutionary time.
Collapse
Affiliation(s)
- Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Ariane Mora
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Connie M. Ross
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Scott Bottoms
- Campus Straubing for Biotechnology and Sustainability, Technische Universität München, Straubing, Germany
| | - Leander Sützl
- Institut für Lebensmitteltechnologie, Universität für Bodenkultur Wien, Vienna, Austria
| | - Marnie L. Lamprecht
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Julian Zaugg
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Alexandra Essebier
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Brad Balderson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Rhys Newell
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Raine E. S. Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Bostjan Kobe
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Institute for Molecular Bioscience and Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Australia
| | - Ross T. Barnard
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Luke Guddat
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Gerhard Schenk
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Sustainable Minerals Institute, The University of Queensland, Brisbane, Australia
| | - Jörg Carsten
- Zentralinstitut für Katalyseforschung, Technische Universität München, Munich, Germany
| | - Yosephine Gumulya
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Burkhard Rost
- Fakultät für Informatik, Technische Universität München, Munich, Germany
| | - Dietmar Haltrich
- Institut für Lebensmitteltechnologie, Universität für Bodenkultur Wien, Vienna, Austria
| | - Volker Sieber
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Campus Straubing for Biotechnology and Sustainability, Technische Universität München, Straubing, Germany
- Zentralinstitut für Katalyseforschung, Technische Universität München, Munich, Germany
| | - Elizabeth M. J. Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- * E-mail: (MB); (EMJG)
| | - Mikael Bodén
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- * E-mail: (MB); (EMJG)
| |
Collapse
|
5
|
VanAntwerp J, Finneran P, Dolgikh B, Woldring D. Ancestral Sequence Reconstruction and Alternate Amino Acid States Guide Protein Library Design for Directed Evolution. Methods Mol Biol 2022; 2491:75-86. [PMID: 35482185 DOI: 10.1007/978-1-0716-2285-8_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Engineered proteins possess nearly limitless possibilities in medical and industrial applications but finding a precise amino acid sequence for these applications is challenging. A robust approach for discovering protein sequences with a desired functionality uses a library design method in which combinations of mutations are applied to a robust starting point. Determining useful mutations can be tortuous, yet rewarding; in this chapter, we present a novel library design method that uses information provided by ancestral sequence reconstruction (ASR) to create a library likely to have stable proteins with diverse function. ASR computational tools use a multi-sequence alignment of homologous proteins and an evolutionary model to estimate the protein sequences of the numerous common ancestors. For all ancestors, these tools calculate the probability of every amino acid occurring at each position within the sequence alignment. The alternate amino acid states at individual positions corelate to a region of stability in sequence space around the ancestral sequence which can inform site-wise diversification within a combinatorial library. The method presented in this chapter balances the quality of results, the computational resources needed, and ease of use.
Collapse
Affiliation(s)
- James VanAntwerp
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA
| | | | - Benedikt Dolgikh
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Daniel Woldring
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA.
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
6
|
Garcia AK, Fer E, Sephus C, Kacar B. An Integrated Method to Reconstruct Ancient Proteins. Methods Mol Biol 2022; 2569:267-281. [PMID: 36083453 DOI: 10.1007/978-1-0716-2691-7_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Proteins have played a fundamental role throughout life's history on Earth. Despite their biological importance, ancient origin, early function, and evolution of proteins are seldom able to be directly studied because few of these attributes are preserved across geologic timescales. Ancestral sequence reconstruction (ASR) provides a method to infer ancestral amino acid sequences and determine the evolutionary predecessors of modern-day proteins using phylogenetic tools. Laboratory application of ASR allows ancient sequences to be deduced from genetic information available in extant organisms and then experimentally resurrected to elucidate ancestral characteristics. In this article, we provide a generalized, stepwise protocol that considers the major elements of a well-designed ASR study and details potential sources of reconstruction bias that can reduce the relevance of historical inferences. We underscore key stages in our approach so that it may be broadly utilized to reconstruct the evolutionary histories of proteins.
Collapse
Affiliation(s)
- Amanda K Garcia
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Evrim Fer
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Microbiology Doctoral Training Program, University of Wisconsin-Madison, Madison, WI, USA
| | - Cathryn Sephus
- Scripps Institution of Oceanography, University of California at San Diego, La Jolla, CA, USA
| | - Betul Kacar
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
7
|
Pinto GP, Corbella M, Demkiv AO, Kamerlin SCL. Exploiting enzyme evolution for computational protein design. Trends Biochem Sci 2021; 47:375-389. [PMID: 34544655 DOI: 10.1016/j.tibs.2021.08.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/18/2021] [Accepted: 08/24/2021] [Indexed: 11/15/2022]
Abstract
Recent years have seen an explosion of interest in understanding the physicochemical parameters that shape enzyme evolution, as well as substantial advances in computational enzyme design. This review discusses three areas where evolutionary information can be used as part of the design process: (i) using ancestral sequence reconstruction (ASR) to generate new starting points for enzyme design efforts; (ii) learning from how nature uses conformational dynamics in enzyme evolution to mimic this process in silico; and (iii) modular design of enzymes from smaller fragments, again mimicking the process by which nature appears to create new protein folds. Using showcase examples, we highlight the importance of incorporating evolutionary information to continue to push forward the boundaries of enzyme design studies.
Collapse
Affiliation(s)
- Gaspar P Pinto
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Marina Corbella
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Andrey O Demkiv
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | | |
Collapse
|
8
|
Carletti MS, Monzon AM, Garcia-Rios E, Benitez G, Hirsh L, Fornasari MS, Parisi G. Revenant: a database of resurrected proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2020:5828294. [PMID: 32400867 PMCID: PMC7218706 DOI: 10.1093/database/baaa031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 03/06/2020] [Accepted: 03/31/2020] [Indexed: 11/29/2022]
Abstract
Revenant is a database of resurrected proteins coming from extinct organisms. Currently, it contains a manually curated collection of 84 resurrected proteins derived from bibliographic data. Each protein is extensively annotated, including structural, biochemical and biophysical information. Revenant contains a browse capability designed as a timeline from where the different proteins can be accessed. The oldest Revenant entries are between 4200 and 3500 million years ago, while the younger entries are between 8.8 and 6.3 million years ago. These proteins have been resurrected using computational tools called ancestral sequence reconstruction techniques combined with wet-laboratory synthesis and expression. Resurrected proteins are commonly used, with a noticeable increase during the past years, to explore and test different evolutionary hypotheses such as protein stability, to explore the origin of new functions, to get biochemical insights into past metabolisms and to explore specificity and promiscuous behaviour of ancient proteins.
Collapse
Affiliation(s)
- Matias Sebastian Carletti
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| | - Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina.,Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, Padova, I-35131, Padova, Italy
| | - Emilio Garcia-Rios
- Departamento de Ingeniería, Pontificia Universidad Católica del Perú, Lima, Perú
| | - Guillermo Benitez
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| | - Layla Hirsh
- Departamento de Ingeniería, Pontificia Universidad Católica del Perú, Lima, Perú
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Saenz Peña 182, Bernal, B1876BXD, Buenos Aires, Argentina
| |
Collapse
|
9
|
Copley SD. Setting the stage for evolution of a new enzyme. Curr Opin Struct Biol 2021; 69:41-49. [PMID: 33865035 DOI: 10.1016/j.sbi.2021.03.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2020] [Revised: 02/19/2021] [Accepted: 03/03/2021] [Indexed: 12/18/2022]
Abstract
The evolution of novel enzymes has fueled the diversification of life on earth for billions of years. Insights into events that set the stage for the evolution of a new enzyme can be obtained from ancestral reconstruction and laboratory evolution. Ancestral reconstruction can reveal the emergence of a promiscuous activity in a pre-existing protein and the impact of subsequent mutations that enhance a new activity. Laboratory evolution provides a more holistic view by revealing mutations elsewhere in the genome that indirectly enhance the level of a newly important enzymatic activity. This review will highlight recent studies that probe the early stages of the evolution of a new enzyme from these complementary points of view.
Collapse
Affiliation(s)
- Shelley D Copley
- Department of Molecular, Cellular and Developmental Biology, The Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO, 80309, USA.
| |
Collapse
|
10
|
Selberg AGA, Gaucher EA, Liberles DA. Ancestral Sequence Reconstruction: From Chemical Paleogenetics to Maximum Likelihood Algorithms and Beyond. J Mol Evol 2021; 89:157-164. [PMID: 33486547 PMCID: PMC7828096 DOI: 10.1007/s00239-021-09993-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Accepted: 01/04/2021] [Indexed: 12/13/2022]
Abstract
As both a computational and an experimental endeavor, ancestral sequence reconstruction remains a timely and important technique. Modern approaches to conduct ancestral sequence reconstruction for proteins are built upon a conceptual framework from journal founder Emile Zuckerkandl. On top of this, work on maximum likelihood phylogenetics published in Journal of Molecular Evolution in 1996 was one of the first approaches for generating maximum likelihood ancestral sequences of proteins. From its computational history, future model development needs as well as potential applications in areas as diverse as computational systems biology, molecular community ecology, infectious disease therapeutics and other biomedical applications, and biotechnology are discussed. From its past in this journal, there is a bright future for ancestral sequence reconstruction in the field of evolutionary biology.
Collapse
Affiliation(s)
- Avery G A Selberg
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Eric A Gaucher
- Department of Biology, Georgia State University, Atlanta, GA, 30303, USA
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA.
| |
Collapse
|
11
|
Garcia AK, Kaçar B. How to resurrect ancestral proteins as proxies for ancient biogeochemistry. Free Radic Biol Med 2019; 140:260-269. [PMID: 30951835 DOI: 10.1016/j.freeradbiomed.2019.03.033] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 02/11/2019] [Accepted: 03/26/2019] [Indexed: 10/27/2022]
Abstract
Throughout the history of life, enzymes have served as the primary molecular mediators of biogeochemical cycles by catalyzing the metabolic pathways that interact with geochemical substrates. The byproducts of enzymatic activities have been preserved as chemical and isotopic signatures in the geologic record. However, interpretations of these signatures are limited by the assumption that such enzymes have remained functionally conserved over billions of years of molecular evolution. By reconstructing ancient genetic sequences in conjunction with laboratory enzyme resurrection, preserved biogeochemical signatures can instead be related to experimentally constrained, ancestral enzymatic properties. We may thereby investigate instances within molecular evolutionary trajectories potentially tied to significant biogeochemical transitions evidenced in the geologic record. Here, we survey recent enzyme resurrection studies to provide a reasoned assessment of areas of success and common pitfalls relevant to ancient biogeochemical applications. We conclude by considering the Great Oxidation Event, which provides a constructive example of a significant biogeochemical transition that warrants investigation with ancestral enzyme resurrection. This event also serves to highlight the pitfalls of facile interpretation of paleophenotype models and data, as applied to two examples of enzymes that likely both influenced and were influenced by the rise of atmospheric oxygen - RuBisCO and nitrogenase.
Collapse
Affiliation(s)
- Amanda K Garcia
- Department of Molecular and Cell Biology, University of Arizona, Tucson, AZ, 85721, USA
| | - Betül Kaçar
- Department of Molecular and Cell Biology, University of Arizona, Tucson, AZ, 85721, USA; Department of Astronomy and Steward Observatory, University of Arizona, Tucson, AZ, 85721, USA.
| |
Collapse
|
12
|
Savory FR, Milner DS, Miles DC, Richards TA. Ancestral Function and Diversification of a Horizontally Acquired Oomycete Carboxylic Acid Transporter. Mol Biol Evol 2019; 35:1887-1900. [PMID: 29701800 PMCID: PMC6063262 DOI: 10.1093/molbev/msy082] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Horizontal gene transfer (HGT) can equip organisms with novel genes, expanding the repertoire of genetic material available for evolutionary innovation and allowing recipient lineages to colonize new environments. However, few studies have characterized the functions of HGT genes experimentally or examined postacquisition functional divergence. Here, we report the use of ancestral sequence reconstruction and heterologous expression in Saccharomyces cerevisiae to examine the evolutionary history of an oomycete transporter gene family that was horizontally acquired from fungi. We demonstrate that the inferred ancestral oomycete HGT transporter proteins and their extant descendants transport dicarboxylic acids which are intermediates of the tricarboxylic acid cycle. The substrate specificity profile of the most ancestral protein has largely been retained throughout the radiation of oomycetes, including in both plant and animal pathogens and in a free-living saprotroph, indicating that the ancestral HGT transporter function has been maintained by selection across a range of different lifestyles. No evidence of neofunctionalization in terms of substrate specificity was detected for different HGT transporter paralogues which have different patterns of temporal expression. However, a striking expansion of substrate range was observed for one plant pathogenic oomycete, with a HGT derived paralogue from Pythium aphanidermatum encoding a protein that enables tricarboxylic acid uptake in addition to dicarboxylic acid uptake. This demonstrates that HGT acquisitions can provide functional additions to the recipient proteome as well as the foundation material for the evolution of expanded protein functions.
Collapse
Affiliation(s)
- Fiona R Savory
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - David S Milner
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - Daniel C Miles
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - Thomas A Richards
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| |
Collapse
|
13
|
Straub K, Linde M, Kropp C, Blanquart S, Babinger P, Merkl R. Sequence selection by FitSS4ASR alleviates ancestral sequence reconstruction as exemplified for geranylgeranylglyceryl phosphate synthase. Biol Chem 2019; 400:367-381. [PMID: 30763032 DOI: 10.1515/hsz-2018-0344] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Accepted: 12/07/2018] [Indexed: 11/15/2022]
Abstract
For evolutionary studies, but also for protein engineering, ancestral sequence reconstruction (ASR) has become an indispensable tool. The first step of every ASR protocol is the preparation of a representative sequence set containing at most a few hundred recent homologs whose composition determines decisively the outcome of a reconstruction. A common approach for sequence selection consists of several rounds of manual recompilation that is driven by embedded phylogenetic analyses of the varied sequence sets. For ASR of a geranylgeranylglyceryl phosphate synthase, we additionally utilized FitSS4ASR, which replaces this time-consuming protocol with an efficient and more rational approach. FitSS4ASR applies orthogonal filters to a set of homologs to eliminate outlier sequences and those bearing only a weak phylogenetic signal. To demonstrate the usefulness of FitSS4ASR, we determined experimentally the oligomerization state of eight predecessors, which is a delicate and taxon-specific property. Corresponding ancestors deduced in a manual approach and by means of FitSS4ASR had the same dimeric or hexameric conformation; this concordance testifies to the efficiency of FitSS4ASR for sequence selection. FitSS4ASR-based results of two other ASR experiments were added to the Supporting Information. Program and documentation are available at https://gitlab.bioinf.ur.de/hek61586/FitSS4ASR.
Collapse
Affiliation(s)
- Kristina Straub
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93040 Regensburg, Germany
| | - Mona Linde
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93040 Regensburg, Germany
| | - Cosimo Kropp
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93040 Regensburg, Germany
| | - Samuel Blanquart
- University of Rennes, Inria, CNRS, IRISA, F-35000 Rennes, France
| | - Patrick Babinger
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93040 Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstraße 31, D-93040 Regensburg, Germany
| |
Collapse
|
14
|
Straub K, Merkl R. Ancestral Sequence Reconstruction as a Tool for the Elucidation of a Stepwise Evolutionary Adaptation. Methods Mol Biol 2019; 1851:171-182. [PMID: 30298397 DOI: 10.1007/978-1-4939-8736-8_9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Ancestral sequence reconstruction (ASR) is a powerful tool to infer primordial sequences from contemporary, i.e., extant ones. An essential element of ASR is the computation of a phylogenetic tree whose leaves are the chosen extant sequences. Most often, the reconstructed sequence related to the root of this tree is of greatest interest: It represents the common ancestor (CA) of the sequences under study. If this sequence encodes a protein, one can "resurrect" the CA by means of gene synthesis technology and study biochemical properties of this extinct predecessor with the help of wet-lab experiments.However, ASR deduces also sequences for all internal nodes of the tree, and the well-considered analysis of these "intermediates" can help to elucidate evolutionary processes. Moreover, one can identify key mutations that alter proteins or protein complexes and are responsible for the differing properties of extant proteins. As an illustrative example, we describe the protocol for the rapid identification of hotspots determining the binding of the two subunits within the heteromeric complex imidazole glycerol phosphate synthase.
Collapse
Affiliation(s)
- Kristina Straub
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany.
| |
Collapse
|
15
|
Tanaka S, Schweizer G, Rössel N, Fukada F, Thines M, Kahmann R. Neofunctionalization of the secreted Tin2 effector in the fungal pathogen Ustilago maydis. Nat Microbiol 2018; 4:251-257. [PMID: 30510169 DOI: 10.1038/s41564-018-0304-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Accepted: 10/24/2018] [Indexed: 01/23/2023]
Abstract
Plant-pathogenic fungi hijack their hosts by secreting effector proteins. Effectors serve to suppress plant immune responses and modulate the host metabolism to benefit the pathogen. Smut fungi are biotrophic pathogens that also parasitize important cereals, including maize1. Symptom development is usually restricted to the plant inflorescences. Ustilago maydis is an exception in its ability to cause tumours in both inflorescences and leaves of maize, and in inducing anthocyanin biosynthesis through the secreted Tin2 effector2,3. How the unique lifestyle of U. maydis has evolved remains to be elucidated. Here we show that Tin2 in U. maydis has been neofunctionalized. We functionally compared Tin2 effectors of U. maydis and the related smut Sporisorium reilianum, which results in symptoms only in the inflorescences of maize and fails to induce anthocyanin. We show that Tin2 effectors from both fungi target distinct paralogues of a maize protein kinase, leading to stabilization and inhibition, respectively. An ancestral Tin2 effector functionally replaced the virulence function of S. reilianum Tin2 but failed to induce anthocyanin, and was unable to substitute for Tin2 in U. maydis. This shows that Tin2 in U. maydis has acquired a specialized function, probably connected to the distinct pathogenic lifestyle of this fungus.
Collapse
Affiliation(s)
- Shigeyuki Tanaka
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Gabriel Schweizer
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany.,University of Zurich, Department of Evolutionary Biology and Environmental Studies, Zurich, Switzerland
| | - Nicole Rössel
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Fumi Fukada
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Marco Thines
- Department of Biological Sciences, Institute of Ecology, Evolution and Diversity, Goethe University, Frankfurt am Main, Germany.,Senckenberg Biodiversity and Climate Research Center, Frankfurt am Main, Germany
| | - Regine Kahmann
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany.
| |
Collapse
|
16
|
Kaltenbach M, Burke JR, Dindo M, Pabis A, Munsberg FS, Rabin A, Kamerlin SCL, Noel JP, Tawfik DS. Evolution of chalcone isomerase from a noncatalytic ancestor. Nat Chem Biol 2018; 14:548-555. [PMID: 29686356 DOI: 10.1038/s41589-018-0042-3] [Citation(s) in RCA: 110] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 03/01/2018] [Indexed: 11/09/2022]
Abstract
The emergence of catalysis in a noncatalytic protein scaffold is a rare, unexplored event. Chalcone isomerase (CHI), a key enzyme in plant flavonoid biosynthesis, is presumed to have evolved from a nonenzymatic ancestor related to the widely distributed fatty-acid binding proteins (FAPs) and a plant protein family with no isomerase activity (CHILs). Ancestral inference supported the evolution of CHI from a protein lacking isomerase activity. Further, we identified four alternative founder mutations, i.e., mutations that individually instated activity, including a mutation that is not phylogenetically traceable. Despite strong epistasis in other cases of protein evolution, CHI's laboratory reconstructed mutational trajectory shows weak epistasis. Thus, enantioselective CHI activity could readily emerge despite a catalytically inactive starting point. Accordingly, X-ray crystallography, NMR, and molecular dynamics simulations reveal reshaping of the active site toward a productive substrate-binding mode and repositioning of the catalytic arginine that was inherited from the ancestral fatty-acid binding proteins.
Collapse
Affiliation(s)
- Miriam Kaltenbach
- Department of Biological Chemistry, The Weizmann Institute of Science, Rehovot, Israel
| | - Jason R Burke
- Howard Hughes Medical Institute, Jack H. Skirball Center for Chemical Biology and Proteomics, The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Mirco Dindo
- Department of Biological Chemistry, The Weizmann Institute of Science, Rehovot, Israel.,Department of Neuroscience, Biomedicine and Movement Sciences, Biological Chemistry Section, University of Verona, Verona, Italy
| | - Anna Pabis
- Uppsala Biomedicinsk Centrum, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Fabian S Munsberg
- Uppsala Biomedicinsk Centrum, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Avigayel Rabin
- Department of Biological Chemistry, The Weizmann Institute of Science, Rehovot, Israel.,Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Jerusalem, Israel
| | - Shina C L Kamerlin
- Uppsala Biomedicinsk Centrum, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Joseph P Noel
- Howard Hughes Medical Institute, Jack H. Skirball Center for Chemical Biology and Proteomics, The Salk Institute for Biological Studies, La Jolla, CA, USA.
| | - Dan S Tawfik
- Department of Biological Chemistry, The Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
17
|
Sunden F, AlSadhan I, Lyubimov A, Doukov T, Swan J, Herschlag D. Differential catalytic promiscuity of the alkaline phosphatase superfamily bimetallo core reveals mechanistic features underlying enzyme evolution. J Biol Chem 2017; 292:20960-20974. [PMID: 29070681 DOI: 10.1074/jbc.m117.788240] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Revised: 10/19/2017] [Indexed: 11/06/2022] Open
Abstract
Members of enzyme superfamilies specialize in different reactions but often exhibit catalytic promiscuity for one another's reactions, consistent with catalytic promiscuity as an important driver in the evolution of new enzymes. Wanting to understand how catalytic promiscuity and other factors may influence evolution across a superfamily, we turned to the well-studied alkaline phosphatase (AP) superfamily, comparing three of its members, two evolutionarily distinct phosphatases and a phosphodiesterase. We mutated distinguishing active-site residues to generate enzymes that had a common Zn2+ bimetallo core but little sequence similarity and different auxiliary domains. We then tested the catalytic capabilities of these pruned enzymes with a series of substrates. A substantial rate enhancement of ∼1011-fold for both phosphate mono- and diester hydrolysis by each enzyme indicated that the Zn2+ bimetallo core is an effective mono/di-esterase generalist and that the bimetallo cores were not evolutionarily tuned to prefer their cognate reactions. In contrast, our pruned enzymes were ineffective sulfatases, and this limited promiscuity may have provided a driving force for founding the distinct one-metal-ion branch that contains all known AP superfamily sulfatases. Finally, our pruned enzymes exhibited 107-108-fold phosphotriesterase rate enhancements, despite absence of such enzymes within the AP superfamily. We speculate that the superfamily active-site architecture involved in nucleophile positioning prevents accommodation of the additional triester substituent. Overall, we suggest that catalytic promiscuity, and the ease or difficulty of remodeling and building onto existing protein scaffolds, have greatly influenced the course of enzyme evolution. Uncovering principles and properties of enzyme function, promiscuity, and repurposing provides lessons for engineering new enzymes.
Collapse
Affiliation(s)
- Fanny Sunden
- From the Department of Biochemistry, Beckman Center
| | | | - Artem Lyubimov
- the Departments of Molecular and Cellular Physiology.,Neurology and Neurological Science.,Structural Biology, and.,Photon Science.,Howard Hughes Medical Institute
| | - Tzanko Doukov
- the Macromolecular Crystallographic Group, Stanford Synchrotron Radiation Lightsource, National Accelerator Laboratory, Stanford University, Stanford, California 94309
| | - Jeffrey Swan
- From the Department of Biochemistry, Beckman Center
| | - Daniel Herschlag
- From the Department of Biochemistry, Beckman Center, .,the Departments of Chemical Engineering and Chemistry, and.,Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health), Stanford University, Stanford, California 94305 and
| |
Collapse
|
18
|
Castro-Fernandez V, Herrera-Morande A, Zamora R, Merino F, Gonzalez-Ordenes F, Padilla-Salinas F, Pereira HM, Brandão-Neto J, Garratt RC, Guixe V. Reconstructed ancestral enzymes reveal that negative selection drove the evolution of substrate specificity in ADP-dependent kinases. J Biol Chem 2017; 292:15598-15610. [PMID: 28726643 DOI: 10.1074/jbc.m117.790865] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Revised: 07/05/2017] [Indexed: 11/06/2022] Open
Abstract
One central goal in molecular evolution is to pinpoint the mechanisms and evolutionary forces that cause an enzyme to change its substrate specificity; however, these processes remain largely unexplored. Using the glycolytic ADP-dependent kinases of archaea, including the orders Thermococcales, Methanosarcinales, and Methanococcales, as a model and employing an approach involving paleoenzymology, evolutionary statistics, and protein structural analysis, we could track changes in substrate specificity during ADP-dependent kinase evolution along with the structural determinants of these changes. To do so, we studied five key resurrected ancestral enzymes as well as their extant counterparts. We found that a major shift in function from a bifunctional ancestor that could phosphorylate either glucose or fructose 6-phosphate (fructose-6-P) as a substrate to a fructose 6-P-specific enzyme was started by a single amino acid substitution resulting in negative selection with a ground-state mode against glucose and a subsequent 1,600-fold change in specificity of the ancestral protein. This change rendered the residual phosphorylation of glucose a promiscuous and physiologically irrelevant activity, highlighting how promiscuity may be an evolutionary vestige of ancestral enzyme activities, which have been eliminated over time. We also could reconstruct the evolutionary history of substrate utilization by using an evolutionary model of discrete binary characters, indicating that substrate uses can be discretely lost or acquired during enzyme evolution. These findings exemplify how negative selection and subtle enzyme changes can lead to major evolutionary shifts in function, which can subsequently generate important adaptive advantages, for example, in improving glycolytic efficiency in Thermococcales.
Collapse
Affiliation(s)
- Víctor Castro-Fernandez
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile,
| | - Alejandra Herrera-Morande
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile
| | - Ricardo Zamora
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile
| | - Felipe Merino
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile
| | - Felipe Gonzalez-Ordenes
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile
| | - Felipe Padilla-Salinas
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile
| | - Humberto M Pereira
- the São Carlos Institute of Physics, University of São Paulo, São Carlos, São Paulo 13563-120, Brazil, and
| | - Jose Brandão-Neto
- the Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DF, United Kingdom
| | - Richard C Garratt
- the São Carlos Institute of Physics, University of São Paulo, São Carlos, São Paulo 13563-120, Brazil, and
| | - Victoria Guixe
- From the Departamento de Biología, Facultad de Ciencias, Universidad de Chile, Santiago 800003, Chile,
| |
Collapse
|
19
|
Exploring the past and the future of protein evolution with ancestral sequence reconstruction: the 'retro' approach to protein engineering. Biochem J 2017; 474:1-19. [PMID: 28008088 DOI: 10.1042/bcj20160507] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 11/07/2016] [Accepted: 11/10/2016] [Indexed: 12/22/2022]
Abstract
A central goal in molecular evolution is to understand the ways in which genes and proteins evolve in response to changing environments. In the absence of intact DNA from fossils, ancestral sequence reconstruction (ASR) can be used to infer the evolutionary precursors of extant proteins. To date, ancestral proteins belonging to eubacteria, archaea, yeast and vertebrates have been inferred that have been hypothesized to date from between several million to over 3 billion years ago. ASR has yielded insights into the early history of life on Earth and the evolution of proteins and macromolecular complexes. Recently, however, ASR has developed from a tool for testing hypotheses about protein evolution to a useful means for designing novel proteins. The strength of this approach lies in the ability to infer ancestral sequences encoding proteins that have desirable properties compared with contemporary forms, particularly thermostability and broad substrate range, making them good starting points for laboratory evolution. Developments in technologies for DNA sequencing and synthesis and computational phylogenetic analysis have led to an escalation in the number of ancient proteins resurrected in the last decade and greatly facilitated the use of ASR in the burgeoning field of synthetic biology. However, the primary challenge of ASR remains in accurately inferring ancestral states, despite the uncertainty arising from evolutionary models, incomplete sequences and limited phylogenetic trees. This review will focus, firstly, on the use of ASR to uncover links between sequence and phenotype and, secondly, on the practical application of ASR in protein engineering.
Collapse
|
20
|
Kacar B, Ge X, Sanyal S, Gaucher EA. Experimental Evolution of Escherichia coli Harboring an Ancient Translation Protein. J Mol Evol 2017; 84:69-84. [PMID: 28233029 PMCID: PMC5371648 DOI: 10.1007/s00239-017-9781-0] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Accepted: 01/30/2017] [Indexed: 01/20/2023]
Abstract
The ability to design synthetic genes and engineer biological systems at the genome scale opens new means by which to characterize phenotypic states and the responses of biological systems to perturbations. One emerging method involves inserting artificial genes into bacterial genomes and examining how the genome and its new genes adapt to each other. Here we report the development and implementation of a modified approach to this method, in which phylogenetically inferred genes are inserted into a microbial genome, and laboratory evolution is then used to examine the adaptive potential of the resulting hybrid genome. Specifically, we engineered an approximately 700-million-year-old inferred ancestral variant of tufB, an essential gene encoding elongation factor Tu, and inserted it in a modern Escherichia coli genome in place of the native tufB gene. While the ancient homolog was not lethal to the cell, it did cause a twofold decrease in organismal fitness, mainly due to reduced protein dosage. We subsequently evolved replicate hybrid bacterial populations for 2000 generations in the laboratory and examined the adaptive response via fitness assays, whole genome sequencing, proteomics, and biochemical assays. Hybrid lineages exhibit a general adaptive strategy in which the fitness cost of the ancient gene was ameliorated in part by upregulation of protein production. Our results suggest that an ancient-modern recombinant method may pave the way for the synthesis of organisms that exhibit ancient phenotypes, and that laboratory evolution of these organisms may prove useful in elucidating insights into historical adaptive processes.
Collapse
Affiliation(s)
- Betül Kacar
- NASA Astrobiology Institute, Mountain View, CA, 94035, USA.
- Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA, 02138, USA.
| | - Xueliang Ge
- Department of Cell and Molecular Biology, Uppsala University, BMC, Box-596, 75124, Uppsala, Sweden
| | - Suparna Sanyal
- Department of Cell and Molecular Biology, Uppsala University, BMC, Box-596, 75124, Uppsala, Sweden
| | - Eric A Gaucher
- School of Biology, Georgia Institute of Technology, 950 Atlantic Drive, Atlanta, GA, 30332, USA
- Petit H. Parker Institute for Bioengineering and Bioscience, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| |
Collapse
|
21
|
Eick GN, Bridgham JT, Anderson DP, Harms MJ, Thornton JW. Robustness of Reconstructed Ancestral Protein Functions to Statistical Uncertainty. Mol Biol Evol 2017; 34:247-261. [PMID: 27795231 PMCID: PMC6095102 DOI: 10.1093/molbev/msw223] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Hypotheses about the functions of ancient proteins and the effects of historical mutations on them are often tested using ancestral protein reconstruction (APR)-phylogenetic inference of ancestral sequences followed by synthesis and experimental characterization. Usually, some sequence sites are ambiguously reconstructed, with two or more statistically plausible states. The extent to which the inferred functions and mutational effects are robust to uncertainty about the ancestral sequence has not been studied systematically. To address this issue, we reconstructed ancestral proteins in three domain families that have different functions, architectures, and degrees of uncertainty; we then experimentally characterized the functional robustness of these proteins when uncertainty was incorporated using several approaches, including sampling amino acid states from the posterior distribution at each site and incorporating the alternative amino acid state at every ambiguous site in the sequence into a single "worst plausible case" protein. In every case, qualitative conclusions about the ancestral proteins' functions and the effects of key historical mutations were robust to sequence uncertainty, with similar functions observed even when scores of alternate amino acids were incorporated. There was some variation in quantitative descriptors of function among plausible sequences, suggesting that experimentally characterizing robustness is particularly important when quantitative estimates of ancient biochemical parameters are desired. The worst plausible case method appears to provide an efficient strategy for characterizing the functional robustness of ancestral proteins to large amounts of sequence uncertainty. Sampling from the posterior distribution sometimes produced artifactually nonfunctional proteins for sequences reconstructed with substantial ambiguity.
Collapse
Affiliation(s)
- Geeta N Eick
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
- Department of Anthropology, University of Oregon, Eugene, OR
| | - Jamie T Bridgham
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
| | - Douglas P Anderson
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
- Institute of Molecular Biology, University of Oregon, Eugene, OR
| | - Michael J Harms
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
- Institute of Molecular Biology, University of Oregon, Eugene, OR
| | - Joseph W Thornton
- Department of Ecology & Evolution and Department of Human Genetics, University of Chicago, Chicago, IL
| |
Collapse
|
22
|
Holinski A, Heyn K, Merkl R, Sterner R. Combining ancestral sequence reconstruction with protein design to identify an interface hotspot in a key metabolic enzyme complex. Proteins 2017; 85:312-321. [PMID: 27936490 DOI: 10.1002/prot.25225] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Revised: 11/08/2016] [Accepted: 11/21/2016] [Indexed: 01/20/2023]
Abstract
It is important to identify hotspot residues that determine protein-protein interactions in interfaces of macromolecular complexes. We have applied a combination of ancestral sequence reconstruction and protein design to identify hotspots within imidazole glycerol phosphate synthase (ImGPS). ImGPS is a key metabolic enzyme complex, which links histidine and de novo purine biosynthesis and consists of the cyclase subunit HisF and the glutaminase subunit HisH. Initial fluorescence titration experiments showed that HisH from Zymomonas mobilis (zmHisH) binds with high affinity to the reconstructed HisF from the last universal common ancestor (LUCA-HisF) but not to HisF from Pyrobaculum arsenaticum (paHisF), which differ by 103 residues. Subsequent titration experiments with a reconstructed evolutionary intermediate linking LUCA-HisF and paHisF and inspection of the subunit interface of a contemporary ImGPS allowed us to narrow down the differences crucial for zmHisH binding to nine amino acids of HisF. Homology modeling and in silico mutagenesis studies suggested that at most two of these nine HisF residues are crucial for zmHisH binding. These computational results were verified by experimental site-directed mutagenesis, which finally enabled us to pinpoint a single amino acid residue in HisF that is decisive for high-affinity binding of zmHisH. Our work shows that the identification of protein interface hotspots can be very efficient when reconstructed proteins with different binding properties are included in the analysis. Proteins 2017; 85:312-321. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Alexandra Holinski
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, D-93040, Germany
| | - Kristina Heyn
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, D-93040, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, D-93040, Germany
| | - Reinhard Sterner
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, D-93040, Germany
| |
Collapse
|
23
|
Evolutionary trend toward kinetic stability in the folding trajectory of RNases H. Proc Natl Acad Sci U S A 2016; 113:13045-13050. [PMID: 27799545 DOI: 10.1073/pnas.1611781113] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Proper folding of proteins is critical to producing the biological machinery essential for cellular function. The rates and energetics of a protein's folding process, which is described by its energy landscape, are encoded in the amino acid sequence. Over the course of evolution, this landscape must be maintained such that the protein folds and remains folded over a biologically relevant time scale. How exactly a protein's energy landscape is maintained or altered throughout evolution is unclear. To study how a protein's energy landscape changed over time, we characterized the folding trajectories of ancestral proteins of the ribonuclease H (RNase H) family using ancestral sequence reconstruction to access the evolutionary history between RNases H from mesophilic and thermophilic bacteria. We found that despite large sequence divergence, the overall folding pathway is conserved over billions of years of evolution. There are robust trends in the rates of protein folding and unfolding; both modern RNases H evolved to be more kinetically stable than their most recent common ancestor. Finally, our study demonstrates how a partially folded intermediate provides a readily adaptable folding landscape by allowing the independent tuning of kinetics and thermodynamics.
Collapse
|
24
|
Randall RN, Radford CE, Roof KA, Natarajan DK, Gaucher EA. An experimental phylogeny to benchmark ancestral sequence reconstruction. Nat Commun 2016; 7:12847. [PMID: 27628687 PMCID: PMC5027606 DOI: 10.1038/ncomms12847] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 08/05/2016] [Indexed: 12/15/2022] Open
Abstract
Ancestral sequence reconstruction (ASR) is a still-burgeoning method that has revealed many key mechanisms of molecular evolution. One criticism of the approach is an inability to validate its algorithms within a biological context as opposed to a computer simulation. Here we build an experimental phylogeny using the gene of a single red fluorescent protein to address this criticism. The evolved phylogeny consists of 19 operational taxonomic units (leaves) and 17 ancestral bifurcations (nodes) that display a wide variety of fluorescent phenotypes. The 19 leaves then serve as 'modern' sequences that we subject to ASR analyses using various algorithms and to benchmark against the known ancestral genotypes and ancestral phenotypes. We confirm computer simulations that show all algorithms infer ancient sequences with high accuracy, yet we also reveal wide variation in the phenotypes encoded by incorrectly inferred sequences. Specifically, Bayesian methods incorporating rate variation significantly outperform the maximum parsimony criterion in phenotypic accuracy. Subsampling of extant sequences had minor effect on the inference of ancestral sequences.
Collapse
Affiliation(s)
- Ryan N. Randall
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Caelan E. Radford
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Kelsey A. Roof
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Divya K. Natarajan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Eric A. Gaucher
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
- Institute for Bioengineering and Biosciences, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| |
Collapse
|
25
|
Trudeau DL, Kaltenbach M, Tawfik DS. On the Potential Origins of the High Stability of Reconstructed Ancestral Proteins. Mol Biol Evol 2016; 33:2633-41. [PMID: 27413048 DOI: 10.1093/molbev/msw138] [Citation(s) in RCA: 96] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Ancestral reconstruction provides instrumental insights regarding the biochemical and biophysical characteristics of past proteins. A striking observation relates to the remarkably high thermostability of reconstructed ancestors. The latter has been linked to high environmental temperatures in the Precambrian era, the era relating to most reconstructed proteins. We found that inferred ancestors of the serum paraoxonase (PON) enzyme family, including the mammalian ancestor, exhibit dramatically increased thermostabilities compared with the extant, human enzyme (up to 30 °C higher melting temperature). However, the environmental temperature at the time of emergence of mammals is presumed to be similar to the present one. Additionally, the mammalian PON ancestor has superior folding properties (kinetic stability)-unlike the extant mammalian PONs, it expresses in E. coli in a soluble and functional form, and at a high yield. We discuss two potential origins of this unexpectedly high stability. First, ancestral stability may be overestimated by a "consensus effect," whereby replacing amino acids that are rare in contemporary sequences with the amino acid most common in the family increases protein stability. Comparison to other reconstructed ancestors indicates that the consensus effect may bias some but not all reconstructions. Second, we note that high stability may relate to factors other than high environmental temperature such as oxidative stress or high radiation levels. Foremost, intrinsic factors such as high rates of genetic mutations and/or of transcriptional and translational errors, and less efficient protein quality control systems, may underlie the high kinetic and thermodynamic stability of past proteins.
Collapse
Affiliation(s)
- Devin L Trudeau
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Miriam Kaltenbach
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
26
|
Wheeler LC, Lim SA, Marqusee S, Harms MJ. The thermostability and specificity of ancient proteins. Curr Opin Struct Biol 2016; 38:37-43. [PMID: 27288744 DOI: 10.1016/j.sbi.2016.05.015] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Revised: 05/18/2016] [Accepted: 05/24/2016] [Indexed: 11/16/2022]
Abstract
Were ancient proteins systematically different than modern proteins? The answer to this question is profoundly important, shaping how we understand the origins of protein biochemical, biophysical, and functional properties. Ancestral sequence reconstruction (ASR), a phylogenetic approach to infer the sequences of ancestral proteins, may reveal such trends. We discuss two proposed trends: a transition from higher to lower thermostability and a tendency for proteins to acquire higher specificity over time. We review the evidence for elevated ancestral thermostability and discuss its possible origins in a changing environmental temperature and/or reconstruction bias. We also conclude that there is, as yet, insufficient data to support a trend from promiscuity to specificity. Finally, we propose future work to understand these proposed evolutionary trends.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States; Institute of Molecular Biology, University of Oregon, Eugene, OR, United States
| | - Shion A Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States.
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States; Institute of Molecular Biology, University of Oregon, Eugene, OR, United States.
| |
Collapse
|
27
|
Busch F, Rajendran C, Heyn K, Schlee S, Merkl R, Sterner R. Ancestral Tryptophan Synthase Reveals Functional Sophistication of Primordial Enzyme Complexes. Cell Chem Biol 2016; 23:709-15. [DOI: 10.1016/j.chembiol.2016.05.009] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Revised: 04/26/2016] [Accepted: 05/05/2016] [Indexed: 11/29/2022]
|
28
|
Smock RG, Yadid I, Dym O, Clarke J, Tawfik DS. De Novo Evolutionary Emergence of a Symmetrical Protein Is Shaped by Folding Constraints. Cell 2016; 164:476-86. [PMID: 26806127 PMCID: PMC4735018 DOI: 10.1016/j.cell.2015.12.024] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2015] [Revised: 10/05/2015] [Accepted: 12/07/2015] [Indexed: 01/02/2023]
Abstract
Molecular evolution has focused on the divergence of molecular functions, yet we know little about how structurally distinct protein folds emerge de novo. We characterized the evolutionary trajectories and selection forces underlying emergence of β-propeller proteins, a globular and symmetric fold group with diverse functions. The identification of short propeller-like motifs (<50 amino acids) in natural genomes indicated that they expanded via tandem duplications to form extant propellers. We phylogenetically reconstructed 47-residue ancestral motifs that form five-bladed lectin propellers via oligomeric assembly. We demonstrate a functional trajectory of tandem duplications of these motifs leading to monomeric lectins. Foldability, i.e., higher efficiency of folding, was the main parameter leading to improved functionality along the entire evolutionary trajectory. However, folding constraints changed along the trajectory: initially, conflicts between monomer folding and oligomer assembly dominated, whereas subsequently, upon tandem duplication, tradeoffs between monomer stability and foldability took precedence. Inferred 47-aminoacid ancestral motifs fold into functional β-propeller assemblies Motif duplication, fusion, and diversification yield functional monomeric propellers Folding efficiency was the key parameter optimized throughout propeller emergence Single-motif precursors in extant genomes support the reconstructed emergence pathway
Collapse
Affiliation(s)
- Robert G Smock
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Itamar Yadid
- Metabolic Pathways and Enzyme Evolution Laboratory, Migal Galilee Research Institute, Kiryat Shmona 11016, Israel
| | - Orly Dym
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Jane Clarke
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK
| | - Dan S Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|