1
|
Prakinee K, Phaisan S, Kongjaroon S, Chaiyen P. Ancestral Sequence Reconstruction for Designing Biocatalysts and Investigating their Functional Mechanisms. JACS AU 2024; 4:4571-4591. [PMID: 39735918 PMCID: PMC11672134 DOI: 10.1021/jacsau.4c00653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 10/09/2024] [Accepted: 10/09/2024] [Indexed: 12/31/2024]
Abstract
Biocatalysis has emerged as a green approach for efficient and sustainable production in various industries. In recent decades, numerous advancements in computational and predictive approaches, including ancestral sequence reconstruction (ASR) have sparked a new wave for protein engineers to improve and expand biocatalyst capabilities. ASR is an evolution-based strategy that uses phylogenetic relationships among homologous extant sequences to probabilistically infer the most likely ancestral sequences. It has proven to be a powerful tool with applications ranging from creating highly stable enzymes for direct applications to preparing moderately active robust protein scaffolds for further enzyme engineering. Intriguingly, it can also provide insights into fundamental aspects that are challenging to study with extant (current) enzymes. This Perspective discusses a practical strategy for guiding enzyme engineers on how to embrace ASR as a practical or associated protocol for protein engineering and highlights recent examples of using ASR in various applications, including increasing thermostability, expanding promiscuity, fine-tuning selectivity and function, and investigating mechanistic and evolution aspects. We believe that the use of the ASR approach will continue to contribute to the ongoing development of the biocatalysis field. We have been in a "golden era" for biocatalysis in which numerous useful enzymes have been developed through many waves of enzyme engineering via advancements in computational methodologies.
Collapse
Affiliation(s)
- Kridsadakorn Prakinee
- School of Biomolecular Science and
Engineering, Vidyasirimedhi Institute of
Science and Technology (VISTEC), Wangchan Valley, Rayong 21210, Thailand
| | - Suppalak Phaisan
- School of Biomolecular Science and
Engineering, Vidyasirimedhi Institute of
Science and Technology (VISTEC), Wangchan Valley, Rayong 21210, Thailand
| | - Sirus Kongjaroon
- School of Biomolecular Science and
Engineering, Vidyasirimedhi Institute of
Science and Technology (VISTEC), Wangchan Valley, Rayong 21210, Thailand
| | - Pimchai Chaiyen
- School of Biomolecular Science and
Engineering, Vidyasirimedhi Institute of
Science and Technology (VISTEC), Wangchan Valley, Rayong 21210, Thailand
| |
Collapse
|
2
|
Barkman TJ. Applications of ancestral sequence reconstruction for understanding the evolution of plant specialized metabolism. Philos Trans R Soc Lond B Biol Sci 2024; 379:20230348. [PMID: 39343033 PMCID: PMC11439504 DOI: 10.1098/rstb.2023.0348] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 04/10/2024] [Accepted: 04/15/2024] [Indexed: 10/01/2024] Open
Abstract
Studies of enzymes in modern-day plants have documented the diversity of metabolic activities retained by species today but only provide limited insight into how those properties evolved. Ancestral sequence reconstruction (ASR) is an approach that provides statistical estimates of ancient plant enzyme sequences which can then be resurrected to test hypotheses about the evolution of catalytic activities and pathway assembly. Here, I review the insights that have been obtained using ASR to study plant metabolism and highlight important methodological aspects. Overall, studies of resurrected plant enzymes show that (i) exaptation is widespread such that even low or undetectable levels of ancestral activity with a substrate can later become the apparent primary activity of descendant enzymes, (ii) intramolecular epistasis may or may not limit evolutionary paths towards catalytic or substrate preference switches, and (iii) ancient pathway flux often differs from modern-day metabolic networks. These and other insights gained from ASR would not have been possible using only modern-day sequences. Future ASR studies characterizing entire ancestral metabolic networks as well as those that link ancient structures with enzymatic properties should continue to provide novel insights into how the chemical diversity of plants evolved. This article is part of the theme issue 'The evolution of plant metabolism'.
Collapse
Affiliation(s)
- Todd J. Barkman
- Department of Biological Sciences, Western Michigan University, Kalamazoo, MI49008, USA
| |
Collapse
|
3
|
Redelings BD, Holmes I, Lunter G, Pupko T, Anisimova M. Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications. Mol Biol Evol 2024; 41:msae177. [PMID: 39172750 PMCID: PMC11385596 DOI: 10.1093/molbev/msae177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 07/02/2024] [Accepted: 07/09/2024] [Indexed: 08/24/2024] Open
Abstract
Insertions and deletions constitute the second most important source of natural genomic variation. Insertions and deletions make up to 25% of genomic variants in humans and are involved in complex evolutionary processes including genomic rearrangements, adaptation, and speciation. Recent advances in long-read sequencing technologies allow detailed inference of insertions and deletion variation in species and populations. Yet, despite their importance, evolutionary studies have traditionally ignored or mishandled insertions and deletions due to a lack of comprehensive methodologies and statistical models of insertions and deletion dynamics. Here, we discuss methods for describing insertions and deletion variation and modeling insertions and deletions over evolutionary time. We provide practical advice for tackling insertions and deletions in genomic sequences and illustrate our discussion with examples of insertions and deletion-induced effects in human and other natural populations and their contribution to evolutionary processes. We outline promising directions for future developments in statistical methodologies that would allow researchers to analyze insertions and deletion variation and their effects in large genomic data sets and to incorporate insertions and deletions in evolutionary inference.
Collapse
Affiliation(s)
| | - Ian Holmes
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
- Calico Life Sciences LLC, South San Francisco, CA 94080, USA
| | - Gerton Lunter
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen 9713 GZ, The Netherlands
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Maria Anisimova
- Institute of Computational Life Sciences, Zurich University of Applied Sciences, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
4
|
Carrera-Pacheco SE, Mueller A, Puente-Pineda JA, Zúñiga-Miranda J, Guamán LP. Designing cytochrome P450 enzymes for use in cancer gene therapy. Front Bioeng Biotechnol 2024; 12:1405466. [PMID: 38860140 PMCID: PMC11164052 DOI: 10.3389/fbioe.2024.1405466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 04/30/2024] [Indexed: 06/12/2024] Open
Abstract
Cancer is a significant global socioeconomic burden, as millions of new cases and deaths occur annually. In 2020, almost 10 million cancer deaths were recorded worldwide. Advancements in cancer gene therapy have revolutionized the landscape of cancer treatment. An approach with promising potential for cancer gene therapy is introducing genes to cancer cells that encode for chemotherapy prodrug metabolizing enzymes, such as Cytochrome P450 (CYP) enzymes, which can contribute to the effective elimination of cancer cells. This can be achieved through gene-directed enzyme prodrug therapy (GDEPT). CYP enzymes can be genetically engineered to improve anticancer prodrug conversion to its active metabolites and to minimize chemotherapy side effects by reducing the prodrug dosage. Rational design, directed evolution, and phylogenetic methods are some approaches to developing tailored CYP enzymes for cancer therapy. Here, we provide a compilation of genetic modifications performed on CYP enzymes aiming to build highly efficient therapeutic genes capable of bio-activating different chemotherapeutic prodrugs. Additionally, this review summarizes promising preclinical and clinical trials highlighting engineered CYP enzymes' potential in GDEPT. Finally, the challenges, limitations, and future directions of using CYP enzymes for GDEPT in cancer gene therapy are discussed.
Collapse
Affiliation(s)
- Saskya E. Carrera-Pacheco
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | | | | | | | | |
Collapse
|
5
|
Clifton BE, Kozome D, Laurino P. Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design. Biochemistry 2023; 62:210-220. [PMID: 35245020 DOI: 10.1021/acs.biochem.1c00757] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The rapid growth of sequence databases over the past two decades means that protein engineers faced with optimizing a protein for any given task will often have immediate access to a vast number of related protein sequences. These sequences encode information about the evolutionary history of the protein and the underlying sequence requirements to produce folded, stable, and functional protein variants. Methods that can take advantage of this information are an increasingly important part of the protein engineering tool kit. In this Perspective, we discuss the utility of sequence data in protein engineering and design, focusing on recent advances in three main areas: the use of ancestral sequence reconstruction as an engineering tool to generate thermostable and multifunctional proteins, the use of sequence data to guide engineering of multipoint mutants by structure-based computational protein design, and the use of unlabeled sequence data for unsupervised and semisupervised machine learning, allowing the generation of diverse and functional protein sequences in unexplored regions of sequence space. Altogether, these methods enable the rapid exploration of sequence space within regions enriched with functional proteins and therefore have great potential for accelerating the engineering of stable, functional, and diverse proteins for industrial and biomedical applications.
Collapse
Affiliation(s)
- Ben E Clifton
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Dan Kozome
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Paola Laurino
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
6
|
Kinateder T, Drexler L, Straub K, Merkl R, Sterner R. Experimental and computational analysis of the ancestry of an evolutionary young enzyme from histidine biosynthesis. Protein Sci 2023; 32:e4536. [PMID: 36502290 PMCID: PMC9798254 DOI: 10.1002/pro.4536] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 12/01/2022] [Accepted: 12/03/2022] [Indexed: 12/14/2022]
Abstract
The conservation of fold and chemistry of the enzymes associated with histidine biosynthesis suggests that this pathway evolved prior to the diversification of Bacteria, Archaea, and Eukaryotes. The only exception is the histidinol phosphate phosphatase (HolPase). So far, non-homologous HolPases that possess distinct folds and belong to three different protein superfamilies have been identified in various phylogenetic clades. However, their evolution has remained unknown to date. Here, we analyzed the evolutionary history of the HolPase from γ-Proteobacteria (HisB-N). It has been argued that HisB-N and its closest homologue d-glycero-d-manno-heptose-1,7-bisphosphate 7-phosphatase (GmhB) have emerged from the same promiscuous ancestral phosphatase. GmhB variants catalyze the hydrolysis of the anomeric d-glycero-d-manno-heptose-1,7-bisphosphate (αHBP or βHBP) with a strong preference for one anomer (αGmhB or βGmhB). We found that HisB-N from Escherichia coli shows promiscuous activity for βHBP but not αHBP, while βGmhB from Crassaminicella sp. shows promiscuous activity for HolP. Accordingly, a combined phylogenetic tree of αGmhBs, βGmhBs, and HisB-N sequences revealed that HisB-Ns form a compact subcluster derived from βGmhBs. Ancestral sequence reconstruction and in vitro analysis revealed a promiscuous HolPase activity in the resurrected enzymes prior to functional divergence of the successors. The following increase in catalytic efficiency of the HolP turnover is reflected in the shape and electrostatics of the active site predicted by AlphaFold. An analysis of the phylogenetic tree led to a revised evolutionary model that proposes the horizontal gene transfer of a promiscuous βGmhB from δ- to γ-Proteobacteria where it evolved to the modern HisB-N.
Collapse
Affiliation(s)
- Thomas Kinateder
- Institute of Biophysics and Physical Biochemistry and Regensburg Center for Biochemistry, University of RegensburgRegensburgGermany
| | - Lukas Drexler
- Institute of Biophysics and Physical Biochemistry and Regensburg Center for Biochemistry, University of RegensburgRegensburgGermany
| | - Kristina Straub
- Institute of Biophysics and Physical Biochemistry and Regensburg Center for Biochemistry, University of RegensburgRegensburgGermany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry and Regensburg Center for Biochemistry, University of RegensburgRegensburgGermany
| | - Reinhard Sterner
- Institute of Biophysics and Physical Biochemistry and Regensburg Center for Biochemistry, University of RegensburgRegensburgGermany
| |
Collapse
|
7
|
Ayuso-Fernández I, Molpeceres G, Camarero S, Ruiz-Dueñas FJ, Martínez AT. Ancestral sequence reconstruction as a tool to study the evolution of wood decaying fungi. FRONTIERS IN FUNGAL BIOLOGY 2022; 3:1003489. [PMID: 37746217 PMCID: PMC10512382 DOI: 10.3389/ffunb.2022.1003489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 09/22/2022] [Indexed: 09/26/2023]
Abstract
The study of evolution is limited by the techniques available to do so. Aside from the use of the fossil record, molecular phylogenetics can provide a detailed characterization of evolutionary histories using genes, genomes and proteins. However, these tools provide scarce biochemical information of the organisms and systems of interest and are therefore very limited when they come to explain protein evolution. In the past decade, this limitation has been overcome by the development of ancestral sequence reconstruction (ASR) methods. ASR allows the subsequent resurrection in the laboratory of inferred proteins from now extinct organisms, becoming an outstanding tool to study enzyme evolution. Here we review the recent advances in ASR methods and their application to study fungal evolution, with special focus on wood-decay fungi as essential organisms in the global carbon cycling.
Collapse
Affiliation(s)
- Iván Ayuso-Fernández
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Gonzalo Molpeceres
- Centro de Investigaciones Biológicas “Margarita Salas” (CIB), CSIC, Madrid, Spain
| | - Susana Camarero
- Centro de Investigaciones Biológicas “Margarita Salas” (CIB), CSIC, Madrid, Spain
| | | | - Angel T. Martínez
- Centro de Investigaciones Biológicas “Margarita Salas” (CIB), CSIC, Madrid, Spain
| |
Collapse
|
8
|
Thomson RES, Carrera-Pacheco SE, Gillam EMJ. Engineering functional thermostable proteins using ancestral sequence reconstruction. J Biol Chem 2022; 298:102435. [PMID: 36041629 PMCID: PMC9525910 DOI: 10.1016/j.jbc.2022.102435] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 08/23/2022] [Accepted: 08/24/2022] [Indexed: 11/20/2022] Open
Abstract
Natural proteins are often only slightly more stable in the native state than the denatured state, and an increase in environmental temperature can easily shift the balance toward unfolding. Therefore, the engineering of proteins to improve protein stability is an area of intensive research. Thermostable proteins are required to withstand industrial process conditions, for increased shelf-life of protein therapeutics, for developing robust 'biobricks' for synthetic biology applications, and for research purposes (e.g., structure determination). In addition, thermostability buffers the often destabilizing effects of mutations introduced to improve other properties. Rational design approaches to engineering thermostability require structural information, but even with advanced computational methods, it is challenging to predict or parameterize all the relevant structural factors with sufficient precision to anticipate the results of a given mutation. Directed evolution is an alternative when structures are unavailable but requires extensive screening of mutant libraries. Recently, however, bioinspired approaches based on phylogenetic analyses have shown great promise. Leveraging the rapid expansion in sequence data and bioinformatic tools, ancestral sequence reconstruction can generate highly stable folds for novel applications in industrial chemistry, medicine, and synthetic biology. This review provides an overview of the factors important for successful inference of thermostable proteins by ancestral sequence reconstruction and what it can reveal about the determinants of stability in proteins.
Collapse
Affiliation(s)
- Raine E S Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Saskya E Carrera-Pacheco
- Centro de Investigación Biomédica (CENBIO), Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito, Ecuador
| | - Elizabeth M J Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia.
| |
Collapse
|
9
|
Foley G, Mora A, Ross CM, Bottoms S, Sützl L, Lamprecht ML, Zaugg J, Essebier A, Balderson B, Newell R, Thomson RES, Kobe B, Barnard RT, Guddat L, Schenk G, Carsten J, Gumulya Y, Rost B, Haltrich D, Sieber V, Gillam EMJ, Bodén M. Engineering indel and substitution variants of diverse and ancient enzymes using Graphical Representation of Ancestral Sequence Predictions (GRASP). PLoS Comput Biol 2022; 18:e1010633. [PMID: 36279274 PMCID: PMC9632902 DOI: 10.1371/journal.pcbi.1010633] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 11/03/2022] [Accepted: 10/04/2022] [Indexed: 11/06/2022] Open
Abstract
Ancestral sequence reconstruction is a technique that is gaining widespread use in molecular evolution studies and protein engineering. Accurate reconstruction requires the ability to handle appropriately large numbers of sequences, as well as insertion and deletion (indel) events, but available approaches exhibit limitations. To address these limitations, we developed Graphical Representation of Ancestral Sequence Predictions (GRASP), which efficiently implements maximum likelihood methods to enable the inference of ancestors of families with more than 10,000 members. GRASP implements partial order graphs (POGs) to represent and infer insertion and deletion events across ancestors, enabling the identification of building blocks for protein engineering. To validate the capacity to engineer novel proteins from realistic data, we predicted ancestor sequences across three distinct enzyme families: glucose-methanol-choline (GMC) oxidoreductases, cytochromes P450, and dihydroxy/sugar acid dehydratases (DHAD). All tested ancestors demonstrated enzymatic activity. Our study demonstrates the ability of GRASP (1) to support large data sets over 10,000 sequences and (2) to employ insertions and deletions to identify building blocks for engineering biologically active ancestors, by exploring variation over evolutionary time.
Collapse
Affiliation(s)
- Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Ariane Mora
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Connie M. Ross
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Scott Bottoms
- Campus Straubing for Biotechnology and Sustainability, Technische Universität München, Straubing, Germany
| | - Leander Sützl
- Institut für Lebensmitteltechnologie, Universität für Bodenkultur Wien, Vienna, Austria
| | - Marnie L. Lamprecht
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Julian Zaugg
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Alexandra Essebier
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Brad Balderson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Rhys Newell
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Raine E. S. Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Bostjan Kobe
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Institute for Molecular Bioscience and Australian Infectious Diseases Research Centre, The University of Queensland, Brisbane, Australia
| | - Ross T. Barnard
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Luke Guddat
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Gerhard Schenk
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Sustainable Minerals Institute, The University of Queensland, Brisbane, Australia
| | - Jörg Carsten
- Zentralinstitut für Katalyseforschung, Technische Universität München, Munich, Germany
| | - Yosephine Gumulya
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Burkhard Rost
- Fakultät für Informatik, Technische Universität München, Munich, Germany
| | - Dietmar Haltrich
- Institut für Lebensmitteltechnologie, Universität für Bodenkultur Wien, Vienna, Austria
| | - Volker Sieber
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Campus Straubing for Biotechnology and Sustainability, Technische Universität München, Straubing, Germany
- Zentralinstitut für Katalyseforschung, Technische Universität München, Munich, Germany
| | - Elizabeth M. J. Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- * E-mail: (MB); (EMJG)
| | - Mikael Bodén
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- * E-mail: (MB); (EMJG)
| |
Collapse
|
10
|
Mascotti ML. Resurrecting Enzymes by Ancestral Sequence Reconstruction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2397:111-136. [PMID: 34813062 DOI: 10.1007/978-1-0716-1826-4_7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Ancestral Sequence Reconstruction (ASR) allows one to infer the sequences of extinct proteins using the phylogeny of extant proteins. It consists of disclosing the evolutionary history-i.e., the phylogeny-of a protein family of interest and then inferring the sequences of its ancestors-i.e., the nodes in the phylogeny. Assisted by gene synthesis, the selected ancestors can be resurrected in the lab and experimentally characterized. The crucial step to succeed with ASR is starting from a reliable phylogeny. At the same time, it is of the utmost importance to have a clear idea on the evolutionary history of the family under study and the events that influenced it. This allows us to implement ASR with well-defined hypotheses and to apply the appropriate experimental methods. In the last years, ASR has become popular to test hypotheses about the origin of functionalities, changes in activities, understanding physicochemical properties of proteins, among others. In this context, the aim of this chapter is to present the ASR approach applied to the reconstruction of enzymes-i.e., proteins with catalytic roles. The spirit of this contribution is to provide a basic, hands-to-work guide for biochemists and biologists who are unfamiliar with molecular phylogenetics.
Collapse
Affiliation(s)
- Maria Laura Mascotti
- Molecular Enzymology group, University of Groningen, Groningen, The Netherlands. .,IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, San Luis, Argentina.
| |
Collapse
|
11
|
Lichman BR. Ancestral Sequence Reconstruction for Exploring Alkaloid Evolution. Methods Mol Biol 2022; 2505:165-179. [PMID: 35732944 DOI: 10.1007/978-1-0716-2349-7_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The complex and bioactive monoterpene indole alkaloids (MIAs) found in Catharanthus roseus and related species are the products of many millions of years of evolution through mutation and natural selection. Ancestral sequence reconstruction (ASR) is a method that combines phylogenetic analysis and experimental biochemistry to infer details about past events in protein evolution. Here, I propose that ASR could be leveraged to understand how enzymes catalyzing the formation of complex alkaloids arose over evolutionary time. I discuss the steps of ASR, including sequence selection, multiple sequence alignment, tree inference, and the generation and characterization of inferred ancestral enzymes.
Collapse
Affiliation(s)
- Benjamin R Lichman
- Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK.
| |
Collapse
|
12
|
Jia H, Aadland K, Kolaczkowski O, Kolaczkowski B. Direct molecular evidence for an ancient, conserved developmental toolkit controlling post-transcriptional gene regulation in land plants. Mol Biol Evol 2021; 38:4765-4777. [PMID: 34196710 PMCID: PMC8557471 DOI: 10.1093/molbev/msab201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In plants, miRNA production is orchestrated by a suite of proteins that control transcription of the pri-miRNA gene, post-transcriptional processing and nuclear export of the mature miRNA. Post-transcriptional processing of miRNAs is controlled by a pair of physically interacting proteins, hyponastic leaves 1 (HYL1) and Dicer-like 1 (DCL1). However, the evolutionary history and structural basis of the HYL1–DCL1 interaction is unknown. Here we use ancestral sequence reconstruction and functional characterization of ancestral HYL1 in vitro and in Arabidopsis thaliana to better understand the origin and evolution of the HYL1–DCL1 interaction and its impact on miRNA production and plant development. We found the ancestral plant HYL1 evolved high affinity for both double-stranded RNA (dsRNA) and its DCL1 partner before the divergence of mosses from seed plants (∼500 Ma), and these high-affinity interactions remained largely conserved throughout plant evolutionary history. Structural modeling and molecular binding experiments suggest that the second of two dsRNA-binding motifs (DSRMs) in HYL1 may interact tightly with the first of two C-terminal DCL1 DSRMs to mediate the HYL1–DCL1 physical interaction necessary for efficient miRNA production. Transgenic expression of the nearly 200 Ma-old ancestral flowering-plant HYL1 in A. thaliana was sufficient to rescue many key aspects of plant development disrupted by HYL1− knockout and restored near-native miRNA production, suggesting that the functional partnership of HYL1–DCL1 originated very early in and was strongly conserved throughout the evolutionary history of terrestrial plants. Overall, our results are consistent with a model in which miRNA-based gene regulation evolved as part of a conserved plant “developmental toolkit.”
Collapse
Affiliation(s)
- Haiyan Jia
- Department of Biology, University of North Carolina, Chapel Hill, NC
| | - Kelsey Aadland
- Department of Microbiology and Plant Pathology, University of California, Riverside, CA
| | - Oralia Kolaczkowski
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL
| | - Bryan Kolaczkowski
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL
| |
Collapse
|
13
|
Spence MA, Kaczmarski JA, Saunders JW, Jackson CJ. Ancestral sequence reconstruction for protein engineers. Curr Opin Struct Biol 2021; 69:131-141. [PMID: 34023793 DOI: 10.1016/j.sbi.2021.04.001] [Citation(s) in RCA: 78] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/22/2021] [Accepted: 04/07/2021] [Indexed: 12/11/2022]
Abstract
In addition to its value in the study of molecular evolution, ancestral sequence reconstruction (ASR) has emerged as a useful methodology for engineering proteins with enhanced properties. Proteins generated by ASR often exhibit unique or improved activity, stability, and/or promiscuity, all of which are properties that are valued by protein engineers. Comparison between extant proteins and evolutionary intermediates generated by ASR also allows protein engineers to identify substitutions that have contributed to functional innovation or diversification within protein families. As ASR becomes more widely adopted as a protein engineering approach, it is important to understand the applications, limitations, and recent developments of this technique. This review highlights recent exemplifications of ASR, as well as technical aspects of the reconstruction process that are relevant to protein engineering.
Collapse
Affiliation(s)
- Matthew A Spence
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Joe A Kaczmarski
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Jake W Saunders
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Peptide & Protein Science, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Synthetic Biology, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia.
| |
Collapse
|
14
|
Copley SD. Setting the stage for evolution of a new enzyme. Curr Opin Struct Biol 2021; 69:41-49. [PMID: 33865035 DOI: 10.1016/j.sbi.2021.03.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2020] [Revised: 02/19/2021] [Accepted: 03/03/2021] [Indexed: 12/18/2022]
Abstract
The evolution of novel enzymes has fueled the diversification of life on earth for billions of years. Insights into events that set the stage for the evolution of a new enzyme can be obtained from ancestral reconstruction and laboratory evolution. Ancestral reconstruction can reveal the emergence of a promiscuous activity in a pre-existing protein and the impact of subsequent mutations that enhance a new activity. Laboratory evolution provides a more holistic view by revealing mutations elsewhere in the genome that indirectly enhance the level of a newly important enzymatic activity. This review will highlight recent studies that probe the early stages of the evolution of a new enzyme from these complementary points of view.
Collapse
Affiliation(s)
- Shelley D Copley
- Department of Molecular, Cellular and Developmental Biology, The Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, CO, 80309, USA.
| |
Collapse
|
15
|
Scossa F, Fernie AR. Ancestral sequence reconstruction - An underused approach to understand the evolution of gene function in plants? Comput Struct Biotechnol J 2021; 19:1579-1594. [PMID: 33868595 PMCID: PMC8039532 DOI: 10.1016/j.csbj.2021.03.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Revised: 03/04/2021] [Accepted: 03/06/2021] [Indexed: 02/06/2023] Open
Abstract
Whilst substantial research effort has been placed on understanding the interactions of plant proteins with their molecular partners, relatively few studies in plants - by contrast to work in other organisms - address how these interactions evolve. It is thought that ancestral proteins were more promiscuous than modern proteins and that specificity often evolved following gene duplication and subsequent functional refining. However, ancestral protein resurrection studies have found that some modern proteins have evolved de novo from ancestors lacking those functions. Intriguingly, the new interactions evolved as a consequence of just a few mutations and, as such, acquisition of new functions appears to be neither difficult nor rare, however, only a few of them are incorporated into biological processes before they are lost to subsequent mutations. Here, we detail the approach of ancestral sequence reconstruction (ASR), providing a primer to reconstruct the sequence of an ancestral gene. We will present case studies from a range of different eukaryotes before discussing the few instances where ancestral reconstructions have been used in plants. As ASR is used to dig into the remote evolutionary past, we will also present some alternative genetic approaches to investigate molecular evolution on shorter timescales. We argue that the study of plant secondary metabolism is particularly well suited for ancestral reconstruction studies. Indeed, its ancient evolutionary roots and highly diverse landscape provide an ideal context in which to address the focal issue around the emergence of evolutionary novelties and how this affects the chemical diversification of plant metabolism.
Collapse
Key Words
- APR, ancestral protein resurrection
- ASR, ancestral sequence reconstruction
- Ancestral sequence reconstruction
- CDS, coding sequence
- Evolution
- GR, glucocorticoid receptor
- GWAS, genome wide association study
- Genomics
- InDel, insertion/deletion
- MCMC, Markov Chain Monte Carlo
- ML, maximum likelihood
- MP, maximum parsimony
- MR, mineralcorticoid receptor
- MSA, multiple sequence alignment
- Metabolism
- NJ, neighbor-joining
- Phylogenetics
- Plants
- SFS, site frequency spectrum
Collapse
Affiliation(s)
- Federico Scossa
- Max-Planck-Institute of Molecular Plant Physiology (MPI-MP), 14476 Potsdam-Golm, Germany
- Council for Agricultural Research and Economics (CREA), Research Centre for Genomics and Bioinformatics (CREA-GB), Rome, Italy
| | - Alisdair R. Fernie
- Max-Planck-Institute of Molecular Plant Physiology (MPI-MP), 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
| |
Collapse
|
16
|
Selberg AGA, Gaucher EA, Liberles DA. Ancestral Sequence Reconstruction: From Chemical Paleogenetics to Maximum Likelihood Algorithms and Beyond. J Mol Evol 2021; 89:157-164. [PMID: 33486547 PMCID: PMC7828096 DOI: 10.1007/s00239-021-09993-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Accepted: 01/04/2021] [Indexed: 12/13/2022]
Abstract
As both a computational and an experimental endeavor, ancestral sequence reconstruction remains a timely and important technique. Modern approaches to conduct ancestral sequence reconstruction for proteins are built upon a conceptual framework from journal founder Emile Zuckerkandl. On top of this, work on maximum likelihood phylogenetics published in Journal of Molecular Evolution in 1996 was one of the first approaches for generating maximum likelihood ancestral sequences of proteins. From its computational history, future model development needs as well as potential applications in areas as diverse as computational systems biology, molecular community ecology, infectious disease therapeutics and other biomedical applications, and biotechnology are discussed. From its past in this journal, there is a bright future for ancestral sequence reconstruction in the field of evolutionary biology.
Collapse
Affiliation(s)
- Avery G A Selberg
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Eric A Gaucher
- Department of Biology, Georgia State University, Atlanta, GA, 30303, USA
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA.
| |
Collapse
|