1
|
Doherty DZ, De Voss JJ, Bruning JB, Bell SG. Evolutionary insights into the selectivity of sterol oxidising cytochrome P450 enzymes based on ancestral sequence reconstruction. Chem Sci 2025:d5sc01863c. [PMID: 40417289 PMCID: PMC12100521 DOI: 10.1039/d5sc01863c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2025] [Accepted: 05/12/2025] [Indexed: 05/27/2025] Open
Abstract
The cytochrome P450 (CYP) enzyme CYP125A1 is a crucial enzyme for the long-term survival and pathogenicity of Mycobacterium tuberculosis. CYP125 genes are found not only in pathogenic mycobacteria but are also widely dispersed within the Actinobacteria phylum, with many species possessing multiple copies of CYP125 encoding genes. Their primary function is the catalytic hydroxylation of the terminal methyl group of cholesterol and phytosterols. We have previously shown that CYP125 enzymes from distinct mycobacteria have substrate selectivity preferences for animal versus plant steroid oxidation. An evolutionary understanding of this selectivity is not known. Here, we use Ancestral Sequence Reconstruction (ASR), to support the hypothesis that some CYP125 enzymes evolved in a manner reflective of their adaptation to a pathogenic niche. We constructed a maximum-likelihood, most-recent common ancestor of the CYP125 clade (CYP125MRCA). We were then able to produce and characterise this enzyme both functionally and structurally. We found that CYP125MRCA was able to catalyse the terminal hydroxylation of cholesterol, phytosterols, and vitamin D3 (cholecalciferol); the latter was hydroxylated at both C-25 and C-26. This is the first example to date of vitamin D3 oxidation by a CYP125 enzyme, thereby demonstrating an increased substrate range of CYP125MRCA relative to its characterised extant relatives. The X-ray crystal structures of CYP125MRCA bound with sitosterol and vitamin D3 were determined, providing important insight into the changes that enable the expanded substrate range.
Collapse
Affiliation(s)
- Daniel Z Doherty
- Department of Chemistry, University of Adelaide Adelaide South Australia 5005 Australia
| | - James J De Voss
- School of Chemistry and Molecular Biosciences, University of Queensland Brisbane Queensland 4072 Australia
| | - John B Bruning
- School of Biological Sciences, University of Adelaide SA 5005 Australia
| | - Stephen G Bell
- Department of Chemistry, University of Adelaide Adelaide South Australia 5005 Australia
| |
Collapse
|
2
|
De Leonardis M, Pagnani A, Barrat-Charlaix P. Reconstruction of Ancestral Protein Sequences Using Autoregressive Generative Models. Mol Biol Evol 2025; 42:msaf070. [PMID: 40139916 PMCID: PMC12006719 DOI: 10.1093/molbev/msaf070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 01/21/2025] [Accepted: 02/14/2025] [Indexed: 03/29/2025] Open
Abstract
Ancestral sequence reconstruction (ASR) is an important tool to understand how protein structure and function changed over the course of evolution. It essentially relies on models of sequence evolution that can quantitatively describe changes in a sequence over time. Such models usually consider that sequence positions evolve independently from each other and neglect epistasis: the context-dependence of the effect of mutations. On the other hand, the last years have seen major developments in the field of generative protein models, which learn constraints associated with structure and function from large ensembles of evolutionarily related proteins. Here, we show that it is possible to extend a specific type of generative model to describe the evolution of sequences in time while taking epistasis into account. We apply the developed technique to the problem of ASR: given a protein family and its evolutionary tree, we try to infer the sequences of extinct ancestors. Using both simulations and data coming from experimental evolution we show that our method outperforms state-of-the-art ones. Moreover, it allows for sampling a greater diversity of potential ancestors, allowing for a less biased characterization of ancestral sequences.
Collapse
Affiliation(s)
- Matteo De Leonardis
- DISAT, Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino 10129, Italy
| | - Andrea Pagnani
- DISAT, Politecnico di Torino, Corso Duca degli Abruzzi 24, Torino 10129, Italy
- Italian Institute for Genomic Medicine, IRCCS Candiolo, SP-142, Candiolo 10060, Italy
- INFN, Sezione di Torino, Via Pietro Giuria 1, Torino 10125, Italy
| | | |
Collapse
|
3
|
Muñiz-Trejo R, Park Y, Thornton JW. Robustness of Ancestral Sequence Reconstruction to Among-site and Among-lineage Evolutionary Heterogeneity. Mol Biol Evol 2025; 42:msaf084. [PMID: 40203289 PMCID: PMC12046983 DOI: 10.1093/molbev/msaf084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 03/17/2025] [Accepted: 03/31/2025] [Indexed: 04/11/2025] Open
Abstract
Ancestral sequence reconstruction is typically performed using homogeneous evolutionary models, which assume that the same substitution propensities affect all sites and lineages. These assumptions are routinely violated: heterogeneous structural and functional constraints favor different amino acids at different sites, and these constraints often change among lineages as epistatic substitutions accrue at other sites. To evaluate how violations of the homogeneity assumption affect ancestral sequence reconstruction under realistic conditions, we developed site-specific substitution models and parameterized them using data from deep mutational scanning experiments on three protein families; we then used these models to perform ancestral sequence reconstruction on the empirical alignments and on alignments simulated under heterogeneous conditions derived from the experiments. Extensive among-site and -lineage heterogeneity is present in these datasets, but the sequences reconstructed from empirical alignments are almost identical when heterogeneous or homogeneous models are used for ancestral sequence reconstruction. Using models fit to deep mutational scanning data from distantly related proteins in which mutational effects are very different also has a minimal impact on ancestral sequence reconstruction. The rare differences occur primarily where phylogenetic signal is weak-at fast-evolving sites and nodes connected by long branches. When ancestral sequence reconstruction is performed on simulated data, errors in the reconstructed sequences become more likely as branch lengths increase, but incorporating heterogeneity into the model does not improve accuracy. These data establish that ancestral sequence reconstruction is robust to unincorporated realistic forms of evolutionary heterogeneity, because the primary determinant of ancestral sequence reconstruction is phylogenetic signal, not the substitution model. The best way to improve accuracy is therefore not to develop more elaborate models but to apply ancestral sequence reconstruction to densely sampled alignments that maximize phylogenetic signal at the nodes of interest.
Collapse
Affiliation(s)
- Ricardo Muñiz-Trejo
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Yeonwoo Park
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA
- Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
4
|
Chi H, Wan J, Melin AD, DeCasien AR, Wang S, Zhang Y, Cui Y, Guo X, Zhao L, Williamson J, Zhang T, Li Q, Zhan Y, Li N, Guo J, Xu Z, Hou W, Cao Y, Yuan J, Zheng J, Shao Y, Wang J, Chen W, Song S, Lu X, Qi X, Zhang G, Rossiter SJ, Wu DD, Liu Y, Lu H, Li G. Genomic and phenotypic evidence support visual and olfactory shifts in primate evolution. Nat Ecol Evol 2025; 9:721-733. [PMID: 40021902 DOI: 10.1038/s41559-025-02651-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Accepted: 01/31/2025] [Indexed: 03/03/2025]
Abstract
Sensory trade-offs between vision and olfaction in the evolution and radiation of primates have long been debated. However, insights have been limited by a lack of sensory gene sequences and accompanying functional predictions. Here we conduct large-scale functional analyses of visual and olfactory receptors and related brain regions across extant primates. Our results reveal a visual shift from ultraviolet to violet colour sensitivity in early haplorrhine primates, followed by acceleration in the rhodopsin retinal release rates at the origin of anthropoids, both of which are expected to greatly enhance visual acuity under brighter light conditions. Additionally, we find that the sensitivity of olfactory receptors shifted from narrowly to broadly tuned early in anthropoid evolution. In contrast, strepsirrhines appear to have retained sensitive dim-light vision and underwent functional enhancement of narrowly tuned olfactory receptors. Our models indicate that this would have enhanced odorant discrimination and facilitated olfaction-mediated physiology and behaviour. These differences in tuning patterns of olfactory receptors between major primate lineages mirror well-established morphological differences in external anatomy and brain structures, revealing new mechanisms of olfactory adaptation and evolutionary plasticity. Our multisystem analyses reveal patterns of co-evolution in genomic, molecular and neuroanatomical traits that are consistent with a sensory 'reallocation' rather than strict trade-offs.
Collapse
Affiliation(s)
- Hai Chi
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Jiahui Wan
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Amanda D Melin
- Department of Anthropology and Archaeology, University of Calgary, Calgary, Alberta, Canada
- Department of Medical Genetics, University of Calgary, Calgary, Alberta, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, Calgary, Alberta, Canada
| | - Alex R DeCasien
- Computational and Evolutionary Neurogenomics Unit, National Institute on Aging, Bethesda, MD, USA
| | - Sufang Wang
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Yudan Zhang
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Yimeng Cui
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
- College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang, China
| | - Xin Guo
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Le Zhao
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
- QinLing-Bashan Mountains Bioresources Comprehensive Development C. I. C., School of Bioscience and Engineering, Shaanxi University of Technology, Hanzhong, China
| | - Joseph Williamson
- School of Biological and Behavioural Sciences, Queen Mary, University of London, London, UK
| | - Tianmin Zhang
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Qian Li
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Yue Zhan
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
- College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang, China
| | - Na Li
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Jinqu Guo
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Zhe Xu
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Wenhui Hou
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Yumin Cao
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Jiaqing Yuan
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Jiangmin Zheng
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China
| | - Yong Shao
- State Key Laboratory of Genetic Evolution & Animal Models, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Jinhong Wang
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Wu Chen
- Guangzhou Zoo & Guangzhou Wildlife Research Center, Guangzhou, China
| | - Shengjing Song
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Xiaoli Lu
- School of Life Sciences, Westlake University, Hangzhou, China
| | - Xiaoguang Qi
- Shaanxi Key Laboratory for Animal Conservation, College of Life Sciences, Northwest University, Xi'an, China
| | - Guojie Zhang
- State Key Laboratory of Genetic Evolution & Animal Models, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- BGI-Shenzhen, Shenzhen, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Stephen J Rossiter
- School of Biological and Behavioural Sciences, Queen Mary, University of London, London, UK
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Evolution & Animal Models, Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
- National Resource Center for Non-Human Primates, Kunming Primate Research Center, and National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| | - Yang Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an, China.
| | - Huimeng Lu
- School of Life Sciences, Northwestern Polytechnical University, Xi'an, China.
| | - Gang Li
- College of Life Sciences, Shaanxi Normal University, Xi'an, China.
- QinLing-Bashan Mountains Bioresources Comprehensive Development C. I. C., School of Bioscience and Engineering, Shaanxi University of Technology, Hanzhong, China.
| |
Collapse
|
5
|
Muñiz-Trejo R, Park Y, Thornton JW. Robustness of ancestral sequence reconstruction to among-site evolutionary heterogeneity and epistasis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.20.629812. [PMID: 39763774 PMCID: PMC11702759 DOI: 10.1101/2024.12.20.629812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]
Abstract
Ancestral sequence reconstruction (ASR) is typically performed using homogeneous evolutionary models, which assume that the same substitution propensities affect all sites and lineages. These assumptions are routinely violated: heterogeneous structural and functional constraints favor different amino acid states at different sites, and these constraints often change among lineages as epistatic substitutions accrue at other sites. To evaluate how realistic violations of the homogeneity assumption affect ASR, we developed site-specific substitution models and parameterized them using data from deep mutational scanning experiments on three protein families; we then used these models to perform ASR on the empirical alignments and on alignments simulated under heterogeneous conditions derived from the experiments. Extensive among-site and -lineage heterogeneity is present in these datasets, but the sequences reconstructed from empirical alignments are almost identical, irrespective of whether heterogeneous or homogeneous models are used for ASR. The rare differences occur primarily when phylogenetic signal is weak - at fast-evolving sites and nodes connected by long branches. When ASR is performed on simulated data, errors in the reconstructed sequences become more likely as branch lengths increase, but incorporating heterogeneity into the model does not improve accuracy. These data establish that ASR is robust to unincorporated realistic forms of evolutionary heterogeneity, because the primary determinant of ASR is phylogenetic signal, not the substitution model. The best way to improve accuracy is therefore not to develop more elaborate models but to apply ASR to densely sampled alignments that maximize phylogenetic signal at the nodes of interest.
Collapse
Affiliation(s)
- Ricardo Muñiz-Trejo
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Yeonwoo Park
- Committee on Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA
- Center for RNA Research, Institute for Basic Science, Seoul, Republic of Korea
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
6
|
Zhu XX, Zheng WQ, Xia ZW, Chen XR, Jin T, Ding XW, Chen FF, Chen Q, Xu JH, Kong XD, Zheng GW. Evolutionary insights into the stereoselectivity of imine reductases based on ancestral sequence reconstruction. Nat Commun 2024; 15:10330. [PMID: 39609402 PMCID: PMC11605051 DOI: 10.1038/s41467-024-54613-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 11/14/2024] [Indexed: 11/30/2024] Open
Abstract
The stereoselectivity of enzymes plays a central role in asymmetric biocatalytic reactions, but there remains a dearth of evolution-driven biochemistry studies investigating the evolutionary trajectory of this vital property. Imine reductases (IREDs) are one such enzyme that possesses excellent stereoselectivity, and stereocomplementary members are pervasive in the family. However, the regulatory mechanism behind stereocomplementarity remains cryptic. Herein, we reconstruct a panel of active ancestral IREDs and trace the evolution of stereoselectivity from ancestors to extant IREDs. Combined with coevolution analysis, we reveal six historical mutations capable of recapitulating stereoselectivity evolution. An investigation of the mechanism with X-ray crystallography shows that they collectively reshape the substrate-binding pocket to regulate stereoselectivity inversion. In addition, we construct an empirical fitness landscape and discover that epistasis is prevalent in stereoselectivity evolution. Our findings emphasize the power of ASR in circumventing the time-consuming large-scale mutagenesis library screening for identifying mutations that change functions and support a Darwinian premise from a molecular perspective that the evolution of biological functions is a stepwise process.
Collapse
Affiliation(s)
- Xin-Xin Zhu
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Wen-Qing Zheng
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Zi-Wei Xia
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Xin-Ru Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Tian Jin
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Xu-Wei Ding
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Fei-Fei Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Qi Chen
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Jian-He Xu
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China
| | - Xu-Dong Kong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Zhangjiang Institute for Advanced Study, Shanghai Jiao Tong University, Shanghai, China.
| | - Gao-Wei Zheng
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing, East China University of Science and Technology, Shanghai, China.
| |
Collapse
|
7
|
Pérez-Niño JA, Guerra Y, Díaz-Salazar AJ, Costas M, Rodríguez-Romero A, Fernández-Velasco DA. Stable monomers in the ancestral sequence reconstruction of the last opisthokont common ancestor of dimeric triosephosphate isomerase. Protein Sci 2024; 33:e5134. [PMID: 39145435 PMCID: PMC11325190 DOI: 10.1002/pro.5134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 07/01/2024] [Accepted: 07/21/2024] [Indexed: 08/16/2024]
Abstract
Function and structure are strongly coupled in obligated oligomers such as Triosephosphate isomerase (TIM). In animals and fungi, TIM monomers are inactive and unstable. Previously, we used ancestral sequence reconstruction to study TIM evolution and found that before these lineages diverged, the last opisthokonta common ancestor of TIM (LOCATIM) was an obligated oligomer that resembles those of extant TIMs. Notably, calorimetric evidence indicated that ancestral TIM monomers are more structured than extant ones. To further increase confidence about the function, structure, and stability of the LOCATIM, in this work, we applied two different inference methodologies and the worst plausible case scenario for both of them, to infer four sequences of this ancestor and test the robustness of their physicochemical properties. The extensive biophysical characterization of the four reconstructed sequences of LOCATIM showed very similar hydrodynamic and spectroscopic properties, as well as ligand-binding energetics and catalytic parameters. Their 3D structures were also conserved. Although differences were observed in melting temperature, all LOCATIMs showed reversible urea-induced unfolding transitions, and for those that reached equilibrium, high conformational stability was estimated (ΔGTot = 40.6-46.2 kcal/mol). The stability of the inactive monomeric intermediates was also high (ΔGunf = 12.6-18.4 kcal/mol), resembling some protozoan TIMs rather than the unstable monomer observed in extant opisthokonts. A comparative analysis of the 3D structure of ancestral and extant TIMs shows a correlation between the higher stability of the ancestral monomers with the presence of several hydrogen bonds located in the "bottom" part of the barrel.
Collapse
Affiliation(s)
- Jorge Alejandro Pérez-Niño
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | - Yasel Guerra
- Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias Aplicadas, Universidad de Las Américas, Quito, Ecuador
- Grupo de Bio-Quimioinformática, Universidad de Las Américas, Quito, Ecuador
| | - A Jessica Díaz-Salazar
- Laboratorio de Biofisicoquímica, Departamento de Fisicoquímica, Facultad de Química, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | - Miguel Costas
- Laboratorio de Biofisicoquímica, Departamento de Fisicoquímica, Facultad de Química, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | | | - D Alejandro Fernández-Velasco
- Laboratorio de Fisicoquímica e Ingeniería de Proteínas, Departamento de Bioquímica, Facultad de Medicina, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| |
Collapse
|
8
|
Reddy KD, Rasool B, Akher FB, Kutlešić N, Pant S, Boudker O. Evolutionary analysis reveals the origin of sodium coupling in glutamate transporters. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.03.569786. [PMID: 38106174 PMCID: PMC10723334 DOI: 10.1101/2023.12.03.569786] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Secondary active membrane transporters harness the energy of ion gradients to concentrate their substrates. Homologous transporters evolved to couple transport to different ions in response to changing environments and needs. The bases of such diversification, and thus principles of ion coupling, are unexplored. Employing phylogenetics and ancestral protein reconstruction, we investigated sodium-coupled transport in prokaryotic glutamate transporters, a mechanism ubiquitous across life domains and critical to neurotransmitter recycling in humans. We found that the evolutionary transition from sodium-dependent to independent substrate binding to the transporter preceded changes in the coupling mechanism. Structural and functional experiments suggest that the transition entailed allosteric mutations, making sodium binding dispensable without affecting ion-binding sites. Allosteric tuning of transporters' energy landscapes might be a widespread route of their functional diversification.
Collapse
Affiliation(s)
- Krishna D. Reddy
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Burha Rasool
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Farideh Badichi Akher
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Nemanja Kutlešić
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Swati Pant
- Dept. of Biochemistry, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| | - Olga Boudker
- Dept. of Physiology & Biophysics, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
- Howard Hughes Medical Institute, Weill Cornell Medical College, 1300 York Ave, New York, NY 10021, USA
| |
Collapse
|
9
|
Sennett MA, Theobald DL. Extant Sequence Reconstruction: The Accuracy of Ancestral Sequence Reconstructions Evaluated by Extant Sequence Cross-Validation. J Mol Evol 2024; 92:181-206. [PMID: 38502220 PMCID: PMC10978691 DOI: 10.1007/s00239-024-10162-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 02/20/2024] [Indexed: 03/21/2024]
Abstract
Ancestral sequence reconstruction (ASR) is a phylogenetic method widely used to analyze the properties of ancient biomolecules and to elucidate mechanisms of molecular evolution. Despite its increasingly widespread application, the accuracy of ASR is currently unknown, as it is generally impossible to compare resurrected proteins to the true ancestors. Which evolutionary models are best for ASR? How accurate are the resulting inferences? Here we answer these questions using a cross-validation method to reconstruct each extant sequence in an alignment with ASR methodology, a method we term "extant sequence reconstruction" (ESR). We thus can evaluate the accuracy of ASR methodology by comparing ESR reconstructions to the corresponding known true sequences. We find that a common measure of the quality of a reconstructed sequence, the average probability, is indeed a good estimate of the fraction of correct amino acids when the evolutionary model is accurate or overparameterized. However, the average probability is a poor measure for comparing reconstructions from different models, because, surprisingly, a more accurate phylogenetic model often results in reconstructions with lower probability. While better (more predictive) models may produce reconstructions with lower sequence identity to the true sequences, better models nevertheless produce reconstructions that are more biophysically similar to true ancestors. In addition, we find that a large fraction of sequences sampled from the reconstruction distribution may have fewer errors than the single most probable (SMP) sequence reconstruction, despite the fact that the SMP has the lowest expected error of all possible sequences. Our results emphasize the importance of model selection for ASR and the usefulness of sampling sequence reconstructions for analyzing ancestral protein properties. ESR is a powerful method for validating the evolutionary models used for ASR and can be applied in practice to any phylogenetic analysis of real biological sequences. Most significantly, ESR uses ASR methodology to provide a general method by which the biophysical properties of resurrected proteins can be compared to the properties of the true protein.
Collapse
Affiliation(s)
- Michael A Sennett
- Department of Biochemistry, Brandeis University, Waltham, MA, 02453, USA
| | - Douglas L Theobald
- Department of Biochemistry, Brandeis University, Waltham, MA, 02453, USA.
| |
Collapse
|
10
|
Franceus J, Rivas-Fernández JP, Lormans J, Rovira C, Desmet T. Evolution of Phosphorylase Activity in an Ancestral Glycosyltransferase. ACS Catal 2024; 14:3103-3114. [PMID: 38449530 PMCID: PMC10913872 DOI: 10.1021/acscatal.3c05819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/08/2024] [Accepted: 01/11/2024] [Indexed: 03/08/2024]
Abstract
The reconstruction of ancestral sequences can offer a glimpse into the fascinating process of molecular evolution by exposing the adaptive pathways that shape the proteins found in nature today. Here, we track the evolution of the carbohydrate-active enzymes responsible for the synthesis and turnover of mannogen, a critical carbohydrate reserve in Leishmania parasites. Biochemical characterization of resurrected enzymes demonstrated that mannoside phosphorylase activity emerged in an ancestral bacterial mannosyltransferase, and later disappeared in the process of horizontal gene transfer and gene duplication in Leishmania. By shuffling through plausible historical sequence space in an ancestral mannosyltransferase, we found that mannoside phosphorylase activity could be toggled on through various combinations of mutations at positions outside of the active site. Molecular dynamics simulations showed that such mutations can affect loop rigidity and shield the active site from water molecules that disrupt key interactions, allowing α-mannose 1-phosphate to adopt a catalytically productive conformation. These findings highlight the importance of subtle distal mutations in protein evolution and suggest that the vast collection of natural glycosyltransferases may be a promising source of engineering templates for the design of tailored phosphorylases.
Collapse
Affiliation(s)
- Jorick Franceus
- Centre
for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, Ghent 9000, Belgium
| | - José Pablo Rivas-Fernández
- Departament
de Química Inorgànica i Orgànica (Secció
de Química Orgànica) and Institut de Química
Teòrica i Computacional (IQTCUB), Universitat de Barcelona, Martí i Franquès 1, Barcelona 08028, Spain
| | - Jolien Lormans
- Centre
for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, Ghent 9000, Belgium
| | - Carme Rovira
- Departament
de Química Inorgànica i Orgànica (Secció
de Química Orgànica) and Institut de Química
Teòrica i Computacional (IQTCUB), Universitat de Barcelona, Martí i Franquès 1, Barcelona 08028, Spain
- Institució
Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
| | - Tom Desmet
- Centre
for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, Ghent 9000, Belgium
| |
Collapse
|
11
|
Mihalič F, Arcila D, Pettersson ME, Farkhondehkish P, Andersson E, Andersson L, Betancur-R R, Jemth P. Conservation of Affinity Rather Than Sequence Underlies a Dynamic Evolution of the Motif-Mediated p53/MDM2 Interaction in Ray-Finned Fishes. Mol Biol Evol 2024; 41:msae018. [PMID: 38301272 PMCID: PMC10901556 DOI: 10.1093/molbev/msae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 12/12/2023] [Accepted: 01/22/2024] [Indexed: 02/03/2024] Open
Abstract
The transcription factor and cell cycle regulator p53 is marked for degradation by the ubiquitin ligase MDM2. The interaction between these 2 proteins is mediated by a conserved binding motif in the disordered p53 transactivation domain (p53TAD) and the folded SWIB domain in MDM2. The conserved motif in p53TAD from zebrafish displays a 20-fold weaker interaction with MDM2, compared to the interaction in human and chicken. To investigate this apparent difference, we tracked the molecular evolution of the p53TAD/MDM2 interaction among ray-finned fishes (Actinopterygii), the largest vertebrate clade. Intriguingly, phylogenetic analyses, ancestral sequence reconstructions, and binding experiments showed that different loss-of-affinity changes in the canonical binding motif within p53TAD have occurred repeatedly and convergently in different fish lineages, resulting in relatively low extant affinities (KD = 0.5 to 5 μM). However, for 11 different fish p53TAD/MDM2 interactions, nonconserved regions flanking the canonical motif increased the affinity 4- to 73-fold to be on par with the human interaction. Our findings suggest that compensating changes at conserved and nonconserved positions within the motif, as well as in flanking regions of low conservation, underlie a stabilizing selection of "functional affinity" in the p53TAD/MDM2 interaction. Such interplay complicates bioinformatic prediction of binding and calls for experimental validation. Motif-mediated protein-protein interactions involving short binding motifs and folded interaction domains are very common across multicellular life. It is likely that the evolution of affinity in motif-mediated interactions often involves an interplay between specific interactions made by conserved motif residues and nonspecific interactions by nonconserved disordered regions.
Collapse
Affiliation(s)
- Filip Mihalič
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Dahiana Arcila
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Mats E Pettersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Pouria Farkhondehkish
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| | - Leif Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77483, USA
| | - Ricardo Betancur-R
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA 92093, USA
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC, Uppsala SE-75123, Sweden
| |
Collapse
|
12
|
Nicoll CR, Alvigini L, Gottinger A, Cecchini D, Mannucci B, Corana F, Mascotti ML, Mattevi A. In vitro construction of the COQ metabolon unveils the molecular determinants of coenzyme Q biosynthesis. Nat Catal 2024; 7:148-160. [PMID: 38425362 PMCID: PMC7615680 DOI: 10.1038/s41929-023-01087-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 11/20/2023] [Indexed: 03/02/2024]
Abstract
Metabolons are protein assemblies that perform a series of reactions in a metabolic pathway. However, the general importance and aptitude of metabolons for enzyme catalysis remain poorly understood. In animals, biosynthesis of coenzyme Q is currently attributed to ten different proteins, with COQ3, COQ4, COQ5, COQ6, COQ7 and COQ9 forming the iconic COQ metabolon. Yet several reaction steps conducted by the metabolon remain enigmatic. To elucidate the prerequisites for animal coenzyme Q biosynthesis, we sought to construct the entire metabolon in vitro. Here we show that this approach, rooted in ancestral sequence reconstruction, reveals the enzymes responsible for the uncharacterized steps and captures the biosynthetic pathway in vitro. We demonstrate that COQ8, a kinase, increases and streamlines coenzyme Q production. Our findings provide crucial insight into how biocatalytic efficiency is regulated and enhanced by these biosynthetic engines in the context of the cell.
Collapse
Affiliation(s)
- Callum R. Nicoll
- Department of Biology and Biotechnology ‘Lazzaro Spallanzani’, University of Pavia, Pavia, Italy
| | - Laura Alvigini
- Department of Biology and Biotechnology ‘Lazzaro Spallanzani’, University of Pavia, Pavia, Italy
| | - Andrea Gottinger
- Department of Biology and Biotechnology ‘Lazzaro Spallanzani’, University of Pavia, Pavia, Italy
| | - Domiziana Cecchini
- Department of Biology and Biotechnology ‘Lazzaro Spallanzani’, University of Pavia, Pavia, Italy
| | | | - Federica Corana
- ’Centro Grandi Strumenti’, University of Pavia, Pavia, Italy
| | - María Laura Mascotti
- Molecular Enzymology Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, the Netherlands
- IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, San Luis, Argentina
| | - Andrea Mattevi
- Department of Biology and Biotechnology ‘Lazzaro Spallanzani’, University of Pavia, Pavia, Italy
| |
Collapse
|
13
|
Karaś P, Kochanowicz K, Pitek M, Domanski P, Obuchowski I, Tomiczek B, Liberek K. Evolution towards simplicity in bacterial small heat shock protein system. eLife 2023; 12:RP89813. [PMID: 38063373 PMCID: PMC10708888 DOI: 10.7554/elife.89813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2023] Open
Abstract
Evolution can tinker with multi-protein machines and replace them with simpler single-protein systems performing equivalent functions in an equally efficient manner. It is unclear how, on a molecular level, such simplification can arise. With ancestral reconstruction and biochemical analysis, we have traced the evolution of bacterial small heat shock proteins (sHsp), which help to refold proteins from aggregates using either two proteins with different functions (IbpA and IbpB) or a secondarily single sHsp that performs both functions in an equally efficient way. Secondarily single sHsp evolved from IbpA, an ancestor specialized in strong substrate binding. Evolution of an intermolecular binding site drove the alteration of substrate binding properties, as well as the formation of higher-order oligomers. Upon two mutations in the α-crystallin domain, secondarily single sHsp interacts with aggregated substrates less tightly. Paradoxically, less efficient binding positively influences the ability of sHsp to stimulate substrate refolding, since the dissociation of sHps from aggregates is required to initiate Hsp70-Hsp100-dependent substrate refolding. After the loss of a partner, IbpA took over its role in facilitating the sHsp dissociation from an aggregate by weakening the interaction with the substrate, which became beneficial for the refolding process. We show that the same two amino acids introduced in modern-day systems define whether the IbpA acts as a single sHsp or obligatorily cooperates with an IbpB partner. Our discoveries illuminate how one sequence has evolved to encode functions previously performed by two distinct proteins.
Collapse
Affiliation(s)
- Piotr Karaś
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| | - Klaudia Kochanowicz
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| | - Marcin Pitek
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| | - Przemyslaw Domanski
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| | - Igor Obuchowski
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| | - Barlomiej Tomiczek
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| | - Krzysztof Liberek
- Intercollegiate Faculty of Biotechnology UG-MUG, University of GdanskGdańskPoland
| |
Collapse
|
14
|
Evolution of enzyme functionality in the flavin-containing monooxygenases. Nat Commun 2023; 14:1042. [PMID: 36823138 PMCID: PMC9950137 DOI: 10.1038/s41467-023-36756-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 02/15/2023] [Indexed: 02/25/2023] Open
Abstract
Among the molecular mechanisms of adaptation in biology, enzyme functional diversification is indispensable. By allowing organisms to expand their catalytic repertoires and adopt fundamentally different chemistries, animals can harness or eliminate new-found substances and xenobiotics that they are exposed to in new environments. Here, we explore the flavin-containing monooxygenases (FMOs) that are essential for xenobiotic detoxification. Employing a paleobiochemistry approach in combination with enzymology techniques we disclose the set of historical substitutions responsible for the family's functional diversification in tetrapods. Remarkably, a few amino acid replacements differentiate an ancestral multi-tasking FMO into a more specialized monooxygenase by modulating the oxygenating flavin intermediate. Our findings substantiate an ongoing premise that enzymatic function hinges on a subset of residues that is not limited to the active site core.
Collapse
|
15
|
Orlandi KN, Phillips SR, Sailer ZR, Harman JL, Harms MJ. Topiary: Pruning the manual labor from ancestral sequence reconstruction. Protein Sci 2023; 32:e4551. [PMID: 36565302 PMCID: PMC9847077 DOI: 10.1002/pro.4551] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 12/14/2022] [Accepted: 12/17/2022] [Indexed: 12/25/2022]
Abstract
Ancestral sequence reconstruction (ASR) is a powerful tool to study the evolution of proteins and thus gain deep insight into the relationships among protein sequence, structure, and function. A major barrier to its broad use is the complexity of the task: it requires multiple software packages, complex file manipulations, and expert phylogenetic knowledge. Here we introduce topiary, a software pipeline that aims to overcome this barrier. To use topiary, users prepare a spreadsheet with a handful of sequences. Topiary then: (1) Infers the taxonomic scope for the ASR study and finds relevant sequences by BLAST; (2) Does taxonomically informed sequence quality control and redundancy reduction; (3) Constructs a multiple sequence alignment; (4) Generates a maximum-likelihood gene tree; (5) Reconciles the gene tree to the species tree; (6) Reconstructs ancestral amino acid sequences; and (7) Determines branch supports. The pipeline returns annotated evolutionary trees, spreadsheets with sequences, and graphical summaries of ancestor quality. This is achieved by integrating modern phylogenetics software (Muscle5, RAxML-NG, GeneRax, and PastML) with online databases (NCBI and the Open Tree of Life). In this paper, we introduce non-expert readers to the steps required for ASR, describe the specific design choices made in topiary, provide a detailed protocol for users, and then validate the pipeline using datasets from a broad collection of protein families. Topiary is freely available for download: https://github.com/harmslab/topiary.
Collapse
Affiliation(s)
- Kona N. Orlandi
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of BiologyUniversity of OregonEugeneOregonUSA
| | - Sophia R. Phillips
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Zachary R. Sailer
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Joseph L. Harman
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Michael J. Harms
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| |
Collapse
|
16
|
Clifton BE, Kozome D, Laurino P. Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design. Biochemistry 2023; 62:210-220. [PMID: 35245020 DOI: 10.1021/acs.biochem.1c00757] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The rapid growth of sequence databases over the past two decades means that protein engineers faced with optimizing a protein for any given task will often have immediate access to a vast number of related protein sequences. These sequences encode information about the evolutionary history of the protein and the underlying sequence requirements to produce folded, stable, and functional protein variants. Methods that can take advantage of this information are an increasingly important part of the protein engineering tool kit. In this Perspective, we discuss the utility of sequence data in protein engineering and design, focusing on recent advances in three main areas: the use of ancestral sequence reconstruction as an engineering tool to generate thermostable and multifunctional proteins, the use of sequence data to guide engineering of multipoint mutants by structure-based computational protein design, and the use of unlabeled sequence data for unsupervised and semisupervised machine learning, allowing the generation of diverse and functional protein sequences in unexplored regions of sequence space. Altogether, these methods enable the rapid exploration of sequence space within regions enriched with functional proteins and therefore have great potential for accelerating the engineering of stable, functional, and diverse proteins for industrial and biomedical applications.
Collapse
Affiliation(s)
- Ben E Clifton
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Dan Kozome
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Paola Laurino
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
17
|
Schulz L, Guo Z, Zarzycki J, Steinchen W, Schuller JM, Heimerl T, Prinz S, Mueller-Cajar O, Erb TJ, Hochberg GKA. Evolution of increased complexity and specificity at the dawn of form I Rubiscos. Science 2022; 378:155-160. [PMID: 36227987 DOI: 10.1126/science.abq1416] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The evolution of ribulose-1,5-bisphosphate carboxylase/oxygenases (Rubiscos) that discriminate strongly between their substrate carbon dioxide and the undesired side substrate dioxygen was an important event for photosynthetic organisms adapting to an oxygenated environment. We use ancestral sequence reconstruction to recapitulate this event. We show that Rubisco increased its specificity and carboxylation efficiency through the gain of an accessory subunit before atmospheric oxygen was present. Using structural and biochemical approaches, we retrace how this subunit was gained and became essential. Our work illuminates the emergence of an adaptation to rising ambient oxygen levels, provides a template for investigating the function of interactions that have remained elusive because of their essentiality, and sheds light on the determinants of specificity in Rubisco.
Collapse
Affiliation(s)
- Luca Schulz
- Department of Biochemistry and Synthetic Metabolism, Max Planck Institute for Terrestrial Microbiology, 35043 Marburg, Germany
| | - Zhijun Guo
- School of Biological Sciences, Nanyang Technological University, Singapore 637551, Singapore
| | - Jan Zarzycki
- Department of Biochemistry and Synthetic Metabolism, Max Planck Institute for Terrestrial Microbiology, 35043 Marburg, Germany
| | - Wieland Steinchen
- Center for Synthetic Microbiology (SYNMIKRO), Philipps University Marburg, 35043 Marburg, Germany.,Department of Chemistry, Philipps University Marburg, 35043 Marburg, Germany
| | - Jan M Schuller
- Center for Synthetic Microbiology (SYNMIKRO), Philipps University Marburg, 35043 Marburg, Germany.,Department of Chemistry, Philipps University Marburg, 35043 Marburg, Germany
| | - Thomas Heimerl
- Center for Synthetic Microbiology (SYNMIKRO), Philipps University Marburg, 35043 Marburg, Germany.,Department of Biology, Philipps University Marburg, 35043 Marburg, Germany
| | - Simone Prinz
- Central Electron Microscopy Facility, Max Planck Institute of Biophysics, 60438 Frankfurt am Main, Germany
| | - Oliver Mueller-Cajar
- School of Biological Sciences, Nanyang Technological University, Singapore 637551, Singapore
| | - Tobias J Erb
- Department of Biochemistry and Synthetic Metabolism, Max Planck Institute for Terrestrial Microbiology, 35043 Marburg, Germany.,Center for Synthetic Microbiology (SYNMIKRO), Philipps University Marburg, 35043 Marburg, Germany
| | - Georg K A Hochberg
- Center for Synthetic Microbiology (SYNMIKRO), Philipps University Marburg, 35043 Marburg, Germany.,Department of Chemistry, Philipps University Marburg, 35043 Marburg, Germany.,Evolutionary Biochemistry Group, Max Planck Institute for Terrestrial Microbiology, 35043 Marburg, Germany
| |
Collapse
|
18
|
Fer E, McGrath KM, Guy L, Hockenberry AJ, Kaçar B. Early divergence of translation initiation and elongation factors. Protein Sci 2022; 31:e4393. [PMID: 36250475 PMCID: PMC9601768 DOI: 10.1002/pro.4393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 07/05/2022] [Accepted: 07/11/2022] [Indexed: 11/18/2022]
Abstract
Protein translation is a foundational attribute of all living cells. The translation function carried out by the ribosome critically depends on an assortment of protein interaction partners, collectively referred to as the translation machinery. Various studies suggest that the diversification of the translation machinery occurred prior to the last universal common ancestor, yet it is unclear whether the predecessors of the extant translation machinery factors were functionally distinct from their modern counterparts. Here we reconstructed the shared ancestral trajectory and subsequent evolution of essential translation factor GTPases, elongation factor EF-Tu (aEF-1A/eEF-1A), and initiation factor IF2 (aIF5B/eIF5B). Based upon their similar functions and structural homologies, it has been proposed that EF-Tu and IF2 emerged from an ancient common ancestor. We generated the phylogenetic tree of IF2 and EF-Tu proteins and reconstructed ancestral sequences corresponding to the deepest nodes in their shared evolutionary history, including the last common IF2 and EF-Tu ancestor. By identifying the residue and domain substitutions, as well as structural changes along the phylogenetic history, we developed an evolutionary scenario for the origins, divergence and functional refinement of EF-Tu and IF2 proteins. Our analyses suggest that the common ancestor of IF2 and EF-Tu was an IF2-like GTPase protein. Given the central importance of the translation machinery to all cellular life, its earliest evolutionary constraints and trajectories are key to characterizing the universal constraints and capabilities of cellular evolution.
Collapse
Affiliation(s)
- Evrim Fer
- Department of BacteriologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- Microbiology Doctoral Training ProgramUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- NASA Center for Early Life and EvolutionUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Kaitlyn M. McGrath
- Department of BacteriologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- NASA Center for Early Life and EvolutionUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- Department of Molecular and Cellular BiologyUniversity of ArizonaTucsonArizonaUSA
| | - Lionel Guy
- Department of Medical Biochemistry and Microbiology, Science for Life LaboratoryUppsala UniversityUppsalaSweden
| | - Adam J. Hockenberry
- Department of Integrative BiologyThe University of Texas at AustinAustinTexasUSA
| | - Betül Kaçar
- Department of BacteriologyUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
- NASA Center for Early Life and EvolutionUniversity of Wisconsin‐MadisonMadisonWisconsinUSA
| |
Collapse
|
19
|
Nocedal I, Laub MT. Ancestral reconstruction of duplicated signaling proteins reveals the evolution of signaling specificity. eLife 2022; 11:e77346. [PMID: 35686729 PMCID: PMC9208753 DOI: 10.7554/elife.77346] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/27/2022] [Indexed: 01/30/2023] Open
Abstract
Gene duplication is crucial to generating novel signaling pathways during evolution. However, it remains unclear how the redundant proteins produced by gene duplication ultimately acquire new interaction specificities to establish insulated paralogous signaling pathways. Here, we used ancestral sequence reconstruction to resurrect and characterize a bacterial two-component signaling system that duplicated in α-proteobacteria. We determined the interaction specificities of the signaling proteins that existed before and immediately after this duplication event and then identified key mutations responsible for establishing specificity in the two systems. Just three mutations, in only two of the four interacting proteins, were sufficient to establish specificity of the extant systems. Some of these mutations weakened interactions between paralogous systems to limit crosstalk. However, others strengthened interactions within a system, indicating that the ancestral interaction, although functional, had the potential to be strengthened. Our work suggests that protein-protein interactions with such latent potential may be highly amenable to duplication and divergence.
Collapse
Affiliation(s)
- Isabel Nocedal
- Department of Biology, Massachusetts Institute of TechnologyCambridgeUnited States
| | - Michael T Laub
- Department of Biology, Massachusetts Institute of TechnologyCambridgeUnited States
- Howard Hughes Medical Institute, Massachusetts Institute of TechnologyCambridgeUnited States
| |
Collapse
|
20
|
Karlsson E, Sorgenfrei FA, Andersson E, Dogan J, Jemth P, Chi CN. The dynamic properties of a nuclear coactivator binding domain are evolutionarily conserved. Commun Biol 2022; 5:286. [PMID: 35354917 PMCID: PMC8967867 DOI: 10.1038/s42003-022-03217-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 03/02/2022] [Indexed: 12/21/2022] Open
Abstract
Evolution of proteins is constrained by their structure and function. While there is a consensus that the plasticity of intrinsically disordered proteins relaxes the structural constraints on evolution there is a paucity of data on the molecular details of these processes. The Nuclear Coactivator Binding Domain (NCBD) from CREB-binding protein is a protein interaction domain, which contains a hydrophobic core but is not behaving as a typical globular domain, and has been described as 'molten-globule like'. The highly dynamic properties of NCBD makes it an interesting model system for evolutionary structure-function investigation of intrinsically disordered proteins. We have here compared the structure and biophysical properties of an ancient version of NCBD present in a bilaterian animal ancestor living around 600 million years ago with extant human NCBD. Using a combination of NMR spectroscopy, circular dichroism and kinetics we show that although NCBD has increased its thermodynamic stability, it has retained its dynamic biophysical properties in the ligand-free state in the evolutionary lineage leading from the last common bilaterian ancestor to humans. Our findings suggest that the dynamic properties of NCBD have been maintained by purifying selection and thus are important for its function, which includes mediating several distinct protein-protein interactions.
Collapse
Affiliation(s)
- Elin Karlsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden
| | - Frieda A Sorgenfrei
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden.,acib GmbH, Krenngasse 37, 8010 Graz c/o University of Graz, Institute of Chemistry, NAWI Graz, BioTechMed Graz, Heinrichstrasse 28, 8010, Graz, Austria
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden
| | - Jakob Dogan
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden.
| | - Celestine N Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden. .,Department of Pharmaceutical Biosciences, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden.
| |
Collapse
|
21
|
Mascotti ML. Resurrecting Enzymes by Ancestral Sequence Reconstruction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2397:111-136. [PMID: 34813062 DOI: 10.1007/978-1-0716-1826-4_7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Ancestral Sequence Reconstruction (ASR) allows one to infer the sequences of extinct proteins using the phylogeny of extant proteins. It consists of disclosing the evolutionary history-i.e., the phylogeny-of a protein family of interest and then inferring the sequences of its ancestors-i.e., the nodes in the phylogeny. Assisted by gene synthesis, the selected ancestors can be resurrected in the lab and experimentally characterized. The crucial step to succeed with ASR is starting from a reliable phylogeny. At the same time, it is of the utmost importance to have a clear idea on the evolutionary history of the family under study and the events that influenced it. This allows us to implement ASR with well-defined hypotheses and to apply the appropriate experimental methods. In the last years, ASR has become popular to test hypotheses about the origin of functionalities, changes in activities, understanding physicochemical properties of proteins, among others. In this context, the aim of this chapter is to present the ASR approach applied to the reconstruction of enzymes-i.e., proteins with catalytic roles. The spirit of this contribution is to provide a basic, hands-to-work guide for biochemists and biologists who are unfamiliar with molecular phylogenetics.
Collapse
Affiliation(s)
- Maria Laura Mascotti
- Molecular Enzymology group, University of Groningen, Groningen, The Netherlands. .,IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, San Luis, Argentina.
| |
Collapse
|
22
|
VanAntwerp J, Finneran P, Dolgikh B, Woldring D. Ancestral Sequence Reconstruction and Alternate Amino Acid States Guide Protein Library Design for Directed Evolution. Methods Mol Biol 2022; 2491:75-86. [PMID: 35482185 DOI: 10.1007/978-1-0716-2285-8_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Engineered proteins possess nearly limitless possibilities in medical and industrial applications but finding a precise amino acid sequence for these applications is challenging. A robust approach for discovering protein sequences with a desired functionality uses a library design method in which combinations of mutations are applied to a robust starting point. Determining useful mutations can be tortuous, yet rewarding; in this chapter, we present a novel library design method that uses information provided by ancestral sequence reconstruction (ASR) to create a library likely to have stable proteins with diverse function. ASR computational tools use a multi-sequence alignment of homologous proteins and an evolutionary model to estimate the protein sequences of the numerous common ancestors. For all ancestors, these tools calculate the probability of every amino acid occurring at each position within the sequence alignment. The alternate amino acid states at individual positions corelate to a region of stability in sequence space around the ancestral sequence which can inform site-wise diversification within a combinatorial library. The method presented in this chapter balances the quality of results, the computational resources needed, and ease of use.
Collapse
Affiliation(s)
- James VanAntwerp
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA
| | | | - Benedikt Dolgikh
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA
| | - Daniel Woldring
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA.
- Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI, USA.
| |
Collapse
|
23
|
Garcia AK, Fer E, Sephus C, Kacar B. An Integrated Method to Reconstruct Ancient Proteins. Methods Mol Biol 2022; 2569:267-281. [PMID: 36083453 DOI: 10.1007/978-1-0716-2691-7_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Proteins have played a fundamental role throughout life's history on Earth. Despite their biological importance, ancient origin, early function, and evolution of proteins are seldom able to be directly studied because few of these attributes are preserved across geologic timescales. Ancestral sequence reconstruction (ASR) provides a method to infer ancestral amino acid sequences and determine the evolutionary predecessors of modern-day proteins using phylogenetic tools. Laboratory application of ASR allows ancient sequences to be deduced from genetic information available in extant organisms and then experimentally resurrected to elucidate ancestral characteristics. In this article, we provide a generalized, stepwise protocol that considers the major elements of a well-designed ASR study and details potential sources of reconstruction bias that can reduce the relevance of historical inferences. We underscore key stages in our approach so that it may be broadly utilized to reconstruct the evolutionary histories of proteins.
Collapse
Affiliation(s)
- Amanda K Garcia
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Evrim Fer
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Microbiology Doctoral Training Program, University of Wisconsin-Madison, Madison, WI, USA
| | - Cathryn Sephus
- Scripps Institution of Oceanography, University of California at San Diego, La Jolla, CA, USA
| | - Betul Kacar
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
24
|
Buchholz PCF, van Loo B, Eenink BDG, Bornberg-Bauer E, Pleiss J. Ancestral sequences of a large promiscuous enzyme family correspond to bridges in sequence space in a network representation. J R Soc Interface 2021; 18:20210389. [PMID: 34727710 DOI: 10.1098/rsif.2021.0389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Evolutionary relationships of protein families can be characterized either by networks or by trees. Whereas trees allow for hierarchical grouping and reconstruction of the most likely ancestral sequences, networks lack a time axis but allow for thresholds of pairwise sequence identity to be chosen and, therefore, the clustering of family members with presumably more similar functions. Here, we use the large family of arylsulfatases and phosphonate monoester hydrolases to investigate similarities, strengths and weaknesses in tree and network representations. For varying thresholds of pairwise sequence identity, values of betweenness centrality and clustering coefficients were derived for nodes of the reconstructed ancestors to measure the propensity to act as a bridge in a network. Based on these properties, ancestral protein sequences emerge as bridges in protein sequence networks. Interestingly, many ancestral protein sequences appear close to extant sequences. Therefore, reconstructed ancestor sequences might also be interpreted as yet-to-be-identified homologues. The concept of ancestor reconstruction is compared to consensus sequences, too. It was found that hub sequences in a network, e.g. reconstructed ancestral sequences that are connected to many neighbouring sequences, share closer similarity with derived consensus sequences. Therefore, some reconstructed ancestor sequences can also be interpreted as consensus sequences.
Collapse
Affiliation(s)
- Patrick C F Buchholz
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, Stuttgart 70569, Germany
| | - Bert van Loo
- Department of Applied Sciences, Northumbria University, Newcastle-upon-Tyne NE1 8ST, UK.,Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, Münster 48149, Germany
| | - Bernard D G Eenink
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, Münster 48149, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, Münster 48149, Germany.,Department of Protein Evolution, Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, Tübingen 72076, Germany
| | - Jürgen Pleiss
- Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, Stuttgart 70569, Germany
| |
Collapse
|
25
|
Abstract
Some have hypothesized that ancestral proteins were, on average, less specific than their descendants. If true, this would provide a universal axis along which to organize protein evolution and suggests that reconstructed ancestral proteins may be uniquely powerful tools for protein engineering. Ancestral sequence reconstruction studies are one line of evidence used to support this hypothesis. Previously, we performed such a study, investigating the evolution of peptide-binding specificity for the paralogs S100A5 and S100A6. The modern proteins appeared more specific than their last common ancestor (ancA5/A6), as each paralog bound a subset of the peptides bound by ancA5/A6. In this study, we revisit this transition, using quantitative phage display to measure the interactions of 30,533 random peptides with human S100A5, S100A6, and ancA5/A6. This unbiased screen reveals a different picture. While S100A5 and S100A6 do indeed bind to a subset of the peptides recognized by ancA5/A6, they also acquired new peptide partners outside of the set recognized by ancA5/A6. Our previous work showed that ancA5/A6 had lower specificity than its descendants when measured against biological targets; our new work shows that ancA5/A6 has similar specificity to the modern proteins when measured against a random set of peptide targets. This demonstrates that altered biological specificity does not necessarily indicate altered intrinsic specificity, and sounds a cautionary note for using ancestral reconstruction studies with biological targets as a means to infer global evolutionary trends in specificity.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA.,Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, USA
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
| |
Collapse
|
26
|
Aadland K, Kolaczkowski B. Alignment-Integrated Reconstruction of Ancestral Sequences Improves Accuracy. Genome Biol Evol 2021; 12:1549-1565. [PMID: 32785673 PMCID: PMC7523730 DOI: 10.1093/gbe/evaa164] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2020] [Indexed: 12/31/2022] Open
Abstract
Ancestral sequence reconstruction (ASR) uses an alignment of extant protein sequences, a phylogeny describing the history of the protein family and a model of the molecular-evolutionary process to infer the sequences of ancient proteins, allowing researchers to directly investigate the impact of sequence evolution on protein structure and function. Like all statistical inferences, ASR can be sensitive to violations of its underlying assumptions. Previous studies have shown that, whereas phylogenetic uncertainty has only a very weak impact on ASR accuracy, uncertainty in the protein sequence alignment can more strongly affect inferred ancestral sequences. Here, we show that errors in sequence alignment can produce errors in ASR across a range of realistic and simplified evolutionary scenarios. Importantly, sequence reconstruction errors can lead to errors in estimates of structural and functional properties of ancestral proteins, potentially undermining the reliability of analyses relying on ASR. We introduce an alignment-integrated ASR approach that combines information from many different sequence alignments. We show that integrating alignment uncertainty improves ASR accuracy and the accuracy of downstream structural and functional inferences, often performing as well as highly accurate structure-guided alignment. Given the growing evidence that sequence alignment errors can impact the reliability of ASR studies, we recommend that future studies incorporate approaches to mitigate the impact of alignment uncertainty. Probabilistic modeling of insertion and deletion events has the potential to radically improve ASR accuracy when the model reflects the true underlying evolutionary history, but further studies are required to thoroughly evaluate the reliability of these approaches under realistic conditions.
Collapse
Affiliation(s)
- Kelsey Aadland
- Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida
| | - Bryan Kolaczkowski
- Department of Microbiology and Cell Science, Institute of Food and Agricultural Sciences, University of Florida
| |
Collapse
|
27
|
Scossa F, Fernie AR. Ancestral sequence reconstruction - An underused approach to understand the evolution of gene function in plants? Comput Struct Biotechnol J 2021; 19:1579-1594. [PMID: 33868595 PMCID: PMC8039532 DOI: 10.1016/j.csbj.2021.03.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Revised: 03/04/2021] [Accepted: 03/06/2021] [Indexed: 02/06/2023] Open
Abstract
Whilst substantial research effort has been placed on understanding the interactions of plant proteins with their molecular partners, relatively few studies in plants - by contrast to work in other organisms - address how these interactions evolve. It is thought that ancestral proteins were more promiscuous than modern proteins and that specificity often evolved following gene duplication and subsequent functional refining. However, ancestral protein resurrection studies have found that some modern proteins have evolved de novo from ancestors lacking those functions. Intriguingly, the new interactions evolved as a consequence of just a few mutations and, as such, acquisition of new functions appears to be neither difficult nor rare, however, only a few of them are incorporated into biological processes before they are lost to subsequent mutations. Here, we detail the approach of ancestral sequence reconstruction (ASR), providing a primer to reconstruct the sequence of an ancestral gene. We will present case studies from a range of different eukaryotes before discussing the few instances where ancestral reconstructions have been used in plants. As ASR is used to dig into the remote evolutionary past, we will also present some alternative genetic approaches to investigate molecular evolution on shorter timescales. We argue that the study of plant secondary metabolism is particularly well suited for ancestral reconstruction studies. Indeed, its ancient evolutionary roots and highly diverse landscape provide an ideal context in which to address the focal issue around the emergence of evolutionary novelties and how this affects the chemical diversification of plant metabolism.
Collapse
Key Words
- APR, ancestral protein resurrection
- ASR, ancestral sequence reconstruction
- Ancestral sequence reconstruction
- CDS, coding sequence
- Evolution
- GR, glucocorticoid receptor
- GWAS, genome wide association study
- Genomics
- InDel, insertion/deletion
- MCMC, Markov Chain Monte Carlo
- ML, maximum likelihood
- MP, maximum parsimony
- MR, mineralcorticoid receptor
- MSA, multiple sequence alignment
- Metabolism
- NJ, neighbor-joining
- Phylogenetics
- Plants
- SFS, site frequency spectrum
Collapse
Affiliation(s)
- Federico Scossa
- Max-Planck-Institute of Molecular Plant Physiology (MPI-MP), 14476 Potsdam-Golm, Germany
- Council for Agricultural Research and Economics (CREA), Research Centre for Genomics and Bioinformatics (CREA-GB), Rome, Italy
| | - Alisdair R. Fernie
- Max-Planck-Institute of Molecular Plant Physiology (MPI-MP), 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
| |
Collapse
|
28
|
Zhu X, Boulet A, Buckley KM, Phillips CB, Gammon MG, Oldfather LE, Moore SA, Leary SC, Cobine PA. Mitochondrial copper and phosphate transporter specificity was defined early in the evolution of eukaryotes. eLife 2021; 10:64690. [PMID: 33591272 PMCID: PMC7924939 DOI: 10.7554/elife.64690] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Accepted: 02/15/2021] [Indexed: 12/21/2022] Open
Abstract
The mitochondrial carrier family protein SLC25A3 transports both copper and phosphate in mammals, yet in Saccharomyces cerevisiae the transport of these substrates is partitioned across two paralogs: PIC2 and MIR1. To understand the ancestral state of copper and phosphate transport in mitochondria, we explored the evolutionary relationships of PIC2 and MIR1 orthologs across the eukaryotic tree of life. Phylogenetic analyses revealed that PIC2-like and MIR1-like orthologs are present in all major eukaryotic supergroups, indicating an ancient gene duplication created these paralogs. To link this phylogenetic signal to protein function, we used structural modeling and site-directed mutagenesis to identify residues involved in copper and phosphate transport. Based on these analyses, we generated an L175A variant of mouse SLC25A3 that retains the ability to transport copper but not phosphate. This work highlights the utility of using an evolutionary framework to uncover amino acids involved in substrate recognition by mitochondrial carrier family proteins.
Collapse
Affiliation(s)
- Xinyu Zhu
- Department of Biological Sciences, Auburn University, Auburn, United States
| | - Aren Boulet
- Department of Biochemistry, Microbiology and Immunology, University of Saskatchewan, Saskatoon, Canada
| | | | - Casey B Phillips
- Department of Biological Sciences, Auburn University, Auburn, United States
| | - Micah G Gammon
- Department of Biological Sciences, Auburn University, Auburn, United States
| | - Laura E Oldfather
- Department of Biological Sciences, Auburn University, Auburn, United States
| | - Stanley A Moore
- Department of Biochemistry, Microbiology and Immunology, University of Saskatchewan, Saskatoon, Canada
| | - Scot C Leary
- Department of Biochemistry, Microbiology and Immunology, University of Saskatchewan, Saskatoon, Canada
| | - Paul A Cobine
- Department of Biological Sciences, Auburn University, Auburn, United States
| |
Collapse
|
29
|
Laursen L, Čalyševa J, Gibson TJ, Jemth P. Divergent Evolution of a Protein-Protein Interaction Revealed through Ancestral Sequence Reconstruction and Resurrection. Mol Biol Evol 2021; 38:152-167. [PMID: 32750125 PMCID: PMC7782867 DOI: 10.1093/molbev/msaa198] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The postsynaptic density extends across the postsynaptic dendritic spine with discs large (DLG) as the most abundant scaffolding protein. DLG dynamically alters the structure of the postsynaptic density, thus controlling the function and distribution of specific receptors at the synapse. DLG contains three PDZ domains and one important interaction governing postsynaptic architecture is that between the PDZ3 domain from DLG and a protein called cysteine-rich interactor of PDZ3 (CRIPT). However, little is known regarding functional evolution of the PDZ3:CRIPT interaction. Here, we subjected PDZ3 and CRIPT to ancestral sequence reconstruction, resurrection, and biophysical experiments. We show that the PDZ3:CRIPT interaction is an ancient interaction, which was likely present in the last common ancestor of Eukaryotes, and that high affinity is maintained in most extant animal phyla. However, affinity is low in nematodes and insects, raising questions about the physiological function of the interaction in species from these animal groups. Our findings demonstrate how an apparently established protein-protein interaction involved in cellular scaffolding in bilaterians can suddenly be subject to dynamic evolution including possible loss of function.
Collapse
Affiliation(s)
- Louise Laursen
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Jelena Čalyševa
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Faculty of Biosciences, Collaboration for Joint PhD Degree between EMBL and Heidelberg University
| | - Toby J Gibson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
30
|
Dishman AF, Tyler RC, Fox JC, Kleist AB, Prehoda KE, Babu MM, Peterson FC, Volkman BF. Evolution of fold switching in a metamorphic protein. Science 2021; 371:86-90. [PMID: 33384377 PMCID: PMC8017559 DOI: 10.1126/science.abd8700] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 11/11/2020] [Indexed: 12/14/2022]
Abstract
Metamorphic proteins switch between different folds, defying the protein folding paradigm. It is unclear how fold switching arises during evolution. With ancestral reconstruction and nuclear magnetic resonance, we studied the evolution of the metamorphic human protein XCL1, which has two distinct folds with different functions, making it an unusual member of the chemokine family, whose members generally adopt one conserved fold. XCL1 evolved from an ancestor with the chemokine fold. Evolution of a dimer interface, changes in structural constraints and molecular strain, and alteration of intramolecular protein contacts drove the evolution of metamorphosis. Then, XCL1 likely evolved to preferentially populate the noncanonical fold before reaching its modern-day near-equal population of folds. These discoveries illuminate how one sequence has evolved to encode multiple structures, revealing principles for protein design and engineering.
Collapse
Affiliation(s)
- Acacia F Dishman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Robert C Tyler
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jamie C Fox
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Andrew B Kleist
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Kenneth E Prehoda
- Institute of Molecular Biology, Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
| | - M Madan Babu
- MRC Laboratory of Molecular Biology, Cambridge, UK
- Department of Structural Biology and Center for Data Driven Discovery, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Francis C Peterson
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA.
| |
Collapse
|
31
|
Kropp C, Straub K, Linde M, Babinger P. Hexamerization and thermostability emerged very early during geranylgeranylglyceryl phosphate synthase evolution. Protein Sci 2020; 30:583-596. [PMID: 33342010 PMCID: PMC7888582 DOI: 10.1002/pro.4016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 12/09/2020] [Accepted: 12/11/2020] [Indexed: 12/12/2022]
Abstract
A large number of archaea live in hyperthermophilic environments. In consequence, their proteins need to adopt to these harsh conditions, including the enzymes that catalyze the synthesis of their membrane ether lipids. The enzyme that catalyzes the formation of the first ether bond in these lipids, geranylgeranylglyceryl phosphate synthase (GGGPS), exists as a hexamer in many hyperthermophilic archaea, and a recent study suggested that hexamerization serves for a fine‐tuning of the flexibility – stability trade‐off under hyperthermophilic conditions. We have recently reconstructed the sequences of ancestral group II GGGPS enzymes and now present a detailed biochemical characterization of nine of these predecessors, which allowed us to trace back the evolution of hexameric GGGPS and to draw conclusions about the properties of extant GGGPS branches that were not accessible to experiments up to now. Almost all ancestral GGGPS proteins formed hexamers, which demonstrates that hexamerization is even more widespread among the GGGPS family than previously assumed. Furthermore, all experimentally studied ancestral proteins showed high thermostability. Our results indicate that the hexameric oligomerization state and thermostability were present very early during the evolution of group II GGGPS, while the fine tuning of the flexibility – stability trade‐off developed very late, independent of the emergence of hexamerization.
Collapse
Affiliation(s)
- Cosimo Kropp
- Institute of Biophysics and Physical Biochemistry, Regensburg Center for Biochemistry, University of Regensburg, Regensburg, Germany
| | - Kristina Straub
- Institute of Biophysics and Physical Biochemistry, Regensburg Center for Biochemistry, University of Regensburg, Regensburg, Germany
| | - Mona Linde
- Institute of Biophysics and Physical Biochemistry, Regensburg Center for Biochemistry, University of Regensburg, Regensburg, Germany
| | - Patrick Babinger
- Institute of Biophysics and Physical Biochemistry, Regensburg Center for Biochemistry, University of Regensburg, Regensburg, Germany
| |
Collapse
|
32
|
Longo LM, Despotović D, Weil-Ktorza O, Walker MJ, Jabłońska J, Fridmann-Sirkis Y, Varani G, Metanis N, Tawfik DS. Primordial emergence of a nucleic acid-binding protein via phase separation and statistical ornithine-to-arginine conversion. Proc Natl Acad Sci U S A 2020; 117:15731-15739. [PMID: 32561643 PMCID: PMC7355028 DOI: 10.1073/pnas.2001989117] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
De novo emergence demands a transition from disordered polypeptides into structured proteins with well-defined functions. However, can polypeptides confer functions of evolutionary relevance, and how might such polypeptides evolve into modern proteins? The earliest proteins present an even greater challenge, as they were likely based on abiotic, spontaneously synthesized amino acids. Here we asked whether a primordial function, such as nucleic acid binding, could emerge with ornithine, a basic amino acid that forms abiotically yet is absent in modern-day proteins. We combined ancestral sequence reconstruction and empiric deconstruction to unravel a gradual evolutionary trajectory leading from a polypeptide to a ubiquitous nucleic acid-binding protein. Intermediates along this trajectory comprise sequence-duplicated functional proteins built from 10 amino acid types, with ornithine as the only basic amino acid. Ornithine side chains were further modified into arginine by an abiotic chemical reaction, improving both structure and function. Along this trajectory, function evolved from phase separation with RNA (coacervates) to avid and specific double-stranded DNA binding. Our results suggest that phase-separating polypeptides may have been an evolutionary resource for the emergence of early proteins, and that ornithine, together with its postsynthesis modification to arginine, could have been the earliest basic amino acids.
Collapse
Affiliation(s)
- Liam M Longo
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Dragana Despotović
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Orit Weil-Ktorza
- Institute of Chemistry, Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Matthew J Walker
- Department of Chemistry, University of Washington, Seattle, WA 98195
| | - Jagoda Jabłońska
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Yael Fridmann-Sirkis
- Life Sciences Core Facility, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Gabriele Varani
- Department of Chemistry, University of Washington, Seattle, WA 98195
| | - Norman Metanis
- Institute of Chemistry, Hebrew University of Jerusalem, Jerusalem 9190401, Israel;
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel;
| |
Collapse
|
33
|
Resurrection of ancestral effector caspases identifies novel networks for evolution of substrate specificity. Biochem J 2020; 476:3475-3492. [PMID: 31675069 PMCID: PMC6874516 DOI: 10.1042/bcj20190625] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 10/30/2019] [Accepted: 11/01/2019] [Indexed: 12/18/2022]
Abstract
Apoptotic caspases evolved with metazoans more than 950 million years ago (MYA), and a series of gene duplications resulted in two subfamilies consisting of initiator and effector caspases. The effector caspase genes (caspases-3, -6, and -7) were subsequently fixed into the Chordata phylum more than 650 MYA when the gene for a common ancestor (CA) duplicated, and the three effector caspases have persisted throughout mammalian evolution. All caspases prefer an aspartate residue at the P1 position of substrates, so each caspase evolved discrete cellular roles through changes in substrate recognition at the P4 position combined with allosteric regulation. We examined the evolution of substrate specificity in caspase-6, which prefers valine at the P4 residue, compared with caspases-3 and -7, which prefer aspartate, by reconstructing the CA of effector caspases (AncCP-Ef1) and the CA of caspase-6 (AncCP-6An). We show that AncCP-Ef1 is a promiscuous enzyme with little distinction between Asp, Val, or Leu at P4. The specificity of caspase-6 was defined early in its evolution, where AncCP-6An demonstrates a preference for Val over Asp at P4. Structures of AncCP-Ef1 and of AncCP-6An show a network of charged amino acids near the S4 pocket that, when combined with repositioning a flexible active site loop, resulted in a more hydrophobic binding pocket in AncCP-6An. The ancestral protein reconstructions show that the caspase-hemoglobinase fold has been conserved for over 650 million years and that only three substitutions in the scaffold are necessary to shift substrate selection toward Val over Asp.
Collapse
|
34
|
Non-conservation of folding rates in the thioredoxin family reveals degradation of ancestral unassisted-folding. Biochem J 2020; 476:3631-3647. [PMID: 31750876 PMCID: PMC6906118 DOI: 10.1042/bcj20190739] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 11/19/2019] [Accepted: 11/21/2019] [Indexed: 01/04/2023]
Abstract
Evolution involves not only adaptation, but also the degradation of superfluous features. Many examples of degradation at the morphological level are known (vestigial organs, for instance). However, the impact of degradation on molecular evolution has been rarely addressed. Thioredoxins serve as general oxidoreductases in all cells. Here, we report extensive mutational analyses on the folding of modern and resurrected ancestral bacterial thioredoxins. Contrary to claims from recent literature, in vitro folding rates in the thioredoxin family are not evolutionarily conserved, but span at least a ∼100-fold range. Furthermore, modern thioredoxin folding is often substantially slower than ancestral thioredoxin folding. Unassisted folding, as probed in vitro, thus emerges as an ancestral vestigial feature that underwent degradation, plausibly upon the evolutionary emergence of efficient cellular folding assistance. More generally, our results provide evidence that degradation of ancestral features shapes, not only morphological evolution, but also the evolution of individual proteins.
Collapse
|
35
|
Harman JL, Loes AN, Warren GD, Heaphy MC, Lampi KJ, Harms MJ. Evolution of multifunctionality through a pleiotropic substitution in the innate immune protein S100A9. eLife 2020; 9:e54100. [PMID: 32255429 PMCID: PMC7213983 DOI: 10.7554/elife.54100] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 04/03/2020] [Indexed: 12/16/2022] Open
Abstract
Multifunctional proteins are evolutionary puzzles: how do proteins evolve to satisfy multiple functional constraints? S100A9 is one such multifunctional protein. It potently amplifies inflammation via Toll-like receptor four and is antimicrobial as part of a heterocomplex with S100A8. These two functions are seemingly regulated by proteolysis: S100A9 is readily degraded, while S100A8/S100A9 is resistant. We take an evolutionary biochemical approach to show that S100A9 evolved both functions and lost proteolytic resistance from a weakly proinflammatory, proteolytically resistant amniote ancestor. We identify a historical substitution that has pleiotropic effects on S100A9 proinflammatory activity and proteolytic resistance but has little effect on S100A8/S100A9 antimicrobial activity. We thus propose that mammals evolved S100A8/S100A9 antimicrobial and S100A9 proinflammatory activities concomitantly with a proteolytic 'timer' to selectively regulate S100A9. This highlights how the same mutation can have pleiotropic effects on one functional state of a protein but not another, thus facilitating the evolution of multifunctionality.
Collapse
Affiliation(s)
- Joseph L Harman
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | - Andrea N Loes
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | - Gus D Warren
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | - Maureen C Heaphy
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | | | - Michael J Harms
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| |
Collapse
|
36
|
Scossa F, Fernie AR. The evolution of metabolism: How to test evolutionary hypotheses at the genomic level. Comput Struct Biotechnol J 2020; 18:482-500. [PMID: 32180906 PMCID: PMC7063335 DOI: 10.1016/j.csbj.2020.02.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Revised: 02/12/2020] [Accepted: 02/13/2020] [Indexed: 01/21/2023] Open
Abstract
The origin of primordial metabolism and its expansion to form the metabolic networks extant today represent excellent systems to study the impact of natural selection and the potential adaptive role of novel compounds. Here we present the current hypotheses made on the origin of life and ancestral metabolism and present the theories and mechanisms by which the large chemical diversity of plants might have emerged along evolution. In particular, we provide a survey of statistical methods that can be used to detect signatures of selection at the gene and population level, and discuss potential and limits of these methods for investigating patterns of molecular adaptation in plant metabolism.
Collapse
Affiliation(s)
- Federico Scossa
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, 14476 Potsdam-Golm, Germany
- Council for Agricultural Research and Economics (CREA), Research Centre for Genomics and Bioinformatics (CREA-GB), Via Ardeatina 546, 00178 Rome, Italy
| | - Alisdair R. Fernie
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, 14476 Potsdam-Golm, Germany
- Center of Plant Systems Biology and Biotechnology (CPSBB), Plovdiv, Bulgaria
| |
Collapse
|
37
|
Chandler PG, Broendum SS, Riley BT, Spence MA, Jackson CJ, McGowan S, Buckle AM. Strategies for Increasing Protein Stability. Methods Mol Biol 2020; 2073:163-181. [PMID: 31612442 DOI: 10.1007/978-1-4939-9869-2_10] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The stability of wild-type proteins is often a hurdle to their practical use in research, industry, and medicine. The route to engineering stability of a protein of interest lies largely with the available data. Where high-resolution structural data is available, rational design, based on fundamental principles of protein chemistry, can improve protein stability. Recent advances in computational biology and the use of nonnatural amino acids have also provided novel rational methods for improving protein stability. Likewise, the explosion of sequence and structural data available in public databases, in combination with improvements in freely available computational tools, has produced accessible phylogenetic approaches. Trawling modern sequence databases can identify the thermostable homologs of a target protein, and evolutionary data can be quickly generated using available phylogenetic tools. Grafting features from those thermostable homologs or ancestors provides stability improvement through a semi-rational approach. Further, molecular techniques such as directed evolution have shown great promise in delivering designer proteins. These strategies are well documented and newly accessible to the molecular biologist, allowing for rapid enhancements of protein stability.
Collapse
Affiliation(s)
- Peter G Chandler
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
| | - Sebastian S Broendum
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
| | - Blake T Riley
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
| | - Matthew A Spence
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT, Australia
| | - Sheena McGowan
- Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
| | - Ashley M Buckle
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia.
| |
Collapse
|
38
|
Sloutsky R, Naegle KM. ASPEN, a methodology for reconstructing protein evolution with improved accuracy using ensemble models. eLife 2019; 8:e47676. [PMID: 31621582 PMCID: PMC6797483 DOI: 10.7554/elife.47676] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2019] [Accepted: 09/19/2019] [Indexed: 12/27/2022] Open
Abstract
Evolutionary reconstruction algorithms produce models of the evolutionary history of proteins or species. Such algorithms are highly sensitive to their inputs: the sequences used and their alignments. Here, we asked whether the variance introduced by selecting different input sequences could be used to better identify accurate evolutionary models. We subsampled from available ortholog sequences and measured the distribution of observed relationships between paralogs produced across hundreds of models inferred from the subsamples. We observed two important phenomena. First, the reproducibility of an all-sequence, single-alignment reconstruction, measured by comparing topologies inferred from 90% subsamples, directly correlates with the accuracy of that single-alignment reconstruction, producing a measurable value for something that has been traditionally unknowable. Second, topologies that are most consistent with the observations made in the ensemble are more accurate and we present a meta algorithm that exploits this property to improve model accuracy.
Collapse
Affiliation(s)
- Roman Sloutsky
- Program in Computational and Systems BiologyWashington UniversitySt. LouisUnited States
- Department for Biomedical EngineeringWashington UniversitySt. LouisUnited States
- Department of Biochemistry and Molecular BiologyUniversity of MassachusettsAmherstUnited States
- Center for Biological Systems EngineeringWashington UniversitySt. LouisUnited States
| | - Kristen M Naegle
- Department for Biomedical EngineeringWashington UniversitySt. LouisUnited States
- Center for Biological Systems EngineeringWashington UniversitySt. LouisUnited States
- Department of Biomedical EngineeringUniversity of VirginiaCharlottesvilleUnited States
- Center for Public Health GenomicsUniversity of VirginiaCharlottesvilleUnited States
| |
Collapse
|
39
|
Savory FR, Milner DS, Miles DC, Richards TA. Ancestral Function and Diversification of a Horizontally Acquired Oomycete Carboxylic Acid Transporter. Mol Biol Evol 2019; 35:1887-1900. [PMID: 29701800 PMCID: PMC6063262 DOI: 10.1093/molbev/msy082] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Horizontal gene transfer (HGT) can equip organisms with novel genes, expanding the repertoire of genetic material available for evolutionary innovation and allowing recipient lineages to colonize new environments. However, few studies have characterized the functions of HGT genes experimentally or examined postacquisition functional divergence. Here, we report the use of ancestral sequence reconstruction and heterologous expression in Saccharomyces cerevisiae to examine the evolutionary history of an oomycete transporter gene family that was horizontally acquired from fungi. We demonstrate that the inferred ancestral oomycete HGT transporter proteins and their extant descendants transport dicarboxylic acids which are intermediates of the tricarboxylic acid cycle. The substrate specificity profile of the most ancestral protein has largely been retained throughout the radiation of oomycetes, including in both plant and animal pathogens and in a free-living saprotroph, indicating that the ancestral HGT transporter function has been maintained by selection across a range of different lifestyles. No evidence of neofunctionalization in terms of substrate specificity was detected for different HGT transporter paralogues which have different patterns of temporal expression. However, a striking expansion of substrate range was observed for one plant pathogenic oomycete, with a HGT derived paralogue from Pythium aphanidermatum encoding a protein that enables tricarboxylic acid uptake in addition to dicarboxylic acid uptake. This demonstrates that HGT acquisitions can provide functional additions to the recipient proteome as well as the foundation material for the evolution of expanded protein functions.
Collapse
Affiliation(s)
- Fiona R Savory
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - David S Milner
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - Daniel C Miles
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - Thomas A Richards
- Living Systems Institute, School of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| |
Collapse
|
40
|
Doroshkov AV, Konstantinov DK, Afonnikov DA, Gunbin KV. The evolution of gene regulatory networks controlling Arabidopsis thaliana L. trichome development. BMC PLANT BIOLOGY 2019; 19:53. [PMID: 30813891 PMCID: PMC6393967 DOI: 10.1186/s12870-019-1640-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
BACKGROUND The variation in structure and function of gene regulatory networks (GRNs) participating in organisms development is a key for understanding species-specific evolutionary strategies. Even the tiniest modification of developmental GRN might result in a substantial change of a complex morphogenetic pattern. Great variety of trichomes and their accessibility makes them a useful model for studying the molecular processes of cell fate determination, cell cycle control and cellular morphogenesis. Nowadays, a large number of genes regulating the morphogenesis of A. thaliana trichomes are described. Here we aimed at a study the evolution of the GRN defining the trichome formation, and evaluation its importance in other developmental processes. RESULTS In study of the evolution of trichomes formation GRN we combined classical phylogenetic analysis with information on the GRN topology and composition in major plants taxa. This approach allowed us to estimate both times of evolutionary emergence of the GRN components which are mainly proteins, and the relative rate of their molecular evolution. Various simplifications of protein structure (based on the position of amino acid residues in protein globula, secondary structure type, and structural disorder) allowed us to demonstrate the evolutionary associations between changes in protein globules and speciations/duplications events. We discussed their potential involvement in protein-protein interactions and GRN function. CONCLUSIONS We hypothesize that the divergence and/or the specialization of the trichome-forming GRN is linked to the emergence of plant taxa. Information about the structural targets of the protein evolution in the GRN may predict switching points in gene networks functioning in course of evolution. We also propose a list of candidate genes responsible for the development of trichomes in a wide range of plant species.
Collapse
Affiliation(s)
- Alexey V. Doroshkov
- The Siberian Branch of the Russian Academy of Sciences (IC&G SB RAS), The Institute of Cytology and Genetics, Novosibirsk, Russia
- Novosibirsk State University (NSU), Novosibirsk, Russia
| | - Dmitrii K. Konstantinov
- The Siberian Branch of the Russian Academy of Sciences (IC&G SB RAS), The Institute of Cytology and Genetics, Novosibirsk, Russia
- Novosibirsk State University (NSU), Novosibirsk, Russia
| | - Dmitrij A. Afonnikov
- The Siberian Branch of the Russian Academy of Sciences (IC&G SB RAS), The Institute of Cytology and Genetics, Novosibirsk, Russia
- Novosibirsk State University (NSU), Novosibirsk, Russia
| | - Konstantin V. Gunbin
- Novosibirsk State University (NSU), Novosibirsk, Russia
- School of Life Science, Immanuel Kant Federal Baltic University, Kaliningrad, Russia
- Center of Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| |
Collapse
|
41
|
Musil M, Konegger H, Hon J, Bednar D, Damborsky J. Computational Design of Stable and Soluble Biocatalysts. ACS Catal 2018. [DOI: 10.1021/acscatal.8b03613] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Milos Musil
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 612 66 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - Hannes Konegger
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Hon
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 612 66 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
42
|
Reconstructing the evolutionary history of F 420-dependent dehydrogenases. Sci Rep 2018; 8:17571. [PMID: 30514849 PMCID: PMC6279831 DOI: 10.1038/s41598-018-35590-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 08/24/2018] [Indexed: 12/20/2022] Open
Abstract
During the last decade the number of characterized F420-dependent enzymes has significantly increased. Many of these deazaflavoproteins share a TIM-barrel fold and are structurally related to FMN-dependent luciferases and monooxygenases. In this work, we traced the origin and evolutionary history of the F420-dependent enzymes within the luciferase-like superfamily. By a thorough phylogenetic analysis we inferred that the F420-dependent enzymes emerged from a FMN-dependent common ancestor. Furthermore, the data show that during evolution, the family of deazaflavoproteins split into two well-defined groups of enzymes: the F420-dependent dehydrogenases and the F420-dependent reductases. By such event, the dehydrogenases specialized in generating the reduced deazaflavin cofactor, while the reductases employ the reduced F420 for catalysis. Particularly, we focused on investigating the dehydrogenase subfamily and demonstrated that this group diversified into three types of dehydrogenases: the already known F420-dependent glucose-6-phosphate dehydrogenases, the F420-dependent alcohol dehydrogenases, and the sugar-6-phosphate dehydrogenases that were identified in this study. By reconstructing and experimentally characterizing ancestral and extant representatives of F420-dependent dehydrogenases, their biochemical properties were investigated and compared. We propose an evolutionary path for the emergence and diversification of the TIM-barrel fold F420-dependent dehydrogenases subfamily.
Collapse
|
43
|
Liu Q, Onal P, Datta RR, Rogers JM, Schmidt-Ott U, Bulyk ML, Small S, Thornton JW. Ancient mechanisms for the evolution of the bicoid homeodomain's function in fly development. eLife 2018; 7:e34594. [PMID: 30298815 PMCID: PMC6177261 DOI: 10.7554/elife.34594] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 07/28/2018] [Indexed: 12/14/2022] Open
Abstract
The ancient mechanisms that caused developmental gene regulatory networks to diversify among distantly related taxa are not well understood. Here we use ancestral protein reconstruction, biochemical experiments, and developmental assays of transgenic animals carrying reconstructed ancestral genes to investigate how the transcription factor Bicoid (Bcd) evolved its central role in anterior-posterior patterning in flies. We show that most of Bcd's derived functions are attributable to evolutionary changes within its homeodomain (HD) during a phylogenetic interval >140 million years ago. A single substitution from this period (Q50K) accounts almost entirely for the evolution of Bcd's derived DNA specificity in vitro. In transgenic embryos expressing the reconstructed ancestral HD, however, Q50K confers activation of only a few of Bcd's transcriptional targets and yields a very partial rescue of anterior development. Adding a second historical substitution (M54R) confers regulation of additional Bcd targets and further rescues anterior development. These results indicate that two epistatically interacting mutations played a major role in the evolution of Bcd's controlling regulatory role in early development. They also show how ancestral sequence reconstruction can be combined with in vivo characterization of transgenic animals to illuminate the historical mechanisms of developmental evolution.
Collapse
Affiliation(s)
- Qinwen Liu
- Department of Ecology and EvolutionUniversity of ChicagoChicagoUnited States
| | - Pinar Onal
- Department of BiologyNew York UniversityNew YorkUnited States
| | - Rhea R Datta
- Department of BiologyNew York UniversityNew YorkUnited States
| | - Julia M Rogers
- Committee on Higher Degrees in BiophysicsHarvard UniversityCambridgeUnited States
- Division of Genetics, Department of MedicineBrigham and Women’s Hospital and Harvard Medical SchoolBostonUnited States
| | - Urs Schmidt-Ott
- Department of Organismal Biology and AnatomyUniversity of ChicagoChicagoUnited States
| | - Martha L Bulyk
- Committee on Higher Degrees in BiophysicsHarvard UniversityCambridgeUnited States
- Division of Genetics, Department of MedicineBrigham and Women’s Hospital and Harvard Medical SchoolBostonUnited States
- Department of PathologyBrigham and Women’s Hospital and Harvard Medical SchoolBostonUnited States
| | - Stephen Small
- Department of BiologyNew York UniversityNew YorkUnited States
| | - Joseph W Thornton
- Department of Ecology and EvolutionUniversity of ChicagoChicagoUnited States
- Department of Human GeneticsUniversity of ChicagoChicagoUnited States
| |
Collapse
|
44
|
Grinshpon RD, Williford A, Titus‐McQuillan J, Clay Clark A. The CaspBase: a curated database for evolutionary biochemical studies of caspase functional divergence and ancestral sequence inference. Protein Sci 2018; 27:1857-1870. [PMID: 30076665 PMCID: PMC6199153 DOI: 10.1002/pro.3494] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Revised: 07/31/2018] [Accepted: 08/01/2018] [Indexed: 12/22/2022]
Abstract
Sequence databases are powerful tools for the contemporary scientists' toolkit. However, most functional annotations in public databases are determined computationally and are not verified by a human expert. While hypotheses generated from computational studies are now amenable to experimentation, the quality of the results relies on the quality of input data. We developed the CaspBase to expedite high-quality dataset compilation of annotated caspase sequences, to maximize phylogenetic signal, and to reduce the noise contributed from public databanks. We describe our methods of curation for the CaspBase and how researchers can acquire sequences from CaspBase.org. Our immediate goal for developing the CaspBase was to optimize the ancestral protein reconstruction (APR) of caspases, and we demonstrate the utility of the CaspBase in APR studies. We also developed the Common Position (CP) system for comparing human caspase family paralogs and suggest the CP system as an update to current reporting methods of caspase amino acid positions. We present a standardized multiple sequence alignment (MSA) for the CP system and show the advantage of using large databases such as the CaspBase in defining structural positions in proteins. Although the results described here pertain to caspase evolution and structure-function studies, the methods can be adapted to any gene family.
Collapse
Affiliation(s)
- Robert D. Grinshpon
- Department of Molecular and Structural BiochemistryNC State UniversityRaleighNorth Carolina27608
| | - Anna Williford
- Department of BiologyUniversity of Texas at ArlingtonArlingtonTexas76019
| | | | - A. Clay Clark
- Department of BiologyUniversity of Texas at ArlingtonArlingtonTexas76019
| |
Collapse
|
45
|
Lim SA, Marqusee S. The burst-phase folding intermediate of ribonuclease H changes conformation over evolutionary history. Biopolymers 2018; 109:e23086. [PMID: 29152711 PMCID: PMC6047922 DOI: 10.1002/bip.23086] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 10/23/2017] [Accepted: 10/30/2017] [Indexed: 11/06/2022]
Abstract
The amino acid sequence encodes the energy landscape of a protein. Therefore, we expect evolutionary mutations to change features of the protein energy landscape, including the conformations adopted by a polypeptide as it folds to its native state. Ribonucleases H (RNase H) from Escherichia coli and Thermus thermophilus both fold via a partially folded intermediate in which the core region of the protein (helices A-D and strands 4-5) is structured. Strand 1, however, uniquely contributes to the T. thermophilus RNase H folding intermediate (Icore+1 ), but not the E. coli RNase H intermediate (Icore ) (Rosen & Marqusee, PLoS One 2015). We explore the origin of this difference by characterizing the folding intermediate of seven ancestral RNases H spanning the evolutionary history of these two homologs. Using fragment models with or without strand 1 and FRET probes to characterize the folding intermediate of each ancestor, we find a distinct evolutionary trend across the family-the involvement of strand 1 in the folding intermediate is an ancestral feature that is maintained in the thermophilic lineage and is gradually lost in the mesophilic lineage. Evolutionary sequence changes indeed modulate the conformations present on the folding landscape and altered the folding trajectory of RNase H.
Collapse
Affiliation(s)
- Shion An Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
- Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
46
|
Wheeler LC, Anderson JA, Morrison AJ, Wong CE, Harms MJ. Conservation of Specificity in Two Low-Specificity Proteins. Biochemistry 2017; 57:684-695. [PMID: 29240404 DOI: 10.1021/acs.biochem.7b01086] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Many regulatory proteins bind peptide regions of target proteins and modulate their activity. Such regulatory proteins can often interact with highly diverse target peptides. In many instances, it is not known if the peptide-binding interface discriminates targets in a biological context, or whether biological specificity is achieved exclusively through external factors such as subcellular localization. We used an evolutionary biochemical approach to distinguish these possibilities for two such low-specificity proteins: S100A5 and S100A6. We used isothermal titration calorimetry to study the binding of peptides with diverse sequence and biochemistry to human S100A5 and S100A6. These proteins bound distinct, but overlapping, sets of peptide targets. We then studied the peptide binding properties of orthologs sampled from across five amniote species. Binding specificity was conserved along all lineages, for the last 320 million years, despite the low specificity of each protein. We used ancestral sequence reconstruction to determine the binding specificity of the last common ancestor of the paralogs. The ancestor bound the entire set of peptides bound by modern S100A5 and S100A6 proteins, suggesting that paralog specificity evolved via subfunctionalization. To rule out the possibility that specificity is conserved because it is difficult to modify, we identified a single historical mutation that, when reverted in human S100A5, gave it the ability to bind an S100A6-specific peptide. These results reveal strong evolutionary constraints on peptide binding specificity. Despite being able to bind a large number of targets, the specificity of S100 peptide interfaces is likely important for the biology of these proteins.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Jeremy A Anderson
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Anneliese J Morrison
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Caitlyn E Wong
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| |
Collapse
|
47
|
Kacar B, Guy L, Smith E, Baross J. Resurrecting ancestral genes in bacteria to interpret ancient biosignatures. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2017; 375:20160352. [PMID: 29133450 PMCID: PMC5686408 DOI: 10.1098/rsta.2016.0352] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 04/17/2017] [Indexed: 05/04/2023]
Abstract
Two datasets, the geologic record and the genetic content of extant organisms, provide complementary insights into the history of how key molecular components have shaped or driven global environmental and macroevolutionary trends. Changes in global physico-chemical modes over time are thought to be a consistent feature of this relationship between Earth and life, as life is thought to have been optimizing protein functions for the entirety of its approximately 3.8 billion years of history on the Earth. Organismal survival depends on how well critical genetic and metabolic components can adapt to their environments, reflecting an ability to optimize efficiently to changing conditions. The geologic record provides an array of biologically independent indicators of macroscale atmospheric and oceanic composition, but provides little in the way of the exact behaviour of the molecular components that influenced the compositions of these reservoirs. By reconstructing sequences of proteins that might have been present in ancient organisms, we can downselect to a subset of possible sequences that may have been optimized to these ancient environmental conditions. How can one use modern life to reconstruct ancestral behaviours? Configurations of ancient sequences can be inferred from the diversity of extant sequences, and then resurrected in the laboratory to ascertain their biochemical attributes. One way to augment sequence-based, single-gene methods to obtain a richer and more reliable picture of the deep past, is to resurrect inferred ancestral protein sequences in living organisms, where their phenotypes can be exposed in a complex molecular-systems context, and then to link consequences of those phenotypes to biosignatures that were preserved in the independent historical repository of the geological record. As a first step beyond single-molecule reconstruction to the study of functional molecular systems, we present here the ancestral sequence reconstruction of the beta-carbonic anhydrase protein. We assess how carbonic anhydrase proteins meet our selection criteria for reconstructing ancient biosignatures in the laboratory, which we term palaeophenotype reconstruction.This article is part of the themed issue 'Reconceptualizing the origins of life'.
Collapse
Affiliation(s)
- Betul Kacar
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Lionel Guy
- Department of Medical Biochemistry and Microbiology, Uppsala University, 75123 Uppsala, Sweden
| | - Eric Smith
- Earth-Science Life Institute, Meguro-ku, Tokyo 152-8550, Japan
- Santa Fe Institute, Santa Fe, NM 87501, USA
| | - John Baross
- The School of Oceanography, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
48
|
Starr TN, Picton LK, Thornton JW. Alternative evolutionary histories in the sequence space of an ancient protein. Nature 2017; 549:409-413. [PMID: 28902834 PMCID: PMC6214350 DOI: 10.1038/nature23902] [Citation(s) in RCA: 130] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Accepted: 08/08/2017] [Indexed: 12/28/2022]
Abstract
To understand why molecular evolution turned out as it did, we must characterize not only the path that evolution followed across the space of possible molecular sequences but also the many alternative trajectories that could have been taken but were not. A large-scale comparison of real and possible histories would establish whether the outcome of evolution represents an optimal state driven by natural selection or the contingent product of historical chance events; it would also reveal how the underlying distribution of functions across sequence space shaped historical evolution. Here we combine ancestral protein reconstruction with deep mutational scanning to characterize alternative histories in the sequence space around an ancient transcription factor, which evolved a novel biological function through well-characterized mechanisms. We find hundreds of alternative protein sequences that use diverse biochemical mechanisms to perform the derived function at least as well as the historical outcome. These alternatives all require prior permissive substitutions that do not enhance the derived function, but not all require the same permissive changes that occurred during history. We find that if evolution had begun from a different starting point within the network of sequences encoding the ancestral function, outcomes with different genetic and biochemical forms would probably have resulted; this contingency arises from the distribution of functional variants in sequence space and epistasis between residues. Our results illuminate the topology of the vast space of possibilities from which history sampled one path, highlighting how the outcome of evolution depends on a serial chain of compounding chance events.
Collapse
Affiliation(s)
- Tyler N Starr
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, USA
| | - Lora K Picton
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
49
|
Tessier CJG, Emlaw JR, Cao ZQ, Pérez-Areales FJ, Salameh JPJ, Prinston JE, McNulty MS, daCosta CJB. Back to the future: Rational maps for exploring acetylcholine receptor space and time. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2017; 1865:1522-1528. [PMID: 28844740 DOI: 10.1016/j.bbapap.2017.08.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Revised: 08/09/2017] [Accepted: 08/11/2017] [Indexed: 12/27/2022]
Abstract
Global functions of nicotinic acetylcholine receptors, such as subunit cooperativity and compatibility, likely emerge from a network of amino acid residues distributed across the entire pentameric complex. Identification of such networks has stymied traditional approaches to acetylcholine receptor structure and function, likely due to the cryptic interdependency of their underlying amino acid residues. An emerging evolutionary biochemistry approach, which traces the evolutionary history of acetylcholine receptor subunits, allows for rational mapping of acetylcholine receptor sequence space, and offers new hope for uncovering the amino acid origins of these enigmatic properties.
Collapse
Affiliation(s)
- Christian J G Tessier
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - Johnathon R Emlaw
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - Zhuo Qian Cao
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - F Javier Pérez-Areales
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - Jean-Paul J Salameh
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - Jethro E Prinston
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - Melissa S McNulty
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada
| | - Corrie J B daCosta
- Department of Chemistry and Biomolecular Sciences, Centre for Chemical and Synthetic Biology, University of Ottawa, 10 Marie-Curie, Ottawa, Ontario K1N 6N5, Canada.
| |
Collapse
|
50
|
Akanuma S. Characterization of Reconstructed Ancestral Proteins Suggests a Change in Temperature of the Ancient Biosphere. Life (Basel) 2017; 7:life7030033. [PMID: 28783077 PMCID: PMC5617958 DOI: 10.3390/life7030033] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Revised: 08/02/2017] [Accepted: 08/03/2017] [Indexed: 01/02/2023] Open
Abstract
Understanding the evolution of ancestral life, and especially the ability of some organisms to flourish in the variable environments experienced in Earth’s early biosphere, requires knowledge of the characteristics and the environment of these ancestral organisms. Information about early life and environmental conditions has been obtained from fossil records and geological surveys. Recent advances in phylogenetic analysis, and an increasing number of protein sequences available in public databases, have made it possible to infer ancestral protein sequences possessed by ancient organisms. However, the in silico studies that assess the ancestral base content of ribosomal RNAs, the frequency of each amino acid in ancestral proteins, and estimate the environmental temperatures of ancient organisms, show conflicting results. The characterization of ancestral proteins reconstructed in vitro suggests that ancient organisms had very thermally stable proteins, and therefore were thermophilic or hyperthermophilic. Experimental data supports the idea that only thermophilic ancestors survived the catastrophic increase in temperature of the biosphere that was likely associated with meteorite impacts during the early history of Earth. In addition, by expanding the timescale and including more ancestral proteins for reconstruction, it appears as though the Earth’s surface temperature gradually decreased over time, from Archean to present.
Collapse
Affiliation(s)
- Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama 359-1192, Japan.
| |
Collapse
|