1
|
Tran TTQ, Narayanan C, Loes AN, Click TH, Pham NTH, Létourneau M, Harms MJ, Calmettes C, Agarwal PK, Doucet N. Ancestral sequence reconstruction dissects structural and functional differences among eosinophil ribonucleases. J Biol Chem 2024; 300:107280. [PMID: 38588810 DOI: 10.1016/j.jbc.2024.107280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/30/2024] [Accepted: 04/03/2024] [Indexed: 04/10/2024] Open
Abstract
Evolutionarily conserved structural folds can give rise to diverse biological functions, yet predicting atomic-scale interactions that contribute to the emergence of novel activities within such folds remains challenging. Pancreatic-type ribonucleases illustrate this complexity, sharing a core structure that has evolved to accommodate varied functions. In this study, we used ancestral sequence reconstruction to probe evolutionary and molecular determinants that distinguish biological activities within eosinophil members of the RNase 2/3 subfamily. Our investigation unveils functional, structural, and dynamical behaviors that differentiate the evolved ancestral ribonuclease (AncRNase) from its contemporary eosinophil RNase orthologs. Leveraging the potential of ancestral reconstruction for protein engineering, we used AncRNase predictions to design a minimal 4-residue variant that transforms human RNase 2 into a chimeric enzyme endowed with the antimicrobial and cytotoxic activities of RNase 3 members. This work provides unique insights into mutational and evolutionary pathways governing structure, function, and conformational states within the eosinophil RNase subfamily, offering potential for targeted modulation of RNase-associated functions.
Collapse
Affiliation(s)
- Thi Thanh Quynh Tran
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, Laval, Quebec, Canada
| | - Chitra Narayanan
- Department of Chemistry, York College, City University of New York (CUNY), Jamaica, New York, USA
| | - Andrea N Loes
- Institute of Molecular Biology and Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA
| | - Timothy H Click
- Chemistry and Biochemistry, University of Mary, Bismarck, North Dakota, USA
| | - N T Hang Pham
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, Laval, Quebec, Canada
| | - Myriam Létourneau
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, Laval, Quebec, Canada
| | - Michael J Harms
- Institute of Molecular Biology and Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA
| | - Charles Calmettes
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, Laval, Quebec, Canada; PROTEO, The Quebec Network for Research on Protein Function, Engineering, and Applications, UQAM, Montréal, Quebec, Canada
| | - Pratul K Agarwal
- Department of Physiological Sciences and High-Performance Computing Center, Oklahoma State University, Stillwater, Oklahoma, USA
| | - Nicolas Doucet
- Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique (INRS), Université du Québec, Laval, Quebec, Canada; PROTEO, The Quebec Network for Research on Protein Function, Engineering, and Applications, UQAM, Montréal, Quebec, Canada.
| |
Collapse
|
2
|
Glenn SJ, Gentry-Lear Z, Shavlik M, Harms MJ, Asaki TJ, Baylink A. Bacterial vampirism mediated through taxis to serum. bioRxiv 2024:2023.07.07.548164. [PMID: 37461633 PMCID: PMC10350070 DOI: 10.1101/2023.07.07.548164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2023]
Abstract
Bacteria of the family Enterobacteriaceae are associated with gastrointestinal (GI) bleeding and bacteremia and are a leading cause of death, from sepsis, for individuals with inflammatory bowel diseases. The bacterial behaviors and mechanisms underlying why these bacteria are prone to bloodstream entry remains poorly understood. Herein, we report that clinical isolates of non-typhoidal Salmonella enterica serovars, Escherichia coli, and Citrobacter koseri are rapidly attracted toward sources of human serum. To simulate GI bleeding, we utilized a custom injection-based microfluidics device and found that femtoliter volumes of human serum are sufficient to induce the bacterial population to swim toward and aggregate at the serum source. This response is orchestrated through chemotaxis, and a major chemical cue driving chemoattraction is L-serine, an amino acid abundant in serum that is recognized through direct binding by the chemoreceptor Tsr. We report the first crystal structures of Salmonella Typhimurium Tsr in complex with L-serine and identify a conserved amino acid recognition motif for L-serine shared among Tsr orthologues. By mapping the phylogenetic distribution of this chemoreceptor we found Tsr to be widely conserved among Enterobacteriaceae and numerous World Health Organization priority pathogens associated with bloodstream infections. Lastly, we find that Enterobacteriaceae use human serum as a source of nutrients for growth and that chemotaxis and the chemoreceptor Tsr provides a competitive advantage for migration into enterohaemorrhagic lesions. We term this bacterial behavior of taxis toward serum, colonization of hemorrhagic lesions, and the consumption of serum nutrients, as 'bacterial vampirism' which may relate to the proclivity of Enterobacteriaceae for bloodstream infections.
Collapse
|
3
|
Gentry-Lear Z, Franco K, Shavlik M, Harms MJ, Baylink A. Navigating contradictions: Salmonella Typhimurium chemotactic responses to conflicting chemoeffector signals show parity with bacterial growth benefits. bioRxiv 2024:2024.01.18.576330. [PMID: 38293242 PMCID: PMC10827161 DOI: 10.1101/2024.01.18.576330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Many bacteria that colonize the guts of animals use chemotaxis to direct swimming motility and select sites for colonization based on sources of effectors derived from the host, diet, and microbial competitors of the local environ. The complex ecosystem of the gastrointestinal tract contains mixtures of chemoattractants and chemorepellents, but it remains poorly understood how swimming bacteria navigate conflicting signals. The enteric pathogen Salmonella Typhimurium possesses Tsr, a chemoreceptor protein that directs both chemoattraction and chemorepulsion responses, which we employed as a model to study chemotaxis in the presence of conflicting effector stimuli. We investigated how S. Typhimurium responds to human fecal matter, an effector source in the enteric lumen that contains high concentrations of indole, a bacteriostatic chemorepellent produced by the native commensals of the microbiota, and also nutrients such as l-serine, a chemoattractant. The indole concentration in human feces is more than 12-fold the concentration required for half-maximal chemorepulsion, however, we find S. Typhimurium, and various clinical isolates of non-typhoidal S. enterica serovars, are strongly attracted to liquid fecal matter. We further investigated the chemotactic responses of S. Typhimurium to titrations of indole and l-serine and revealed that chemorepulsion to indole is overridden in the presence of excess l-serine. We capture the inversion of these two opposing taxis behaviors in a phenomenon we define as "chemohalation" in which the bacteria organize into a halo around the treatment source with an interior zone of avoidance, which represents a compromise between chemoattraction and chemorepulsion. Growth analyses reveal that the chemotactic responses to these opposing effectors align chemoattraction and chemorepulsion with the relative growth of the bacteria in culture. Hence, our study supports the view that evolution has finely tuned chemotaxis to assess environmental habitability by evaluating the tradeoffs in bacterial growth based on the local combination of effectors.
Collapse
Affiliation(s)
- Zealon Gentry-Lear
- University of Oregon, Department of Chemistry & Biochemistry, Eugene, OR, 97403
| | - Kailie Franco
- Washington State University, Department of Veterinary Microbiology and Pathology, Pullman, WA 99164
| | - Michael Shavlik
- University of Oregon, Department of Chemistry & Biochemistry, Eugene, OR, 97403
| | - Michael J. Harms
- University of Oregon, Department of Chemistry & Biochemistry, Eugene, OR, 97403
- University of Oregon, Institute of Molecular Biology, Eugene, OR, 97403
| | - Arden Baylink
- Washington State University, Department of Veterinary Microbiology and Pathology, Pullman, WA 99164
| |
Collapse
|
4
|
Chisholm LO, Orlandi KN, Phillips SR, Shavlik MJ, Harms MJ. Ancestral Reconstruction and the Evolution of Protein Energy Landscapes. Annu Rev Biophys 2023; 53. [PMID: 38134334 DOI: 10.1146/annurev-biophys-030722-125440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
A protein's sequence determines its conformational energy landscape. This, in turn, determines the protein's function. Understanding the evolution of new protein functions therefore requires understanding how mutations alter the protein energy landscape. Ancestral sequence reconstruction (ASR) has proven a valuable tool for tackling this problem. In ASR, one phylogenetically infers the sequences of ancient proteins, allowing characterization of their properties. When coupled to biophysical, biochemical, and functional characterization, ASR can reveal how historical mutations altered the energy landscape of ancient proteins, allowing the evolution of enzyme activity, altered conformations, binding specificity, oligomerization, and many other protein features. In this article, we review how ASR studies have been used to dissect the evolution of energy landscapes. We also discuss ASR studies that reveal how energy landscapes have shaped protein evolution. Finally, we propose that thinking about evolution from the perspective of an energy landscape can improve how we approach and interpret ASR studies. Expected final online publication date for the Annual Review of Biophysics, Volume 53 is May 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Lauren O Chisholm
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Kona N Orlandi
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Sophia R Phillips
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Shavlik
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| |
Collapse
|
5
|
Banks OGB, Harms MJ, McKnight JN, McKnight LE. Simultaneous Mapping of DNA Binding and Nucleosome Positioning with SpLiT-ChEC. bioRxiv 2023:2023.07.03.547581. [PMID: 37461563 PMCID: PMC10349973 DOI: 10.1101/2023.07.03.547581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]
Abstract
The organization of chromatin - including the positions of nucleosomes and the binding of other proteins to DNA - helps define transcriptional profiles in eukaryotic organisms. While techniques like ChIP-Seq and MNase-Seq can map protein-DNA and nucleosome localization separately, assays designed to simultaneously capture nucleosome positions and protein-DNA interactions can produce a detailed picture of the chromatin landscape. Most assays that monitor chromatin organization and protein binding rely on antibodies, which often exhibit nonspecific binding, and/or the addition of bulky adducts to the DNA-binding protein being studied, which can affect their expression and activity. Here, we describe SpyCatcher Linked Targeting of Chromatin Endogenous Cleavage (SpLiT-ChEC), where a 13-amino acid SpyTag peptide, appended to a protein of interest, serves as a highly-specific targeting moiety for in situ enzymatic digestion. The SpyTag/SpyCatcher system forms a covalent bond, linking the target protein and a co-expressed MNase-SpyCatcher fusion construct. SpyTagged proteins are expressed from endogenous loci, whereas MNase-SpyCatcher expression is induced immediately before harvesting cultures. MNase is activated with high concentrations of calcium, which primarily digests DNA near target protein binding sites. By sequencing the DNA fragments released by targeted MNase digestion, we found that this method recovers information on protein binding and proximal nucleosome positioning. SpLiT-ChEC provides precise temporal control that we anticipate can be used to monitor chromatin under various conditions and at distinct points in the cell cycle.
Collapse
Affiliation(s)
- Orion G. B. Banks
- Institute of Molecular Biology, University of Oregon, Eugene OR 97403, USA
| | - Michael J. Harms
- Institute of Molecular Biology, University of Oregon, Eugene OR 97403, USA
- Department of Chemistry and Biochemistry, University of Oregon, Eugene OR 97403, USA
| | - Jeffrey. N. McKnight
- Institute of Molecular Biology, University of Oregon, Eugene OR 97403, USA
- Knight Campus for Accelerated Research, University of Oregon, Eugene OR 97403, USA
| | - Laura E. McKnight
- Institute of Molecular Biology, University of Oregon, Eugene OR 97403, USA
- Knight Campus for Accelerated Research, University of Oregon, Eugene OR 97403, USA
| |
Collapse
|
6
|
Wonderlick DR, Widom JR, Harms MJ. Disentangling contact and ensemble epistasis in a riboswitch. Biophys J 2023; 122:1600-1612. [PMID: 36710492 PMCID: PMC10183321 DOI: 10.1016/j.bpj.2023.01.033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 01/09/2023] [Accepted: 01/24/2023] [Indexed: 01/29/2023] Open
Abstract
Mutations introduced into macromolecules often exhibit epistasis, where the effect of one mutation alters the effect of another. Knowing the mechanisms that lead to epistasis is important for understanding how macromolecules work and evolve, as well as for effective macromolecular engineering. Here, we investigate the interplay between "contact epistasis" (epistasis arising from physical interactions between mutated residues) and "ensemble epistasis" (epistasis that occurs when a mutation redistributes the conformational ensemble of a macromolecule, thus changing the effect of the second mutation). We argue that the two mechanisms can be distinguished in allosteric macromolecules by measuring epistasis at differing allosteric effector concentrations. Contact epistasis manifests as nonadditivity in the microscopic equilibrium constants describing the conformational ensemble. This epistatic effect is independent of allosteric effector concentration. Ensemble epistasis manifests as nonadditivity in thermodynamic observables-such as ligand binding-that are determined by the distribution of ensemble conformations. This epistatic effect strongly depends on allosteric effector concentration. Using this framework, we experimentally investigated the origins of epistasis in three pairwise mutant cycles introduced into the adenine riboswitch aptamer domain by measuring ligand binding as a function of allosteric effector concentration. We found evidence for both contact and ensemble epistasis in all cycles. Furthermore, we found that the two mechanisms of epistasis could interact with each other. For example, in one mutant cycle we observed 6 kcal/mol of contact epistasis in a microscopic equilibrium constant. In that same cycle, the maximum epistasis in ligand binding was only 1.5 kcal/mol: shifts in the ensemble masked the contribution of contact epistasis. Finally, our work yields simple heuristics for identifying contact and ensemble epistasis based on measurements of a biochemical observable as a function of allosteric effector concentration.
Collapse
Affiliation(s)
- Daria R Wonderlick
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon
| | - Julia R Widom
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon; Institute for Molecular Biology, University of Oregon, Eugene, Oregon; Oregon Center for Optical, Molecular, & Quantum Science, University of Oregon, Eugene, Oregon
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon; Institute for Molecular Biology, University of Oregon, Eugene, Oregon.
| |
Collapse
|
7
|
Orlandi KN, Phillips SR, Sailer ZR, Harman JL, Harms MJ. Topiary: Pruning the manual labor from ancestral sequence reconstruction. Protein Sci 2023; 32:e4551. [PMID: 36565302 PMCID: PMC9847077 DOI: 10.1002/pro.4551] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 12/14/2022] [Accepted: 12/17/2022] [Indexed: 12/25/2022]
Abstract
Ancestral sequence reconstruction (ASR) is a powerful tool to study the evolution of proteins and thus gain deep insight into the relationships among protein sequence, structure, and function. A major barrier to its broad use is the complexity of the task: it requires multiple software packages, complex file manipulations, and expert phylogenetic knowledge. Here we introduce topiary, a software pipeline that aims to overcome this barrier. To use topiary, users prepare a spreadsheet with a handful of sequences. Topiary then: (1) Infers the taxonomic scope for the ASR study and finds relevant sequences by BLAST; (2) Does taxonomically informed sequence quality control and redundancy reduction; (3) Constructs a multiple sequence alignment; (4) Generates a maximum-likelihood gene tree; (5) Reconciles the gene tree to the species tree; (6) Reconstructs ancestral amino acid sequences; and (7) Determines branch supports. The pipeline returns annotated evolutionary trees, spreadsheets with sequences, and graphical summaries of ancestor quality. This is achieved by integrating modern phylogenetics software (Muscle5, RAxML-NG, GeneRax, and PastML) with online databases (NCBI and the Open Tree of Life). In this paper, we introduce non-expert readers to the steps required for ASR, describe the specific design choices made in topiary, provide a detailed protocol for users, and then validate the pipeline using datasets from a broad collection of protein families. Topiary is freely available for download: https://github.com/harmslab/topiary.
Collapse
Affiliation(s)
- Kona N. Orlandi
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of BiologyUniversity of OregonEugeneOregonUSA
| | - Sophia R. Phillips
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Zachary R. Sailer
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Joseph L. Harman
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Michael J. Harms
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| |
Collapse
|
8
|
Morin MA, Morrison AJ, Harms MJ, Dutton RJ. Higher-order interactions shape microbial interactions as microbial community complexity increases. Sci Rep 2022; 12:22640. [PMID: 36587027 PMCID: PMC9805437 DOI: 10.1038/s41598-022-25303-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Accepted: 11/28/2022] [Indexed: 01/01/2023] Open
Abstract
Non-pairwise interactions, or higher-order interactions (HOIs), in microbial communities have been described as significant drivers of emergent features in microbiomes. Yet, the re-organization of microbial interactions between pairwise cultures and larger communities remains largely unexplored from a molecular perspective but is central to our understanding and further manipulation of microbial communities. Here, we used a bottom-up approach to investigate microbial interaction mechanisms from pairwise cultures up to 4-species communities from a simple microbiome (Hafnia alvei, Geotrichum candidum, Pencillium camemberti and Escherichia coli). Specifically, we characterized the interaction landscape for each species combination involving E. coli by identifying E. coli's interaction-associated mutants using an RB-TnSeq-based interaction assay. We observed a deep reorganization of the interaction-associated mutants, with very few 2-species interactions conserved all the way up to a 4-species community and the emergence of multiple HOIs. We further used a quantitative genetics strategy to decipher how 2-species interactions were quantitatively conserved in higher community compositions. Epistasis-based analysis revealed that, of the interactions that are conserved at all levels of complexity, 82% follow an additive pattern. Altogether, we demonstrate the complex architecture of microbial interactions even within a simple microbiome, and provide a mechanistic and molecular explanation of HOIs.
Collapse
Affiliation(s)
- Manon A. Morin
- grid.266100.30000 0001 2107 4242School of Biological Science, University of California San Diego, San Diego, 92093 USA
| | - Anneliese J. Morrison
- grid.170202.60000 0004 1936 8008Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR USA ,grid.170202.60000 0004 1936 8008Institute of Molecular Biology, University of Oregon, Eugene, OR USA
| | - Michael J. Harms
- grid.170202.60000 0004 1936 8008Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR USA ,grid.170202.60000 0004 1936 8008Institute of Molecular Biology, University of Oregon, Eugene, OR USA
| | - Rachel J. Dutton
- grid.266100.30000 0001 2107 4242School of Biological Science, University of California San Diego, San Diego, 92093 USA
| |
Collapse
|
9
|
Cortez LM, Morrison AJ, Garen CR, Patterson S, Uyesugi T, Petrosyan R, Sekar RV, Harms MJ, Woodside MT, Sim VL. Probing the origin of prion protein misfolding via reconstruction of ancestral proteins. Protein Sci 2022; 31:e4477. [PMID: 36254680 PMCID: PMC9667828 DOI: 10.1002/pro.4477] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 10/11/2022] [Accepted: 10/13/2022] [Indexed: 12/13/2022]
Abstract
Prion diseases are fatal neurodegenerative diseases caused by pathogenic misfolding of the prion protein, PrP. They are transmissible between hosts, and sometimes between different species, as with transmission of bovine spongiform encephalopathy to humans. Although PrP is found in a wide range of vertebrates, prion diseases are seen only in certain mammals, suggesting that infectious misfolding was a recent evolutionary development. To explore when PrP acquired the ability to misfold infectiously, we reconstructed the sequences of ancestral versions of PrP from the last common primate, primate-rodent, artiodactyl, placental, bird, and amniote. Recombinant ancestral PrPs were then tested for their ability to form β-sheet aggregates, either spontaneously or when seeded with infectious prion strains from human, cervid, or rodent species. The ability to aggregate developed after the oldest ancestor (last common amniote), and aggregation capabilities diverged along evolutionary pathways consistent with modern-day susceptibilities. Ancestral bird PrP could not be seeded with modern-day prions, just as modern-day birds are resistant to prion disease. Computational modeling of structures suggested that differences in helix 2 could account for the resistance of ancestral bird PrP to seeding. Interestingly, ancestral primate PrP could be converted by all prion seeds, including both human and cervid prions, raising the possibility that species descended from an ancestral primate have retained the susceptibility to conversion by cervid prions. More generally, the results suggest that susceptibility to prion disease emerged prior to ~100 million years ago, with placental mammals possibly being generally susceptible to disease.
Collapse
Affiliation(s)
- Leonardo M. Cortez
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
- Division of Neurology, Department of MedicineUniversity of AlbertaEdmontonAlbertaCanada
- Neuroscience and Mental Health InstituteUniversity of AlbertaEdmontonAlbertaCanada
| | - Anneliese J. Morrison
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Craig R. Garen
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
| | - Sawyer Patterson
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
| | - Toshi Uyesugi
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
| | - Rafayel Petrosyan
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
- Present address:
Zaven & Sonia Akian College of Science and EngineeringAmerican University of ArmeniaYerevanArmenia
| | | | - Michael J. Harms
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Michael T. Woodside
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
- Department of PhysicsUniversity of AlbertaEdmontonAlbertaCanada
- Li Ka Shing Institute of VirologyUniversity of AlbertaEdmontonAlbertaCanada
| | - Valerie L. Sim
- Centre for Prions and Protein Folding DiseasesUniversity of AlbertaEdmontonAlbertaCanada
- Division of Neurology, Department of MedicineUniversity of AlbertaEdmontonAlbertaCanada
- Neuroscience and Mental Health InstituteUniversity of AlbertaEdmontonAlbertaCanada
| |
Collapse
|
10
|
Harman JL, Reardon PN, Costello SM, Warren GD, Phillips SR, Connor PJ, Marqusee S, Harms MJ. Evolution avoids a pathological stabilizing interaction in the immune protein S100A9. Proc Natl Acad Sci U S A 2022; 119:e2208029119. [PMID: 36194634 PMCID: PMC9565474 DOI: 10.1073/pnas.2208029119] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 09/07/2022] [Indexed: 01/03/2023] Open
Abstract
Stability constrains evolution. While much is known about constraints on destabilizing mutations, less is known about the constraints on stabilizing mutations. We recently identified a mutation in the innate immune protein S100A9 that provides insight into such constraints. When introduced into human S100A9, M63F simultaneously increases the stability of the protein and disrupts its natural ability to activate Toll-like receptor 4. Using chemical denaturation, we found that M63F stabilizes a calcium-bound conformation of hS100A9. We then used NMR to solve the structure of the mutant protein, revealing that the mutation distorts the hydrophobic binding surface of hS100A9, explaining its deleterious effect on function. Hydrogen-deuterium exchange (HDX) experiments revealed stabilization of the region around M63F in the structure, notably Phe37. In the structure of the M63F mutant, the Phe37 and Phe63 sidechains are in contact, plausibly forming an edge-face π-stack. Mutating Phe37 to Leu abolished the stabilizing effect of M63F as probed by both chemical denaturation and HDX. It also restored the biological activity of S100A9 disrupted by M63F. These findings reveal that Phe63 creates a molecular staple with Phe37 that stabilizes a nonfunctional conformation of the protein, thus disrupting function. Using a bioinformatic analysis, we found that S100A9 proteins from different organisms rarely have Phe at both positions 37 and 63, suggesting that avoiding a pathological stabilizing interaction indeed constrains S100A9 evolution. This work highlights an important evolutionary constraint on stabilizing mutations, namely, that they must avoid inappropriately stabilizing nonfunctional protein conformations.
Collapse
Affiliation(s)
- Joseph L Harman
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Patrick N Reardon
- College of Science, NMR Facility, Oregon State University, Corvallis, OR 97331
| | - Shawn M Costello
- Biophysics Graduate Program, University of California, Berkeley, Berkeley, CA 94720
| | - Gus D Warren
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Sophia R Phillips
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Patrick J Connor
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
- Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA 94720
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403
| |
Collapse
|
11
|
Abstract
Epistasis-when mutations combine nonadditively-is a profoundly important aspect of biology. It is often difficult to understand its mechanistic origins. Here, we show that epistasis can arise from the thermodynamic ensemble, or the set of interchanging conformations a protein adopts. Ensemble epistasis occurs because mutations can have different effects on different conformations of the same protein, leading to nonadditive effects on its average, observable properties. Using a simple analytical model, we found that ensemble epistasis arises when two conditions are met: (1) a protein populates at least three conformations and (2) mutations have differential effects on at least two conformations. To explore the relative magnitude of ensemble epistasis, we performed a virtual deep-mutational scan of the allosteric Ca2+ signaling protein S100A4. We found that 47% of mutation pairs exhibited ensemble epistasis with a magnitude on the order of thermal fluctuations. We observed many forms of epistasis: magnitude, sign, and reciprocal sign epistasis. The same mutation pair could even exhibit different forms of epistasis under different environmental conditions. The ubiquity of thermodynamic ensembles in biology and the pervasiveness of ensemble epistasis in our dataset suggests that it may be a common mechanism of epistasis in proteins and other macromolecules.
Collapse
Affiliation(s)
- Anneliese J Morrison
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403, USA
- Department of Chemistry and Biochemistry, University of Oregon, Eugene OR 97403, USA
| | - Daria R Wonderlick
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403, USA
- Department of Chemistry and Biochemistry, University of Oregon, Eugene OR 97403, USA
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403, USA
- Department of Chemistry and Biochemistry, University of Oregon, Eugene OR 97403, USA
| |
Collapse
|
12
|
Abstract
Some have hypothesized that ancestral proteins were, on average, less specific than their descendants. If true, this would provide a universal axis along which to organize protein evolution and suggests that reconstructed ancestral proteins may be uniquely powerful tools for protein engineering. Ancestral sequence reconstruction studies are one line of evidence used to support this hypothesis. Previously, we performed such a study, investigating the evolution of peptide-binding specificity for the paralogs S100A5 and S100A6. The modern proteins appeared more specific than their last common ancestor (ancA5/A6), as each paralog bound a subset of the peptides bound by ancA5/A6. In this study, we revisit this transition, using quantitative phage display to measure the interactions of 30,533 random peptides with human S100A5, S100A6, and ancA5/A6. This unbiased screen reveals a different picture. While S100A5 and S100A6 do indeed bind to a subset of the peptides recognized by ancA5/A6, they also acquired new peptide partners outside of the set recognized by ancA5/A6. Our previous work showed that ancA5/A6 had lower specificity than its descendants when measured against biological targets; our new work shows that ancA5/A6 has similar specificity to the modern proteins when measured against a random set of peptide targets. This demonstrates that altered biological specificity does not necessarily indicate altered intrinsic specificity, and sounds a cautionary note for using ancestral reconstruction studies with biological targets as a means to infer global evolutionary trends in specificity.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA.,Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, USA
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
| |
Collapse
|
13
|
Lupo BE, Chu P, Harms MJ, Morrison EA, Musselman CA. Evolutionary Conservation of Structural and Functional Coupling between the BRM AT-Hook and Bromodomain. J Mol Biol 2021; 433:166845. [PMID: 33539881 PMCID: PMC8184587 DOI: 10.1016/j.jmb.2021.166845] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Revised: 01/18/2021] [Accepted: 01/21/2021] [Indexed: 01/13/2023]
Abstract
The BAF chromatin remodeling complex is critical for genome regulation. The central ATPase of BAF is either BRM or BRG1, both of which contain a C-terminal bromodomain, known to associate with acetylated lysines. We have recently demonstrated that in addition to acetyl-lysine binding, the BRG1/BRM bromodomain can associate with DNA through a lysine/arginine rich patch that is adjacent to the acetyl-lysine binding pocket. Flanking the bromodomain is an AT-hook separated by a short, proline-rich linker. We previously found that the AT-hook and bromodomain can associate with DNA in a multivalent manner. Here, we investigate the conservation of this composite module and find that the AT-hook, linker, and lysine/arginine rich bromodomain patch are ancient, conserved over ~1 billion years. We utilize extensive mutagenesis, NMR spectroscopy, and fluorescence anisotropy to dissect the contribution of each of these conserved elements in association of this module with DNA. Our results reveal a structural and functional coupling of the AT-hook and bromodomain mediated by the linker. The lysine/arginine rich patch on the bromodomain and the conserved elements of the AT-hook are critical for robust affinity for DNA, while the conserved elements of the linker are dispensable for overall DNA affinity but critical for maintaining the relative conformation of the AT-hook and bromodomain in binding to DNA. This supports that the coupled action of the AT-hook and bromodomain are important for BAF activity.
Collapse
Affiliation(s)
- Brianna E Lupo
- University of Iowa, Carver College of Medicine, Department of Biochemistry, Iowa City, IA 52242, United States
| | - Peirou Chu
- University of Iowa, Carver College of Medicine, Department of Biochemistry, Iowa City, IA 52242, United States
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403, United States; Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403, United States
| | - Emma A Morrison
- University of Iowa, Carver College of Medicine, Department of Biochemistry, Iowa City, IA 52242, United States; Medical College of Wisconsin, Department of Biochemistry, Milwaukee, WI 53226, United States.
| | - Catherine A Musselman
- University of Iowa, Carver College of Medicine, Department of Biochemistry, Iowa City, IA 52242, United States; University of Colorado Anschutz Medical Campus, Department of Biochemistry and Molecular Genetics, Aurora, CO 80045, United States.
| |
Collapse
|
14
|
Wonderlick DR, Harms MJ. Characterization of Ensemble Epistasis in the Adenine Riboswitch. Biophys J 2021. [DOI: 10.1016/j.bpj.2020.11.1977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
15
|
Loes AN, Hinman MN, Farnsworth DR, Miller AC, Guillemin K, Harms MJ. Identification and Characterization of Zebrafish Tlr4 Coreceptor Md-2. J Immunol 2021; 206:1046-1057. [PMID: 33472906 DOI: 10.4049/jimmunol.1901288] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 12/16/2020] [Indexed: 12/16/2022]
Abstract
The zebrafish (Danio rerio) is a powerful model organism for studies of the innate immune system. One apparent difference between human and zebrafish innate immunity is the cellular machinery for LPS sensing. In amniotes, the protein complex formed by TLR4 and myeloid differentiation factor 2 (Tlr4/Md-2) recognizes the bacterial molecule LPS and triggers an inflammatory response. It is believed that zebrafish have neither Md-2 nor Tlr4; Md-2 has not been identified outside of amniotes, whereas the zebrafish tlr4 genes appear to be paralogs, not orthologs, of amniote TLR4s We revisited these conclusions. We identified a zebrafish gene encoding Md-2, ly96 Using single-cell RNA sequencing, we found that ly96 is transcribed in cells that also transcribe genes diagnostic for innate immune cells, including the zebrafish tlr4-like genes. In larval zebrafish, ly96 is expressed in a small number of macrophage-like cells. In a functional assay, zebrafish Md-2 and Tlr4ba form a complex that activates NF-κB signaling in response to LPS. In larval zebrafish ly96 loss-of-function mutations perturbed LPS-induced cytokine production but gave little protection against LPS toxicity. Finally, by analyzing the genomic context of tlr4 genes in 11 jawed vertebrates, we found that tlr4 arose prior to the divergence of teleosts and tetrapods. Thus, an LPS-sensitive Tlr4/Md-2 complex is likely an ancestral feature shared by mammals and zebrafish, rather than a de novo invention on the tetrapod lineage. We hypothesize that zebrafish retain an ancestral, low-sensitivity Tlr4/Md-2 complex that confers LPS responsiveness to a specific subset of innate immune cells.
Collapse
Affiliation(s)
- Andrea N Loes
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
| | - Melissa N Hinman
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403.,Department of Biology, University of Oregon, Eugene, OR 97403
| | - Dylan R Farnsworth
- Department of Biology, University of Oregon, Eugene, OR 97403.,Institute of Neuroscience, University of Oregon, Eugene, OR 97403; and
| | - Adam C Miller
- Department of Biology, University of Oregon, Eugene, OR 97403.,Institute of Neuroscience, University of Oregon, Eugene, OR 97403; and
| | - Karen Guillemin
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403.,Department of Biology, University of Oregon, Eugene, OR 97403.,Humans and the Microbiome Program, Canadian Institute for Advanced Research, Toronto, Ontario M5G 1Z8, Canada
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR 97403; .,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
| |
Collapse
|
16
|
Nixon CF, Lim SA, Sailer ZR, Zheludev IN, Gee CL, Kelch BA, Harms MJ, Marqusee S. Exploring the Evolutionary History of Kinetic Stability in the α-Lytic Protease Family. Biochemistry 2021; 60:170-181. [PMID: 33433210 DOI: 10.1021/acs.biochem.0c00720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In addition to encoding the tertiary fold and stability, the primary sequence of a protein encodes the folding trajectory and kinetic barriers that determine the speed of folding. How these kinetic barriers are encoded is not well understood. Here, we use evolutionary sequence variation in the α-lytic protease (αLP) protein family to probe the relationship between sequence and energy landscape. αLP has an unusual energy landscape: the native state of αLP is not the most thermodynamically favored conformation and, instead, remains folded due to a large kinetic barrier preventing unfolding. To fold, αLP utilizes an N-terminal pro region similar in size to the protease itself that functions as a folding catalyst. Once folded, the pro region is removed, and the native state does not unfold on a biologically relevant time scale. Without the pro region, αLP folds on the order of millennia. A phylogenetic search uncovers αLP homologs with a wide range of pro region sizes, including some with no pro region at all. In the resulting phylogenetic tree, these homologs cluster by pro region size. By studying homologs naturally lacking a pro region, we demonstrate they can be thermodynamically stable, fold much faster than αLP, yet retain the same fold as αLP. Key amino acids thought to contribute to αLP's extreme kinetic stability are lost in these homologs, supporting their role in kinetic stability. This study highlights how the entire energy landscape plays an important role in determining the evolutionary pressures on the protein sequence.
Collapse
Affiliation(s)
- Charlotte F Nixon
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States
| | - Shion A Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States
| | - Zachary R Sailer
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403, United States.,Department of Chemistry & Biochemistry, University of Oregon, Eugene, Oregon 97403, United States
| | - Ivan N Zheludev
- California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, California 94720, United States
| | - Christine L Gee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States.,California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, California 94720, United States.,Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, California 94720, United States
| | - Brian A Kelch
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts 01655, United States
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403, United States.,Department of Chemistry & Biochemistry, University of Oregon, Eugene, Oregon 97403, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States.,California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, California 94720, United States.,Department of Chemistry, University of California, Berkeley, Berkeley, California 94720, United States.,Chan Zuckerberg Biohub, San Francisco, California 94158, United States
| |
Collapse
|
17
|
Wheeler LC, Perkins A, Wong CE, Harms MJ. Learning peptide recognition rules for a low-specificity protein. Protein Sci 2020; 29:2259-2273. [PMID: 32979254 PMCID: PMC7586891 DOI: 10.1002/pro.3958] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 09/18/2020] [Accepted: 09/18/2020] [Indexed: 12/18/2022]
Abstract
Many proteins interact with short linear regions of target proteins. For some proteins, however, it is difficult to identify a well-defined sequence motif that defines its target peptides. To overcome this difficulty, we used supervised machine learning to train a model that treats each peptide as a collection of easily-calculated biochemical features rather than as an amino acid sequence. As a test case, we dissected the peptide-recognition rules for human S100A5 (hA5), a low-specificity calcium binding protein. We trained a Random Forest model against a recently released, high-throughput phage display dataset collected for hA5. The model identifies hydrophobicity and shape complementarity, rather than polar contacts, as the primary determinants of peptide binding specificity in hA5. We tested this hypothesis by solving a crystal structure of hA5 and through computational docking studies of diverse peptides onto hA5. These structural studies revealed that peptides exhibit multiple binding modes at the hA5 peptide interface-all of which have few polar contacts with hA5. Finally, we used our trained model to predict new, plausible binding targets in the human proteome. This revealed a fragment of the protein α-1-syntrophin that binds to hA5. Our work helps better understand the biochemistry and biology of hA5, as well as demonstrating how high-throughput experiments coupled with machine learning of biochemical features can reveal the determinants of binding specificity in low-specificity proteins.
Collapse
Affiliation(s)
- Lucas C. Wheeler
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
- Department of Ecology and Evolutionary BiologyUniversity of ColoradoBoulderColoradoUSA
| | - Arden Perkins
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Caitlyn E. Wong
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Michael J. Harms
- Institute of Molecular BiologyUniversity of OregonEugeneOregonUSA
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| |
Collapse
|
18
|
Sailer ZR, Shafik SH, Summers RL, Joule A, Patterson-Robert A, Martin RE, Harms MJ. Inferring a complete genotype-phenotype map from a small number of measured phenotypes. PLoS Comput Biol 2020; 16:e1008243. [PMID: 32991585 PMCID: PMC7546491 DOI: 10.1371/journal.pcbi.1008243] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2019] [Revised: 10/09/2020] [Accepted: 08/13/2020] [Indexed: 01/02/2023] Open
Abstract
Understanding evolution requires detailed knowledge of genotype-phenotype maps; however, it can be a herculean task to measure every phenotype in a combinatorial map. We have developed a computational strategy to predict the missing phenotypes from an incomplete, combinatorial genotype-phenotype map. As a test case, we used an incomplete genotype-phenotype dataset previously generated for the malaria parasite’s ‘chloroquine resistance transporter’ (PfCRT). Wild-type PfCRT (PfCRT3D7) lacks significant chloroquine (CQ) transport activity, but the introduction of the eight mutations present in the ‘Dd2’ isoform of PfCRT (PfCRTDd2) enables the protein to transport CQ away from its site of antimalarial action. This gain of a transport function imparts CQ resistance to the parasite. A combinatorial map between PfCRT3D7 and PfCRTDd2 consists of 256 genotypes, of which only 52 have had their CQ transport activities measured through expression in the Xenopus laevis oocyte. We trained a statistical model with these 52 measurements to infer the CQ transport activity for the remaining 204 combinatorial genotypes between PfCRT3D7 and PfCRTDd2. Our best-performing model incorporated a binary classifier, a nonlinear scale, and additive effects for each mutation. The addition of specific pairwise- and high-order-epistatic coefficients decreased the predictive power of the model. We evaluated our predictions by experimentally measuring the CQ transport activities of 24 additional PfCRT genotypes. The R2 value between our predicted and newly-measured phenotypes was 0.90. We then used the model to probe the accessibility of evolutionary trajectories through the map. Approximately 1% of the possible trajectories between PfCRT3D7 and PfCRTDd2 are accessible; however, none of the trajectories entailed eight successive increases in CQ transport activity. These results demonstrate that phenotypes can be inferred with known uncertainty from a partial genotype-phenotype dataset. We also validated our approach against a collection of previously published genotype-phenotype maps. The model therefore appears general and should be applicable to a large number of genotype-phenotype maps. Biological macromolecules are built from chains of building blocks. The function of a macromolecule depends on the specific chemical properties of the building blocks that make it up. Macromolecules evolve through mutations that swap one building block for another. Understanding how biomolecules work and evolve therefore requires knowledge of the effects of mutations. The effects of mutations can be measured experimentally; however, because there are a vast number of possible combinations of mutations, it is often difficult to make enough measurements to understand biomolecular function and evolution. In this paper, we describe a simple method to predict the effects of mutations on biomolecules from a small number of measurements. This method works by appropriately averaging the effects of mutations seen in different contexts. We test the method by predicting the effects of mutations on a PfCRT—a macromolecule from the malarial parasite that confers drug resistance. We find that our method is fast and effective. Using a small number of measurements, we were able to gain insight into the evolutionary steps by which this macromolecule conferred drug resistance. To make this method accessible to other researchers, we have released it as an open-source software package: https://gpseer.readthedocs.io.
Collapse
Affiliation(s)
- Zachary R. Sailer
- Institute for Molecular Biology, University of Oregon, Eugene, OR, United States of America
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States of America
| | - Sarah H. Shafik
- Research School of Biology, Australian National University, Canberra, ACT, Australia
| | - Robert L. Summers
- Research School of Biology, Australian National University, Canberra, ACT, Australia
| | - Alex Joule
- Research School of Biology, Australian National University, Canberra, ACT, Australia
| | | | - Rowena E. Martin
- Research School of Biology, Australian National University, Canberra, ACT, Australia
- * E-mail: (REM); (MJH)
| | - Michael J. Harms
- Institute for Molecular Biology, University of Oregon, Eugene, OR, United States of America
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States of America
- * E-mail: (REM); (MJH)
| |
Collapse
|
19
|
Harman JL, Loes AN, Warren GD, Heaphy MC, Lampi KJ, Harms MJ. Evolution of multifunctionality through a pleiotropic substitution in the innate immune protein S100A9. eLife 2020; 9:e54100. [PMID: 32255429 PMCID: PMC7213983 DOI: 10.7554/elife.54100] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Accepted: 04/03/2020] [Indexed: 12/16/2022] Open
Abstract
Multifunctional proteins are evolutionary puzzles: how do proteins evolve to satisfy multiple functional constraints? S100A9 is one such multifunctional protein. It potently amplifies inflammation via Toll-like receptor four and is antimicrobial as part of a heterocomplex with S100A8. These two functions are seemingly regulated by proteolysis: S100A9 is readily degraded, while S100A8/S100A9 is resistant. We take an evolutionary biochemical approach to show that S100A9 evolved both functions and lost proteolytic resistance from a weakly proinflammatory, proteolytically resistant amniote ancestor. We identify a historical substitution that has pleiotropic effects on S100A9 proinflammatory activity and proteolytic resistance but has little effect on S100A8/S100A9 antimicrobial activity. We thus propose that mammals evolved S100A8/S100A9 antimicrobial and S100A9 proinflammatory activities concomitantly with a proteolytic 'timer' to selectively regulate S100A9. This highlights how the same mutation can have pleiotropic effects on one functional state of a protein but not another, thus facilitating the evolution of multifunctionality.
Collapse
Affiliation(s)
- Joseph L Harman
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | - Andrea N Loes
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | - Gus D Warren
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | - Maureen C Heaphy
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| | | | - Michael J Harms
- Department of Chemistry and Biochemistry, University of OregonEugeneUnited States
- Institute of Molecular Biology, University of OregonEugeneUnited States
| |
Collapse
|
20
|
Morrison AJ, Harms MJ. Experimental Test of Ensemble-Induced Epistasis in Macromolecules. Biophys J 2020. [DOI: 10.1016/j.bpj.2019.11.1162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
|
21
|
Harman J, Loes A, Anderson JA, Warren G, Heaphy M, Lampi K, Harms MJ. S100A9S Evolved Potent Proinflammatory Activity and Lost Proteolytic Resistance from a Proteolytically Resistant, Weakly Proinflammatory Ancestor. Biophys J 2020. [DOI: 10.1016/j.bpj.2019.11.1882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
|
22
|
Anderson JA, Loes AN, Waddell GL, Harms MJ. Tracing the evolution of novel features of human Toll-like receptor 4. Protein Sci 2019; 28:1350-1358. [PMID: 31075178 DOI: 10.1002/pro.3644] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 05/07/2019] [Accepted: 05/08/2019] [Indexed: 12/13/2022]
Abstract
Toll-like receptor 4 (TLR4) is a critical innate immune protein that activates inflammation in response to extracellular cues. Much of the work to understand how the protein works in humans has been done using mouse models. Although human and mouse TLR4 have many shared features, they have also diverged significantly since their last common ancestor, acquiring 277 sequence differences. Functional differences include the extent of ligand-independent activation, whether lipid IVa acts as an antagonist or agonist, and the relative species cross-compatibility of their MD-2 cofactor. We set out to understand the evolutionary origins for these functional differences between human and mouse TLR4. Using a combination of phylogenetics, ancestral sequence reconstruction, and functional characterization, we found that evolutionary changes to the human TLR4, rather than changes to the mouse TLR4, were largely responsible for these functional changes. Human TLR4 repressed ancestral ligand-independent activity and gained antagonism to lipid IVa. Additionally, mutations to the human TLR4 cofactor MD-2 led to lineage-specific incompatibility between human and opossum TLR4 complex members. These results were surprising, as mouse TLR4 has acquired many more mutations than human TLR4 since their last common ancestor. Our work has polarized this set of transitions and sets up work to study the mechanistic underpinnings for the evolution of new functions in TLR4.
Collapse
Affiliation(s)
- Jeremy A Anderson
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403
| | - Andrea N Loes
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403
| | - Grace L Waddell
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403
| | - Michael J Harms
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403
| |
Collapse
|
23
|
Affiliation(s)
- Michael J Harms
- Institute of Molecular Biology and Chemistry and Biochemistry Department, University of Oregon, Eugene, OR, USA.
| |
Collapse
|
24
|
Abstract
Here we describe pytc, an open-source Python package for global fits of thermodynamic models to multiple isothermal titration calorimetry experiments. Key features include simplicity, the ability to implement new thermodynamic models, a robust maximum likelihood fitter, a fast Bayesian Markov-Chain Monte Carlo sampler, rigorous implementation, extensive documentation, and full cross-platform compatibility. pytc fitting can be done using an application program interface or via a graphical user interface. It is available for download at https://github.com/harmslab/pytc .
Collapse
Affiliation(s)
- Hiranmayi Duvvuri
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA
| | - Lucas C. Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA
| | - Michael J. Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA
| |
Collapse
|
25
|
Abstract
Toll-like receptor 4 (TLR4) induces inflammation in response to both pathogen- and host-derived molecules. Lipopolysaccharide (LPS) recognition by TLR4 has been shown to occur across the amniotes, but endogenous signaling through TLR4 has not been validated outside of placental mammals. To determine whether endogenous danger signaling is also shared across amniotes, we studied the evolution of TLR4-activation by the calgranulin proteins (S100A8, S100A9, and S100A12), a clade of host molecules that potently activate TLR4 in placental mammals. We performed phylogenetic and syntenic analysis and found MRP-126—a gene in birds and reptiles—is likely orthologous to the mammalian calgranulins. We then used an ex vivo TLR4 activation assay to establish that calgranulin pro-inflammatory activity is not specific to placental mammals, but is also exhibited by representative marsupial and sauropsid species. This activity is strongly dependent on the cofactors CD14 and MD-2 for all species studied, suggesting a conserved mode of activation across the amniotes. Ortholog complementation experiments between the calgranulins, TLR4, CD14, and MD-2 revealed extensive lineage specific-coevolution and multi-way interactions between components that are necessary for the activation of NF-κB signaling by calgranulins and LPS. Our work demonstrates that calgranulin activation of TLR4 evolved at least ~320 million years ago and has been conserved in the amniote innate immune system.
Collapse
Affiliation(s)
- Andrea N Loes
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States.,Institute of Molecular Biology, University of Oregon, Eugene, OR, United States
| | - Jamie T Bridgham
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR, United States
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States.,Institute of Molecular Biology, University of Oregon, Eugene, OR, United States
| |
Collapse
|
26
|
Lim SA, Bolin ER, Hart KM, Harms MJ, Marqusee S. An Evolutionary Trend towards Kinetic Stability in the Folding Trajectory of Ribonucleases H. Biophys J 2018. [DOI: 10.1016/j.bpj.2017.11.3158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
27
|
Abstract
Many regulatory proteins bind peptide regions of target proteins and modulate their activity. Such regulatory proteins can often interact with highly diverse target peptides. In many instances, it is not known if the peptide-binding interface discriminates targets in a biological context, or whether biological specificity is achieved exclusively through external factors such as subcellular localization. We used an evolutionary biochemical approach to distinguish these possibilities for two such low-specificity proteins: S100A5 and S100A6. We used isothermal titration calorimetry to study the binding of peptides with diverse sequence and biochemistry to human S100A5 and S100A6. These proteins bound distinct, but overlapping, sets of peptide targets. We then studied the peptide binding properties of orthologs sampled from across five amniote species. Binding specificity was conserved along all lineages, for the last 320 million years, despite the low specificity of each protein. We used ancestral sequence reconstruction to determine the binding specificity of the last common ancestor of the paralogs. The ancestor bound the entire set of peptides bound by modern S100A5 and S100A6 proteins, suggesting that paralog specificity evolved via subfunctionalization. To rule out the possibility that specificity is conserved because it is difficult to modify, we identified a single historical mutation that, when reverted in human S100A5, gave it the ability to bind an S100A6-specific peptide. These results reveal strong evolutionary constraints on peptide binding specificity. Despite being able to bind a large number of targets, the specificity of S100 peptide interfaces is likely important for the biology of these proteins.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Jeremy A Anderson
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Anneliese J Morrison
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Caitlyn E Wong
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon , Eugene, Oregon 97403, United States.,Institute of Molecular Biology, University of Oregon , Eugene, Oregon 97403, United States
| |
Collapse
|
28
|
Abstract
Background S100A5 is a calcium binding protein found in a small subset of amniote tissues. Little is known about the biological roles of S100A5, but it may be involved in inflammation and olfactory signaling. Previous work indicated that S100A5 displays antagonism between binding of Ca2+ and Cu2+ ions-one of the most commonly cited features of the protein. We set out to characterize the interplay between Ca2+ and Cu2+ binding by S100A5 using isothermal titration calorimetry (ITC), circular dichroism spectroscopy (CD), and analytical ultracentrifugation (AUC). Results We found that human S100A5 is capable of binding both Cu2+ and Ca2+ ions simultaneously. The wildtype protein was extremely aggregation-prone in the presence of Cu2+ and Ca2+. A Cys-free version of S100A5, however, was not prone to precipitation or oligomerization. Mutation of the cysteines does not disrupt the binding of either Ca2+ or Cu2+ to S100A5. In the Cys-free background, we measured Ca2+ and Cu2+ binding in the presence and absence of the other metal using ITC. Saturating concentrations of Ca2+ or Cu2+ do not disrupt the binding of one another. Ca2+ and Cu2+ binding induce structural changes in S100A5, which are measurable using CD spectroscopy. We show via sedimentation velocity AUC that the wildtype protein is prone to the formation of soluble oligomers, which are not present in Cys-free samples. Conclusions S100A5 can bind Ca2+ and Cu2+ ions simultaneously and independently. This observation is in direct contrast to previously-reported antagonism between binding of Cu2+ and Ca2+ ions. The previous result is likely due to metal-dependent aggregation. Little is known about the biology of S100A5, so an accurate understanding of the biochemistry is necessary to make informed biological hypotheses. Our observations suggest the possibility of independent biological functions for Cu2+ and Ca2+ binding by S100A5.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, 97403 OR USA.,Insitute of Molecular Biology, University of Oregon, Eugene, 97403 OR USA
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, 97403 OR USA.,Insitute of Molecular Biology, University of Oregon, Eugene, 97403 OR USA
| |
Collapse
|
29
|
Abstract
High-order epistasis—where the effect of a mutation is determined by interactions with two or more other mutations—makes small, but detectable, contributions to genotype-fitness maps. While epistasis between pairs of mutations is known to be an important determinant of evolutionary trajectories, the evolutionary consequences of high-order epistasis remain poorly understood. To determine the effect of high-order epistasis on evolutionary trajectories, we computationally removed high-order epistasis from experimental genotype-fitness maps containing all binary combinations of five mutations. We then compared trajectories through maps both with and without high-order epistasis. We found that high-order epistasis strongly shapes the accessibility and probability of evolutionary trajectories. A closer analysis revealed that the magnitude of epistasis, not its order, predicts is effects on evolutionary trajectories. We further find that high-order epistasis makes it impossible to predict evolutionary trajectories from the individual and paired effects of mutations. We therefore conclude that high-order epistasis profoundly shapes evolutionary trajectories through genotype-fitness maps. A key goal for evolutionary biologists is understanding why one evolutionary trajectory is taken rather than others. This requires understanding how individual mutations, as well as interactions between them, determine the accessibility of evolutionary pathways. We used a robust statistical analysis to reveal interactions between up to five mutations in published datasets, meaning that the effect of a mutation can depend on the presence or absence of four other mutations. Simulations reveal that these interactions strongly shape evolutionary trajectories. These interactions lead to profound unpredictability in evolution, as one cannot use the effect of a mutation in the ancestor to predict its effect later in the trajectory.
Collapse
Affiliation(s)
- Zachary R. Sailer
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
| | - Michael J. Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
- * E-mail:
| |
Collapse
|
30
|
An Lim S, Hart KM, Harms MJ, Marqusee S. An Evolutionary Trend towards Kinetic Stability in the Folding Trajectory of RNases H. Biophys J 2017. [DOI: 10.1016/j.bpj.2016.11.359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
31
|
Abstract
Hypotheses about the functions of ancient proteins and the effects of historical mutations on them are often tested using ancestral protein reconstruction (APR)-phylogenetic inference of ancestral sequences followed by synthesis and experimental characterization. Usually, some sequence sites are ambiguously reconstructed, with two or more statistically plausible states. The extent to which the inferred functions and mutational effects are robust to uncertainty about the ancestral sequence has not been studied systematically. To address this issue, we reconstructed ancestral proteins in three domain families that have different functions, architectures, and degrees of uncertainty; we then experimentally characterized the functional robustness of these proteins when uncertainty was incorporated using several approaches, including sampling amino acid states from the posterior distribution at each site and incorporating the alternative amino acid state at every ambiguous site in the sequence into a single "worst plausible case" protein. In every case, qualitative conclusions about the ancestral proteins' functions and the effects of key historical mutations were robust to sequence uncertainty, with similar functions observed even when scores of alternate amino acids were incorporated. There was some variation in quantitative descriptors of function among plausible sequences, suggesting that experimentally characterizing robustness is particularly important when quantitative estimates of ancient biochemical parameters are desired. The worst plausible case method appears to provide an efficient strategy for characterizing the functional robustness of ancestral proteins to large amounts of sequence uncertainty. Sampling from the posterior distribution sometimes produced artifactually nonfunctional proteins for sequences reconstructed with substantial ambiguity.
Collapse
Affiliation(s)
- Geeta N Eick
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
- Department of Anthropology, University of Oregon, Eugene, OR
| | - Jamie T Bridgham
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
| | - Douglas P Anderson
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
- Institute of Molecular Biology, University of Oregon, Eugene, OR
| | - Michael J Harms
- Institute of Ecology & Evolutionary Biology, University of Oregon, Eugene, OR
- Institute of Molecular Biology, University of Oregon, Eugene, OR
| | - Joseph W Thornton
- Department of Ecology & Evolution and Department of Human Genetics, University of Chicago, Chicago, IL
| |
Collapse
|
32
|
Wheeler LC, Donor MT, Prell JS, Harms MJ. Multiple Evolutionary Origins of Ubiquitous Cu2+ and Zn2+ Binding in the S100 Protein Family. PLoS One 2016; 11:e0164740. [PMID: 27764152 PMCID: PMC5072561 DOI: 10.1371/journal.pone.0164740] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Accepted: 09/29/2016] [Indexed: 12/24/2022] Open
Abstract
The S100 proteins are a large family of signaling proteins that play critical roles in biology and disease. Many S100 proteins bind Zn2+, Cu2+, and/or Mn2+ as part of their biological functions; however, the evolutionary origins of binding remain obscure. One key question is whether divalent transition metal binding is ancestral, or instead arose independently on multiple lineages. To tackle this question, we combined phylogenetics with biophysical characterization of modern S100 proteins. We demonstrate an earlier origin for established S100 subfamilies than previously believed, and reveal that transition metal binding is widely distributed across the tree. Using isothermal titration calorimetry, we found that Cu2+ and Zn2+ binding are common features of the family: the full breadth of human S100 paralogs-as well as two early-branching S100 proteins found in the tunicate Oikopleura dioica-bind these metals with μM affinity and stoichiometries ranging from 1:1 to 3:1 (metal:protein). While binding is consistent across the tree, structural responses to binding are quite variable. Further, mutational analysis and structural modeling revealed that transition metal binding occurs at different sites in different S100 proteins. This is consistent with multiple origins of transition metal binding over the evolution of this protein family. Our work reveals an evolutionary pattern in which the overall phenotype of binding is a constant feature of S100 proteins, even while the site and mechanism of binding is evolutionarily labile.
Collapse
Affiliation(s)
- Lucas C. Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403, United States of America
| | - Micah T. Donor
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
| | - James S. Prell
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
| | - Michael J. Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, 97403, United States of America
- Institute for Molecular Biology, University of Oregon, Eugene, Oregon, 97403, United States of America
| |
Collapse
|
33
|
Wheeler LC, Lim SA, Marqusee S, Harms MJ. The thermostability and specificity of ancient proteins. Curr Opin Struct Biol 2016; 38:37-43. [PMID: 27288744 DOI: 10.1016/j.sbi.2016.05.015] [Citation(s) in RCA: 85] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Revised: 05/18/2016] [Accepted: 05/24/2016] [Indexed: 11/16/2022]
Abstract
Were ancient proteins systematically different than modern proteins? The answer to this question is profoundly important, shaping how we understand the origins of protein biochemical, biophysical, and functional properties. Ancestral sequence reconstruction (ASR), a phylogenetic approach to infer the sequences of ancestral proteins, may reveal such trends. We discuss two proposed trends: a transition from higher to lower thermostability and a tendency for proteins to acquire higher specificity over time. We review the evidence for elevated ancestral thermostability and discuss its possible origins in a changing environmental temperature and/or reconstruction bias. We also conclude that there is, as yet, insufficient data to support a trend from promiscuity to specificity. Finally, we propose future work to understand these proposed evolutionary trends.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States; Institute of Molecular Biology, University of Oregon, Eugene, OR, United States
| | - Shion A Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States.
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States; Institute of Molecular Biology, University of Oregon, Eugene, OR, United States.
| |
Collapse
|
34
|
Lim SA, Bolin ER, Harms MJ, Hart KM, Thornton JW, Marqusee S. Ancestral Sequence Reconstruction Reveals the Evolutionary History of the Folding Pathway and Landscape of Ribonucleases H. Biophys J 2016. [DOI: 10.1016/j.bpj.2015.11.2111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
35
|
Harms MJ, Thornton JW. Historical contingency and its biophysical basis in glucocorticoid receptor evolution. Nature 2014; 512:203-7. [PMID: 24930765 PMCID: PMC4447330 DOI: 10.1038/nature13410] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2014] [Accepted: 04/28/2014] [Indexed: 02/06/2023]
Abstract
Understanding how chance historical events shape evolutionary processes is a central goal of evolutionary biology1–7. Direct insights into the extent and causes of evolutionary contingency have been limited to experimental systems,7–9 because it is difficult to know what happened in the deep past and to characterize other paths that evolution could have followed. Here we combine ancestral protein reconstruction, directed evolution, and biophysical analysis to explore alternate “might-have-been” trajectories during the ancient evolution of a novel protein function. We previously found that the evolution of cortisol specificity in the ancestral glucocorticoid receptor (GR) was contingent on permissive substitutions, which had no apparent effect on receptor function but were necessary for GR to tolerate the large-effect mutations that caused the shift in specificity.6 Here we show that alternative mutations that could have permitted the historical function-switching substitutions are extremely rare in the ensemble of genotypes accessible to the ancestral GR. In a library of thousands of variants of the ancestral protein, we recovered historical permissive substitutions, but no alternate permissive genotypes. Using biophysical analysis, we found that permissive mutations must satisfy at least three physical requirements—they must stabilize specific local elements of the protein structure, maintain the correct energetic balance between functional conformations, and be compatible with the ancestral and derived structures—thus revealing why permissive mutations are rare. These findings demonstrate that GR evolution depended strongly on improbable, nondeterministic events, and this contingency arose from intrinsic biophysical properties of the protein.
Collapse
Affiliation(s)
- Michael J Harms
- 1] Institute of Molecular Biology and Department of Chemistry &Biochemistry, University of Oregon, Eugene, Oregon 97403, USA [2] Departments of Human Genetics and Ecology &Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | - Joseph W Thornton
- 1] Departments of Human Genetics and Ecology &Evolution, University of Chicago, Chicago, Illinois 60637, USA [2] Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon 97403, USA
| |
Collapse
|
36
|
Abstract
The repertoire of proteins and nucleic acids in the living world is determined by evolution; their properties are determined by the laws of physics and chemistry. Explanations of these two kinds of causality - the purviews of evolutionary biology and biochemistry, respectively - are typically pursued in isolation, but many fundamental questions fall squarely at the interface of fields. Here we articulate the paradigm of evolutionary biochemistry, which aims to dissect the physical mechanisms and evolutionary processes by which biological molecules diversified and to reveal how their physical architecture facilitates and constrains their evolution. We show how an integration of evolution with biochemistry moves us towards a more complete understanding of why biological molecules have the properties that they do.
Collapse
Affiliation(s)
- Michael J Harms
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon 97403, USA
| | | |
Collapse
|
37
|
Abstract
The U1A/U2B″/SNF family of proteins found in the U1 and U2 spliceosomal small nuclear ribonucleoproteins is highly conserved. In spite of the high degree of sequence and structural conservation, modern members of this protein family have unique RNA binding properties. These differences have necessarily resulted from evolutionary processes, and therefore, we reconstructed the protein phylogeny in order to understand how and when divergence occurred and how protein function has been modulated. Contrary to the conventional understanding of an ancient human U1A/U2B″ gene duplication, we show that the last common ancestor of bilaterians contained a single ancestral protein (URB). The gene for URB was synthesized, the protein was overexpressed and purified, and we assessed RNA binding to modern snRNA sequences. We find that URB binds human and Drosophila U1 snRNA SLII and U2 snRNA SLIV with higher affinity than do modern homologs, suggesting that both Drosophila SNF and human U1A/U2B″ have evolved into weaker binders of one RNA or both RNAs.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Base Sequence
- Drosophila
- Drosophila Proteins/chemistry
- Drosophila Proteins/genetics
- Drosophila Proteins/metabolism
- Evolution, Molecular
- Gene Duplication
- Humans
- Inverted Repeat Sequences
- Models, Molecular
- Molecular Sequence Data
- Nucleic Acid Conformation
- Phylogeny
- Protein Binding
- Protein Conformation
- RNA, Small Nuclear/chemistry
- RNA, Small Nuclear/genetics
- RNA, Small Nuclear/metabolism
- Ribonucleoprotein, U1 Small Nuclear/chemistry
- Ribonucleoprotein, U1 Small Nuclear/genetics
- Ribonucleoprotein, U1 Small Nuclear/metabolism
- Ribonucleoproteins, Small Nuclear/chemistry
- Ribonucleoproteins, Small Nuclear/genetics
- Ribonucleoproteins, Small Nuclear/metabolism
- Sequence Alignment
- Spliceosomes/metabolism
- snRNP Core Proteins/chemistry
- snRNP Core Proteins/genetics
- snRNP Core Proteins/metabolism
Collapse
Affiliation(s)
- Sandra G Williams
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO 63108, USA
| | | | | |
Collapse
|
38
|
Eick GN, Colucci JK, Harms MJ, Ortlund EA, Thornton JW. Evolution of minimal specificity and promiscuity in steroid hormone receptors. PLoS Genet 2012; 8:e1003072. [PMID: 23166518 PMCID: PMC3499368 DOI: 10.1371/journal.pgen.1003072] [Citation(s) in RCA: 89] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2012] [Accepted: 09/21/2012] [Indexed: 01/02/2023] Open
Abstract
Most proteins are regulated by physical interactions with other molecules; some are highly specific, but others interact with many partners. Despite much speculation, we know little about how and why specificity/promiscuity evolves in natural proteins. It is widely assumed that specific proteins evolved from more promiscuous ancient forms and that most proteins' specificity has been tuned to an optimal state by selection. Here we use ancestral protein reconstruction to trace the evolutionary history of ligand recognition in the steroid hormone receptors (SRs), a family of hormone-regulated animal transcription factors. We resurrected the deepest ancestral proteins in the SR family and characterized the structure-activity relationships by which they distinguished among ligands. We found that that the most ancient split in SR evolution involved a discrete switch from an ancient receptor for aromatized estrogens—including xenobiotics—to a derived receptor that recognized non-aromatized progestagens and corticosteroids. The family's history, viewed in relation to the evolution of their ligands, suggests that SRs evolved according to a principle of minimal specificity: at each point in time, receptors evolved ligand recognition criteria that were just specific enough to parse the set of endogenous substances to which they were exposed. By studying the atomic structures of resurrected SR proteins, we found that their promiscuity evolved because the ancestral binding cavity was larger than the primary ligand and contained excess hydrogen bonding capacity, allowing adventitious recognition of larger molecules with additional functional groups. Our findings provide an historical explanation for the sensitivity of modern SRs to natural and synthetic ligands—including endocrine-disrupting drugs and pollutants—and show that knowledge of history can contribute to ligand prediction. They suggest that SR promiscuity may reflect the limited power of selection within real biological systems to discriminate between perfect and “good enough.” The functions of most proteins are defined by their interactions with other biological substances, such as DNA, nutrients, hormones, or other proteins. Some proteins are highly specific, but others are more promiscuous and can interact with a variety of natural substances, as well as drugs and pollutants. Understanding molecular interactions is a key goal in pharmacology and toxicology, but there are few general principles to help explain or predict protein specificity. Because every biological entity is the result of evolution, understanding a protein's history might help explain why it interacts with the substances to which it is sensitive. In this paper, we used ancestral protein reconstruction to experimentally trace how specificity evolved in an ancient group of proteins, the steroid hormone receptors (SRs), a family of proteins that regulate reproduction and other biological processes in animals. We show that SRs evolved according to a principle of minimal specificity: at each point in time, these proteins evolved to be specific enough to distinguish among the substances to which they were naturally exposed, but not more so. Our findings provide an historical explanation for modern SRs' diverse sensitivities to natural and man-made substances; they show that knowledge of history can contribute to predicting the ligands to which a modern protein will respond and indicate that promiscuity reflects the limited power of natural selection to discriminate between perfect and “good enough.”
Collapse
Affiliation(s)
- Geeta N. Eick
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
- Howard Hughes Medical Institute, Eugene, Oregon, United States of America
| | - Jennifer K. Colucci
- Biochemistry Department, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Michael J. Harms
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| | - Eric A. Ortlund
- Biochemistry Department, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Joseph W. Thornton
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
- Howard Hughes Medical Institute, Eugene, Oregon, United States of America
- Department of Human Genetics and Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, United States of America
- * E-mail:
| |
Collapse
|
39
|
Fitch CA, Harms MJ, García-Moreno B. Empirical Method For Calculation of Pka Values of Internal Ionizable Groups in Proteins. Biophys J 2011. [DOI: 10.1016/j.bpj.2010.12.1454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|
40
|
Harms MJ, Thornton JW. Analyzing protein structure and function using ancestral gene reconstruction. Curr Opin Struct Biol 2010; 20:360-6. [PMID: 20413295 DOI: 10.1016/j.sbi.2010.03.005] [Citation(s) in RCA: 146] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2010] [Accepted: 03/22/2010] [Indexed: 01/06/2023]
Abstract
Protein families with functionally diverse members can illuminate the structural determinants of protein function and the process by which protein structure and function evolve. To identify the key amino acid changes that differentiate one family member from another, most studies have taken a horizontal approach, swapping candidate residues between present-day family members. This approach has often been stymied, however, by the fact that shifts in function often require multiple interacting mutations; chimeric proteins are often nonfunctional, either because one lineage has amassed mutations that are incompatible with key residues that conferred a new function on other lineages, or because it lacks mutations required to support those key residues. These difficulties can be overcome by using a vertical strategy, which reconstructs ancestral genes and uses them as the appropriate background in which to study the effects of historical mutations on functional diversification. In this review, we discuss the advantages of the vertical strategy and highlight several exemplary studies that have used ancestral gene reconstruction to reveal the molecular underpinnings of protein structure, function, and evolution.
Collapse
Affiliation(s)
- Michael J Harms
- Howard Hughes Medical Institute, Center for Ecology and Evolutionary Biology, University of Oregon, Eugene, OR 97403, USA.
| | | |
Collapse
|
41
|
Harms MJ, Castañeda CA, Schlessman JL, Sue GR, Bertrand García-Moreno E. The pK(a) values of acidic and basic residues buried at the same internal location in a protein are governed by different factors. J Mol Biol 2009; 389:34-47. [PMID: 19324049 PMCID: PMC3373015 DOI: 10.1016/j.jmb.2009.03.039] [Citation(s) in RCA: 91] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2008] [Revised: 03/06/2009] [Accepted: 03/11/2009] [Indexed: 10/21/2022]
Abstract
The pK(a) values of internal ionizable groups are usually very different from the normal pK(a) values of ionizable groups in water. To examine the molecular determinants of pK(a) values of internal groups, we compared the properties of Lys, Asp, and Glu at internal position 38 in staphylococcal nuclease. Lys38 titrates with a normal or elevated pK(a), whereas Asp38 and Glu38 titrate with elevated pK(a) values of 7.0 and 7.2, respectively. In the structure of the L38K variant, the buried amino group of the Lys38 side chain makes an ion pair with Glu122, whereas in the structure of the L38E variant, the buried carboxyl group of Glu38 interacts with two backbone amides and has several nearby carboxyl oxygen atoms. Previously, we showed that the pK(a) of Lys38 is normal owing to structural reorganization and water penetration concomitant with ionization of the Lys side chain. In contrast, the pK(a) values of Asp38 and Glu38 are perturbed significantly owing to an imbalance between favorable polar interactions and unfavorable contributions from dehydration and from Coulomb interactions with surface carboxylic groups. Their ionization is also coupled to subtle structural reorganization. These results illustrate the complex interplay between local polarity, Coulomb interactions, and structural reorganization as determinants of pK(a) values of internal groups in proteins. This study suggests that improvements to computational methods for pK(a) calculations will require explicit treatment of the conformational reorganization that can occur when internal groups ionize.
Collapse
Affiliation(s)
- Michael J. Harms
- Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore MD, 21218
| | - Carlos A. Castañeda
- Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore MD, 21218
| | - Jamie L. Schlessman
- Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore MD, 21218
- Department of Chemistry, United States Naval Academy, 572 Holloway Rd. Annapolis, MD 21402
| | - Gloria R. Sue
- Department of Biophysics, Johns Hopkins University, 3400 N Charles St, Baltimore MD, 21218
| | | |
Collapse
|
42
|
Harms MJ, Schlessman JL, Chimenti MS, Sue GR, Damjanović A, García-Moreno B. A buried lysine that titrates with a normal pKa: role of conformational flexibility at the protein-water interface as a determinant of pKa values. Protein Sci 2008; 17:833-45. [PMID: 18369193 DOI: 10.1110/ps.073397708] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Previously we reported that Lys, Asp, and Glu residues at positions 66 and 92 in staphylococcal nuclease (SNase) titrate with pK(a) values shifted by up to 5 pK(a) units in the direction that promotes the neutral state. In contrast, the internal Lys-38 in SNase titrates with a normal pK(a). The crystal structure of the L38K variant shows that the side chain of Lys-38 is buried. The ionizable moiety is approximately 7 A from solvent and ion paired with Glu-122. This suggests that the pK(a) value of Lys-38 is normal because the energetic penalty for dehydration is offset by a favorable Coulomb interaction. However, the pK(a) of Lys-38 was also normal when Glu-122 was replaced with Gln or with Ala. Continuum electrostatics calculations were unable to reproduce the pK(a) of Lys-38 unless the protein was treated with an artificially high dielectric constant, consistent with structural reorganization being responsible for the normal pK(a) value of Lys-38. This reorganization must be local because circular dichroism and NMR spectroscopy indicate that the L38K protein is native-like under all conditions studied. In molecular dynamics simulations, the ion pair between Lys-38 and Glu-122 is unstable. The simulations show that a minor rearrangement of a loop is sufficient to allow penetration of water to the amino moiety of Lys-38. This illustrates both the important roles of local flexibility and water penetration as determinants of pK(a) values of ionizable groups buried near the protein-water interface, and the challenges faced by structure-based pK(a) calculations in reproducing these effects.
Collapse
Affiliation(s)
- Michael J Harms
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | | | | | | | | | | |
Collapse
|
43
|
Harms MJ, Wilmarth PA, Kapfer DM, Steel EA, David LL, Bächinger HP, Lampi KJ. Laser light-scattering evidence for an altered association of beta B1-crystallin deamidated in the connecting peptide. Protein Sci 2004; 13:678-86. [PMID: 14978307 PMCID: PMC2286738 DOI: 10.1110/ps.03427504] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2003] [Revised: 11/21/2003] [Accepted: 12/02/2003] [Indexed: 10/26/2022]
Abstract
Deamidation is a prevalent modification of crystallin proteins in the vertebrate lens. The effect of specific sites of deamidation on crystallin stability in vivo is not known. Using mass spectrometry, a previously unreported deamidation in beta B1-crystallin was identified at Gln146. Another deamidation was investigated at Asn157. It was determined that whole soluble beta B1 contained 13%-17% deamidation at Gln146 and Asn157. Static and quasi-elastic laser light scattering, circular dichroism, and heat aggregation studies were used to explore the structure and associative properties of recombinantly expressed wild-type (wt) beta B1 and the deamidated beta B1 mutants, Q146E and N157D. Dimer formation occurred for wt beta B1, Q146E, and N157D in a concentration-dependent manner, but only Q146E showed formation of higher ordered oligomers at the concentrations studied. Deamidation at Gln146, but not Asn157, led to an increased tendency of beta B1 to aggregate upon heating. We conclude that deamidation creates unique effects depending upon where the deamidation is introduced in the crystallin structure.
Collapse
Affiliation(s)
- Michael J Harms
- Oregon Health and Science University, Portland, OR 97239, USA
| | | | | | | | | | | | | |
Collapse
|