1
|
Hernández Berthet AS, Aptekmann AA, Tejero J, Sánchez IE, Noguera ME, Roman EA. Associating protein sequence positions with the modulation of quantitative phenotypes. Arch Biochem Biophys 2024; 755:109979. [PMID: 38583654 DOI: 10.1016/j.abb.2024.109979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/11/2024] [Accepted: 03/27/2024] [Indexed: 04/09/2024]
Abstract
Although protein sequences encode the information for folding and function, understanding their link is not an easy task. Unluckily, the prediction of how specific amino acids contribute to these features is still considerably impaired. Here, we developed a simple algorithm that finds positions in a protein sequence with potential to modulate the studied quantitative phenotypes. From a few hundred protein sequences, we perform multiple sequence alignments, obtain the per-position pairwise differences for both the sequence and the observed phenotypes, and calculate the correlation between these last two quantities. We tested our methodology with four cases: archaeal Adenylate Kinases and the organisms optimal growth temperatures, microbial rhodopsins and their maximal absorption wavelengths, mammalian myoglobins and their muscular concentration, and inhibition of HIV protease clinical isolates by two different molecules. We found from 3 to 10 positions tightly associated with those phenotypes, depending on the studied case. We showed that these correlations appear using individual positions but an improvement is achieved when the most correlated positions are jointly analyzed. Noteworthy, we performed phenotype predictions using a simple linear model that links per-position divergences and differences in the observed phenotypes. Predictions are comparable to the state-of-art methodologies which, in most of the cases, are far more complex. All of the calculations are obtained at a very low information cost since the only input needed is a multiple sequence alignment of protein sequences with their associated quantitative phenotypes. The diversity of the explored systems makes our work a valuable tool to find sequence determinants of biological activity modulation and to predict various functional features for uncharacterized members of a protein family.
Collapse
Affiliation(s)
- Ayelén S Hernández Berthet
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Intendente Güiraldes 2160 - Ciudad Universitaria, 1428EGA, C.A.B.A., Argentina.
| | - Ariel A Aptekmann
- Universidad de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Buenos Aires, Argentina; Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08873, USA; Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ, 08901, USA.
| | - Jesús Tejero
- Heart, Lung, Blood and Vascular Medicine Institute, University of Pittsburgh, Pittsburgh, PA, 15261, USA; Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA; Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA; Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Ignacio E Sánchez
- Universidad de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Buenos Aires, Argentina.
| | - Martín E Noguera
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química y Fisicoquímica Biológicas Dr. Alejandro Paladini, Junín 956, 1113AAD, C.A.B.A., Argentina; Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina.
| | - Ernesto A Roman
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Intendente Güiraldes 2160 - Ciudad Universitaria, 1428EGA, C.A.B.A., Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química y Fisicoquímica Biológicas Dr. Alejandro Paladini, Junín 956, 1113AAD, C.A.B.A., Argentina.
| |
Collapse
|
2
|
Dasmeh P, Zheng J, Erdoğan AN, Tokuriki N, Wagner A. Rapid evolutionary change in trait correlations of single proteins. Nat Commun 2024; 15:3327. [PMID: 38637501 PMCID: PMC11026499 DOI: 10.1038/s41467-024-46658-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 03/06/2024] [Indexed: 04/20/2024] Open
Abstract
Many organismal traits are genetically determined and covary in evolving populations. The resulting trait correlations can either help or hinder evolvability - the ability to bring forth new and adaptive phenotypes. The evolution of evolvability requires that trait correlations themselves must be able to evolve, but we know little about this ability. To learn more about it, we here study two evolvable systems, a yellow fluorescent protein and the antibiotic resistance protein VIM-2 metallo beta-lactamase. We consider two traits in the fluorescent protein, namely the ability to emit yellow and green light, and three traits in our enzyme, namely the resistance against ampicillin, cefotaxime, and meropenem. We show that correlations between these traits can evolve rapidly through both mutation and selection on short evolutionary time scales. In addition, we show that these correlations are driven by a protein's ability to fold, because single mutations that alter foldability can dramatically change trait correlations. Since foldability is important for most proteins and their traits, mutations affecting protein folding may alter trait correlations mediated by many other proteins. Thus, mutations that affect protein foldability may also help shape the correlations of complex traits that are affected by hundreds of proteins.
Collapse
Affiliation(s)
- Pouria Dasmeh
- Center for Human Genetics, Marburg University, Marburg, 35043, Germany.
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, 8057, Switzerland.
- Swiss Institute of Bioinformatics (SIB), Lausanne, 1015, Switzerland.
| | - Jia Zheng
- Zhejiang Key Laboratory of Structural Biology, School of Life Sciences, Westlake University, Hangzhou, 310030, China
- Westlake Laboratory of Life Sciences and Biomedicine, 310030, Hangzhou, China
- Institute of Biology, Westlake Institute for Advanced Study, 310030, Hangzhou, China
| | - Ayşe Nisan Erdoğan
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Andreas Wagner
- Institute for Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, 8057, Switzerland.
- Swiss Institute of Bioinformatics (SIB), Lausanne, 1015, Switzerland.
- The Santa Fe Institute, Santa Fe, New Mexico, 87501, US.
- Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch, 7600, South Africa.
| |
Collapse
|
3
|
Ramos E, Selleghin-Veiga G, Magpali L, Daros B, Silva F, Picorelli A, Freitas L, Nery MF. Molecular Footprints on Osmoregulation-Related Genes Associated with Freshwater Colonization by Cetaceans and Sirenians. J Mol Evol 2023; 91:865-881. [PMID: 38010516 DOI: 10.1007/s00239-023-10141-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 10/29/2023] [Indexed: 11/29/2023]
Abstract
The genetic basis underlying adaptive physiological mechanisms has been extensively explored in mammals after colonizing the seas. However, independent lineages of aquatic mammals exhibit complex patterns of secondary colonization in freshwater environments. This change in habitat represents new osmotic challenges, and additional changes in key systems, such as the osmoregulatory system, are expected. Here, we studied the selective regime on coding and regulatory regions of 20 genes related to the osmoregulation system in strict aquatic mammals from independent evolutionary lineages, cetaceans, and sirenians, with representatives in marine and freshwater aquatic environments. We identified positive selection signals in genes encoding the protein vasopressin (AVP) in mammalian lineages with secondary colonization in the fluvial environment and in aquaporins for lineages inhabiting the marine and fluvial environments. A greater number of sites with positive selection signals were found for the dolphin species compared to the Amazonian manatee. Only the AQP5 and AVP genes showed selection signals in more than one independent lineage of these mammals. Furthermore, the vasopressin gene tree indicates greater similarity in river dolphin sequences despite the independence of their lineages based on the species tree. Patterns of distribution and enrichment of Transcription Factors in the promoter regions of target genes were analyzed and appear to be phylogenetically conserved among sister species. We found accelerated evolution signs in genes ACE, AQP1, AQP5, AQP7, AVP, NPP4, and NPR1 for the fluvial mammals. Together, these results allow a greater understanding of the molecular bases of the evolution of genes responsible for osmotic control in aquatic mammals.
Collapse
Affiliation(s)
- Elisa Ramos
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Giovanna Selleghin-Veiga
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Letícia Magpali
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Beatriz Daros
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Felipe Silva
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Agnello Picorelli
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Lucas Freitas
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil
| | - Mariana F Nery
- Laboratório de Genômica Evolutiva., Departamento de Genética, Evolução, Microbiologia e Imunologia, Universidade Estadual de Campinas, Cidade Universitária, Campinas, SP, 13083970, Brazil.
| |
Collapse
|
4
|
Thakur S, Planeta Kepp K, Mehra R. Predicting virus Fitness: Towards a structure-based computational model. J Struct Biol 2023; 215:108042. [PMID: 37931730 DOI: 10.1016/j.jsb.2023.108042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 10/12/2023] [Accepted: 11/03/2023] [Indexed: 11/08/2023]
Abstract
Predicting the impact of new emerging virus mutations is of major interest in surveillance and for understanding the evolutionary forces of the pathogens. The SARS-CoV-2 surface spike-protein (S-protein) binds to human ACE2 receptors as a critical step in host cell infection. At the same time, S-protein binding to human antibodies neutralizes the virus and prevents interaction with ACE2. Here we combine these two binding properties in a simple virus fitness model, using structure-based computation of all possible mutation effects averaged over 10 ACE2 complexes and 10 antibody complexes of the S-protein (∼380,000 computed mutations), and validated the approach against diverse experimental binding/escape data of ACE2 and antibodies. The ACE2-antibody selectivity change caused by mutation (i.e., the differential change in binding to ACE2 vs. immunity-inducing antibodies) is proposed to be a key metric of fitness model, enabling systematic error cancelation when evaluated. In this model, new mutations become fixated if they increase the selective binding to ACE2 relative to circulating antibodies, assuming that both are present in the host in a competitive binding situation. We use this model to categorize viral mutations that may best reach ACE2 before being captured by antibodies. Our model may aid the understanding of variant-specific vaccines and molecular mechanisms of viral evolution in the context of a human host.
Collapse
Affiliation(s)
- Shivani Thakur
- Department of Chemistry, Indian Institute of Technology Bhilai, Kutelabhata, Durg - 491001, Chhattisgarh, India
| | - Kasper Planeta Kepp
- DTU Chemistry, Technical University of Denmark, Building 206, 2800 Kongens Lyngby, Denmark
| | - Rukmankesh Mehra
- Department of Chemistry, Indian Institute of Technology Bhilai, Kutelabhata, Durg - 491001, Chhattisgarh, India; Department of Bioscience and Biomedical Engineering, Indian Institute of Technology Bhilai, Kutelabhata, Durg - 491001, Chhattisgarh, India.
| |
Collapse
|
5
|
Sackerson C, Garcia V, Medina N, Maldonado J, Daly J, Cartwright R. Comparative analysis of the myoglobin gene in whales and humans reveals evolutionary changes in regulatory elements and expression levels. PLoS One 2023; 18:e0284834. [PMID: 37643191 PMCID: PMC10464968 DOI: 10.1371/journal.pone.0284834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 08/15/2023] [Indexed: 08/31/2023] Open
Abstract
Cetacea and other diving mammals have undergone numerous adaptations to their aquatic environment, among them high levels of the oxygen-carrying intracellular hemoprotein myoglobin in skeletal muscles. Hypotheses regarding the mechanisms leading to these high myoglobin levels often invoke the induction of gene expression by exercise, hypoxia, and other physiological gene regulatory pathways. Here we explore an alternative hypothesis: that cetacean myoglobin genes have evolved high levels of transcription driven by the intrinsic developmental mechanisms that drive muscle cell differentiation. We have used luciferase assays in differentiated C2C12 cells to test this hypothesis. Contrary to our hypothesis, we find that the myoglobin gene from the minke whale, Balaenoptera acutorostrata, shows a low level of expression, only about 8% that of humans. This low expression level is broadly shared among cetaceans and artiodactylans. Previous work on regulation of the human gene has identified a core muscle-specific enhancer comprised of two regions, the "AT element" and a C-rich sequence 5' of the AT element termed the "CCAC-box". Analysis of the minke whale gene supports the importance of the AT element, but the minke whale CCAC-box ortholog has little effect. Instead, critical positive input has been identified in a G-rich region 3' of the AT element. Also, a conserved E-box in exon 1 positively affects expression, despite having been assigned a repressive role in the human gene. Last, a novel region 5' of the core enhancer has been identified, which we hypothesize may function as a boundary element. These results illustrate regulatory flexibility during evolution. We discuss the possibility that low transcription levels are actually beneficial, and that evolution of the myoglobin protein toward enhanced stability is a critical factor in the accumulation of high myoglobin levels in adult cetacean muscle tissue.
Collapse
Affiliation(s)
- Charles Sackerson
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Vivian Garcia
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Nicole Medina
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Jessica Maldonado
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - John Daly
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
| | - Rachel Cartwright
- Biology Department, California State University Channel Islands, Camarillo, California, United States of America
- The Keiki Kohola Project, Lahaina, Hawaii, United States of America
| |
Collapse
|
6
|
Stability and expression of SARS-CoV-2 spike-protein mutations. Mol Cell Biochem 2022; 478:1269-1280. [PMID: 36302994 PMCID: PMC9612610 DOI: 10.1007/s11010-022-04588-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 10/12/2022] [Indexed: 12/02/2022]
Abstract
Protein fold stability likely plays a role in SARS-CoV-2 S-protein evolution, together with ACE2 binding and antibody evasion. While few thermodynamic stability data are available for S-protein mutants, many systematic experimental data exist for their expression. In this paper, we explore whether such expression levels relate to the thermodynamic stability of the mutants. We studied mutation-induced SARS-CoV-2 S-protein fold stability, as computed by three very distinct methods and eight different protein structures to account for method- and structure-dependencies. For all methods and structures used (24 comparisons), computed stability changes correlate significantly (99% confidence level) with experimental yeast expression from the literature, such that higher expression is associated with relatively higher fold stability. Also significant, albeit weaker, correlations were seen between stability and ACE2 binding effects. The effect of thermodynamic fold stability may be direct or a correlate of amino acid or site properties, notably the solvent exposure of the site. Correlation between computed stability and experimental expression and ACE2 binding suggests that functional properties of the SARS-CoV-2 S-protein mutant space are largely determined by a few simple features, due to underlying correlations. Our study lends promise to the development of computational tools that may ideally aid in understanding and predicting SARS-CoV-2 S-protein evolution.
Collapse
|
7
|
Abstract
The rediscovery of Mendel’s work showing that the heredity of phenotypes is controlled by discrete genes was followed by the reconciliation of Mendelian genetics with evolution by natural selection in the middle of the last century with the Modern Synthesis. In the past two decades, dramatic advances in genomic methods have facilitated the identification of the loci, genes, and even individual mutations that underlie phenotypic variants that are the putative targets of natural selection. Moreover, these methods have also changed how we can study adaptation by flipping the problem around, allowing us to first examine what loci show evidence of having been under selection, and then connecting these genetic variants to phenotypic variation. As a result, we now have an expanding list of actual genetic changes that underlie potentially adaptive phenotypic variation. Here, we synthesize how considering the effects of these adaptive loci in the context of cellular environments, genomes, organisms, and populations has provided new insights to the genetic architecture of adaptation.
Collapse
|
8
|
Genomic insights into the secondary aquatic transition of penguins. Nat Commun 2022; 13:3912. [PMID: 35853876 PMCID: PMC9296559 DOI: 10.1038/s41467-022-31508-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 06/17/2022] [Indexed: 11/21/2022] Open
Abstract
Penguins lost the ability to fly more than 60 million years ago, subsequently evolving a hyper-specialized marine body plan. Within the framework of a genome-scale, fossil-inclusive phylogeny, we identify key geological events that shaped penguin diversification and genomic signatures consistent with widespread refugia/recolonization during major climate oscillations. We further identify a suite of genes potentially underpinning adaptations related to thermoregulation, oxygenation, diving, vision, diet, immunity and body size, which might have facilitated their remarkable secondary transition to an aquatic ecology. Our analyses indicate that penguins and their sister group (Procellariiformes) have the lowest evolutionary rates yet detected in birds. Together, these findings help improve our understanding of how penguins have transitioned to the marine environment, successfully colonizing some of the most extreme environments on Earth. This study examines the tempo and drivers of penguin diversification by combining genomes from all extant and recently extinct penguin lineages, stratigraphic data from fossil penguins and morphological and biogeographic data from all extant and extinct species. Together, these datasets provide new insights into the genetic basis and evolution of adaptations in penguins.
Collapse
|
9
|
Isogai Y, Imamura H, Sumi T, Shirai T. Improvement of Protein Solubility in Macromolecular Crowding during Myoglobin Evolution. Biochemistry 2022; 61:1543-1547. [PMID: 35674519 DOI: 10.1021/acs.biochem.2c00166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The inside of living cells is crowded by extremely high concentrations of biomolecules, and thus globular proteins should have been developed to increase their solubility under such crowding conditions during organic evolution. The O2-storage protein myoglobin (Mb) is known to be expressed in myocytes of diving mammals in much larger quantities than those of land mammals. We have previously resurrected ancient whale and pinniped Mbs and experimentally demonstrated that the diving animal Mbs have evolved to maintain high solubility under the crowding conditions or to increase their tolerance against macromolecular precipitants, rather than solubility in a dilute buffer solution. However, the detail of chemical mechanisms of the precipitant tolerance remains unclear. Here, we investigated pH dependence of the precipitant tolerance (β, slope of the solubility against precipitant concentration) of extant Mbs and plotted the β values, as well as those of ancestral Mbs, against their surface net charges (ZMb). The results demonstrated that the precipitant tolerance was approximated by the square of ZMb, that is, β = aZMb2 + b, in which a and b are constants. This effect of ZMb against the precipitation is not predicted by a classical excluded volume theory that gives constant β for Mbs but can be explained by electrostatic repulsion between Mb molecules. The present study elucidates how Mb molecules have evolved to increase their in vivo solubility and shows the physiological significance of either neutral or basic isoelectric points (pI) of the natural Mbs, rather than acidic pI.
Collapse
Affiliation(s)
- Yasuhiro Isogai
- Department of Pharmaceutical Engineering, Toyama Prefectural University, Imizu, Toyama 939-0398, Japan
| | - Hiroshi Imamura
- Department of Applied Chemistry, College of Life Sciences, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan
| | - Tomonari Sumi
- Research Institute for Interdisciplinary Science, Okayama University, 3-1-1 Tsushima-Naka, Kita-ku, Okayama 700-8530, Japan
| | - Tsuyoshi Shirai
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga 526-0829, Japan
| |
Collapse
|
10
|
Abstract
The spike protein (S-protein) of SARS-CoV-2, the protein that enables the virus to infect human cells, is the basis for many vaccines and a hotspot of concerning virus evolution. Here, we discuss the outstanding progress in structural characterization of the S-protein and how these structures facilitate analysis of virus function and evolution. We emphasize the differences in reported structures and that analysis of structure-function relationships is sensitive to the structure used. We show that the average residue solvent exposure in nearly complete structures is a good descriptor of open vs closed conformation states. Because of structural heterogeneity of functionally important surface-exposed residues, we recommend using averages of a group of high-quality protein structures rather than a single structure before reaching conclusions on specific structure-function relationships. To illustrate these points, we analyze some significant chemical tendencies of prominent S-protein mutations in the context of the available structures. In the discussion of new variants, we emphasize the selectivity of binding to ACE2 vs prominent antibodies rather than simply the antibody escape or ACE2 affinity separately. We note that larger chemical changes, in particular increased electrostatic charge or side-chain volume of exposed surface residues, are recurring in mutations of concern, plausibly related to adaptation to the negative surface potential of human ACE2. We also find indications that the fixated mutations of the S-protein in the main variants are less destabilizing than would be expected on average, possibly pointing toward a selection pressure on the S-protein. The richness of available structures for all of these situations provides an enormously valuable basis for future research into these structure-function relationships.
Collapse
Affiliation(s)
- Rukmankesh Mehra
- Department of Chemistry, Indian Institute
of Technology Bhilai, Sejbahar, Raipur 492015, Chhattisgarh,
India
| | - Kasper P. Kepp
- DTU Chemistry, Technical University of
Denmark, Building 206, 2800 Kongens Lyngby,
Denmark
| |
Collapse
|
11
|
Peng J, Svetec N, Zhao L. Intermolecular interactions drive protein adaptive and co-adaptive evolution at both species and population levels. Mol Biol Evol 2021; 39:6456312. [PMID: 34878126 PMCID: PMC8789070 DOI: 10.1093/molbev/msab350] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Proteins are the building blocks for almost all the functions in cells. Understanding the molecular evolution of proteins and the forces that shape protein evolution is essential in understanding the basis of function and evolution. Previous studies have shown that adaptation frequently occurs at the protein surface, such as in genes involved in host–pathogen interactions. However, it remains unclear whether adaptive sites are distributed randomly or at regions associated with particular structural or functional characteristics across the genome, since many proteins lack structural or functional annotations. Here, we seek to tackle this question by combining large-scale bioinformatic prediction, structural analysis, phylogenetic inference, and population genomic analysis of Drosophila protein-coding genes. We found that protein sequence adaptation is more relevant to function-related rather than structure-related properties. Interestingly, intermolecular interactions contribute significantly to protein adaptation. We further showed that intermolecular interactions, such as physical interactions, may play a role in the coadaptation of fast-adaptive proteins. We found that strongly differentiated amino acids across geographic regions in protein-coding genes are mostly adaptive, which may contribute to the long-term adaptive evolution. This strongly indicates that a number of adaptive sites tend to be repeatedly mutated and selected throughout evolution in the past, present, and maybe future. Our results highlight the important roles of intermolecular interactions and coadaptation in the adaptive evolution of proteins both at the species and population levels.
Collapse
Affiliation(s)
- Junhui Peng
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Nicolas Svetec
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| |
Collapse
|
12
|
Isogai Y, Imamura H, Nakae S, Sumi T, Takahashi KI, Shirai T. Common and unique strategies of myoglobin evolution for deep-sea adaptation of diving mammals. iScience 2021; 24:102920. [PMID: 34430810 PMCID: PMC8374505 DOI: 10.1016/j.isci.2021.102920] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 07/04/2021] [Accepted: 07/27/2021] [Indexed: 11/18/2022] Open
Abstract
Myoglobin (Mb) is highly concentrated in the myocytes of diving mammals such as whales and seals, in comparison with land animals, and its molecular evolution has played a crucial role in their deep-sea adaptation. We previously resurrected ancestral whale Mbs and demonstrated the evolutional strategies for higher solubility under macromolecular crowding conditions. Pinnipeds, such as seals and sea lions, are also expert diving mammals with Mb-rich muscles. In the present study, we resurrected ancestral pinniped Mbs and investigated their biochemical and structural properties. Comparisons between pinniped and whale Mbs revealed the common and distinctive strategies for the deep-sea adaptation. The overall evolution processes, gaining precipitant tolerance and improving thermodynamic stability, were commonly observed. However, the strategies for improving the folding stability differed, and the pinniped Mbs exploited the shielding of hydrophobic surfaces more effectively than the whale Mbs.
Collapse
Affiliation(s)
- Yasuhiro Isogai
- Department of Pharmaceutical Engineering, Toyama Prefectural University, Imizu, Toyama 939-0398, Japan
- Corresponding author
| | - Hiroshi Imamura
- Department of Applied Chemistry, College of Life Sciences, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan
| | - Setsu Nakae
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga 526-0829, Japan
| | - Tomonari Sumi
- Research Institute for Interdisciplinary Science, Okayama University, 3-1-1 Tsushima-Naka, Kita-ku, Okayama 700-8530, Japan
| | - Ken-ichi Takahashi
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga 526-0829, Japan
| | - Tsuyoshi Shirai
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga 526-0829, Japan
- Corresponding author
| |
Collapse
|
13
|
Del Amparo R, Branco C, Arenas J, Vicens A, Arenas M. Analysis of selection in protein-coding sequences accounting for common biases. Brief Bioinform 2021; 22:6105943. [PMID: 33479739 DOI: 10.1093/bib/bbaa431] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 12/17/2020] [Accepted: 12/22/2020] [Indexed: 12/16/2022] Open
Abstract
The evolution of protein-coding genes is usually driven by selective processes, which favor some evolutionary trajectories over others, optimizing the subsequent protein stability and activity. The analysis of selection in this type of genetic data is broadly performed with the metric nonsynonymous/synonymous substitution rate ratio (dN/dS). However, most of the well-established methodologies to estimate this metric make crucial assumptions, such as lack of recombination or invariable codon frequencies along genes, which can bias the estimation. Here, we review the most relevant biases in the dN/dS estimation and provide a detailed guide to estimate this metric using state-of-the-art procedures that account for such biases, along with illustrative practical examples and recommendations. We also discuss the traditional interpretation of the estimated dN/dS emphasizing the importance of considering complementary biological information such as the role of the observed substitutions on the stability and function of proteins. This review is oriented to help evolutionary biologists that aim to accurately estimate selection in protein-coding sequences.
Collapse
Affiliation(s)
- Roberto Del Amparo
- CINBIO (Biomedical Research Center), University of Vigo, 36310 Vigo, Spain.,Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
| | - Catarina Branco
- CINBIO (Biomedical Research Center), University of Vigo, 36310 Vigo, Spain.,Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
| | - Jesús Arenas
- Unit of Microbiology and Immunology, University of Zaragoza, 50013 Zaragoza, Spain
| | - Alberto Vicens
- CINBIO (Biomedical Research Center), University of Vigo, 36310 Vigo, Spain.,Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
| | - Miguel Arenas
- CINBIO (Biomedical Research Center), University of Vigo, 36310 Vigo, Spain.,Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain
| |
Collapse
|
14
|
Hindle AG. Diving deep: understanding the genetic components of hypoxia tolerance in marine mammals. J Appl Physiol (1985) 2020; 128:1439-1446. [PMID: 32324472 DOI: 10.1152/japplphysiol.00846.2019] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Marine mammals have highly specialized physiology, exhibited in many species by extreme breath-holding capabilities that allow deep dives and extended submergence. Cardiovascular control and cell-level hypoxia tolerance are key features of this phenotype. Identifying genomic signatures tied to physiology will be valuable in understanding these natural model species, which may generate translational opportunities to human diseases arising from hypoxic stress or tissue injury. Genomic analyses have now been conducted in dolphins, river dolphins, minke whales, bowhead whales, and polar bears, with multispecies studies exploring evolutionary signals across marine mammal lineages, encompassing extinct and extant divers. Single-species genome studies for sirenians do not yet exist. Extant marine mammals arose in three lineages from separate aquatic recolonizations. Their physiological specializations, along with these independent origins create an interesting case to examine convergent evolution. Although molecular mechanisms of hypoxia tolerance are not universally apparent across marine mammal genomic studies, altered evolutionary rates have been identified for genes linked to oxygen binding and transport (e.g., MB, HBA, and HBB), blood pressure control (e.g., endothelin pathway genes), and cell protection in multiple species. Despite convergent phenotypes across clades, instances of identical molecular convergence have been uncommon. Given the inherent logistical and regulatory difficulties associated with functional genetic experiments in marine mammals, several avenues of further investigation are suggested to enable validation of candidate genes for hypoxia tolerance: leveraging phylogeny to better understand convergent phenotypes; ontogenic studies to identify regulation of key genes underlying the elite, adult, hypoxia-tolerant physiology; and cell culture manipulations to understand gene function.
Collapse
Affiliation(s)
- Allyson G Hindle
- School of Life Sciences, University of Nevada, Las Vegas, Nevada
| |
Collapse
|
15
|
Daane JM, Giordano D, Coppola D, di Prisco G, Detrich HW, Verde C. Adaptations to environmental change: Globin superfamily evolution in Antarctic fishes. Mar Genomics 2019; 49:100724. [PMID: 31735579 DOI: 10.1016/j.margen.2019.100724] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2019] [Revised: 10/27/2019] [Accepted: 11/01/2019] [Indexed: 02/08/2023]
Abstract
The ancient origins and functional versatility of globins make them ideal subjects for studying physiological adaptation to environmental change. Our goals in this review are to describe the evolution of the vertebrate globin gene superfamily and to explore the structure/function relationships of hemoglobin, myoglobin, neuroglobin and cytoglobin in teleost fishes. We focus on the globins of Antarctic notothenioids, emphasizing their adaptive features as inferred from comparisons with human proteins. We dedicate this review to Guido di Prisco, our co-author, colleague, friend, and husband of C.V. Ever thoughtful, creative, and enthusiastic, Guido spearheaded study of the structure, function, and evolution of the hemoglobins of polar fishes - this review is testimony to his wide-ranging contributions. Throughout his career, Guido inspired younger scientists to embrace polar biological research, and he challenged researchers of all ages to explore evolutionary adaptation in the context of global climate change. Beyond his scientific contributions, we will miss his warmth, his culture, and his great intellect. Guido has left an outstanding legacy, one that will continue to inspire us and our research.
Collapse
Affiliation(s)
- Jacob M Daane
- Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA 01908, USA
| | - Daniela Giordano
- Institute of Biosciences and BioResources (IBBR), CNR, Via Pietro Castellino 111, 80131 Napoli, Italy; Department of Marine Biotechnology, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
| | - Daniela Coppola
- Institute of Biosciences and BioResources (IBBR), CNR, Via Pietro Castellino 111, 80131 Napoli, Italy; Department of Marine Biotechnology, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
| | - Guido di Prisco
- Institute of Biosciences and BioResources (IBBR), CNR, Via Pietro Castellino 111, 80131 Napoli, Italy
| | - H William Detrich
- Department of Marine and Environmental Sciences, Northeastern University Marine Science Center, Nahant, MA 01908, USA
| | - Cinzia Verde
- Institute of Biosciences and BioResources (IBBR), CNR, Via Pietro Castellino 111, 80131 Napoli, Italy; Department of Marine Biotechnology, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy.
| |
Collapse
|
16
|
Isogai Y, Imamura H, Nakae S, Sumi T, Takahashi KI, Nakagawa T, Tsuneshige A, Shirai T. Tracing whale myoglobin evolution by resurrecting ancient proteins. Sci Rep 2018; 8:16883. [PMID: 30442991 PMCID: PMC6237822 DOI: 10.1038/s41598-018-34984-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 10/29/2018] [Indexed: 11/24/2022] Open
Abstract
Extant cetaceans, such as sperm whale, acquired the great ability to dive into the ocean depths during the evolution from their terrestrial ancestor that lived about 50 million years ago. Myoglobin (Mb) is highly concentrated in the myocytes of diving animals, in comparison with those of land animals, and is thought to play a crucial role in their adaptation as the molecular aqualung. Here, we resurrected ancestral whale Mbs, which are from the common ancestor between toothed and baleen whales (Basilosaurus), and from a further common quadrupedal ancestor between whale and hippopotamus (Pakicetus). The experimental and theoretical analyses demonstrated that whale Mb adopted two distinguished strategies to increase the protein concentration in vivo along the evolutionary history of deep sea adaptation; gaining precipitant tolerance in the early phase of the evolution, and increase of folding stability in the late phase.
Collapse
Affiliation(s)
- Yasuhiro Isogai
- Department of Pharmaceutical Engineering, Toyama Prefectural University, Imizu, Toyama, 939-0398, Japan.
| | - Hiroshi Imamura
- Department of Applied Chemistry, College of Life Sciences, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga, 525-8577, Japan
| | - Setsu Nakae
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga, 526-0829, Japan
| | - Tomonari Sumi
- Research Institute for Interdisciplinary Science, Okayama University, 3-1-1 Tsushima-Naka, Kita-ku, Okayama, 700-8530, Japan
| | - Ken-Ichi Takahashi
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga, 526-0829, Japan
| | - Taro Nakagawa
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga, 526-0829, Japan
| | - Antonio Tsuneshige
- Department of Frontier Bioscience and Research Center for Micro-Nano Technology, Hosei University, Koganei, Tokyo, Japan
| | - Tsuyoshi Shirai
- Department of Computer Bioscience, Nagahama Institute of Bio-Science and Technology, 1266 Tamura-Cho, Nagahama, Shiga, 526-0829, Japan.
| |
Collapse
|
17
|
Dasmeh P, Serohijos AWR. Estimating the contribution of folding stability to nonspecific epistasis in protein evolution. Proteins 2018; 86:1242-1250. [DOI: 10.1002/prot.25588] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 06/28/2018] [Accepted: 07/18/2018] [Indexed: 12/28/2022]
Affiliation(s)
- Pouria Dasmeh
- Department of BiochemistryUniversity of Montreal Montreal Quebec Canada
- Cedergren Center for Bioinformatics and GenomicsUniversity of Montreal Montreal, Quebec Canada
- Department of Biochemistry and Institute for Data Valorization (IVADO)University of Montreal Montreal, Quebec Canada
| | - Adrian W. R. Serohijos
- Department of BiochemistryUniversity of Montreal Montreal Quebec Canada
- Cedergren Center for Bioinformatics and GenomicsUniversity of Montreal Montreal, Quebec Canada
| |
Collapse
|
18
|
Saarman NP, Kober KM, Simison WB, Pogson GH. Sequence-Based Analysis of Thermal Adaptation and Protein Energy Landscapes in an Invasive Blue Mussel (Mytilus galloprovincialis). Genome Biol Evol 2018; 9:2739-2751. [PMID: 28985307 PMCID: PMC5647807 DOI: 10.1093/gbe/evx190] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/13/2017] [Indexed: 12/12/2022] Open
Abstract
Adaptive responses to thermal stress in poikilotherms plays an important role in determining competitive ability and species distributions. Amino acid substitutions that affect protein stability and modify the thermal optima of orthologous proteins may be particularly important in this context. Here, we examine a set of 2,770 protein-coding genes to determine if proteins in a highly invasive heat tolerant blue mussel (Mytilus galloprovincialis) contain signals of adaptive increases in protein stability relative to orthologs in a more cold tolerant M. trossulus. Such thermal adaptations might help to explain, mechanistically, the success with which the invasive marine mussel M. galloprovincialis has displaced native species in contact zones in the eastern (California) and western (Japan) Pacific. We tested for stabilizing amino acid substitutions in warm tolerant M. galloprovincialis relative to cold tolerant M. trossulus with a generalized linear model that compares in silico estimates of recent changes in protein stability among closely related congeners. Fixed substitutions in M. galloprovincialis were 3,180.0 calories per mol per substitution more stabilizing at genes with both elevated dN/dS ratios and transcriptional responses to heat stress, and 705.8 calories per mol per substitution more stabilizing across all 2,770 loci investigated. Amino acid substitutions concentrated in a small number of genes were more stabilizing in M. galloprovincialis compared with cold tolerant M. trossulus. We also tested for, but did not find, enrichment of a priori GO terms in genes with elevated dN/dS ratios in M. galloprovincialis. This might indicate that selection for thermodynamic stability is generic across all lineages, and suggests that the high change in estimated protein stability that we observed in M. galloprovincialis is driven by selection for extra stabilizing substitutions, rather than by higher incidence of selection in a greater number of genes in this lineage. Nonetheless, our finding of more stabilizing amino acid changes in the warm adapted lineage is important because it suggests that adaption for thermal stability has contributed to M. galloprovincialis’ superior tolerance to heat stress, and that pairing tests for positive selection and tests for transcriptional response to heat stress can identify candidates of protein stability adaptation.
Collapse
Affiliation(s)
- Norah P Saarman
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz.,Department of Ecology and Evolutionary Biology, Yale University
| | - Kord M Kober
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz.,Department of Physiological Nursing, University of California, San Francisco.,Institute for Computational Health Sciences, University of California, San Francisco
| | - W Brian Simison
- Center for Comparative Genomics, California Academy of Sciences, San Francisco, California
| | - Grant H Pogson
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz
| |
Collapse
|
19
|
Dasmeh P, Kepp KP. Superoxide dismutase 1 is positively selected to minimize protein aggregation in great apes. Cell Mol Life Sci 2017; 74:3023-3037. [PMID: 28389720 PMCID: PMC11107616 DOI: 10.1007/s00018-017-2519-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Revised: 03/17/2017] [Accepted: 04/03/2017] [Indexed: 12/14/2022]
Abstract
Positive (adaptive) selection has recently been implied in human superoxide dismutase 1 (SOD1), a highly abundant antioxidant protein with energy signaling and antiaging functions, one of very few examples of direct selection on a human protein product (exon); the molecular drivers of this selection are unknown. We mapped 30 extant SOD1 sequences to the recently established mammalian species tree and inferred ancestors, key substitutions, and signatures of selection during the protein's evolution. We detected elevated substitution rates leading to great apes (Hominidae) at ~1 per 2 million years, significantly higher than in other primates and rodents, although these paradoxically generally evolve much faster. The high evolutionary rate was partly due to relaxation of some selection pressures and partly to distinct positive selection of SOD1 in great apes. We then show that higher stability and net charge and changes at the dimer interface were selectively introduced upon separation from old world monkeys and lesser apes (gibbons). Consequently, human, chimpanzee and gorilla SOD1s have a net charge of -6 at physiological pH, whereas the closely related gibbons and macaques have -3. These features consistently point towards selection against the malicious aggregation effects of elevated SOD1 levels in long-living great apes. The findings mirror the impact of human SOD1 mutations that reduce net charge and/or stability and cause ALS, a motor neuron disease characterized by oxidative stress and SOD1 aggregates and triggered by aging. Our study thus marks an example of direct selection for a particular chemical phenotype (high net charge and stability) in a single human protein with possible implications for the evolution of aging.
Collapse
Affiliation(s)
- Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
- Department of Biochemistry and Cedergren Center for Bioinformatics and Genomics, Faculty of Medicine, University of Montreal, 2900 Edouard-Montpetit, Montreal, QC, H3T 1J4, Canada
| | - Kasper P Kepp
- Technical University of Denmark, DTU Chemistry, 2800, Kongens Lyngby, Denmark.
| |
Collapse
|
20
|
|
21
|
Suvorov A, Jensen NO, Sharkey CR, Fujimoto MS, Bodily P, Wightman HMC, Ogden TH, Clement MJ, Bybee SM. Opsins have evolved under the permanent heterozygote model: insights from phylotranscriptomics of Odonata. Mol Ecol 2016; 26:1306-1322. [DOI: 10.1111/mec.13884] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Revised: 09/24/2016] [Accepted: 10/04/2016] [Indexed: 02/04/2023]
Affiliation(s)
- Anton Suvorov
- Department of Biology; Brigham Young University; Provo UT 84602 USA
| | | | | | | | - Paul Bodily
- Computer Science Department; Brigham Young University; Provo UT 84602 USA
| | | | - T. Heath Ogden
- Department of Biology; Utah Valley University; Orem UT 84058 USA
| | - Mark J. Clement
- Computer Science Department; Brigham Young University; Provo UT 84602 USA
| | - Seth M. Bybee
- Department of Biology; Brigham Young University; Provo UT 84602 USA
| |
Collapse
|
22
|
Bershtein S, Serohijos AW, Shakhnovich EI. Bridging the physical scales in evolutionary biology: from protein sequence space to fitness of organisms and populations. Curr Opin Struct Biol 2016; 42:31-40. [PMID: 27810574 DOI: 10.1016/j.sbi.2016.10.013] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2016] [Accepted: 10/14/2016] [Indexed: 01/11/2023]
Abstract
Bridging the gap between the molecular properties of proteins and organismal/population fitness is essential for understanding evolutionary processes. This task requires the integration of the several physical scales of biological organization, each defined by a distinct set of mechanisms and constraints, into a single unifying model. The molecular scale is dominated by the constraints imposed by the physico-chemical properties of proteins and their substrates, which give rise to trade-offs and epistatic (non-additive) effects of mutations. At the systems scale, biological networks modulate protein expression and can either buffer or enhance the fitness effects of mutations. The population scale is influenced by the mutational input, selection regimes, and stochastic changes affecting the size and structure of populations, which eventually determine the evolutionary fate of mutations. Here, we summarize the recent advances in theory, computer simulations, and experiments that advance our understanding of the links between various physical scales in biology.
Collapse
Affiliation(s)
- Shimon Bershtein
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer-Sheva 84501, Israel
| | - Adrian Wr Serohijos
- Département de Biochimie, Centre Robert-Cedergren en Bioinformatique & Génomique, Université de Montréal, Montréal, QC H3T 1J4, Canada
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138, United States.
| |
Collapse
|
23
|
Machado JP, Philip S, Maldonado E, O'Brien SJ, Johnson WE, Antunes A. Positive Selection Linked with Generation of Novel Mammalian Dentition Patterns. Genome Biol Evol 2016; 8:2748-59. [PMID: 27613398 PMCID: PMC5630915 DOI: 10.1093/gbe/evw200] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
A diverse group of genes are involved in the tooth development of mammals. Several studies, focused mainly on mice and rats, have provided a detailed depiction of the processes coordinating tooth formation and shape. Here we surveyed 236 tooth-associated genes in 39 mammalian genomes and tested for signatures of selection to assess patterns of molecular adaptation in genes regulating mammalian dentition. Of the 236 genes, 31 (∼13.1%) showed strong signatures of positive selection that may be responsible for the phenotypic diversity observed in mammalian dentition. Mammalian-specific tooth-associated genes had accelerated mutation rates compared with older genes found across all vertebrates. More recently evolved genes had fewer interactions (either genetic or physical), were associated with fewer Gene Ontology terms and had faster evolutionary rates compared with older genes. The introns of these positively selected genes also exhibited accelerated evolutionary rates, which may reflect additional adaptive pressure in the intronic regions that are associated with regulatory processes that influence tooth-gene networks. The positively selected genes were mainly involved in processes like mineralization and structural organization of tooth specific tissues such as enamel and dentin. Of the 236 analyzed genes, 12 mammalian-specific genes (younger genes) provided insights on diversification of mammalian teeth as they have higher evolutionary rates and exhibit different expression profiles compared with older genes. Our results suggest that the evolution and development of mammalian dentition occurred in part through positive selection acting on genes that previously had other functions.
Collapse
Affiliation(s)
- João Paulo Machado
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal Abel Salazar Biomedical Sciences Institute (ICBAS), University of Porto, Porto, Portugal
| | - Siby Philip
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Porto, Portugal
| | - Emanuel Maldonado
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - Stephen J O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, St. Petersburg State University, St. Petersburg, Russia Oceanographic Center, Nova Southeastern University, Ft Lauderdale
| | - Warren E Johnson
- Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, Virginia, USA
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal Abel Salazar Biomedical Sciences Institute (ICBAS), University of Porto, Porto, Portugal Department of Biology, Faculty of Sciences, University of Porto, Porto, Portugal
| |
Collapse
|
24
|
Tracking evolution of myoglobin stability in cetaceans using experimentally calibrated computational methods that account for generic protein relaxation. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2016; 1864:825-34. [DOI: 10.1016/j.bbapap.2016.04.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Revised: 04/05/2016] [Accepted: 04/07/2016] [Indexed: 11/22/2022]
|
25
|
Wheeler LC, Lim SA, Marqusee S, Harms MJ. The thermostability and specificity of ancient proteins. Curr Opin Struct Biol 2016; 38:37-43. [PMID: 27288744 DOI: 10.1016/j.sbi.2016.05.015] [Citation(s) in RCA: 85] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Revised: 05/18/2016] [Accepted: 05/24/2016] [Indexed: 11/16/2022]
Abstract
Were ancient proteins systematically different than modern proteins? The answer to this question is profoundly important, shaping how we understand the origins of protein biochemical, biophysical, and functional properties. Ancestral sequence reconstruction (ASR), a phylogenetic approach to infer the sequences of ancestral proteins, may reveal such trends. We discuss two proposed trends: a transition from higher to lower thermostability and a tendency for proteins to acquire higher specificity over time. We review the evidence for elevated ancestral thermostability and discuss its possible origins in a changing environmental temperature and/or reconstruction bias. We also conclude that there is, as yet, insufficient data to support a trend from promiscuity to specificity. Finally, we propose future work to understand these proposed evolutionary trends.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States; Institute of Molecular Biology, University of Oregon, Eugene, OR, United States
| | - Shion A Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States; Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA, United States.
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, United States; Institute of Molecular Biology, University of Oregon, Eugene, OR, United States.
| |
Collapse
|
26
|
Tian R, Wang Z, Niu X, Zhou K, Xu S, Yang G. Evolutionary Genetics of Hypoxia Tolerance in Cetaceans during Diving. Genome Biol Evol 2016; 8:827-39. [PMID: 26912402 PMCID: PMC4824146 DOI: 10.1093/gbe/evw037] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Hypoxia was a major challenge faced by cetaceans during the course of secondary aquatic adaptation. Although physiological traits of hypoxia tolerance in cetaceans have been well characterized, the underlying molecular mechanisms remain unknown. We investigated the sequences of 17 hypoxia-tolerance-related genes in representative cetaceans to provide a comprehensive insight into the genetic basis of hypoxia tolerance in these animals. Genes involved in carrying and transporting oxygen in the blood and muscle (hemoglobin-α and β, myoglobin), and genes involved in the regulation of vasoconstriction (endothelin-1, -2, and -3; endothelin receptor type A and B; adrenergic receptor α-1D; and arginine vasopressin) appear to have undergone adaptive evolution, evidence for positive selection on their particular sites, and radical physiochemical property changes of selected condons. Interestingly, “long-diving” cetaceans had relatively higher ω (dN/dS) values than “short-diving” cetaceans for the hemoglobin β gene, indicating divergent selective pressure presented in cetacean lineages with different diving abilities. Additionally, parallel positive selection or amino acid changes (ADRA1D: P50A, A53G, AVPR1B: I/V270T) among animals exposed to different hypoxia habitats reflect functional convergence or similar genetic mechanisms of hypoxia tolerance. In summary, positive selection, divergent selective pressures, and parallel evolution at the molecular level provided some new insights into the genetic adaptation of hypoxia tolerance.
Collapse
Affiliation(s)
- Ran Tian
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Zhengfei Wang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xu Niu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Kaiya Zhou
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Shixia Xu
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Guang Yang
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| |
Collapse
|
27
|
Diekmann Y, Pereira-Leal JB. Gene Tree Affects Inference of Sites Under Selection by the Branch-Site Test of Positive Selection. Evol Bioinform Online 2016; 11:11-7. [PMID: 26819542 PMCID: PMC4718152 DOI: 10.4137/ebo.s30902] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Revised: 10/27/2015] [Accepted: 11/01/2015] [Indexed: 12/30/2022] Open
Abstract
The branch-site test of positive selection is a standard approach to detect past episodic positive selection in a priori-specified branches of a gene phylogeny. Here, we ask if differences in the topology of the gene tree have any influence on the ability to infer positively selected sites. Using simulated sequences, we compare the results obtained for true and rearranged topologies. We find a strong relationship between “conflicting branch length,” which occurs when the set of sequences that experiences selection for a given topology and foreground is changed, and the ability to predict positively selected sites. Moreover, by reanalyzing a previously published data set, we show that the choice of a gene tree also affects the results obtained for real-world sequences. This is the first study to demonstrate that tree topology has a clear effect on the inference of positive selection. We conclude that the choice of a gene tree is an important factor for the branch-site analysis of positive selection.
Collapse
Affiliation(s)
- Yoan Diekmann
- Instituto Gulbenkian de Ciência, Oeiras, Portugal.; Programa de Doutoramento em Biologia Computational (PDBC), Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | | |
Collapse
|
28
|
Gu X, Tang W. Model parameters of molecular evolution explain genomic correlations. Brief Bioinform 2015; 18:37-42. [PMID: 26628558 DOI: 10.1093/bib/bbv098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Revised: 10/01/2015] [Indexed: 11/13/2022] Open
Abstract
One long-standing research focus in evolutionary genomics is trying to resolve how biological variables (expression, essentiality, protein-protein interaction, structural stability, etc.) determine the rate of protein evolution. While these studies have considerably deepened our understanding of molecular evolution, many issues remain unsolved. In this opinion article, after having a brief survey of literatures, we establish relationships between model parameters of molecular evolution and genomic variables, based on which, most-observed genomic correlations and confounds can be explained by model parameter combinations under different conditions, which include the strength of stabilizing selection, mutational variance, expression sufficiency, gene pleiotropy, as well as the effective population size. We suggest that the problem to discern biological variable(s) that may determine the rate of protein evolution can be tackled at two levels. The first level, as discussed here, is to demonstrate how the model of molecular evolution can predict potential genomic correlations under various conditions. And the second level is to estimate genome-wide variations of model parameters (or combinations) that help to identify canonical biological variables that may underlie the rate variation among genes that ranges up to at least three magnitudes.
Collapse
|
29
|
Padhi A, Ma L. A testis-specific gene within a widely expressed gene: Contrasting evolutionary patterns of two differentially expressed mammalian proteins encoded by a single gene, CAMK4. Anim Genet 2015; 46:683-92. [PMID: 26388303 DOI: 10.1111/age.12358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2015] [Indexed: 11/28/2022]
Abstract
Understanding the patterns of genetic variations within fertility-related genes and the evolutionary forces that shape such variations is crucial in predicting the fitness landscapes of subsequent generations. This study reports distinct evolutionary features of two differentially expressed mammalian proteins [CaMKIV (Ca(2+) /calmodulin-dependent protein kinase IV) and CaS (calspermin)] that are encoded by a single gene, CAMK4. The multifunctional CaMKIV, which is expressed in multiple tissues including testis and ovary, is evolving at a relatively low rate (0.46-0.64 × 10(-9) nucleotide substitutions/site/year), whereas the testis-specific CaS gene, which is predominantly expressed in post-meiotic cells, evolves at least three to four times faster (1.48-1.98 × 10(-9) substitutions/site/year). Concomitantly, maximum-likelihood-based selection analyses revealed that the ubiquitously expressed CaMKIV is constrained by intense purifying selection and, therefore, remained functionally highly conserved throughout the mammalian evolution, whereas the testis-specific CaS gene is under strong positive selection. The substitution rates of different mammalian lineages within both genes are positively correlated with GC content, indicating the possible influence of GC-biased gene conversion on the estimated substitution rates. The observation of such unusually high GC content of the CaS gene (≈74%), particularly in the lineage that comprises the bovine species, suggests the possible role of GC-biased gene conversion in the evolution of CaS that mimics positive selection.
Collapse
Affiliation(s)
- Abinash Padhi
- Department of Animal and Avian Sciences, University of Maryland, College Park, 20742, MD, USA
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, 20742, MD, USA
| |
Collapse
|
30
|
Studer RA, Opperdoes FR, Nicolaes GAF, Mulder AB, Mulder R. Understanding the functional difference between growth arrest-specific protein 6 and protein S: an evolutionary approach. Open Biol 2015; 4:rsob.140121. [PMID: 25339693 PMCID: PMC4221892 DOI: 10.1098/rsob.140121] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Although protein S (PROS1) and growth arrest-specific protein 6 (GAS6) proteins are homologous with a high degree of structural similarity, they are functionally different. The objectives of this study were to identify the evolutionary origins from which these functional differences arose. Bioinformatics methods were used to estimate the evolutionary divergence time and to detect the amino acid residues under functional divergence between GAS6 and PROS1. The properties of these residues were analysed in the light of their three-dimensional structures, such as their stability effects, the identification of electrostatic patches and the identification potential protein-protein interaction. The divergence between GAS6 and PROS1 probably occurred during the whole-genome duplications in vertebrates. A total of 78 amino acid sites were identified to be under functional divergence. One of these sites, Asn463, is involved in N-glycosylation in GAS6, but is mutated in PROS1, preventing this post-translational modification. Sites experiencing functional divergence tend to express a greater diversity of stabilizing/destabilizing effects than sites that do not experience such functional divergence. Three electrostatic patches in the LG1/LG2 domains were found to differ between GAS6 and PROS1. Finally, a surface responsible for protein-protein interactions was identified. These results may help researchers to analyse disease-causing mutations in the light of evolutionary and structural constraints, and link genetic pathology to clinical phenotypes.
Collapse
Affiliation(s)
- Romain A Studer
- European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fred R Opperdoes
- Laboratory of Biochemistry, de Duve Institute and Université catholique de Louvain, Brussels 1200, Belgium
| | - Gerry A F Nicolaes
- Department of Biochemistry, Cardiovascular Research Institute Maastricht, Maastricht University, Maastricht, The Netherlands
| | - André B Mulder
- Department of Laboratory Medicine, University Medical Centre Groningen, Groningen, The Netherlands
| | - René Mulder
- Department of Laboratory Medicine, University Medical Centre Groningen, Groningen, The Netherlands
| |
Collapse
|
31
|
Abstract
The world of primate genomics is expanding rapidly in new and exciting ways owing to lowered costs and new technologies in molecular methods and bioinformatics. The primate order is composed of 78 genera and 478 species, including human. Taxonomic inferences are complex and likely a consequence of ongoing hybridization, introgression, and reticulate evolution among closely related taxa. Recently, we applied large-scale sequencing methods and extensive taxon sampling to generate a highly resolved phylogeny that affirms, reforms, and extends previous depictions of primate speciation. The next stage of research uses this phylogeny as a foundation for investigating genome content, structure, and evolution across primates. Ongoing and future applications of a robust primate phylogeny are discussed, highlighting advancements in adaptive evolution of genes and genomes, taxonomy and conservation management of endangered species, next-generation genomic technologies, and biomedicine.
Collapse
Affiliation(s)
- Jill Pecon-Slattery
- Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland 21702; Current Affiliation: Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, Virginia 22630;
| |
Collapse
|
32
|
Kepp KP. Towards a "Golden Standard" for computing globin stability: Stability and structure sensitivity of myoglobin mutants. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2015; 1854:1239-48. [PMID: 26054434 DOI: 10.1016/j.bbapap.2015.06.002] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 05/28/2015] [Accepted: 06/04/2015] [Indexed: 12/26/2022]
Abstract
Fast and accurate computation of protein stability is increasingly important for e.g. protein engineering and protein misfolding diseases, but no consensus methods exist for important proteins such as globins, and performance may depend on the type of structural input given. This paper reports benchmarking of six protein stability calculators (POPMUSIC 2.1, I-Mutant 2.0, I-Mutant 3.0, CUPSAT, SDM, and mCSM) against 134 experimental stability changes for mutations of sperm-whale myoglobin. Six different high-resolution structures were used to test structure sensitivity that may impair protein calculations. The trend accuracy of the methods decreased as I-Mutant 2.0 (R=0.64-0.65), SDM (R=0.57-0.60), POPMUSIC2.1 (R=0.54-0.57), I-Mutant 3.0 (R=0.53-0.55), mCSM (R=0.35-0.47), and CUPSAT (R=0.25-0.48). The mean signed errors increased as SDM<CUPSAT<I-Mutant 2.0<I-Mutant 3.0<POPMUSIC 2.1<mCSM. Mean absolute errors increased as I-Mutant 2.0<I-Mutant 3.0<POPMUSIC 2.1<CUPSAT<SDM<mCSM. Structural sensitivity increased as I-Mutant 3.0 (0.05)<I-Mutant 2.0 (0.09)<POPMUSIC 2.1 (0.12)<SDM (0.18)<mCSM (0.27)<CUPSAT (0.58). Leaving out heterogeneous experimental data did not change conclusions. The distinct performances reveal room for improvement, but I-Mutant 2.0 is proficient for this purpose, as further validated against a data set of related cytochrome c like proteins. The results also emphasize the importance of high-quality crystal structures and reveal structure-dependent effects even in the near-atomic resolution limit.
Collapse
Affiliation(s)
- Kasper P Kepp
- Technical University of Denmark, DTU Chemistry, DK-2800 Kongens Lyngby, Denmark.
| |
Collapse
|
33
|
Wright TJ, Davis RW. Myoglobin oxygen affinity in aquatic and terrestrial birds and mammals. ACTA ACUST UNITED AC 2015; 218:2180-9. [PMID: 25987728 DOI: 10.1242/jeb.119321] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2015] [Accepted: 05/08/2015] [Indexed: 12/30/2022]
Abstract
Myoglobin (Mb) is an oxygen binding protein found in vertebrate skeletal muscle, where it facilitates intracellular transport and storage of oxygen. This protein has evolved to suit unique physiological needs in the muscle of diving vertebrates that express Mb at much greater concentrations than their terrestrial counterparts. In this study, we characterized Mb oxygen affinity (P50) from 25 species of aquatic and terrestrial birds and mammals. Among diving species, we tested for correlations between Mb P50 and routine dive duration. Across all species examined, Mb P50 ranged from 2.40 to 4.85 mmHg. The mean P50 of Mb from terrestrial ungulates was 3.72±0.15 mmHg (range 3.70-3.74 mmHg). The P50 of cetaceans was similar to terrestrial ungulates ranging from 3.54 to 3.82 mmHg, with the exception of the melon-headed whale, which had a significantly higher P50 of 4.85 mmHg. Among pinnipeds, the P50 ranged from 3.23 to 3.81 mmHg and showed a trend for higher oxygen affinity in species with longer dive durations. Among diving birds, the P50 ranged from 2.40 to 3.36 mmHg and also showed a trend of higher affinities in species with longer dive durations. In pinnipeds and birds, low Mb P50 was associated with species whose muscles are metabolically active under hypoxic conditions associated with aerobic dives. Given the broad range of potential globin oxygen affinities, Mb P50 from diverse vertebrate species appears constrained within a relatively narrow range. High Mb oxygen affinity within this range may be adaptive for some vertebrates that make prolonged dives.
Collapse
Affiliation(s)
- Traver J Wright
- Department of Marine Biology, Texas A&M University at Galveston, Galveston, TX 77551, USA Department of Wildlife and Fisheries Sciences, Texas A&M University, College Station, TX 77843, USA
| | - Randall W Davis
- Department of Marine Biology, Texas A&M University at Galveston, Galveston, TX 77551, USA Department of Wildlife and Fisheries Sciences, Texas A&M University, College Station, TX 77843, USA
| |
Collapse
|
34
|
Padhi A, Ma L. Time-dependent selection pressure on two arthropod-borne RNA viruses in the same serogroup. INFECTION GENETICS AND EVOLUTION 2015; 32:255-64. [PMID: 25801608 DOI: 10.1016/j.meegid.2015.03.019] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2015] [Revised: 03/11/2015] [Accepted: 03/15/2015] [Indexed: 12/20/2022]
Abstract
Understanding the genetic basis of viral adaptation to taxonomically diverse groups of host species inhabiting different eco-climatic zones is crucial for the discovery of factors underpinning the successful establishment of these infectious pathogens in new hosts/environments. To gain insights into the dynamics of nonsynonymous (dN) and synonymous substitutions (dS) and the ratio between the two (ω=dN/dS), we analyzed the complete nucleotide coding sequence data of the M segment, which encodes glycoproteins of two negative-sense RNA viruses, Akabane virus (AKV) and Schmallenberg virus (SBV) that belong to the same serogroup. While AKV is relatively older and has been circulating in ruminant populations since 1970s, SBV was first reported in 2011. The ω was estimated to be 1.67 and 0.09 for SBV and AKV, respectively, and the estimated mutation rate of SBV is at least 25 times higher than that of AKV. Given the different evolutionary stages of the two viruses, most of the slightly deleterious mutations were likely purged out or kept in low frequency in the AKV genome, whereas positive selection together with the accumulation of slightly deleterious mutations might contribute to such an inflated mutation rate of SBV. The evolutionary distance (d) is nonlinearly and negatively correlated with ω, but is positively correlated with dN and dS. Collectively, the different patterns in ω, dN, dS, and d between AKV and SBV identified in this study provide empirical evidence for a time-dependent selection pressure.
Collapse
Affiliation(s)
- Abinash Padhi
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA.
| | - Li Ma
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
35
|
Kim DS, Hahn Y. The acquisition of novel N-glycosylation sites in conserved proteins during human evolution. BMC Bioinformatics 2015; 16:29. [PMID: 25628020 PMCID: PMC4314935 DOI: 10.1186/s12859-015-0468-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 01/15/2015] [Indexed: 12/19/2022] Open
Abstract
Background N-linked protein glycosylation plays an important role in various biological processes, including protein folding and trafficking, and cell adhesion and signaling. The acquisition of a novel N-glycosylation site may have significant effect on protein structure and function, and therefore, on the phenotype. Results We analyzed the human glycoproteome data set (2,534 N-glycosylation sites in 1,027 proteins) and identified 112 novel N-glycosylation sites in 91 proteins that arose in the human lineage since the last common ancestor of Euarchonta (primates and treeshrews). Three of them, Asn-196 in adipocyte plasma membrane-associated protein (APMAP), Asn-91 in cluster of differentiation 166 (CD166/ALCAM), and Asn-76 in thyroglobulin, are human-specific. Molecular evolutionary analysis suggested that these sites were under positive selection during human evolution. Notably, the Asn-76 of thyroglobulin might be involved in the increased production of thyroid hormones in humans, especially thyroxine (T4), because the removal of the glycan moiety from this site was reported to result in a significant decrease in T4 production. Conclusions We propose that the novel N-glycosylation sites described in this study may be useful candidates for functional analyses to identify innovative genetic modifications for beneficial phenotypes acquired in the human lineage. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0468-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Yoonsoo Hahn
- Department of Life Science, Research Center for Biomolecules and Biosystems, Chung-Ang University, 84 Heukseok-ro, Dongjak-gu 156-756, Seoul, Korea.
| |
Collapse
|
36
|
Dasmeh P, Serohijos AWR, Kepp KP, Shakhnovich EI. The influence of selection for protein stability on dN/dS estimations. Genome Biol Evol 2014; 6:2956-67. [PMID: 25355808 PMCID: PMC4224349 DOI: 10.1093/gbe/evu223] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Understanding the relative contributions of various evolutionary processes-purifying selection, neutral drift, and adaptation-is fundamental to evolutionary biology. A common metric to distinguish these processes is the ratio of nonsynonymous to synonymous substitutions (i.e., dN/dS) interpreted from the neutral theory as a null model. However, from biophysical considerations, mutations have non-negligible effects on the biophysical properties of proteins such as folding stability. In this work, we investigated how stability affects the rate of protein evolution in phylogenetic trees by using simulations that combine explicit protein sequences with associated stability changes. We first simulated myoglobin evolution in phylogenetic trees with a biophysically realistic approach that accounts for 3D structural information and estimates of changes in stability upon mutation. We then compared evolutionary rates inferred directly from simulation to those estimated using maximum-likelihood (ML) methods. We found that the dN/dS estimated by ML methods (ωML) is highly predictive of the per gene dN/dS inferred from the simulated phylogenetic trees. This agreement is strong in the regime of high stability where protein evolution is neutral. At low folding stabilities and under mutation-selection balance, we observe deviations from neutrality (per gene dN/dS > 1 and dN/dS < 1). We showed that although per gene dN/dS is robust to these deviations, ML tests for positive selection detect statistically significant per site dN/dS > 1. Altogether, we show how protein biophysics affects the dN/dS estimations and its subsequent interpretation. These results are important for improving the current approaches for detecting positive selection.
Collapse
Affiliation(s)
- Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University DTU Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark Present address: Max Planck Institute of Immunobiology and Epigenetics, Stübeweg, Freiburg, Germany
| | | | - Kasper P Kepp
- DTU Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark
| | | |
Collapse
|
37
|
Merging molecular mechanism and evolution: theory and computation at the interface of biophysics and evolutionary population genetics. Curr Opin Struct Biol 2014; 26:84-91. [PMID: 24952216 DOI: 10.1016/j.sbi.2014.05.005] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Revised: 04/19/2014] [Accepted: 05/16/2014] [Indexed: 11/24/2022]
Abstract
The variation among sequences and structures in nature is both determined by physical laws and by evolutionary history. However, these two factors are traditionally investigated by disciplines with different emphasis and philosophy-molecular biophysics on one hand and evolutionary population genetics in another. Here, we review recent theoretical and computational approaches that address the crucial need to integrate these two disciplines. We first articulate the elements of these approaches. Then, we survey their contribution to our mechanistic understanding of molecular evolution, the polymorphisms in coding region, the distribution of fitness effects (DFE) of mutations, the observed folding stability of proteins in nature, and the distribution of protein folds in genomes.
Collapse
|
38
|
McGowen MR, Gatesy J, Wildman DE. Molecular evolution tracks macroevolutionary transitions in Cetacea. Trends Ecol Evol 2014; 29:336-46. [DOI: 10.1016/j.tree.2014.04.001] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2013] [Revised: 04/02/2014] [Accepted: 04/03/2014] [Indexed: 10/25/2022]
|
39
|
Matsuoka M, Kikuchi T. Sequence analysis on the information of folding initiation segments in ferredoxin-like fold proteins. BMC STRUCTURAL BIOLOGY 2014; 14:15. [PMID: 24884463 PMCID: PMC4055915 DOI: 10.1186/1472-6807-14-15] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Accepted: 05/15/2014] [Indexed: 02/06/2023]
Abstract
BACKGROUND While some studies have shown that the 3D protein structures are more conservative than their amino acid sequences, other experimental studies have shown that even if two proteins share the same topology, they may have different folding pathways. There are many studies investigating this issue with molecular dynamics or Go-like model simulations, however, one should be able to obtain the same information by analyzing the proteins' amino acid sequences, if the sequences contain all the information about the 3D structures. In this study, we use information about protein sequences to predict the location of their folding segments. We focus on proteins with a ferredoxin-like fold, which has a characteristic topology. Some of these proteins have different folding segments. RESULTS Despite the simplicity of our methods, we are able to correctly determine the experimentally identified folding segments by predicting the location of the compact regions considered to play an important role in structural formation. We also apply our sequence analyses to some homologues of each protein and confirm that there are highly conserved folding segments despite the homologues' sequence diversity. These homologues have similar folding segments even though the homology of two proteins' sequences is not so high. CONCLUSION Our analyses have proven useful for investigating the common or different folding features of the proteins studied.
Collapse
Affiliation(s)
| | - Takeshi Kikuchi
- Department of Bioinformatics, College of Life Sciences, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan.
| |
Collapse
|
40
|
Kepp KP, Dasmeh P. A model of proteostatic energy cost and its use in analysis of proteome trends and sequence evolution. PLoS One 2014; 9:e90504. [PMID: 24587382 PMCID: PMC3938754 DOI: 10.1371/journal.pone.0090504] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 02/03/2014] [Indexed: 12/25/2022] Open
Abstract
A model of proteome-associated chemical energetic costs of cells is derived from protein-turnover kinetics and protein folding. Minimization of the proteostatic maintenance cost can explain a range of trends of proteomes and combines both protein function, stability, size, proteostatic cost, temperature, resource availability, and turnover rates in one simple framework. We then explore the ansatz that the chemical energy remaining after proteostatic maintenance is available for reproduction (or cell division) and thus, proportional to organism fitness. Selection for lower proteostatic costs is then shown to be significant vs. typical effective population sizes of yeast. The model explains and quantifies evolutionary conservation of highly abundant proteins as arising both from functional mutations and from changes in other properties such as stability, cost, or turnover rates. We show that typical hypomorphic mutations can be selected against due to increased cost of compensatory protein expression (both in the mutated gene and in related genes, i.e. epistasis) rather than compromised function itself, although this compensation depends on the protein's importance. Such mutations exhibit larger selective disadvantage in abundant, large, synthetically costly, and/or short-lived proteins. Selection against increased turnover costs of less stable proteins rather than misfolding toxicity per se can explain equilibrium protein stability distributions, in agreement with recent findings in E. coli. The proteostatic selection pressure is stronger at low metabolic rates (i.e. scarce environments) and in hot habitats, explaining proteome adaptations towards rough environments as a question of energy. The model may also explain several trade-offs observed in protein evolution and suggests how protein properties can coevolve to maintain low proteostatic cost.
Collapse
Affiliation(s)
- Kasper P. Kepp
- Department of Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark
- * E-mail:
| | - Pouria Dasmeh
- Department of Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark
| |
Collapse
|
41
|
Kepp KP. Computing stability effects of mutations in human superoxide dismutase 1. J Phys Chem B 2014; 118:1799-812. [PMID: 24472010 DOI: 10.1021/jp4119138] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Protein stability is affected in several diseases and is of substantial interest in efforts to correlate genotypes to phenotypes. Superoxide dismutase 1 (SOD1) is a suitable test case for such correlations due to its abundance, stability, available crystal structures and thermochemical data, and physiological importance. In this work, stability changes of SOD1 mutations were computed with five methods, CUPSAT, I-Mutant2.0, I-Mutant3.0, PoPMuSiC, and SDM, with emphasis on structural sensitivity as a potential issue in structure-based protein calculation. The large correlation between experimental literature data of SOD1 dimers and monomers (r = 0.82) suggests that mutations in separate protein monomers are mostly additive. PoPMuSiC was most accurate (typical MAE ~ 1 kcal/mol, r ~ 0.5). The relative performance of the methods was not very structure-dependent, and the more accurate methods also displayed less structural sensitivity, with the standard deviation from different high-resolution structures down to ~0.2 kcal/mol. Structures of variable resolution and number of protein copies locally affected specific sites, emphasizing the use of state-relevant crystal structures when such sites are of interest, but had little impact on overall batch estimates. Protein-interaction effects (as a mimic of crystal packing) were small for the more accurate methods. Thus, batch computations, relevant to, e.g., comparisons of disease/nondisease mutant sets or different clades in phylogenetic trees, are much more significant than single mutant calculations and may be the only meaningful way to computationally bridge the genotype-phenotype gap of proteomics. Finally, mutations involving glycine were most difficult to model, of relevance to future method improvement. This could be due to structure changes (glycine has a low structural propensity) or water colocalization with glycine.
Collapse
Affiliation(s)
- Kasper P Kepp
- DTU Chemistry, Technical University of Denmark , DK 2800 Kongens Lyngby, Denmark
| |
Collapse
|
42
|
Unfolding simulations of holomyoglobin from four mammals: identification of intermediates and β-sheet formation from partially unfolded states. PLoS One 2013; 8:e80308. [PMID: 24386077 PMCID: PMC3873898 DOI: 10.1371/journal.pone.0080308] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2013] [Accepted: 10/11/2013] [Indexed: 01/15/2023] Open
Abstract
Myoglobin (Mb) is a centrally important, widely studied mammalian protein. While much work has investigated multi-step unfolding of apoMb using acid or denaturant, holomyoglobin unfolding is poorly understood despite its biological relevance. We present here the first systematic unfolding simulations of holoMb and the first comparative study of unfolding of protein orthologs from different species (sperm whale, pig, horse, and harbor seal). We also provide new interpretations of experimental mean molecular ellipticities of myoglobin intermediates, notably correcting for random coil and number of helices in intermediates. The simulated holoproteins at 310 K displayed structures and dynamics in agreement with crystal structures (R g ~1.48-1.51 nm, helicity ~75%). At 400 K, heme was not lost, but some helix loss was observed in pig and horse, suggesting that these helices are less stable in terrestrial species. At 500 K, heme was lost within 1.0-3.7 ns. All four proteins displayed exponentially decaying helix structure within 20 ns. The C- and F-helices were lost quickly in all cases. Heme delayed helix loss, and sperm whale myoglobin exhibited highest retention of heme and D/E helices. Persistence of conformation (RMSD), secondary structure, and ellipticity between 2-11 ns was interpreted as intermediates of holoMb unfolding in all four species. The intermediates resemble those of apoMb notably in A and H helices, but differ substantially in the D-, E- and F-helices, which interact with heme. The identified mechanisms cast light on the role of metal/cofactor in poorly understood holoMb unfolding. We also observed β-sheet formation of several myoglobins at 500 K as seen experimentally, occurring after disruption of helices to a partially unfolded, globally disordered state; heme reduced this tendency and sperm-whale did not display any sheet propensity during the simulations.
Collapse
|
43
|
Nery MF, Arroyo JI, Opazo JC. Accelerated Evolutionary Rate of the Myoglobin Gene in Long-Diving Whales. J Mol Evol 2013; 76:380-7. [DOI: 10.1007/s00239-013-9572-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 06/29/2013] [Indexed: 10/26/2022]
|