1
|
Usmanova DR, Plata G, Vitkup D. Functional Optimization in Distinct Tissues and Conditions Constrains the Rate of Protein Evolution. Mol Biol Evol 2024; 41:msae200. [PMID: 39431545 PMCID: PMC11523136 DOI: 10.1093/molbev/msae200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 07/29/2024] [Accepted: 08/05/2024] [Indexed: 10/22/2024] Open
Abstract
Understanding the main determinants of protein evolution is a fundamental challenge in biology. Despite many decades of active research, the molecular and cellular mechanisms underlying the substantial variability of evolutionary rates across cellular proteins are not currently well understood. It also remains unclear how protein molecular function is optimized in the context of multicellular species and why many proteins, such as enzymes, are only moderately efficient on average. Our analysis of genomics and functional datasets reveals in multiple organisms a strong inverse relationship between the optimality of protein molecular function and the rate of protein evolution. Furthermore, we find that highly expressed proteins tend to be substantially more functionally optimized. These results suggest that cellular expression costs lead to more pronounced functional optimization of abundant proteins and that the purifying selection to maintain high levels of functional optimality significantly slows protein evolution. We observe that in multicellular species both the rate of protein evolution and the degree of protein functional efficiency are primarily affected by expression in several distinct cell types and tissues, specifically, in developed neurons with upregulated synaptic processes in animals and in young and fast-growing tissues in plants. Overall, our analysis reveals how various constraints from the molecular, cellular, and species' levels of biological organization jointly affect the rate of protein evolution and the level of protein functional adaptation.
Collapse
Affiliation(s)
- Dinara R Usmanova
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Germán Plata
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- BiomEdit, Fishers, IN 46037, USA
| | - Dennis Vitkup
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
2
|
Potera K, Tomala K. Using yeasts for the studies of nonfunctional factors in protein evolution. Yeast 2024; 41:529-536. [PMID: 38895906 DOI: 10.1002/yea.3970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 05/08/2024] [Accepted: 06/06/2024] [Indexed: 06/21/2024] Open
Abstract
The evolution of protein sequence is driven not only by factors directly related to protein function and shape but also by nonfunctional factors. Such factors in protein evolution might be categorized as those connected to energetic costs, synthesis efficiency, and avoidance of misfolding and toxicity. A common approach to studying them is correlational analysis contrasting them with some characteristics of the protein, like amino acid composition, but these features are interdependent. To avoid possible bias, empirical studies are needed, and not enough work has been done to date. In this review, we describe the role of nonfunctional factors in protein evolution and present an experimental approach using yeast as a suitable model organism. The focus of the proposed approach is on the potential negative impact on the fitness of mutations that change protein properties not related to function and the frequency of mutations that change these properties. Experimental results of testing the misfolding avoidance hypothesis as an explanation for why highly expressed proteins evolve slowly are inconsistent with correlational research results. Therefore, more efforts should be made to empirically test the effects of nonfunctional factors in protein evolution and to contrast these results with the results of the correlational analysis approach.
Collapse
Affiliation(s)
- Katarzyna Potera
- Faculty of Biology, Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Krakow, Poland
| | - Katarzyna Tomala
- Faculty of Biology, Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
| |
Collapse
|
3
|
Mehlhoff JD, Ostermeier M. Genes Vary Greatly in Their Propensity for Collateral Fitness Effects of Mutations. Mol Biol Evol 2023; 40:7043719. [PMID: 36798991 PMCID: PMC9999109 DOI: 10.1093/molbev/msad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 01/18/2023] [Accepted: 02/13/2023] [Indexed: 02/18/2023] Open
Abstract
Mutations can have deleterious fitness effects when they decrease protein specific activity or decrease active protein abundance. Mutations will also be deleterious when they cause misfolding or misinteractions that are toxic to the cell (i.e., independent of whether the mutations affect specific activity and abundance). The extent to which protein evolution is shaped by these and other collateral fitness effects is unclear in part because little is known of their frequency and magnitude. Using deep mutational scanning (DMS), we previously found at least 42% of missense mutations in the TEM-1 β-lactamase antibiotic resistance gene cause deleterious collateral fitness effects. Here, we used DMS to comprehensively determine the collateral fitness effects of missense mutations in three genes encoding the antibiotic resistance proteins New Delhi metallo-β-lactamase (NDM-1), chloramphenicol acetyltransferase I (CAT-I), and 2″-aminoglycoside nucleotidyltransferase (AadB). AadB (20%), CAT-I (0.9%), and NDM-1 (0.2%) were less susceptible to deleterious collateral fitness effects than TEM-1 (42%) indicating that genes have different propensities for these effects. As was observed with TEM-1, all the studied deleterious aadB mutants increased aggregation. However, aggregation did not correlate with collateral fitness effects for many of the deleterious mutants of CAT-I and NDM-1. Select deleterious mutants caused unexpected phenotypes to emerge. The introduction of internal start codons in CAT-1 caused loss of the episome and a mutation in aadB made its cognate antibiotic essential for growth. Our study illustrates how the complexity of the cell provides a rich environment for collateral fitness effects and new phenotypes to emerge.
Collapse
Affiliation(s)
- Jacob D Mehlhoff
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD
| |
Collapse
|
4
|
Bédard C, Cisneros AF, Jordan D, Landry CR. Correlation between protein abundance and sequence conservation: what do recent experiments say? Curr Opin Genet Dev 2022; 77:101984. [PMID: 36162152 DOI: 10.1016/j.gde.2022.101984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 08/23/2022] [Accepted: 08/26/2022] [Indexed: 01/27/2023]
Abstract
Cells evolve in a space of parameter values set by physical and chemical forces. These constraints create associations among cellular properties. A particularly strong association is the negative correlation between the rate of evolution of proteins and their abundance in the cell. Highly expressed proteins evolve slower than lowly expressed ones. Multiple hypotheses have been put forward to explain this relationship, including, for instance, the requirement for higher mRNA stability, misfolding avoidance, and misinteraction avoidance for highly expressed proteins. Here, we review some of these hypotheses, their predictions, and how they are supported to finally discuss recent experiments that have been performed to test these predictions.
Collapse
Affiliation(s)
- Camille Bédard
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada. https://twitter.com/@CamilleBed17
| | - Angel F Cisneros
- Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada; Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada. https://twitter.com/@AngelFCC119
| | - David Jordan
- Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada; Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada. https://twitter.com/@DavidJordan1997
| | - Christian R Landry
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada; Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada.
| |
Collapse
|
5
|
Palenchar PM. The Influence of Codon Usage, Protein Abundance, and Protein Stability on Protein Evolution Vary by Evolutionary Distance and the Type of Protein. Protein J 2022; 41:216-229. [PMID: 35147896 DOI: 10.1007/s10930-022-10045-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2022] [Indexed: 12/01/2022]
Abstract
In general, the evolutionary rate of proteins is not primarily related to protein and amino acid functions, and factors such as protein abundance, codon usage, and the protein's TM are more important. To better understand the factors that affect protein evolution, E. coli MG1655 orthologs were compared to those in closely related bacteria and to more distantly related prokaryotes, eukaryotes, and archaea. Also, the evolution of different types of proteins was studied. The analyses indicate that the amino acid conservation of enzymes that do not use macromolecules (e.g. DNA, RNA, and proteins) as substrates and that carry out metabolic processes involving small molecules (i.e. small molecule enzymes) is different than other enzymes. For example, the small molecule enzymes have a lower percent identity than other enzymes when sequences from closely related bacteria are compared. Analyses indicate the lower percent identity is not a result of the amino acid or codon usage of the small molecule enzymes. The small molecule enzymes also don't have a significantly lower protein abundance indicating that is also not likely an important factor driving differences in amino acid conservation. Analyses indicate different methods to measure the TM of proteins have different relationships between amino acid conservation over different evolutionary distances. In totality, the results demonstrate that the relationship between the factors thought to affect protein evolution (protein abundance, codon usage, and proteins TMs) and protein evolution are complex and depend on the factor, the organisms, and the type of proteins being analyzed.
Collapse
Affiliation(s)
- Peter M Palenchar
- Department of Chemistry, Villanova University, 800 E. Lancaster Ave, Villanova, PA, 19805, USA.
| |
Collapse
|
6
|
Latrille T, Lartillot N. Quantifying the impact of changes in effective population size and expression level on the rate of coding sequence evolution. Theor Popul Biol 2021; 142:57-66. [PMID: 34563555 DOI: 10.1016/j.tpb.2021.09.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 09/08/2021] [Accepted: 09/11/2021] [Indexed: 02/07/2023]
Abstract
Molecular sequences are shaped by selection, where the strength of selection relative to drift is determined by effective population size (Ne). Populations with high Ne are expected to undergo stronger purifying selection, and consequently to show a lower substitution rate for selected mutations relative to the substitution rate for neutral mutations (ω). However, computational models based on biophysics of protein stability have suggested that ω can also be independent of Ne. Together, the response of ω to changes in Ne depends on the specific mapping from sequence to fitness. Importantly, an increase in protein expression level has been found empirically to result in decrease of ω, an observation predicted by theoretical models assuming selection for protein stability. Here, we derive a theoretical approximation for the response of ω to changes in Ne and expression level, under an explicit genotype-phenotype-fitness map. The method is generally valid for additive traits and log-concave fitness functions. We applied these results to protein undergoing selection for their conformational stability and corroborate out findings with simulations under more complex models. We predict a weak response of ω to changes in either Ne or expression level, which are interchangeable. Based on empirical data, we propose that fitness based on the conformational stability may not be a sufficient mechanism to explain the empirically observed variation in ω across species. Other aspects of protein biophysics might be explored, such as protein-protein interactions, which can lead to a stronger response of ω to changes in Ne.
Collapse
Affiliation(s)
- T Latrille
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR 5558, F-69622 Villeurbanne, France; École Normale Supérieure de Lyon, Université de Lyon, Université Lyon 1, Lyon, France.
| | - N Lartillot
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR 5558, F-69622 Villeurbanne, France
| |
Collapse
|
7
|
Pogoda E, Tutaj H, Pirog A, Tomala K, Korona R. Overexpression of a single ORF can extend chronological lifespan in yeast if retrograde signaling and stress response are stimulated. Biogerontology 2021; 22:415-427. [PMID: 34052951 PMCID: PMC8266792 DOI: 10.1007/s10522-021-09924-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 05/12/2021] [Indexed: 11/30/2022]
Abstract
Systematic collections of single-gene deletions have been invaluable in uncovering determinants of lifespan in yeast. Overexpression of a single gene does not have such a clear outcome as cancellation of its function but it can lead to a variety of imbalances, deregulations and compensations, and some of them could be important for longevity. We report an experiment in which a genome-wide collection of strains overexpressing a single gene was assayed for chronological lifespan (CLS). Only one group of proteins, those locating to the inner membrane and matrix of mitochondria, tended to extend CLS when abundantly overproduced. We selected two such strains—one overexpressing Qcr7 of the respiratory complex III, the other overexpressing Mrps28 of the small mitoribosomal subunit—and analyzed their transcriptomes. The uncovered shifts in RNA abundance in the two strains were nearly identical and highly suggestive. They implied a distortion in the co-translational assembly of respiratory complexes followed by retrograde signaling to the nucleus. The consequent reprogramming of the entire cellular metabolism towards the resistance to stress resulted in an enhanced ability to persist in a non-proliferating state. Our results show that surveillance of the inner mitochondrial membrane integrity is of outstanding importance for the cell. They also demonstrate that overexpression of single genes could be used effectively to elucidate the mitochondrion-nucleus crosstalk.
Collapse
Affiliation(s)
- Elzbieta Pogoda
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Cracow, Poland
| | - Hanna Tutaj
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Cracow, Poland
| | - Adrian Pirog
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Cracow, Poland
| | - Katarzyna Tomala
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Cracow, Poland
| | - Ryszard Korona
- Institute of Environmental Sciences, Jagiellonian University, Gronostajowa 7, 30-387, Cracow, Poland.
| |
Collapse
|
8
|
Wei C, Chen YM, Chen Y, Qian W. The Missing Expression Level-Evolutionary Rate Anticorrelation in Viruses Does Not Support Protein Function as a Main Constraint on Sequence Evolution. Genome Biol Evol 2021; 13:evab049. [PMID: 33713114 PMCID: PMC7989579 DOI: 10.1093/gbe/evab049] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/06/2021] [Indexed: 12/13/2022] Open
Abstract
One of the central goals in molecular evolutionary biology is to determine the sources of variation in the rate of sequence evolution among proteins. Gene expression level is widely accepted as the primary determinant of protein evolutionary rate, because it scales with the extent of selective constraints imposed on a protein, leading to the well-known negative correlation between expression level and protein evolutionary rate (the E-R anticorrelation). Selective constraints have been hypothesized to entail the maintenance of protein function, the avoidance of cytotoxicity caused by protein misfolding or nonspecific protein-protein interactions, or both. However, empirical tests evaluating the relative importance of these hypotheses remain scarce, likely due to the nontrivial difficulties in distinguishing the effect of a deleterious mutation on a protein's function versus its cytotoxicity. We realized that examining the sequence evolution of viral proteins could overcome this hurdle. It is because purifying selection against mutations in a viral protein that result in cytotoxicity per se is likely relaxed, whereas purifying selection against mutations that impair viral protein function persists. Multiple analyses of SARS-CoV-2 and nine other virus species revealed a complete absence of any E-R anticorrelation. As a control, the E-R anticorrelation does exist in human endogenous retroviruses where purifying selection against cytotoxicity is present. Taken together, these observations do not support the maintenance of protein function as the main constraint on protein sequence evolution in cellular organisms.
Collapse
Affiliation(s)
- Changshuo Wei
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yan-Ming Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ying Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Wenfeng Qian
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
9
|
Usmanova DR, Plata G, Vitkup D. The Relationship between the Misfolding Avoidance Hypothesis and Protein Evolutionary Rates in the Light of Empirical Evidence. Genome Biol Evol 2021; 13:6081017. [PMID: 33432359 PMCID: PMC7874998 DOI: 10.1093/gbe/evab006] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2021] [Indexed: 12/14/2022] Open
Abstract
For more than a decade, the misfolding avoidance hypothesis (MAH) and related theories have dominated evolutionary discussions aimed at explaining the variance of the molecular clock across cellular proteins. In this study, we use various experimental data to further investigate the consistency of the MAH predictions with empirical evidence. We also critically discuss experimental results that motivated the MAH development and that are often viewed as evidence of its major contribution to the variability of protein evolutionary rates. We demonstrate, in Escherichia coli and Homo sapiens, the lack of a substantial negative correlation between protein evolutionary rates and Gibbs free energies of unfolding, a direct measure of protein stability. We then analyze multiple new genome-scale data sets characterizing protein aggregation and interaction propensities, the properties that are likely optimized in evolution to alleviate deleterious effects associated with toxic protein misfolding and misinteractions. Our results demonstrate that the propensity of proteins to aggregate, the fraction of charged amino acids, and protein stickiness do correlate with protein abundances. Nevertheless, across multiple organisms and various data sets we do not observe substantial correlations between proteins’ aggregation- and stability-related properties and evolutionary rates. Therefore, diverse empirical data support the conclusion that the MAH and similar hypotheses do not play a major role in mediating a strong negative correlation between protein expression and the molecular clock, and thus in explaining the variability of evolutionary rates across cellular proteins.
Collapse
Affiliation(s)
- Dinara R Usmanova
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Germán Plata
- Department of Systems Biology, Columbia University, New York, NY, USA.,Elanco Animal Health, Greenfield, IN, USA
| | - Dennis Vitkup
- Department of Systems Biology, Columbia University, New York, NY, USA.,Department of Biomedical Informatics, Columbia University, New York, NY, USA
| |
Collapse
|