1
|
Wang Y, Xu H, He Q, Wu Z, Han GZ. Natural Transposable Element Insertions Contribute to Host Fitness in Model Yeasts. Genome Biol Evol 2024; 16:evae193. [PMID: 39228319 PMCID: PMC11403283 DOI: 10.1093/gbe/evae193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 08/30/2024] [Accepted: 08/31/2024] [Indexed: 09/05/2024] Open
Abstract
Transposable elements (TEs) are ubiquitous in the eukaryote genomes, but their evolutionary and functional significance remains largely obscure and contentious. Here, we explore the evolution and functional impact of TEs in two model unicellular eukaryotes, the fission yeast Schizosaccharomyces pombe and the budding yeast Saccharomyces cerevisiae, which diverged around 330 to 420 million years ago. We analyze the distribution of LTR retrotransposons (LTR-RTs, the only TE order identified in both species) and their solo-LTR derivatives in 35 strains of S. pombe and 128 strains of S. cerevisiae. We find that natural LTR-RT and solo-LTR insertions exhibit high presence-absence polymorphism among individuals in both species. Population genetics analyses show that solo-LTR insertions experienced functional constraints similar to synonymous sites of host genes in both species, indicating a majority of solo-LTR insertions might have evolved in a neutral manner. When knocking out nine representative solo-LTR insertions separately in the S. pombe strain 972h- and 12 representative solo-LTR insertions separately in the S. cerevisiae strain S288C, we find that one solo-LTR insertion in S. pombe has a significant effect on the fitness and transcriptome of its host. Together, our findings indicate that a fraction of natural TE insertions likely shape their host transcriptomes and thereby contribute to their host fitness, with implications for understanding the functional significance of TEs in eukaryotes.
Collapse
Affiliation(s)
- Yan Wang
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Hao Xu
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Qinliu He
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Zhiwei Wu
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| | - Guan-Zhu Han
- College of Life Sciences, Nanjing Normal University, Nanjing, Jiangsu 210023, China
| |
Collapse
|
2
|
Metzger BPH, Park Y, Starr TN, Thornton JW. Epistasis facilitates functional evolution in an ancient transcription factor. eLife 2024; 12:RP88737. [PMID: 38767330 PMCID: PMC11105156 DOI: 10.7554/elife.88737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024] Open
Abstract
A protein's genetic architecture - the set of causal rules by which its sequence produces its functions - also determines its possible evolutionary trajectories. Prior research has proposed that the genetic architecture of proteins is very complex, with pervasive epistatic interactions that constrain evolution and make function difficult to predict from sequence. Most of this work has analyzed only the direct paths between two proteins of interest - excluding the vast majority of possible genotypes and evolutionary trajectories - and has considered only a single protein function, leaving unaddressed the genetic architecture of functional specificity and its impact on the evolution of new functions. Here, we develop a new method based on ordinal logistic regression to directly characterize the global genetic determinants of multiple protein functions from 20-state combinatorial deep mutational scanning (DMS) experiments. We use it to dissect the genetic architecture and evolution of a transcription factor's specificity for DNA, using data from a combinatorial DMS of an ancient steroid hormone receptor's capacity to activate transcription from two biologically relevant DNA elements. We show that the genetic architecture of DNA recognition consists of a dense set of main and pairwise effects that involve virtually every possible amino acid state in the protein-DNA interface, but higher-order epistasis plays only a tiny role. Pairwise interactions enlarge the set of functional sequences and are the primary determinants of specificity for different DNA elements. They also massively expand the number of opportunities for single-residue mutations to switch specificity from one DNA target to another. By bringing variants with different functions close together in sequence space, pairwise epistasis therefore facilitates rather than constrains the evolution of new functions.
Collapse
Affiliation(s)
- Brian PH Metzger
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
| | - Yeonwoo Park
- Program in Genetics, Genomics, and Systems Biology, University of ChicagoChicagoUnited States
| | - Tyler N Starr
- Department of Biochemistry and Molecular Biophysics, University of ChicagoChicagoUnited States
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
- Department of Human Genetics, University of ChicagoChicagoUnited States
| |
Collapse
|
3
|
Nkurikiyimfura O, Waheed A, Fang H, Yuan X, Chen L, Wang YP, Lu G, Zhan J, Yang L. Fitness difference between two synonymous mutations of Phytophthora infestans ATP6 gene. BMC Ecol Evol 2024; 24:36. [PMID: 38494489 PMCID: PMC10946160 DOI: 10.1186/s12862-024-02223-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Accepted: 03/11/2024] [Indexed: 03/19/2024] Open
Abstract
BACKGROUND Sequence variation produced by mutation provides the ultimate source of natural selection for species adaptation. Unlike nonsynonymous mutation, synonymous mutations are generally considered to be selectively neutral but accumulating evidence suggests they also contribute to species adaptation by regulating the flow of genetic information and the development of functional traits. In this study, we analysed sequence characteristics of ATP6, a housekeeping gene from 139 Phytophthora infestans isolates, and compared the fitness components including metabolic rate, temperature sensitivity, aggressiveness, and fungicide tolerance among synonymous mutations. RESULTS We found that the housekeeping gene exhibited low genetic variation and was represented by two major synonymous mutants at similar frequency (0.496 and 0.468, respectively). The two synonymous mutants were generated by a single nucleotide substitution but differed significantly in fitness as well as temperature-mediated spatial distribution and expression. The synonymous mutant ending in AT was more common in cold regions and was more expressed at lower experimental temperature than the synonymous mutant ending in GC and vice versa. CONCLUSION Our results are consistent with the argument that synonymous mutations can modulate the adaptive evolution of species including pathogens and have important implications for sustainable disease management, especially under climate change.
Collapse
Affiliation(s)
- Oswald Nkurikiyimfura
- Institute of Plant Virology, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Abdul Waheed
- Institute of Plant Virology, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Hanmei Fang
- Institute of Plant Virology, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Xiaoxian Yuan
- Institute of Plant Virology, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Lixia Chen
- Fujian Key Laboratory on Conservation and Sustainable Utilization of Marine Biodiversity, Fuzhou Institute of Oceanography, Minjiang University, Fuzhou, 350108, China
- College of Resources and Environment, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Yan-Ping Wang
- College of Chemistry and Life Sciences, Sichuan Provincial Key Laboratory for Development and Utilization of Characteristic Horticultural Biological Resources, Chengdu Normal University, Chengdu, Sichuan, 611130, China
| | - Guodong Lu
- Department of Plant Pathology, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
| | - Jiasui Zhan
- Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Uppsala, 75007, Sweden.
| | - Lina Yang
- Fujian Key Laboratory on Conservation and Sustainable Utilization of Marine Biodiversity, Fuzhou Institute of Oceanography, Minjiang University, Fuzhou, 350108, China.
| |
Collapse
|
4
|
Tsimenidis S, Vrochidou E, Papakostas GA. Omics Data and Data Representations for Deep Learning-Based Predictive Modeling. Int J Mol Sci 2022; 23:12272. [PMID: 36293133 PMCID: PMC9603455 DOI: 10.3390/ijms232012272] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/03/2022] [Accepted: 10/12/2022] [Indexed: 11/25/2022] Open
Abstract
Medical discoveries mainly depend on the capability to process and analyze biological datasets, which inundate the scientific community and are still expanding as the cost of next-generation sequencing technologies is decreasing. Deep learning (DL) is a viable method to exploit this massive data stream since it has advanced quickly with there being successive innovations. However, an obstacle to scientific progress emerges: the difficulty of applying DL to biology, and this because both fields are evolving at a breakneck pace, thus making it hard for an individual to occupy the front lines of both of them. This paper aims to bridge the gap and help computer scientists bring their valuable expertise into the life sciences. This work provides an overview of the most common types of biological data and data representations that are used to train DL models, with additional information on the models themselves and the various tasks that are being tackled. This is the essential information a DL expert with no background in biology needs in order to participate in DL-based research projects in biomedicine, biotechnology, and drug discovery. Alternatively, this study could be also useful to researchers in biology to understand and utilize the power of DL to gain better insights into and extract important information from the omics data.
Collapse
Affiliation(s)
| | | | - George A. Papakostas
- MLV Research Group, Department of Computer Science, International Hellenic University, 65404 Kavala, Greece
| |
Collapse
|
5
|
Chandra S, Gupta K, Khare S, Kohli P, Asok A, Mohan SV, Gowda H, Varadarajan R. The High Mutational Sensitivity of ccdA Antitoxin Is Linked to Codon Optimality. Mol Biol Evol 2022; 39:msac187. [PMID: 36069948 PMCID: PMC9555053 DOI: 10.1093/molbev/msac187] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Deep mutational scanning studies suggest that synonymous mutations are typically silent and that most exposed, nonactive-site residues are tolerant to mutations. Here, we show that the ccdA antitoxin component of the Escherichia coli ccdAB toxin-antitoxin system is unusually sensitive to mutations when studied in the operonic context. A large fraction (∼80%) of single-codon mutations, including many synonymous mutations in the ccdA gene shows inactive phenotype, but they retain native-like binding affinity towards cognate toxin, CcdB. Therefore, the observed phenotypic effects are largely not due to alterations in protein structure/stability, consistent with a large region of CcdA being intrinsically disordered. E. coli codon preference and strength of ribosome-binding associated with translation of downstream ccdB gene are found to be major contributors of the observed ccdA mutant phenotypes. In select cases, proteomics studies reveal altered ratios of CcdA:CcdB protein levels in vivo, suggesting that the ccdA mutations likely alter relative translation efficiencies of the two genes in the operon. We extend these results by studying single-site synonymous mutations that lead to loss of function phenotypes in the relBE operon upon introduction of rarer codons. Thus, in their operonic context, genes are likely to be more sensitive to both synonymous and nonsynonymous point mutations than inferred previously.
Collapse
Affiliation(s)
- Soumyanetra Chandra
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Kritika Gupta
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Shruti Khare
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Pehu Kohli
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Aparna Asok
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | | | - Harsha Gowda
- Institute of Bioinformatics, Bangalore 560100, India
| | | |
Collapse
|
6
|
Daron J, Bravo IG. Variability in Codon Usage in Coronaviruses Is Mainly Driven by Mutational Bias and Selective Constraints on CpG Dinucleotide. Viruses 2021; 13:v13091800. [PMID: 34578381 PMCID: PMC8473333 DOI: 10.3390/v13091800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/18/2022] Open
Abstract
The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.
Collapse
Affiliation(s)
- Josquin Daron
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Correspondence:
| | - Ignacio G. Bravo
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Center for Research on the Ecology and Evolution of Diseases (CREES), 34394 Montpellier, France
| |
Collapse
|
7
|
Dench J, Hinz A, Aris‐Brosou S, Kassen R. Identifying the drivers of computationally detected correlated evolution among sites under antibiotic selection. Evol Appl 2020; 13:781-793. [PMID: 32211067 PMCID: PMC7086105 DOI: 10.1111/eva.12900] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 10/02/2019] [Accepted: 11/14/2019] [Indexed: 11/29/2022] Open
Abstract
The ultimate causes of correlated evolution among sites in a genome remain difficult to tease apart. To address this problem directly, we performed a high-throughput search for correlated evolution among sites associated with resistance to a fluoroquinolone antibiotic using whole-genome data from clinical strains of Pseudomonas aeruginosa, before validating our computational predictions experimentally. We show that for at least two sites, this correlation is underlain by epistasis. Our analysis also revealed eight additional pairs of synonymous substitutions displaying correlated evolution underlain by physical linkage, rather than selection associated with antibiotic resistance. Our results provide direct evidence that both epistasis and physical linkage among sites can drive the correlated evolution identified by high-throughput computational tools. In other words, the observation of correlated evolution is not by itself sufficient evidence to guarantee that the sites in question are epistatic; such a claim requires additional evidence, ideally coming from direct estimates of epistasis, based on experimental evidence.
Collapse
Affiliation(s)
- Jonathan Dench
- Department of BiologyUniversity of OttawaOttawaOntarioCanada
| | - Aaron Hinz
- Department of BiologyUniversity of OttawaOttawaOntarioCanada
| | - Stéphane Aris‐Brosou
- Department of BiologyUniversity of OttawaOttawaOntarioCanada
- Department of Mathematics and StatisticsUniversity of OttawaOttawaOntarioCanada
| | - Rees Kassen
- Department of BiologyUniversity of OttawaOttawaOntarioCanada
| |
Collapse
|
8
|
Flynn JM, Rossouw A, Cote-Hammarlof P, Fragata I, Mavor D, Hollins C, Bank C, Bolon DN. Comprehensive fitness maps of Hsp90 show widespread environmental dependence. eLife 2020; 9:53810. [PMID: 32129763 PMCID: PMC7069724 DOI: 10.7554/elife.53810] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 03/03/2020] [Indexed: 12/29/2022] Open
Abstract
Gene-environment interactions have long been theorized to influence molecular evolution. However, the environmental dependence of most mutations remains unknown. Using deep mutational scanning, we engineered yeast with all 44,604 single codon changes encoding 14,160 amino acid variants in Hsp90 and quantified growth effects under standard conditions and under five stress conditions. To our knowledge, these are the largest determined comprehensive fitness maps of point mutants. The growth of many variants differed between conditions, indicating that environment can have a large impact on Hsp90 evolution. Multiple variants provided growth advantages under individual conditions; however, these variants tended to exhibit growth defects in other environments. The diversity of Hsp90 sequences observed in extant eukaryotes preferentially contains variants that supported robust growth under all tested conditions. Rather than favoring substitutions in individual conditions, the long-term selective pressure on Hsp90 may have been that of fluctuating environments, leading to robustness under a variety of conditions.
Collapse
Affiliation(s)
- Julia M Flynn
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Ammeret Rossouw
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Pamela Cote-Hammarlof
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Inês Fragata
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | - David Mavor
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Carl Hollins
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Claudia Bank
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | - Daniel Na Bolon
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| |
Collapse
|
9
|
Kemble H, Nghe P, Tenaillon O. Recent insights into the genotype-phenotype relationship from massively parallel genetic assays. Evol Appl 2019; 12:1721-1742. [PMID: 31548853 PMCID: PMC6752143 DOI: 10.1111/eva.12846] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/21/2019] [Accepted: 07/02/2019] [Indexed: 12/20/2022] Open
Abstract
With the molecular revolution in Biology, a mechanistic understanding of the genotype-phenotype relationship became possible. Recently, advances in DNA synthesis and sequencing have enabled the development of deep mutational scanning assays, capable of scoring comprehensive libraries of genotypes for fitness and a variety of phenotypes in massively parallel fashion. The resulting empirical genotype-fitness maps pave the way to predictive models, potentially accelerating our ability to anticipate the behaviour of pathogen and cancerous cell populations from sequencing data. Besides from cellular fitness, phenotypes of direct application in industry (e.g. enzyme activity) and medicine (e.g. antibody binding) can be quantified and even selected directly by these assays. This review discusses the technological basis of and recent developments in massively parallel genetics, along with the trends it is uncovering in the genotype-phenotype relationship (distribution of mutation effects, epistasis), their possible mechanistic bases and future directions for advancing towards the goal of predictive genetics.
Collapse
Affiliation(s)
- Harry Kemble
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Philippe Nghe
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Olivier Tenaillon
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
| |
Collapse
|
10
|
Lebeuf-Taylor E, McCloskey N, Bailey SF, Hinz A, Kassen R. The distribution of fitness effects among synonymous mutations in a gene under directional selection. eLife 2019; 8:45952. [PMID: 31322500 PMCID: PMC6692132 DOI: 10.7554/elife.45952] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2019] [Accepted: 07/18/2019] [Indexed: 12/21/2022] Open
Abstract
The fitness effects of synonymous mutations, nucleotide changes that do not alter the encoded amino acid, have often been assumed to be neutral, but a growing body of evidence suggests otherwise. We used site-directed mutagenesis coupled with direct measures of competitive fitness to estimate the distribution of fitness effects among synonymous mutations for a gene under directional selection and capable of adapting via synonymous nucleotide changes. Synonymous mutations had highly variable fitness effects, both deleterious and beneficial, resembling those of nonsynonymous mutations in the same gene. This variation in fitness was underlain by changes in transcription linked to the creation of internal promoter sites. A positive correlation between fitness and the presence of synonymous substitutions across a phylogeny of related Pseudomonads suggests these mutations may be common in nature. Taken together, our results provide the most compelling evidence to date that synonymous mutations with non-neutral fitness effects may in fact be commonplace.
Collapse
Affiliation(s)
| | - Nick McCloskey
- Department of Biology, University of Ottawa, Ottawa, Canada
| | - Susan F Bailey
- Department of Biology, Clarkson University, Potsdam, United States
| | - Aaron Hinz
- Department of Biology, University of Ottawa, Ottawa, Canada
| | - Rees Kassen
- Department of Biology, University of Ottawa, Ottawa, Canada
| |
Collapse
|
11
|
LaBella AL, Opulente DA, Steenwyk JL, Hittinger CT, Rokas A. Variation and selection on codon usage bias across an entire subphylum. PLoS Genet 2019; 15:e1008304. [PMID: 31365533 PMCID: PMC6701816 DOI: 10.1371/journal.pgen.1008304] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 08/20/2019] [Accepted: 07/11/2019] [Indexed: 01/04/2023] Open
Abstract
Variation in synonymous codon usage is abundant across multiple levels of organization: between codons of an amino acid, between genes in a genome, and between genomes of different species. It is now well understood that variation in synonymous codon usage is influenced by mutational bias coupled with both natural selection for translational efficiency and genetic drift, but how these processes shape patterns of codon usage bias across entire lineages remains unexplored. To address this question, we used a rich genomic data set of 327 species that covers nearly one third of the known biodiversity of the budding yeast subphylum Saccharomycotina. We found that, while genome-wide relative synonymous codon usage (RSCU) for all codons was highly correlated with the GC content of the third codon position (GC3), the usage of codons for the amino acids proline, arginine, and glycine was inconsistent with the neutral expectation where mutational bias coupled with genetic drift drive codon usage. Examination between genes' effective numbers of codons and their GC3 contents in individual genomes revealed that nearly a quarter of genes (381,174/1,683,203; 23%), as well as most genomes (308/327; 94%), significantly deviate from the neutral expectation. Finally, by evaluating the imprint of translational selection on codon usage, measured as the degree to which genes' adaptiveness to the tRNA pool were correlated with selective pressure, we show that translational selection is widespread in budding yeast genomes (264/327; 81%). These results suggest that the contribution of translational selection and drift to patterns of synonymous codon usage across budding yeasts varies across codons, genes, and genomes; whereas drift is the primary driver of global codon usage across the subphylum, the codon bias of large numbers of genes in the majority of genomes is influenced by translational selection.
Collapse
Affiliation(s)
- Abigail L. LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Dana A. Opulente
- Laboratory of Genetics, Genome Center of Wisconsin, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin–Madison, Wisconsin, United States of America
| | - Jacob L. Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Chris Todd Hittinger
- Laboratory of Genetics, Genome Center of Wisconsin, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin–Madison, Wisconsin, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
12
|
Soh YS, Moncla LH, Eguia R, Bedford T, Bloom JD. Comprehensive mapping of adaptation of the avian influenza polymerase protein PB2 to humans. eLife 2019; 8:45079. [PMID: 31038123 PMCID: PMC6491042 DOI: 10.7554/elife.45079] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 03/31/2019] [Indexed: 12/11/2022] Open
Abstract
Viruses like influenza are infamous for their ability to adapt to new hosts. Retrospective studies of natural zoonoses and passaging in the lab have identified a modest number of host-adaptive mutations. However, it is unclear if these mutations represent all ways that influenza can adapt to a new host. Here we take a prospective approach to this question by completely mapping amino-acid mutations to the avian influenza virus polymerase protein PB2 that enhance growth in human cells. We identify numerous previously uncharacterized human-adaptive mutations. These mutations cluster on PB2’s surface, highlighting potential interfaces with host factors. Some previously uncharacterized adaptive mutations occur in avian-to-human transmission of H7N9 influenza, showing their importance for natural virus evolution. But other adaptive mutations do not occur in nature because they are inaccessible via single-nucleotide mutations. Overall, our work shows how selection at key molecular surfaces combines with evolutionary accessibility to shape viral host adaptation. Viruses copy themselves by hijacking the cells of an infected host, but this comes with some limitations. Cells from different species have different molecular machinery and so viruses often have to specialize to a narrow group of species. This specialization consists largely of fine-tuning the way that viral proteins interact with host proteins. For instance, in bird flu viruses, a protein known as PB2 does not interact well with the machinery in human cells. Because PB2 proteins form part of the viral polymerase (the structure that copies the viral genome), this prevents bird flu viruses from replicating efficiently in humans. Sometimes however, changes in the PB2 protein allow bird flu viruses to better replicate in humans, potentially leading to deadly flu pandemics. To understand exactly how this happens, researchers have previously used two approaches: examining the changes that have happened in past flu viruses, and monitoring the evolution of bird flu viruses grown in human cells in the lab. However, these approaches can only look at a small number of the many possible genetic changes to the virus. This makes it hard to anticipate the new ways that flu might adapt to human cells in the future. To overcome this problem, Soh et al. systematically created all of the single changes to the bird flu PB2, altering every element of the protein sequence one-by-one. They then tested which of the changes to PB2 helped the virus grow better in human cells. The modifications that made the viruses thrive were on the surface of the protein, suggesting that they might improve interaction with the cell machinery of the host. Some changes have been found in bird flu viruses that have recently jumped into humans in nature, although fortunately none of these viruses have yet spread widely to cause a pandemic. Many factors affect the evolution of viruses, and their ability to infect new species. Understanding which changes in proteins help these microbes adapt to new hosts is an important element that scientists could consider to assess future risks of pandemics.
Collapse
Affiliation(s)
- Yq Shirleen Soh
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, United States.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Louise H Moncla
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Rachel Eguia
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Trevor Bedford
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Jesse D Bloom
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, United States.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Howard Hughes Medical Institute, Seattle, United States
| |
Collapse
|
13
|
Fragata I, Blanckaert A, Dias Louro MA, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol 2019; 34:69-82. [DOI: 10.1016/j.tree.2018.10.009] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 10/16/2018] [Accepted: 10/17/2018] [Indexed: 01/28/2023]
|