1
|
Murga-Moreno J, Casillas S, Barbadilla A, Uricchio L, Enard D. An efficient and robust ABC approach to infer the rate and strength of adaptation. G3 (BETHESDA, MD.) 2024; 14:jkae031. [PMID: 38365205 PMCID: PMC11090462 DOI: 10.1093/g3journal/jkae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 10/10/2023] [Accepted: 01/29/2024] [Indexed: 02/18/2024]
Abstract
Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in nonmodel species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to nonmodel genomes. We apply ABC-MK to the human proteome and a set of known virus interacting proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85719, USA
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | | | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85719, USA
| |
Collapse
|
2
|
Salazar-Tortosa DF, Huang YF, Enard D. Assessing the Presence of Recent Adaptation in the Human Genome With Mixture Density Regression. Genome Biol Evol 2023; 15:evad170. [PMID: 37713622 PMCID: PMC10563788 DOI: 10.1093/gbe/evad170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Revised: 08/30/2023] [Accepted: 09/04/2023] [Indexed: 09/17/2023] Open
Abstract
How much genome differences between species reflect neutral or adaptive evolution is a central question in evolutionary genomics. In humans and other mammals, the presence of adaptive versus neutral genomic evolution has proven particularly difficult to quantify. The difficulty notably stems from the highly heterogeneous organization of mammalian genomes at multiple levels (functional sequence density, recombination, etc.) which complicates the interpretation and distinction of adaptive versus neutral evolution signals. In this study, we introduce mixture density regressions (MDRs) for the study of the determinants of recent adaptation in the human genome. MDRs provide a flexible regression model based on multiple Gaussian distributions. We use MDRs to model the association between recent selection signals and multiple genomic factors likely to affect the occurrence/detection of positive selection, if the latter was present in the first place to generate these associations. We find that an MDR model with two Gaussian distributions provides an excellent fit to the genome-wide distribution of a common sweep summary statistic (integrated haplotype score), with one of the two distributions likely enriched in positive selection. We further find several factors associated with signals of recent adaptation, including the recombination rate, the density of regulatory elements in immune cells, GC content, gene expression in immune cells, the density of mammal-wide conserved elements, and the distance to the nearest virus-interacting gene. These results support the presence of strong positive selection in recent human evolution and highlight MDRs as a powerful tool to make sense of signals of recent genomic adaptation.
Collapse
Affiliation(s)
- Diego F Salazar-Tortosa
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, USA
- Department of Ecology, University of Granada, Granada, Spain
| | - Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, State College, Pennsylvania, PA 16801, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, State College, Pennsylvania, PA 16801, USA
| | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, USA
| |
Collapse
|
3
|
Petak C, Frati L, Brennan RS, Pespeni MH. Whole-Genome Sequencing Reveals That Regulatory and Low Pleiotropy Variants Underlie Local Adaptation to Environmental Variability in Purple Sea Urchins. Am Nat 2023; 202:571-586. [PMID: 37792925 DOI: 10.1086/726013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]
Abstract
AbstractOrganisms experience environments that vary across both space and time. Such environmental heterogeneity shapes standing genetic variation and may influence species' capacity to adapt to rapid environmental change. However, we know little about the kind of genetic variation that is involved in local adaptation to environmental variability. To address this gap, we sequenced the whole genomes of 140 purple sea urchins (Strongylocentrotus purpuratus) from seven populations that vary in their degree of pH variability. Despite no evidence of global population structure, we found a suite of single-nucleotide polymorphisms (SNPs) tightly correlated with local pH variability (outlier SNPs), which were overrepresented in regions putatively involved in gene regulation (long noncoding RNA and enhancers), supporting the idea that variation in regulatory regions is important for local adaptation to variability. In addition, outliers in genes were found to be (i) enriched for biomineralization and ion homeostasis functions related to low pH response, (ii) less central to the protein-protein interaction network, and (iii) underrepresented among genes highly expressed during early development. Taken together, these results suggest that loci that underlie local adaptation to pH variability in purple sea urchins fall in regions with potentially low pleiotropic effects (based on analyses involving regulatory regions, network centrality, and expression time) involved in low pH response (based on functional enrichment).
Collapse
|
4
|
Murga-Moreno J, Casillas S, Barbadilla A, Uricchio L, Enard D. An efficient and robust ABC approach to infer the rate and strength of adaptation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.29.555322. [PMID: 37693550 PMCID: PMC10491248 DOI: 10.1101/2023.08.29.555322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in non-model species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to non-model genomes. We apply ABC-MK to the human proteome and a set of known Virus Interacting Proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, USA
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | | | - David Enard
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, USA
| |
Collapse
|
5
|
Cutter AD. Speciation and development. Evol Dev 2023; 25:289-327. [PMID: 37545126 DOI: 10.1111/ede.12454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 06/13/2023] [Accepted: 07/20/2023] [Indexed: 08/08/2023]
Abstract
Understanding general principles about the origin of species remains one of the foundational challenges in evolutionary biology. The genomic divergence between groups of individuals can spawn hybrid inviability and hybrid sterility, which presents a tantalizing developmental problem. Divergent developmental programs may yield either conserved or divergent phenotypes relative to ancestral traits, both of which can be responsible for reproductive isolation during the speciation process. The genetic mechanisms of developmental evolution involve cis- and trans-acting gene regulatory change, protein-protein interactions, genetic network structures, dosage, and epigenetic regulation, all of which also have roots in population genetic and molecular evolutionary processes. Toward the goal of demystifying Darwin's "mystery of mysteries," this review integrates microevolutionary concepts of genetic change with principles of organismal development, establishing explicit links between population genetic process and developmental mechanisms in the production of macroevolutionary pattern. This integration aims to establish a more unified view of speciation that binds process and mechanism.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
6
|
Padró J, De Panis DN, Luisi P, Dopazo H, Szajnman S, Hasson E, Soto IM. Ortholog genes from cactophilic Drosophila provide insight into human adaptation to hallucinogenic cacti. Sci Rep 2022; 12:13180. [PMID: 35915153 PMCID: PMC9343604 DOI: 10.1038/s41598-022-17118-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 07/20/2022] [Indexed: 11/18/2022] Open
Abstract
Cultural transformations of lifestyles and dietary practices have been key drivers of human evolution. However, while most of the evidence of genomic adaptations is related to the hunter-gatherer transition to agricultural societies, little is known on the influence of other major cultural manifestations. Shamanism is considered the oldest religion that predominated throughout most of human prehistory and still prevails in many indigenous populations. Several lines of evidence from ethno-archeological studies have demonstrated the continuity and importance of psychoactive plants in South American cultures. However, despite the well-known importance of secondary metabolites in human health, little is known about its role in the evolution of ethnic differences. Herein, we identified candidate genes of adaptation to hallucinogenic cactus in Native Andean populations with a long history of shamanic practices. We used genome-wide expression data from the cactophilic fly Drosophila buzzatii exposed to a hallucinogenic columnar cactus, also consumed by humans, to identify ortholog genes exhibiting adaptive footprints of alkaloid tolerance. Genomic analyses in human populations revealed a suite of ortholog genes evolving under recent positive selection in indigenous populations of the Central Andes. Our results provide evidence of selection in genetic variants related to alkaloids toxicity, xenobiotic metabolism, and neuronal plasticity in Aymara and Quechua populations, suggesting a possible process of gene-culture coevolution driven by religious practices.
Collapse
Affiliation(s)
- Julian Padró
- INIBIOMA-CONICET, Universidad Nacional del Comahue, Quintral 1250, R8400FRF, San Carlos de Bariloche, Argentina.
| | - Diego N De Panis
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Pierre Luisi
- Facultad de Filosofía y Humanidades, Universidad Nacional de Córdoba (FFyH-UNC), Córdoba, Argentina.,Microbial Paleogenomics Unit, Institut Pasteur, 25-28 Rue du Dr Roux, 75015, Paris, France
| | - Hernan Dopazo
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Sergio Szajnman
- Departamento de Química Orgánica and UMYMFOR (CONICET-FCEyN), Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Esteban Hasson
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Ignacio M Soto
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| |
Collapse
|
7
|
Chen Q, Yang H, Feng X, Chen Q, Shi S, Wu CI, He Z. Two decades of suspect evidence for adaptive molecular evolution – Negative selection confounding positive selection signals. Natl Sci Rev 2021; 9:nwab217. [PMID: 35663241 PMCID: PMC9154339 DOI: 10.1093/nsr/nwab217] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 11/21/2021] [Indexed: 11/21/2022] Open
Abstract
There has been a large literature in the last two decades affirming adaptive DNA sequence evolution between species. The main lines of evidence are from (i) the McDonald-Kreitman (MK) test, which compares divergence and polymorphism data, and (ii) the phylogenetic analysis by maximum likelihood (PAML) test, which analyzes multispecies divergence data. Here, we apply these two tests concurrently to genomic data of Drosophila and Arabidopsis. To our surprise, the >100 genes identified by the two tests do not overlap beyond random expectation. Because the non-concordance could be due to low powers leading to high false negatives, we merge every 20–30 genes into a ‘supergene’. At the supergene level, the power of detection is large but the calls still do not overlap. We rule out methodological reasons for the non-concordance. In particular, extensive simulations fail to find scenarios whereby positive selection can only be detected by either MK or PAML, but not both. Since molecular evolution is governed by positive and negative selection concurrently, a fundamental assumption for estimating one of these (say, positive selection) is that the other is constant. However, in a broad survey of primates, birds, Drosophila and Arabidopsis, we found that negative selection rarely stays constant for long in evolution. As a consequence, the variation in negative selection is often misconstrued as a signal of positive selection. In conclusion, MK, PAML and any method that examines genomic sequence evolution has to explicitly address the variation in negative selection before estimating positive selection. In a companion study, we propose a possible path forward in two stages—first, by mapping out the changes in negative selection and then using this map to estimate positive selection. For now, the large literature on positive selection between species has to await reassessment.
Collapse
Affiliation(s)
- Qipian Chen
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Hao Yang
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Xiao Feng
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Qingjian Chen
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Suhua Shi
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Chung-I Wu
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| | - Ziwen He
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
8
|
Di C, Murga Moreno J, Salazar-Tortosa DF, Lauterbur ME, Enard D. Decreased recent adaptation at human mendelian disease genes as a possible consequence of interference between advantageous and deleterious variants. eLife 2021; 10:69026. [PMID: 34636724 PMCID: PMC8526059 DOI: 10.7554/elife.69026] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 10/02/2021] [Indexed: 11/27/2022] Open
Abstract
Advances in genome sequencing have improved our understanding of the genetic basis of human diseases, and thousands of human genes have been associated with different diseases. Recent genomic adaptation at disease genes has not been well characterized. Here, we compare the rate of strong recent adaptation in the form of selective sweeps between mendelian, non-infectious disease genes and non-disease genes across distinct human populations from the 1000 Genomes Project. We find that mendelian disease genes have experienced far less selective sweeps compared to non-disease genes especially in Africa. Investigating further the possible causes of the sweep deficit at disease genes, we find that this deficit is very strong at disease genes with both low recombination rates and with high numbers of associated disease variants, but is almost non-existent at disease genes with higher recombination rates or lower numbers of associated disease variants. Because segregating recessive deleterious variants have the ability to interfere with adaptive ones, these observations strongly suggest that adaptation has been slowed down by the presence of interfering recessive deleterious variants at disease genes. These results suggest that disease genes suffer from a transient inability to adapt as fast as the rest of the genome.
Collapse
Affiliation(s)
- Chenlu Di
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, United States
| | - Jesus Murga Moreno
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Barcelona, Spain
| | | | - M Elise Lauterbur
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, United States
| | - David Enard
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, United States
| |
Collapse
|
9
|
Huang YF. Dissecting genomic determinants of positive selection with an evolution-guided regression model. Mol Biol Evol 2021; 39:6379733. [PMID: 34597406 PMCID: PMC8763110 DOI: 10.1093/molbev/msab291] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
In evolutionary genomics, it is fundamentally important to understand how characteristics of genomic sequences, such as gene expression level, determine the rate of adaptive evolution. While numerous statistical methods, such as the McDonald–Kreitman (MK) test, are available to examine the association between genomic features and the rate of adaptation, we currently lack a statistical approach to disentangle the independent effect of a genomic feature from the effects of other correlated genomic features. To address this problem, I present a novel statistical model, the MK regression, which augments the MK test with a generalized linear model. Analogous to the classical multiple regression model, the MK regression can analyze multiple genomic features simultaneously to infer the independent effect of a genomic feature, holding constant all other genomic features. Using the MK regression, I identify numerous genomic features driving positive selection in chimpanzees. These features include well-known ones, such as local mutation rate, residue exposure level, tissue specificity, and immune genes, as well as new features not previously reported, such as gene expression level and metabolic genes. In particular, I show that highly expressed genes may have a higher adaptation rate than their weakly expressed counterparts, even though a higher expression level may impose stronger negative selection. Also, I show that metabolic genes may have a higher adaptation rate than their nonmetabolic counterparts, possibly due to recent changes in diet in primate evolution. Overall, the MK regression is a powerful approach to elucidate the genomic basis of adaptation.
Collapse
Affiliation(s)
- Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA.,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA
| |
Collapse
|
10
|
Souilmi Y, Lauterbur ME, Tobler R, Huber CD, Johar AS, Moradi SV, Johnston WA, Krogan NJ, Alexandrov K, Enard D. An ancient viral epidemic involving host coronavirus interacting genes more than 20,000 years ago in East Asia. Curr Biol 2021; 31:3504-3514.e9. [PMID: 34171302 PMCID: PMC8223470 DOI: 10.1016/j.cub.2021.05.067] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/22/2021] [Accepted: 05/28/2021] [Indexed: 11/30/2022]
Abstract
The current severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has emphasized the vulnerability of human populations to novel viral pressures, despite the vast array of epidemiological and biomedical tools now available. Notably, modern human genomes contain evolutionary information tracing back tens of thousands of years, which may help identify the viruses that have impacted our ancestors-pointing to which viruses have future pandemic potential. Here, we apply evolutionary analyses to human genomic datasets to recover selection events involving tens of human genes that interact with coronaviruses, including SARS-CoV-2, that likely started more than 20,000 years ago. These adaptive events were limited to the population ancestral to East Asian populations. Multiple lines of functional evidence support an ancient viral selective pressure, and East Asia is the geographical origin of several modern coronavirus epidemics. An arms race with an ancient coronavirus, or with a different virus that happened to use similar interactions as coronaviruses with human hosts, may thus have taken place in ancestral East Asian populations. By learning more about our ancient viral foes, our study highlights the promise of evolutionary information to better predict the pandemics of the future. Importantly, adaptation to ancient viral epidemics in specific human populations does not necessarily imply any difference in genetic susceptibility between different human populations, and the current evidence points toward an overwhelming impact of socioeconomic factors in the case of coronavirus disease 2019 (COVID-19).
Collapse
Affiliation(s)
- Yassine Souilmi
- Australian Centre for Ancient DNA, School of Biological Sciences, University of Adelaide, Adelaide, SA 5005, Australia; National Centre for Indigenous Genomics, Australian National University, Canberra, ACT 0200, Australia
| | - M Elise Lauterbur
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, AZ, USA
| | - Ray Tobler
- Australian Centre for Ancient DNA, School of Biological Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Christian D Huber
- Australian Centre for Ancient DNA, School of Biological Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Angad S Johar
- Australian Centre for Ancient DNA, School of Biological Sciences, University of Adelaide, Adelaide, SA 5005, Australia
| | - Shayli Varasteh Moradi
- CSIRO-QUT Synthetic Biology Alliance, Centre for Tropical Crops and Biocommodities, Queensland University of Technology, Brisbane, QLD 4001, Australia
| | - Wayne A Johnston
- CSIRO-QUT Synthetic Biology Alliance, Centre for Tropical Crops and Biocommodities, Queensland University of Technology, Brisbane, QLD 4001, Australia
| | - Nevan J Krogan
- QBI COVID-19 Research Group (QCRG), San Francisco, CA, USA; Quantitative Biosciences Institute (QBI), University of California, San Francisco, San Francisco, CA, USA; J. David Gladstone Institutes, San Francisco, CA, USA; Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA, USA; Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Kirill Alexandrov
- CSIRO-QUT Synthetic Biology Alliance, Centre for Tropical Crops and Biocommodities, Queensland University of Technology, Brisbane, QLD 4001, Australia.
| | - David Enard
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, AZ, USA.
| |
Collapse
|
11
|
Fagny M, Austerlitz F. Polygenic Adaptation: Integrating Population Genetics and Gene Regulatory Networks. Trends Genet 2021; 37:631-638. [PMID: 33892958 DOI: 10.1016/j.tig.2021.03.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/15/2021] [Accepted: 03/16/2021] [Indexed: 12/13/2022]
Abstract
The adaptation of populations to local environments often relies on the selection of optimal values for polygenic traits. Here, we first summarize the results obtained from different quantitative genetics and population genetics models, about the genetic architecture of polygenic traits and their response to directional selection. We then highlight the contribution of systems biology to the understanding of the molecular bases of polygenic traits and the evolution of gene regulatory networks involved in these traits. Finally, we discuss the need for a unifying framework merging the fields of population genetics, quantitative genetics and systems biology to better understand the molecular bases of polygenic traits adaptation.
Collapse
Affiliation(s)
- Maud Fagny
- UMR7206 Eco-Anthropologie, Muséum National d'Histoire Naturelle, Centre National de la Recherche Scientifique, Université de Paris, Paris, France.
| | - Frédéric Austerlitz
- UMR7206 Eco-Anthropologie, Muséum National d'Histoire Naturelle, Centre National de la Recherche Scientifique, Université de Paris, Paris, France
| |
Collapse
|
12
|
Liu J, Robinson-Rechavi M. Robust inference of positive selection on regulatory sequences in the human brain. SCIENCE ADVANCES 2020; 6:6/48/eabc9863. [PMID: 33246961 PMCID: PMC7695467 DOI: 10.1126/sciadv.abc9863] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 10/16/2020] [Indexed: 05/07/2023]
Abstract
A longstanding hypothesis is that divergence between humans and chimpanzees might have been driven more by regulatory level adaptations than by protein sequence adaptations. This has especially been suggested for regulatory adaptations in the evolution of the human brain. We present a new method to detect positive selection on transcription factor binding sites on the basis of measuring predicted affinity change with a machine learning model of binding. Unlike other methods, this approach requires neither defining a priori neutral sites nor detecting accelerated evolution, thus removing major sources of bias. We scanned the signals of positive selection for CTCF binding sites in 29 human and 11 mouse tissues or cell types. We found that human brain-related cell types have the highest proportion of positive selection. This result is consistent with the view that adaptive evolution to gene regulation has played an important role in evolution of the human brain.
Collapse
Affiliation(s)
- Jialin Liu
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
13
|
Enard D, Petrov DA. Ancient RNA virus epidemics through the lens of recent adaptation in human genomes. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190575. [PMID: 33012231 PMCID: PMC7702803 DOI: 10.1098/rstb.2019.0575] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Over the course of the last several million years of evolution, humans probably have been plagued by hundreds or perhaps thousands of epidemics. Little is known about such ancient epidemics and a deep evolutionary perspective on current pathogenic threats is lacking. The study of past epidemics has typically been limited in temporal scope to recorded history, and in physical scope to pathogens that left sufficient DNA behind, such as Yersinia pestis during the Great Plague. Host genomes, however, offer an indirect way to detect ancient epidemics beyond the current temporal and physical limits. Arms races with pathogens have shaped the genomes of the hosts by driving a large number of adaptations at many genes, and these signals can be used to detect and further characterize ancient epidemics. Here, we detect the genomic footprints left by ancient viral epidemics that took place in the past approximately 50 000 years in the 26 human populations represented in the 1000 Genomes Project. By using the enrichment in signals of adaptation at approximately 4500 host loci that interact with specific types of viruses, we provide evidence that RNA viruses have driven a particularly large number of adaptive events across diverse human populations. These results suggest that different types of viruses may have exerted different selective pressures during human evolution. Knowledge of these past selective pressures will provide a deeper evolutionary perspective on current pathogenic threats. This article is part of the theme issue ‘Insights into health and disease from ancient biomolecules’.
Collapse
Affiliation(s)
- David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
14
|
Ferro D, Bakiu R, Pucciarelli S, Miceli C, Vallesi A, Irato P, Santovito G. Molecular Characterization, Protein-Protein Interaction Network, and Evolution of Four Glutathione Peroxidases from Tetrahymena thermophila. Antioxidants (Basel) 2020; 9:antiox9100949. [PMID: 33023127 PMCID: PMC7600574 DOI: 10.3390/antiox9100949] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 09/29/2020] [Accepted: 10/01/2020] [Indexed: 11/23/2022] Open
Abstract
Glutathione peroxidases (GPxs) form a broad family of antioxidant proteins essential for maintaining redox homeostasis in eukaryotic cells. In this study, we used an integrative approach that combines bioinformatics, molecular biology, and biochemistry to investigate the role of GPxs in reactive oxygen species detoxification in the unicellular eukaryotic model organism Tetrahymena thermophila. Both phylogenetic and mechanistic empirical model analyses provided indications about the evolutionary relationships among the GPXs of Tetrahymena and the orthologous enzymes of phylogenetically related species. In-silico gene characterization and text mining were used to predict the functional relationships between GPxs and other physiologically-relevant processes. The GPx genes contain conserved transcriptional regulatory elements in the promoter region, which suggest that transcription is under tight control of specialized signaling pathways. The bioinformatic findings were next experimentally validated by studying the time course of gene transcription and enzymatic activity after copper (Cu) exposure. Results emphasize the role of GPxs in the detoxification pathways that, by complex regulation of GPx gene expression, enable Tethraymena to survive in high Cu concentrations and the associated redox environment.
Collapse
Affiliation(s)
- Diana Ferro
- BIO5 Institute, University of Arizona, Tucson, AZ 85719, USA;
- Department of Pediatrics, Children’s Mercy Hospital and Clinics, Kansas City, MO 64108, USA
| | - Rigers Bakiu
- Department of Aquaculture and Fisheries, Agricultural University of Tirana, 1000 Tiranë, Albania;
| | - Sandra Pucciarelli
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032 Camerino, Italy; (S.P.); (C.M.); (A.V.)
| | - Cristina Miceli
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032 Camerino, Italy; (S.P.); (C.M.); (A.V.)
| | - Adriana Vallesi
- School of Biosciences and Veterinary Medicine, University of Camerino, 62032 Camerino, Italy; (S.P.); (C.M.); (A.V.)
| | - Paola Irato
- Department of Biology, University of Padova, 35131 Padova, Italy;
| | - Gianfranco Santovito
- Department of Biology, University of Padova, 35131 Padova, Italy;
- Correspondence:
| |
Collapse
|
15
|
Kulmuni J, Nouhaud P, Pluckrose L, Satokangas I, Dhaygude K, Butlin RK. Instability of natural selection at candidate barrier loci underlying speciation in wood ants. Mol Ecol 2020; 29:3988-3999. [PMID: 32854139 DOI: 10.1111/mec.15606] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 08/12/2020] [Accepted: 08/13/2020] [Indexed: 12/11/2022]
Abstract
Speciation underlies the generation of novel biodiversity. Yet, there is much to learn about how natural selection shapes genomes during speciation. Selection is assumed to act against gene flow at barrier loci, promoting reproductive isolation. However, evidence for gene flow and selection is often indirect and we know very little about the temporal stability of barrier loci. Here we utilize haplodiploidy to identify candidate male barrier loci in hybrids between two wood ant species. As ant males are haploid, they are expected to reveal recessive barrier loci, which can be masked in diploid females if heterozygous. We then test for barrier stability in a sample collected 10 years later and use survival analysis to provide a direct measure of natural selection acting on candidate male barrier loci. We find multiple candidate male barrier loci scattered throughout the genome. Surprisingly, a proportion of them are not stable after 10 years, natural selection apparently switching from acting against to favouring introgression in the later sample. Instability of the barrier effect and natural selection for introgressed alleles could be due to environment-dependent selection, emphasizing the need to consider temporal variation in the strength of natural selection and the stability of the barrier effect at putative barrier loci in future speciation work.
Collapse
Affiliation(s)
- Jonna Kulmuni
- Organismal & Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland.,Tvärminne Zoological Station, University of Helsinki, Hanko, Finland
| | - Pierre Nouhaud
- Organismal & Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland
| | - Lucy Pluckrose
- Organismal & Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland
| | - Ina Satokangas
- Organismal & Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland
| | - Kishor Dhaygude
- Organismal & Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland
| | - Roger K Butlin
- Department of Animal and Plant Sciences, Alfred Denny Building, University of Sheffield, Sheffield, UK.,Department of Marine Science, University of Gothenburg, Gothenburg, Sweden
| |
Collapse
|
16
|
Roberts WR, Roalson EH. Co-expression clustering across flower development identifies modules for diverse floral forms in Achimenes (Gesneriaceae). PeerJ 2020; 8:e8778. [PMID: 32201652 PMCID: PMC7071821 DOI: 10.7717/peerj.8778] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 02/21/2020] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Genetic pathways involved with flower color and shape are thought to play an important role in the development of flowers associated with different pollination syndromes, such as those associated with bee, butterfly, or hummingbird pollination. Because pollination syndromes are complex traits that are orchestrated by multiple genes and pathways, the gene regulatory networks have not been explored. Gene co-expression networks provide a systems level approach to identify important contributors to floral diversification. METHODS RNA-sequencing was used to assay gene expression across two stages of flower development (an early bud and an intermediate stage) in 10 species of Achimenes (Gesneriaceae). Two stage-specific co-expression networks were created from 9,503 orthologs and analyzed to identify module hubs and the network periphery. Module association with bee, butterfly, and hummingbird pollination syndromes was tested using phylogenetic mixed models. The relationship between network connectivity and evolutionary rates (d N/d S) was tested using linear models. RESULTS Networks contained 65 and 62 modules that were largely preserved between developmental stages and contained few stage-specific modules. Over a third of the modules in both networks were associated with flower color, shape, and pollination syndrome. Within these modules, several hub nodes were identified that related to the production of anthocyanin and carotenoid pigments and the development of flower shape. Evolutionary rates were decreased in highly connected genes and elevated in peripheral genes. DISCUSSION This study aids in the understanding of the genetic architecture and network properties underlying the development of floral form and provides valuable candidate modules and genes for future studies.
Collapse
Affiliation(s)
- Wade R. Roberts
- School of Biological Sciences, Washington State University, Pullman, WA, USA
- Biological Sciences, University of Arkansas, Fayetteville, AR, USA
| | - Eric H. Roalson
- School of Biological Sciences, Washington State University, Pullman, WA, USA
| |
Collapse
|
17
|
Alvarez-Ponce D, Aguilar-Rodríguez J, Fares MA. Molecular Chaperones Accelerate the Evolution of Their Protein Clients in Yeast. Genome Biol Evol 2020; 11:2360-2375. [PMID: 31297528 PMCID: PMC6735891 DOI: 10.1093/gbe/evz147] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/05/2019] [Indexed: 12/23/2022] Open
Abstract
Protein stability is a major constraint on protein evolution. Molecular chaperones, also known as heat-shock proteins, can relax this constraint and promote protein evolution by diminishing the deleterious effect of mutations on protein stability and folding. This effect, however, has only been stablished for a few chaperones. Here, we use a comprehensive chaperone–protein interaction network to study the effect of all yeast chaperones on the evolution of their protein substrates, that is, their clients. In particular, we analyze how yeast chaperones affect the evolutionary rates of their clients at two very different evolutionary time scales. We first study the effect of chaperone-mediated folding on protein evolution over the evolutionary divergence of Saccharomyces cerevisiae and S. paradoxus. We then test whether yeast chaperones have left a similar signature on the patterns of standing genetic variation found in modern wild and domesticated strains of S. cerevisiae. We find that genes encoding chaperone clients have diverged faster than genes encoding non-client proteins when controlling for their number of protein–protein interactions. We also find that genes encoding client proteins have accumulated more intraspecific genetic diversity than those encoding non-client proteins. In a number of multivariate analyses, controlling by other well-known factors that affect protein evolution, we find that chaperone dependence explains the largest fraction of the observed variance in the rate of evolution at both evolutionary time scales. Chaperones affecting rates of protein evolution mostly belong to two major chaperone families: Hsp70s and Hsp90s. Our analyses show that protein chaperones, by virtue of their ability to buffer destabilizing mutations and their role in modulating protein genotype–phenotype maps, have a considerable accelerating effect on protein evolution.
Collapse
Affiliation(s)
- David Alvarez-Ponce
- Biology Department, University of Nevada, Reno.,Instituto de Biología Molecular y Celular de Plantas, CSIC-UPV, Valencia, Spain
| | - José Aguilar-Rodríguez
- Department of Biology, Stanford University, CA.,Department of Chemical and Systems Biology, Stanford University School of Medicine, CA
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas, CSIC-UPV, Valencia, Spain.,Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Ireland
| |
Collapse
|
18
|
Hämälä T, Gorton AJ, Moeller DA, Tiffin P. Pleiotropy facilitates local adaptation to distant optima in common ragweed (Ambrosia artemisiifolia). PLoS Genet 2020; 16:e1008707. [PMID: 32210431 PMCID: PMC7135370 DOI: 10.1371/journal.pgen.1008707] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 04/06/2020] [Accepted: 03/05/2020] [Indexed: 12/23/2022] Open
Abstract
Pleiotropy, the control of multiple phenotypes by a single locus, is expected to slow the rate of adaptation by increasing the chance that beneficial alleles also have deleterious effects. However, a prediction arising from classical theory of quantitative trait evolution states that pleiotropic alleles may have a selective advantage when phenotypes are distant from their selective optima. We examine the role of pleiotropy in regulating adaptive differentiation among populations of common ragweed (Ambrosia artemisiifolia); a species that has recently expanded its North American range due to human-mediated habitat change. We employ a phenotype-free approach by using connectivity in gene networks as a proxy for pleiotropy. First, we identify loci bearing footprints of local adaptation, and then use genotype-expression mapping and co-expression networks to infer the connectivity of the genes. Our results indicate that the putatively adaptive loci are highly pleiotropic, as they are more likely than expected to affect the expression of other genes, and they reside in central positions within the gene networks. We propose that the conditionally advantageous alleles at these loci avoid the cost of pleiotropy by having large phenotypic effects that are beneficial when populations are far from their selective optima. We further use evolutionary simulations to show that these patterns are in agreement with a model where populations face novel selective pressures, as expected during a range expansion. Overall, our results suggest that highly connected genes may be targets of positive selection during environmental change, even though they likely experience strong purifying selection in stable selective environments.
Collapse
Affiliation(s)
- Tuomas Hämälä
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, United States of America
| | - Amanda J. Gorton
- Department of Ecology, Evolution and Behavior, University of Minnesota, St. Paul, Minnesota, United States of America
| | - David A. Moeller
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, United States of America
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota, United States of America
| |
Collapse
|
19
|
Glinsky GV. A Catalogue of 59,732 Human-Specific Regulatory Sequences Reveals Unique-to-Human Regulatory Patterns Associated with Virus-Interacting Proteins, Pluripotency, and Brain Development. DNA Cell Biol 2019; 39:126-143. [PMID: 31730374 DOI: 10.1089/dna.2019.4988] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Extensive searches for genomic regions harboring various types of candidate human-specific regulatory sequences (HSRS) identified thousands' HSRS using high-resolution next-generation sequencing technologies and methodologically diverse comparative analyses of human and nonhuman primates' (NHPs) reference genomes. In this study, a comprehensive catalogue of 59,732 genomic loci harboring candidate HSRS has been assembled to facilitate the systematic analyses of genomic sequences that were either inherited from extinct common ancestors (ECAs) or created de novo in human genomes. These analyses identified thousands of candidate HSRS and HSRS-harboring loci that appear inherited from ECAs, yet absent in genomes of our closest evolutionary relatives, chimpanzee and bonobo, presumably due to the incomplete lineage sorting and/or species-specific loss or regulatory DNA. This pattern is particularly prominent for HSRS-harboring loci that have been putatively associated with human-specific gene expression changes in cerebral organoid models. A prominent majority of regions harboring human-specific mutations associated with human-specific expression changes during brain development is highly conserved in chimpanzee, bonobo, and gorilla genomes. Among NHPs, dominant fractions of HSRS-harboring loci associated with human-specific gene expression in both excitatory neurons (347 loci; 67%) and radial glia (683 loci; 72%) are highly conserved in the gorilla genome. Analysis of 4433 genes encoding virus-interacting proteins (VIPs) revealed that 95.9% of human VIPs are components of human-specific regulatory networks that appear to operate in distinct types of human cells from preimplantation embryos to adult dorsolateral prefrontal cortex. These analyses demonstrate that modern humans captured unique genome-wide combinations of regulatory sequences, divergent subsets of which are highly conserved in distinct species of six NHP separated by 30 million years of evolution. Concurrently, this unique-to-human mosaic of genomic regulatory patterns inherited from ECAs was supplemented with 12,486 created de novo HSRS. Genes encoding VIPs appear to represent a principal genomic target during evolution of human-specific regulatory networks, which contribute to fitness of Homo sapiens and affect a functionally diverse spectrum of biological and cellular processes controlled by VIP-containing liquid-liquid phase-separated condensates.
Collapse
Affiliation(s)
- Gennadi V Glinsky
- Institute of Engineering in Medicine, University of California, San Diego, La Jolla, California
| |
Collapse
|
20
|
Dobon B, Montanucci L, Peretó J, Bertranpetit J, Laayouni H. Gene connectivity and enzyme evolution in the human metabolic network. Biol Direct 2019; 14:17. [PMID: 31481097 PMCID: PMC6724310 DOI: 10.1186/s13062-019-0248-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 08/21/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Determining the factors involved in the likelihood of a gene being under adaptive selection is still a challenging goal in Evolutionary Biology. Here, we perform an evolutionary analysis of the human metabolic genes to explore the associations between network structure and the presence and strength of natural selection in the genes whose products are involved in metabolism. Purifying and positive selection are estimated at interspecific (among mammals) and intraspecific (among human populations) levels, and the connections between enzymatic reactions are differentiated between incoming (in-degree) and outgoing (out-degree) links. RESULTS We confirm that purifying selection has been stronger in highly connected genes. Long-term positive selection has targeted poorly connected enzymes, whereas short-term positive selection has targeted different enzymes depending on whether the selective sweep has reached fixation in the population: genes under a complete selective sweep are poorly connected, whereas those under an incomplete selective sweep have high out-degree connectivity. The last steps of pathways are more conserved due to stronger purifying selection, with long-term positive selection targeting preferentially enzymes that catalyze the first steps. However, short-term positive selection has targeted enzymes that catalyze the last steps in the metabolic network. Strong signals of positive selection have been found for metabolic processes involved in lipid transport and membrane fluidity and permeability. CONCLUSIONS Our analysis highlights the importance of analyzing the same biological system at different evolutionary timescales to understand the evolution of metabolic genes and of distinguishing between incoming and outgoing links in a metabolic network. Short-term positive selection has targeted enzymes with a different connectivity profile depending on the completeness of the selective sweep, while long-term positive selection has targeted genes with fewer connections that code for enzymes that catalyze the first steps in the network. REVIEWERS This article was reviewed by Diamantis Sellis and Brandon Invergo.
Collapse
Affiliation(s)
- Begoña Dobon
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader 88, 08003, Barcelona, Catalonia, Spain
| | - Ludovica Montanucci
- Dipartimento di Biomedicina Comparata e Alimentazione, Università degli Studi di Padova, Padua, Italy
| | - Juli Peretó
- Institute for Integrative Systems Biology I2SysBio (University of Valencia-CSIC) and Department of Biochemistry and Molecular Biology, University of Valencia, Valencia, Spain
| | - Jaume Bertranpetit
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader 88, 08003, Barcelona, Catalonia, Spain.
| | - Hafid Laayouni
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader 88, 08003, Barcelona, Catalonia, Spain. .,Bioinformatics Studies, ESCI-UPF, Pg.Pujades 1, 08003, Barcelona, Catalonia, Spain.
| |
Collapse
|
21
|
Enard D, Petrov DA. Evidence that RNA Viruses Drove Adaptive Introgression between Neanderthals and Modern Humans. Cell 2019; 175:360-371.e13. [PMID: 30290142 DOI: 10.1016/j.cell.2018.08.034] [Citation(s) in RCA: 98] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2018] [Revised: 07/04/2018] [Accepted: 08/16/2018] [Indexed: 01/01/2023]
Abstract
Neanderthals and modern humans interbred at least twice in the past 100,000 years. While there is evidence that most introgressed DNA segments from Neanderthals to modern humans were removed by purifying selection, less is known about the adaptive nature of introgressed sequences that were retained. We hypothesized that interbreeding between Neanderthals and modern humans led to (1) the exposure of each species to novel viruses and (2) the exchange of adaptive alleles that provided resistance against these viruses. Here, we find that long, frequent-and more likely adaptive-segments of Neanderthal ancestry in modern humans are enriched for proteins that interact with viruses (VIPs). We found that VIPs that interact specifically with RNA viruses were more likely to belong to introgressed segments in modern Europeans. Our results show that retained segments of Neanderthal ancestry can be used to detect ancient epidemics.
Collapse
Affiliation(s)
- David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA.
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
22
|
Torres-Sánchez M, Gower DJ, Alvarez-Ponce D, Creevey CJ, Wilkinson M, San Mauro D. What lies beneath? Molecular evolution during the radiation of caecilian amphibians. BMC Genomics 2019; 20:354. [PMID: 31072350 PMCID: PMC6507065 DOI: 10.1186/s12864-019-5694-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 04/15/2019] [Indexed: 12/12/2022] Open
Abstract
Background Evolution leaves an imprint in species through genetic change. At the molecular level, evolutionary changes can be explored by studying ratios of nucleotide substitutions. The interplay among molecular evolution, derived phenotypes, and ecological ranges can provide insights into adaptive radiations. Caecilians (order Gymnophiona), probably the least known of the major lineages of vertebrates, are limbless tropical amphibians, with adults of most species burrowing in soils (fossoriality). This enigmatic order of amphibians are very distinct phenotypically from other extant amphibians and likely from the ancestor of Lissamphibia, but little to nothing is known about the molecular changes underpinning their radiation. We hypothesised that colonization of various depths of tropical soils and of freshwater habitats presented new ecological opportunities to caecilians. Results A total of 8540 candidate groups of orthologous genes from transcriptomic data of five species of caecilian amphibians and the genome of the frog Xenopus tropicalis were analysed in order to investigate the genetic machinery behind caecilian diversification. We found a total of 168 protein-coding genes with signatures of positive selection at different evolutionary times during the radiation of caecilians. The majority of these genes were related to functional elements of the cell membrane and extracellular matrix with expression in several different tissues. The first colonization of the tropical soils was connected to the largest number of protein-coding genes under positive selection in our analysis. From the results of our study, we highlighted molecular changes in genes involved in perception, reduction-oxidation processes, and aging that likely were involved in the adaptation to different soil strata. Conclusions The genes inferred to have been under positive selection provide valuable insights into caecilian evolution, potentially underpin adaptations of caecilians to their extreme environments, and contribute to a better understanding of fossorial adaptations and molecular evolution in vertebrates. Electronic supplementary material The online version of this article (10.1186/s12864-019-5694-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- María Torres-Sánchez
- Department of Biodiversity, Ecology and Evolution, Complutense University of Madrid, 28040, Madrid, Spain. .,Present address: Department of Neuroscience, Spinal Cord and Brain Injury Research Center & Ambystoma Genetic Stock Center, University of Kentucky, Lexington, KY, 40536, USA.
| | - David J Gower
- Department of Life Sciences, The Natural History Museum, London, SW7 5BD, UK
| | | | - Christopher J Creevey
- Institute for Global Food Security, Queen's University Belfast, University Road, Belfast, BT7 1NN, Northern Ireland, UK
| | - Mark Wilkinson
- Department of Life Sciences, The Natural History Museum, London, SW7 5BD, UK
| | - Diego San Mauro
- Department of Biodiversity, Ecology and Evolution, Complutense University of Madrid, 28040, Madrid, Spain
| |
Collapse
|
23
|
Alvarez-Ponce D, Feyertag F, Chakraborty S. Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein-Protein Interaction Network. Genome Biol Evol 2018; 9:1742-1756. [PMID: 28854629 PMCID: PMC5570066 DOI: 10.1093/gbe/evx117] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2017] [Indexed: 02/06/2023] Open
Abstract
The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets.
Collapse
|
24
|
Jabbari K, Heger P, Sharma R, Wiehe T. The Diverging Routes of BORIS and CTCF: An Interactomic and Phylogenomic Analysis. Life (Basel) 2018; 8:life8010004. [PMID: 29385718 PMCID: PMC5871936 DOI: 10.3390/life8010004] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Revised: 01/25/2018] [Accepted: 01/25/2018] [Indexed: 12/11/2022] Open
Abstract
The CCCTC-binding factor (CTCF) is multi-functional, ubiquitously expressed, and highly conserved from Drosophila to human. It has important roles in transcriptional insulation and the formation of a high-dimensional chromatin structure. CTCF has a paralog called “Brother of Regulator of Imprinted Sites” (BORIS) or “CTCF-like” (CTCFL). It binds DNA at sites similar to those of CTCF. However, the expression profiles of the two proteins are quite different. We investigated the evolutionary trajectories of the two proteins after the duplication event using a phylogenomic and interactomic approach. We find that CTCF has 52 direct interaction partners while CTCFL only has 19. Almost all interactors already existed before the emergence of CTCF and CTCFL. The unique secondary loss of CTCF from several nematodes is paralleled by a loss of two of its interactors, the polycomb repressive complex subunit SuZ12 and the multifunctional transcription factor TYY1. In contrast to earlier studies reporting the absence of BORIS from birds, we present evidence for a multigene synteny block containing CTCFL that is conserved in mammals, reptiles, and several species of birds, indicating that not the entire lineage of birds experienced a loss of CTCFL. Within this synteny block, BORIS and its genomic neighbors seem to be partitioned into two nested chromatin loops. The high expression of SPO11, RAE1, RBM38, and PMEPA1 in male tissues suggests a possible link between CTCFL, meiotic recombination, and fertility-associated phenotypes. Using the 65,700 exomes and the 1000 genomes data, we observed a higher number of intergenic, non-synonymous, and loss-of-function mutations in CTCFL than in CTCF, suggesting a reduced strength of purifying selection, perhaps due to less functional constraint.
Collapse
Affiliation(s)
- Kamel Jabbari
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| | - Peter Heger
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| | - Ranu Sharma
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| | - Thomas Wiehe
- Cologne Biocenter, Institute for Genetics, University of Cologne, Zülpicher Straße 47a, 50674 Köln, Germany.
| |
Collapse
|
25
|
Ebel ER, Telis N, Venkataram S, Petrov DA, Enard D. High rate of adaptation of mammalian proteins that interact with Plasmodium and related parasites. PLoS Genet 2017; 13:e1007023. [PMID: 28957326 PMCID: PMC5634635 DOI: 10.1371/journal.pgen.1007023] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Revised: 10/10/2017] [Accepted: 09/15/2017] [Indexed: 11/18/2022] Open
Abstract
Plasmodium parasites, along with their Piroplasm relatives, have caused malaria-like illnesses in terrestrial mammals for millions of years. Several Plasmodium-protective alleles have recently evolved in human populations, but little is known about host adaptation to blood parasites over deeper evolutionary timescales. In this work, we analyze mammalian adaptation in ~500 Plasmodium- or Piroplasm- interacting proteins (PPIPs) manually curated from the scientific literature. We show that (i) PPIPs are enriched for both immune functions and pleiotropy with other pathogens, and (ii) the rate of adaptation across mammals is significantly elevated in PPIPs, compared to carefully matched control proteins. PPIPs with high pathogen pleiotropy show the strongest signatures of adaptation, but this pattern is fully explained by their immune enrichment. Several pieces of evidence suggest that blood parasites specifically have imposed selection on PPIPs. First, even non-immune PPIPs that lack interactions with other pathogens have adapted at twice the rate of matched controls. Second, PPIP adaptation is linked to high expression in the liver, a critical organ in the parasite life cycle. Finally, our detailed investigation of alpha-spectrin, a major red blood cell membrane protein, shows that domains with particularly high rates of adaptation are those known to interact specifically with P. falciparum. Overall, we show that host proteins that interact with Plasmodium and Piroplasm parasites have experienced elevated rates of adaptation across mammals, and provide evidence that some of this adaptation has likely been driven by blood parasites.
Collapse
Affiliation(s)
- Emily R. Ebel
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (ERE); (DE)
| | - Natalie Telis
- Program in Biomedical Informatics, Stanford University, Stanford, California, United States of America
| | - Sandeep Venkataram
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - David Enard
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (ERE); (DE)
| |
Collapse
|
26
|
Gouy A, Daub JT, Excoffier L. Detecting gene subnetworks under selection in biological pathways. Nucleic Acids Res 2017; 45:e149. [PMID: 28934485 PMCID: PMC5766194 DOI: 10.1093/nar/gkx626] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2017] [Revised: 07/04/2017] [Accepted: 07/10/2017] [Indexed: 12/30/2022] Open
Abstract
Advances in high throughput sequencing technologies have created a gap between data production and functional data analysis. Indeed, phenotypes result from interactions between numerous genes, but traditional methods treat loci independently, missing important knowledge brought by network-level emerging properties. Therefore, detecting selection acting on multiple genes affecting the evolution of complex traits remains challenging. In this context, gene network analysis provides a powerful framework to study the evolution of adaptive traits and facilitates the interpretation of genome-wide data. We developed a method to analyse gene networks that is suitable to evidence polygenic selection. The general idea is to search biological pathways for subnetworks of genes that directly interact with each other and that present unusual evolutionary features. Subnetwork search is a typical combinatorial optimization problem that we solve using a simulated annealing approach. We have applied our methodology to find signals of adaptation to high-altitude in human populations. We show that this adaptation has a clear polygenic basis and is influenced by many genetic components. Our approach, implemented in the R package signet, improves on gene-level classical tests for selection by identifying both new candidate genes and new biological processes involved in adaptation to altitude.
Collapse
Affiliation(s)
- Alexandre Gouy
- Institute of Ecology and Evolution, University of Berne, Baltzerstrasse 6, 3012 Berne, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Joséphine T. Daub
- Institute of Evolutionary Biology, Universitat Pompeu Fabra – CSIC, 08003 Barcelona, Spain
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Baltzerstrasse 6, 3012 Berne, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
27
|
Connectivity in gene coexpression networks negatively correlates with rates of molecular evolution in flowering plants. PLoS One 2017; 12:e0182289. [PMID: 28759647 PMCID: PMC5536297 DOI: 10.1371/journal.pone.0182289] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 07/14/2017] [Indexed: 12/22/2022] Open
Abstract
Gene coexpression networks are a useful tool for summarizing transcriptomic data and providing insight into patterns of gene regulation in a variety of species. Though there has been considerable interest in studying the evolution of network topology across species, less attention has been paid to the relationship between network position and patterns of molecular evolution. Here, we generated coexpression networks from publicly available expression data for seven flowering plant taxa (Arabidopsis thaliana, Glycine max, Oryza sativa, Populus spp., Solanum lycopersicum, Vitis spp., and Zea mays) to investigate the relationship between network position and rates of molecular evolution. We found a significant negative correlation between network connectivity and rates of molecular evolution, with more highly connected (i.e., “hub”) genes having significantly lower nonsynonymous substitution rates and dN/dS ratios compared to less highly connected (i.e., “peripheral”) genes across the taxa surveyed. These findings suggest that more centrally located hub genes are, on average, subject to higher levels of evolutionary constraint than are genes located on the periphery of gene coexpression networks. The consistency of this result across disparate taxa suggests that it holds for flowering plants in general, as opposed to being a species-specific phenomenon.
Collapse
|
28
|
Feyertag F, Berninsone PM, Alvarez-Ponce D. Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation. Mol Biol Evol 2017; 34:692-706. [PMID: 28007979 DOI: 10.1093/molbev/msw268] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation.
Collapse
Affiliation(s)
- Felix Feyertag
- Department of Biology, University of Nevada, Reno, Reno, NV
| | | | | |
Collapse
|
29
|
Kulmuni J, Westram AM. Intrinsic incompatibilities evolving as a by-product of divergent ecological selection: Considering them in empirical studies on divergence with gene flow. Mol Ecol 2017; 26:3093-3103. [PMID: 28423210 DOI: 10.1111/mec.14147] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Revised: 03/31/2017] [Accepted: 04/03/2017] [Indexed: 12/31/2022]
Abstract
The possibility of intrinsic barriers to gene flow is often neglected in empirical research on local adaptation and speciation with gene flow, for example when interpreting patterns observed in genome scans. However, we draw attention to the fact that, even with gene flow, divergent ecological selection may generate intrinsic barriers involving both ecologically selected and other interacting loci. Mechanistically, the link between the two types of barriers may be generated by genes that have multiple functions (i.e., pleiotropy), and/or by gene interaction networks. Because most genes function in complex networks, and their evolution is not independent of other genes, changes evolving in response to ecological selection can generate intrinsic barriers as a by-product. A crucial question is to what extent such by-product barriers contribute to divergence and speciation-that is whether they stably reduce gene flow. We discuss under which conditions by-product barriers may increase isolation. However, we also highlight that, depending on the conditions (e.g., the amount of gene flow and the strength of selection acting on the intrinsic vs. the ecological barrier component), the intrinsic incompatibility may actually destabilize barriers to gene flow. In practice, intrinsic barriers generated as a by-product of divergent ecological selection may generate peaks in genome scans that cannot easily be interpreted. We argue that empirical studies on divergence with gene flow should consider the possibility of both ecological and intrinsic barriers. Future progress will likely come from work combining population genomic studies, experiments quantifying fitness and molecular studies on protein function and interactions.
Collapse
Affiliation(s)
- J Kulmuni
- Centre of Excellence in Biological Interactions, Department of Biosciences, University of Helsinki, Helsinki, Finland.,Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| | - A M Westram
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| |
Collapse
|
30
|
Josephs EB, Wright SI, Stinchcombe JR, Schoen DJ. The Relationship between Selection, Network Connectivity, and Regulatory Variation within a Population of Capsella grandiflora. Genome Biol Evol 2017; 9:1099-1109. [PMID: 28402527 PMCID: PMC5408089 DOI: 10.1093/gbe/evx068] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/07/2017] [Indexed: 12/12/2022] Open
Abstract
Interactions between genes can have important consequences for how selection shapes sequence variation at these genes. Specifically, genes that have pleiotropic effects by affecting the expression level of many other genes may be under stronger selective constraint. We used coexpression networks to measure connectivity between genes and investigated the relationship between gene connectivity and selection in a natural population of the plant Capsella grandiflora. We observed that network connectivity was negatively correlated with genetic divergence due to stronger negative selection on highly-connected genes even when controlling for variation in gene expression level. However, the presence of local regulatory variation for a gene's expression level was also associated with reduced negative selection and lower gene connectivity. While it is difficult to disentangle the causal relationships between these factors, our results show that both connectivity and local regulatory variation are important factors for explaining variation in selection between genes.
Collapse
Affiliation(s)
- Emily B. Josephs
- Department of Evolution and Ecology, University of California, Davis
| | - Stephen I. Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada
| | - John R. Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada
| | - Daniel J. Schoen
- Department of Biology, McGill University, Stewart Biology Building, Montreal, Quebec, Canada
| |
Collapse
|
31
|
Structural divergence of essential triad ribbon synapse proteins among placental mammals - Implications for preclinical trials in photoreceptor transplantation therapy. Exp Eye Res 2017; 159:156-167. [PMID: 28322827 DOI: 10.1016/j.exer.2017.03.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Revised: 01/24/2017] [Accepted: 03/17/2017] [Indexed: 11/22/2022]
Abstract
As photoreceptor transplantation rapidly moves closer to the clinic, verifying graft efficacy in animal models may have unforeseen xenogeneic barriers. Although photoreceptor transplants have most convincingly exhibited functional synaptogenesis in conspecific studies, such evidence (while ruling out false-positives due to: viral graft labeling, fusion/cytosolic transfer, or neuroprotection) has not yet been shown for discordant xenografts. From this, a fundamental question should be raised: is useful xenosynaptogenesis likely between human photoreceptors and mouse retina? The triad ribbon synapse (TRS) that would normally form is unique and contains trans-synaptic proteins essential to its formation and function. Thus, could interspecific structural divergence be present that may inhibit this trans-synaptic bridge in discordant xenografts? In an effort to address this question computationally, we compared eight recently confirmed (including subcellular location) TRS specific (or predominantly expressed at the TRS) proteins among placental mammals (1-to-1 orthologs) using HyPhy selection analysis (a predictive measure of structural divergence) and by using Phyre2 tertiary structural modeling. Here, selection analysis revealed strong positive (diversifying) selection acting on a particularly important TRS protein: pikachurin. This positive selection was localized to its second Laminin-G (LG)-like domain and on its N-terminal domain - a putative region of trans-synaptic interaction. Localization of structural divergence to the N-terminus of each putative post-translational cleavage (PTC) product may suggest neofunctionalization from ancestral uncleaved pikachurin - this would be consistent with a recent counter-paradigm report of pikachurin cleavage predominating at the TRS. From this, we suggest a dual role after cleavage where the N-terminal fragment can still mediate the trans-synaptic bridge, while the C-terminal fragment may act as a diffusible trophic or "homing" factor for bipolar cell dendrite migration. Tertiary structural models mirrored the conformational divergence predicted by selection analysis. With human and mouse pikachurin (as well as other TRS proteins) likely to diverge considerably in structure among placental mammals - alongside known inter-mammalian variation in TRS phenotype and protein repertoire, high levels of diversifying selection acting on genes involving sensation, considerable timespans allowing for genetic drift that can create xenogeneic epistasis, and uncertainty surrounding the extent of xenosynaptogenesis in PPC transplant studies to date - use of distantly related hosts to test human photoreceptor graft therapeutic efficacy should be considered with caution.
Collapse
|
32
|
Alvarez-Ponce D, Sabater-Muñoz B, Toft C, Ruiz-González MX, Fares MA. Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli. Genome Biol Evol 2016; 8:2914-2927. [PMID: 27566759 PMCID: PMC5630975 DOI: 10.1093/gbe/evw205] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500–5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length.
Collapse
Affiliation(s)
| | - Beatriz Sabater-Muñoz
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| | - Christina Toft
- Department of Genetics, University of Valencia, Valencia, Spain Departamento de Biotecnología, Instituto de Agroquímica y Tecnología de los Alimentos (CSIC), Valencia, Spain
| | - Mario X Ruiz-González
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Current Address: Secretaría de Educación Superior, Ciencia, Tecnología e Innovación, Proyecto Prometeo; Departamento de Ciencias Biológicas, Universidad Tócnica Particular de Loja, Loja, Ecuador
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|
33
|
Enard D, Cai L, Gwennap C, Petrov DA. Viruses are a dominant driver of protein adaptation in mammals. eLife 2016; 5. [PMID: 27187613 PMCID: PMC4869911 DOI: 10.7554/elife.12469] [Citation(s) in RCA: 163] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 04/04/2016] [Indexed: 12/12/2022] Open
Abstract
Viruses interact with hundreds to thousands of proteins in mammals, yet adaptation against viruses has only been studied in a few proteins specialized in antiviral defense. Whether adaptation to viruses typically involves only specialized antiviral proteins or affects a broad array of virus-interacting proteins is unknown. Here, we analyze adaptation in ~1300 virus-interacting proteins manually curated from a set of 9900 proteins conserved in all sequenced mammalian genomes. We show that viruses (i) use the more evolutionarily constrained proteins within the cellular functions they interact with and that (ii) despite this high constraint, virus-interacting proteins account for a high proportion of all protein adaptation in humans and other mammals. Adaptation is elevated in virus-interacting proteins across all functional categories, including both immune and non-immune functions. We conservatively estimate that viruses have driven close to 30% of all adaptive amino acid changes in the part of the human proteome conserved within mammals. Our results suggest that viruses are one of the most dominant drivers of evolutionary change across mammalian and human proteomes.
Collapse
Affiliation(s)
- David Enard
- Department of Biology, Stanford University, Stanford, United States
| | - Le Cai
- Department of Biology, Stanford University, Stanford, United States
| | - Carina Gwennap
- Department of Biology, Stanford University, Stanford, United States
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, United States
| |
Collapse
|
34
|
Positive Selection and Centrality in the Yeast and Fly Protein-Protein Interaction Networks. BIOMED RESEARCH INTERNATIONAL 2016; 2016:4658506. [PMID: 27119079 PMCID: PMC4826914 DOI: 10.1155/2016/4658506] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2015] [Accepted: 03/07/2016] [Indexed: 01/28/2023]
Abstract
Proteins within a molecular network are expected to be subject to different selective pressures depending on their relative hierarchical positions. However, it is not obvious what genes within a network should be more likely to evolve under positive selection. On one hand, only mutations at genes with a relatively high degree of control over adaptive phenotypes (such as those encoding highly connected proteins) are expected to be “seen” by natural selection. On the other hand, a high degree of pleiotropy at these genes is expected to hinder adaptation. Previous analyses of the human protein-protein interaction network have shown that genes under long-term, recurrent positive selection (as inferred from interspecific comparisons) tend to act at the periphery of the network. It is unknown, however, whether these trends apply to other organisms. Here, we show that long-term positive selection has preferentially targeted the periphery of the yeast interactome. Conversely, in flies, genes under positive selection encode significantly more connected and central proteins. These observations are not due to covariation of genes' adaptability and centrality with confounding factors. Therefore, the distribution of proteins encoded by genes under recurrent positive selection across protein-protein interaction networks varies from one species to another.
Collapse
|
35
|
Roque E, Fares MA, Yenush L, Rochina MC, Wen J, Mysore KS, Gómez-Mena C, Beltrán JP, Cañas LA. Evolution by gene duplication of Medicago truncatula PISTILLATA-like transcription factors. JOURNAL OF EXPERIMENTAL BOTANY 2016; 67:1805-1817. [PMID: 26773809 PMCID: PMC4783364 DOI: 10.1093/jxb/erv571] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
PISTILLATA (PI) is a member of the B-function MADS-box gene family, which controls the identity of both petals and stamens in Arabidopsis thaliana. In Medicago truncatula (Mt), there are two PI-like paralogs, known as MtPI and MtNGL9. These genes differ in their expression patterns, but it is not known whether their functions have also diverged. Describing the evolution of certain duplicated genes, such as transcription factors, remains a challenge owing to the complex expression patterns and functional divergence between the gene copies. Here, we report a number of functional studies, including analyses of gene expression, protein-protein interactions, and reverse genetic approaches designed to demonstrate the respective contributions of each M. truncatula PI-like paralog to the B-function in this species. Also, we have integrated molecular evolution approaches to determine the mode of evolution of Mt PI-like genes after duplication. Our results demonstrate that MtPI functions as a master regulator of B-function in M. truncatula, maintaining the overall ancestral function, while MtNGL9 does not seem to have a role in this regard, suggesting that the pseudogenization could be the functional evolutionary fate for this gene. However, we provide evidence that purifying selection is the primary evolutionary force acting on this paralog, pinpointing the conservation of its biochemical function and, alternatively, the acquisition of a new role for this gene.
Collapse
Affiliation(s)
- Edelín Roque
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Lynne Yenush
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Mari Cruz Rochina
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Jiangqi Wen
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA
| | - Kirankumar S Mysore
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA
| | - Concepción Gómez-Mena
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - José Pío Beltrán
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Luis A Cañas
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| |
Collapse
|
36
|
Vedanayagam JP, Garrigan D. The effects of natural selection across molecular pathways in Drosophila melanogaster. BMC Evol Biol 2015; 15:203. [PMID: 26391223 PMCID: PMC4578789 DOI: 10.1186/s12862-015-0472-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Accepted: 08/30/2015] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Whole-genome RNA interference post-transcriptional silencing (RNAi) is a widely used method for studying the phenotypic effects of knocking down individual genes. In this study, we use a population genomic approach to characterize the rate of evolution for proteins affecting 26 RNAi knockdown phenotypes in Drosophila melanogaster. RESULTS We find that only two of the 26 RNAi knockdown phenotypes are enriched for rapidly evolving proteins: innate immunity and regulation of Hedgehog signaling. Among all genes associated with an RNAi knockdown phenotype, we note examples in which the adaptively evolving proteins play a well-defined role in a given molecular pathway. However, most adaptively evolving proteins are found to perform more general cellular functions. When RNAi phenotypes are grouped into categories according to cellular function, we find that genes involved in the greatest number of phenotypic categories are also significantly more likely to have a history of rapid protein evolution. CONCLUSIONS We show that genes that have been demonstrated to have a measurable effect on multiple molecular phenotypes show higher rates of protein evolution than genes having an effect on a single category of phenotype. Defining pleiotropy in this way yields very different results than previous studies that define pleiotropy by the number of physical interactions, which show highly connected proteins tend to evolve more slowly than lowly connected proteins. We suggest that a high degree of pleiotropy may increase the likelihood of compensatory substitution, consistent with modern theoretical work on adaptation.
Collapse
Affiliation(s)
| | - Daniel Garrigan
- Department of Biology, University of Rochester, Rochester, New York, 14627, USA.
| |
Collapse
|
37
|
Suleiman SH, Koko ME, Nasir WH, Elfateh O, Elgizouli UK, Abdallah MOE, Alfarouk KO, Hussain A, Faisal S, Ibrahim FMA, Romano M, Sultan A, Banks L, Newport M, Baralle F, Elhassan AM, Mohamed HS, Ibrahim ME. Exome sequencing of a colorectal cancer family reveals shared mutation pattern and predisposition circuitry along tumor pathways. Front Genet 2015; 6:288. [PMID: 26442106 PMCID: PMC4584935 DOI: 10.3389/fgene.2015.00288] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2015] [Accepted: 08/28/2015] [Indexed: 01/04/2023] Open
Abstract
The molecular basis of cancer and cancer multiple phenotypes are not yet fully understood. Next Generation Sequencing promises new insight into the role of genetic interactions in shaping the complexity of cancer. Aiming to outline the differences in mutation patterns between familial colorectal cancer cases and controls we analyzed whole exomes of cancer tissues and control samples from an extended colorectal cancer pedigree, providing one of the first data sets of exome sequencing of cancer in an African population against a background of large effective size typically with excess of variants. Tumors showed hMSH2 loss of function SNV consistent with Lynch syndrome. Sets of genes harboring insertions-deletions in tumor tissues revealed, however, significant GO enrichment, a feature that was not seen in control samples, suggesting that ordered insertions-deletions are central to tumorigenesis in this type of cancer. Network analysis identified multiple hub genes of centrality. ELAVL1/HuR showed remarkable centrality, interacting specially with genes harboring non-synonymous SNVs thus reinforcing the proposition of targeted mutagenesis in cancer pathways. A likely explanation to such mutation pattern is DNA/RNA editing, suggested here by nucleotide transition-to-transversion ratio that significantly departed from expected values (p-value 5e-6). NFKB1 also showed significant centrality along with ELAVL1, raising the suspicion of viral etiology given the known interaction between oncogenic viruses and these proteins.
Collapse
Affiliation(s)
| | - Mahmoud E Koko
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Wafaa H Nasir
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Ommnyiah Elfateh
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Ubai K Elgizouli
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Mohammed O E Abdallah
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Khalid O Alfarouk
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Ayman Hussain
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Shima Faisal
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Fathelrahamn M A Ibrahim
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum Khartoum, Sudan
| | - Maurizio Romano
- International Centre for Genetic Engineering and Biotechnology Trieste, Italy
| | - Ali Sultan
- Weill Cornell Medical College Doha, Qatar
| | - Lawrence Banks
- International Centre for Genetic Engineering and Biotechnology Trieste, Italy
| | | | - Francesco Baralle
- International Centre for Genetic Engineering and Biotechnology Trieste, Italy
| | | | - Hiba S Mohamed
- Faculty of Medicine, University of Khartoum Khartoum, Sudan
| | | |
Collapse
|