1
|
Mello B, Schrago CG. Modeling Substitution Rate Evolution across Lineages and Relaxing the Molecular Clock. Genome Biol Evol 2024; 16:evae199. [PMID: 39332907 PMCID: PMC11430275 DOI: 10.1093/gbe/evae199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/08/2024] [Indexed: 09/29/2024] Open
Abstract
Relaxing the molecular clock using models of how substitution rates change across lineages has become essential for addressing evolutionary problems. The diversity of rate evolution models and their implementations are substantial, and studies have demonstrated their impact on divergence time estimates can be as significant as that of calibration information. In this review, we trace the development of rate evolution models from the proposal of the molecular clock concept to the development of sophisticated Bayesian and non-Bayesian methods that handle rate variation in phylogenies. We discuss the various approaches to modeling rate evolution, provide a comprehensive list of available software, and examine the challenges and advancements of the prevalent Bayesian framework, contrasting them to faster non-Bayesian methods. Lastly, we offer insights into potential advancements in the field in the era of big data.
Collapse
Affiliation(s)
- Beatriz Mello
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, RJ 21941-617, Brazil
| | - Carlos G Schrago
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, RJ 21941-617, Brazil
| |
Collapse
|
2
|
Ohadi M, Arabfard M, Khamse S, Alizadeh S, Vafadar S, Bayat H, Tajeddin N, Maddi AMA, Delbari A, Khorram Khorshid HR. Novel crossover and recombination hotspots massively spread across primate genomes. Biol Direct 2024; 19:70. [PMID: 39169390 PMCID: PMC11340189 DOI: 10.1186/s13062-024-00508-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 07/29/2024] [Indexed: 08/23/2024] Open
Abstract
BACKGROUND The recombination landscape and subsequent natural selection have vast consequences forevolution and speciation. However, most of the crossover and recombination hotspots are yet to be discovered. We previously reported the relevance of C and G trinucleotide two-repeat units (CG-TTUs) in crossovers and recombination. METHODS On a genome-wide scale, here we mapped all combinations of A and T trinucleotide two-repeat units (AT-TTUs) in human, consisting of AATAAT, ATAATA, ATTATT, TTATTA, TATTAT, and TAATAA. We also compared a number of the colonies formed by the AT-TTUs (distance between consecutive AT-TTUs < 500 bp) in several other primates and mouse. RESULTS We found that the majority of the AT-TTUs (> 96%) resided in approximately 1.4 million colonies, spread throughout the human genome. In comparison to the CG-TTU colonies, the AT-TTU colonies were significantly more abundant and larger in size. Pure units and overlapping units of the pure units were readily detectable in the same colonies, signifying that the units were the sites of unequal crossover. We discovered dynamic sharedness of several of the colonies across the primate species studied, which mainly reached maximum complexity and size in human. CONCLUSIONS We report novel crossover and recombination hotspots of the finest molecular resolution, massively spread and shared across the genomes of human and several other primates. With respect to crossovers and recombination, these genomes are far more dynamic than previously envisioned.
Collapse
Affiliation(s)
- Mina Ohadi
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - Masoud Arabfard
- Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran.
| | - Safoura Khamse
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Samira Alizadeh
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Sara Vafadar
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Hadi Bayat
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Biochemical Neuroendocrinology, Montreal Clinical and Research Institute (IRCM, affiliated to the McGill University, Montreal, QC, H2W 1R7, Canada
| | - Nahid Tajeddin
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
- Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - Ali M A Maddi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Ahmad Delbari
- Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Hamid R Khorram Khorshid
- Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran
| |
Collapse
|
3
|
Joseph J, Prentout D, Laverré A, Tricou T, Duret L. High prevalence of PRDM9-independent recombination hotspots in placental mammals. Proc Natl Acad Sci U S A 2024; 121:e2401973121. [PMID: 38809707 PMCID: PMC11161765 DOI: 10.1073/pnas.2401973121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 04/26/2024] [Indexed: 05/31/2024] Open
Abstract
In many mammals, recombination events are concentrated in hotspots directed by a sequence-specific DNA-binding protein named PRDM9. Intriguingly, PRDM9 has been lost several times in vertebrates, and notably among mammals, it has been pseudogenized in the ancestor of canids. In the absence of PRDM9, recombination hotspots tend to occur in promoter-like features such as CpG islands. It has thus been proposed that one role of PRDM9 could be to direct recombination away from PRDM9-independent hotspots. However, the ability of PRDM9 to direct recombination hotspots has been assessed in only a handful of species, and a clear picture of how much recombination occurs outside of PRDM9-directed hotspots in mammals is still lacking. In this study, we derived an estimator of past recombination activity based on signatures of GC-biased gene conversion in substitution patterns. We quantified recombination activity in PRDM9-independent hotspots in 52 species of boreoeutherian mammals. We observe a wide range of recombination rates at these loci: several species (such as mice, humans, some felids, or cetaceans) show a deficit of recombination, while a majority of mammals display a clear peak of recombination. Our results demonstrate that PRDM9-directed and PRDM9-independent hotspots can coexist in mammals and that their coexistence appears to be the rule rather than the exception. Additionally, we show that the location of PRDM9-independent hotspots is relatively more stable than that of PRDM9-directed hotspots, but that PRDM9-independent hotspots nevertheless evolve slowly in concert with DNA hypomethylation.
Collapse
Affiliation(s)
- Julien Joseph
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| | - Djivan Prentout
- Department of Biological Sciences, Columbia University, New York, NY10027
| | - Alexandre Laverré
- Department of Ecology and Evolution, University of Lausanne, LausanneCH-1015, Switzerland
- Swiss Institute of Bioinformatics, LausanneCH-1015, Switzerland
| | - Théo Tricou
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| |
Collapse
|
4
|
Joseph J. Increased Positive Selection in Highly Recombining Genes Does not Necessarily Reflect an Evolutionary Advantage of Recombination. Mol Biol Evol 2024; 41:msae107. [PMID: 38829800 PMCID: PMC11173204 DOI: 10.1093/molbev/msae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/08/2024] [Accepted: 05/28/2024] [Indexed: 06/05/2024] Open
Abstract
It is commonly thought that the long-term advantage of meiotic recombination is to dissipate genetic linkage, allowing natural selection to act independently on different loci. It is thus theoretically expected that genes with higher recombination rates evolve under more effective selection. On the other hand, recombination is often associated with GC-biased gene conversion (gBGC), which theoretically interferes with selection by promoting the fixation of deleterious GC alleles. To test these predictions, several studies assessed whether selection was more effective in highly recombining genes (due to dissipation of genetic linkage) or less effective (due to gBGC), assuming a fixed distribution of fitness effects (DFE) for all genes. In this study, I directly derive the DFE from a gene's evolutionary history (shaped by mutation, selection, drift, and gBGC) under empirical fitness landscapes. I show that genes that have experienced high levels of gBGC are less fit and thus have more opportunities for beneficial mutations. Only a small decrease in the genome-wide intensity of gBGC leads to the fixation of these beneficial mutations, particularly in highly recombining genes. This results in increased positive selection in highly recombining genes that is not caused by more effective selection. Additionally, I show that the death of a recombination hotspot can lead to a higher dN/dS than its birth, but with substitution patterns biased towards AT, and only at selected positions. This shows that controlling for a substitution bias towards GC is therefore not sufficient to rule out the contribution of gBGC to signatures of accelerated evolution. Finally, although gBGC does not affect the fixation probability of GC-conservative mutations, I show that by altering the DFE, gBGC can also significantly affect nonsynonymous GC-conservative substitution patterns.
Collapse
Affiliation(s)
- Julien Joseph
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne, France
| |
Collapse
|
5
|
Galtier N. Half a Century of Controversy: The Neutralist/Selectionist Debate in Molecular Evolution. Genome Biol Evol 2024; 16:evae003. [PMID: 38311843 PMCID: PMC10839204 DOI: 10.1093/gbe/evae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/01/2024] [Indexed: 02/06/2024] Open
Abstract
The neutral and nearly neutral theories, introduced more than 50 yr ago, have raised and still raise passionate discussion regarding the forces governing molecular evolution and their relative importance. The debate, initially focused on the amount of within-species polymorphism and constancy of the substitution rate, has spread, matured, and now underlies a wide range of topics and questions. The neutralist/selectionist controversy has structured the field and influences the way molecular evolutionary scientists conceive their research.
Collapse
Affiliation(s)
- Nicolas Galtier
- ISEM, CNRS, IRD, Université de Montpellier, Montpellier, France
| |
Collapse
|
6
|
Liu A, Wang N, Xie G, Li Y, Yan X, Li X, Zhu Z, Li Z, Yang J, Meng F, Dou M, Chen W, Ma N, Jiang Y, Gao Y, Wang Y. GC-biased gene conversion drives accelerated evolution of ultraconserved elements in mammalian and avian genomes. Genome Res 2023; 33:1673-1689. [PMID: 37884342 PMCID: PMC10691551 DOI: 10.1101/gr.277784.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 08/23/2023] [Indexed: 10/28/2023]
Abstract
Ultraconserved elements (UCEs) are the most conserved regions among the genomes of evolutionarily distant species and are thought to play critical biological functions. However, some UCEs rapidly evolved in specific lineages, and whether they contributed to adaptive evolution is still controversial. Here, using an increased number of sequenced genomes with high taxonomic coverage, we identified 2191 mammalian UCEs and 5938 avian UCEs from 95 mammal and 94 bird genomes, respectively. Our results show that these UCEs are functionally constrained and that their adjacent genes are prone to widespread expression with low expression diversity across tissues. Functional enrichment of mammalian and avian UCEs shows different trends indicating that UCEs may contribute to adaptive evolution of taxa. Focusing on lineage-specific accelerated evolution, we discover that the proportion of fast-evolving UCEs in nine mammalian and 10 avian test lineages range from 0.19% to 13.2%. Notably, up to 62.1% of fast-evolving UCEs in test lineages are much more likely to result from GC-biased gene conversion (gBGC). A single cervid-specific gBGC region embracing the uc.359 allele significantly alters the expression of Nova1 and other neural-related genes in the rat brain. Combined with the altered regulatory activity of ancient gBGC-induced fast-evolving UCEs in eutherians, our results provide evidence that synergy between gBGC and selection shaped lineage-specific substitution patterns, even in the most constrained regulatory elements. In summary, our results show that gBGC played an important role in facilitating lineage-specific accelerated evolution of UCEs, and further support the idea that a combination of multiple evolutionary forces shapes adaptive evolution.
Collapse
Affiliation(s)
- Anguo Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Nini Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Faculty of Mathematics and Natural Sciences, University of Cologne, and Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), University Hospital Cologne, Cologne 50931, Germany
| | - Guoxiang Xie
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yang Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xixi Yan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xinmei Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhenliang Zhu
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhuohui Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Jing Yang
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Fanxin Meng
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Mingle Dou
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Weihuang Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Nange Ma
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Center for Functional Genomics, Institute of Future Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yuanpeng Gao
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China;
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China;
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| |
Collapse
|
7
|
Westbury MV, Cabrera AA, Rey-Iglesia A, De Cahsan B, Duchêne DA, Hartmann S, Lorenzen ED. A genomic assessment of the marine-speciation paradox within the toothed whale superfamily Delphinoidea. Mol Ecol 2023; 32:4829-4843. [PMID: 37448145 DOI: 10.1111/mec.17069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 06/21/2023] [Accepted: 07/03/2023] [Indexed: 07/15/2023]
Abstract
The impact of post-divergence gene flow in speciation has been documented across a range of taxa in recent years, and may have been especially widespread in highly mobile, wide-ranging marine species, such as cetaceans. Here, we studied individual genomes from nine species across the three families of the toothed whale superfamily Delphinoidea (Delphinidae, Phocoenidae and Monodontidae). To investigate the role of post-divergence gene flow in the speciation process, we used a multifaceted approach, including (i) phylogenomics, (ii) the distribution of shared derived alleles and (iii) demographic inference. We found the divergence of lineages within Delphinoidea did not follow a process of pure bifurcation, but was much more complex. Sliding-window phylogenomics reveal a high prevalence of discordant topologies within the superfamily, with further analyses indicating these discordances arose due to both incomplete lineage sorting and gene flow. D-statistics and f-branch analyses supported gene flow between members of Delphinoidea, with the vast majority of gene flow occurring as ancient interfamilial events. Demographic analyses provided evidence that introgressive gene flow has likely ceased between all species pairs tested, despite reports of contemporary interspecific hybrids. Our study provides the first steps towards resolving the large complexity of speciation within Delphinoidea; we reveal the prevalence of ancient interfamilial gene flow events prior to the diversification of each family, and suggest that contemporary hybridisation events may be disadvantageous, as hybrid individuals do not appear to contribute to the parental species' gene pools.
Collapse
Affiliation(s)
| | | | | | - Binia De Cahsan
- Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - David A Duchêne
- Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Stefanie Hartmann
- Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | | |
Collapse
|
8
|
Dornburg A, Mallik R, Wang Z, Bernal MA, Thompson B, Bruford EA, Nebert DW, Vasiliou V, Yohe LR, Yoder JA, Townsend JP. Placing human gene families into their evolutionary context. Hum Genomics 2022; 16:56. [PMID: 36369063 PMCID: PMC9652883 DOI: 10.1186/s40246-022-00429-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 10/12/2022] [Indexed: 11/13/2022] Open
Abstract
Following the draft sequence of the first human genome over 20 years ago, we have achieved unprecedented insights into the rules governing its evolution, often with direct translational relevance to specific diseases. However, staggering sequence complexity has also challenged the development of a more comprehensive understanding of human genome biology. In this context, interspecific genomic studies between humans and other animals have played a critical role in our efforts to decode human gene families. In this review, we focus on how the rapid surge of genome sequencing of both model and non-model organisms now provides a broader comparative framework poised to empower novel discoveries. We begin with a general overview of how comparative approaches are essential for understanding gene family evolution in the human genome, followed by a discussion of analyses of gene expression. We show how homology can provide insights into the genes and gene families associated with immune response, cancer biology, vision, chemosensation, and metabolism, by revealing similarity in processes among distant species. We then explain methodological tools that provide critical advances and show the limitations of common approaches. We conclude with a discussion of how these investigations position us to gain fundamental insights into the evolution of gene families among living organisms in general. We hope that our review catalyzes additional excitement and research on the emerging field of comparative genomics, while aiding the placement of the human genome into its existentially evolutionary context.
Collapse
Affiliation(s)
- Alex Dornburg
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA.
| | - Rittika Mallik
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
| | - Zheng Wang
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| | - Moisés A Bernal
- Department of Biological Sciences, College of Science and Mathematics, Auburn University, Auburn, AL, USA
| | - Brian Thompson
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Elspeth A Bruford
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Daniel W Nebert
- Department of Environmental Health, Center for Environmental Genetics, University of Cincinnati Medical Center, P.O. Box 670056, Cincinnati, OH, 45267, USA
- Department of Pediatrics and Molecular Developmental Biology, Division of Human Genetics, Cincinnati Children's Hospital, Cincinnati, OH, 45229, USA
| | - Vasilis Vasiliou
- Department of Environmental Health Sciences, Yale School of Public Health, New Haven, CT, USA
| | - Laurel R Yohe
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
| | - Jeffrey A Yoder
- Department of Molecular Biomedical Sciences, College of Veterinary Medicine, North Carolina State University, Raleigh, NC, USA
| | - Jeffrey P Townsend
- Department of Bioinformatics and Genomics, UNC-Charlotte, Charlotte, NC, USA
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| |
Collapse
|
9
|
Mika K, Whittington CM, McAllan BM, Lynch VJ. Gene expression phylogenies and ancestral transcriptome reconstruction resolves major transitions in the origins of pregnancy. eLife 2022; 11:e74297. [PMID: 35770963 PMCID: PMC9275820 DOI: 10.7554/elife.74297] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 06/29/2022] [Indexed: 11/13/2022] Open
Abstract
Structural and physiological changes in the female reproductive system underlie the origins of pregnancy in multiple vertebrate lineages. In mammals, the glandular portion of the lower reproductive tract has transformed into a structure specialized for supporting fetal development. These specializations range from relatively simple maternal nutrient provisioning in egg-laying monotremes to an elaborate suite of traits that support intimate maternal-fetal interactions in Eutherians. Among these traits are the maternal decidua and fetal component of the placenta, but there is considerable uncertainty about how these structures evolved. Previously, we showed that changes in uterine gene expression contributes to several evolutionary innovations during the origins of pregnancy (Mika et al., 2021b). Here, we reconstruct the evolution of entire transcriptomes ('ancestral transcriptome reconstruction') and show that maternal gene expression profiles are correlated with degree of placental invasion. These results indicate that an epitheliochorial-like placenta evolved early in the mammalian stem-lineage and that the ancestor of Eutherians had a hemochorial placenta, and suggest maternal control of placental invasiveness. These data resolve major transitions in the evolution of pregnancy and indicate that ancestral transcriptome reconstruction can be used to study the function of ancestral cell, tissue, and organ systems.
Collapse
Affiliation(s)
- Katelyn Mika
- Department of Human Genetics, University of ChicagoChicagoUnited States
- Department of Organismal Biology and Anatomy, University of ChicagoChicagoUnited States
| | | | | | - Vincent J Lynch
- Department of Biological Sciences, University at Buffalo, State University of New YorkBuffalo,NewyorkUnited States
| |
Collapse
|
10
|
Borges R, Boussau B, Szöllősi GJ, Kosiol C. Nucleotide Usage Biases Distort Inferences of the Species Tree. Genome Biol Evol 2022; 14:6496956. [PMID: 34983052 PMCID: PMC8829901 DOI: 10.1093/gbe/evab290] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/27/2021] [Indexed: 12/15/2022] Open
Abstract
Despite the importance of natural selection in species’ evolutionary history, phylogenetic methods that take into account population-level processes typically ignore selection. The assumption of neutrality is often based on the idea that selection occurs at a minority of loci in the genome and is unlikely to compromise phylogenetic inferences significantly. However, genome-wide processes like GC-bias and some variation segregating at the coding regions are known to evolve in the nearly neutral range. As we are now using genome-wide data to estimate species trees, it is natural to ask whether weak but pervasive selection is likely to blur species tree inferences. We developed a polymorphism-aware phylogenetic model tailored for measuring signatures of nucleotide usage biases to test the impact of selection in the species tree. Our analyses indicate that although the inferred relationships among species are not significantly compromised, the genetic distances are systematically underestimated in a node-height-dependent manner: that is, the deeper nodes tend to be more underestimated than the shallow ones. Such biases have implications for molecular dating. We dated the evolutionary history of 30 worldwide fruit fly populations, and we found signatures of GC-bias considerably affecting the estimated divergence times (up to 23%) in the neutral model. Our findings call for the need to account for selection when quantifying divergence or dating species evolution.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | - Bastien Boussau
- Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5558, LBBE, Villeurbanne, France
| | - Gergely J Szöllősi
- Department of Biological Physics, Eötvös University, Budapest , Hungary.,MTA-ELTE "Lendület" Evolutionary Genomics Research Group, Budapest, Hungary.,Evolutionary Systems Research Group, Centre for Ecological Research, Hungarian Academy of Sciences, Tihany, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.,Centre for Biological Diversity, University of St Andrews, St Andrews, United Kingdom
| |
Collapse
|
11
|
Bonito M, D’Atanasio E, Ravasini F, Cariati S, Finocchio A, Novelletto A, Trombetta B, Cruciani F. New insights into the evolution of human Y chromosome palindromes through mutation and gene conversion. Hum Mol Genet 2021; 30:2272-2285. [PMID: 34244762 PMCID: PMC8600007 DOI: 10.1093/hmg/ddab189] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 07/01/2021] [Accepted: 07/05/2021] [Indexed: 12/16/2022] Open
Abstract
About one-quarter of the euchromatic portion of the male-specific region of the human Y chromosome consists of large duplicated sequences that are organized in eight palindromes (termed P1-P8), which undergo arm-to arm gene conversion, a proposed mechanism for maintaining their sequence integrity. Although the relevance of gene conversion in the evolution of palindromic sequences has been profoundly recognized, the dynamic of this mechanism is still nuanced. To shed light into the evolution of these genomic elements, we performed a high-depth (50×) targeted next-generation sequencing of the palindrome P6 in 157 subjects belonging to the most divergent evolutionary lineages of the Y chromosome. We found 118 new paralogous sequence variants, which were placed into the context of a robust Y chromosome phylogeny based on 7240 SNPs of the X-degenerate region. We mapped along the phylogeny 80 gene conversion events that shaped the diversity of P6 arms during recent human history. In contrast to previous studies, we demonstrated that arm-to-arm gene conversion, which occurs at a rate of 6.01 × 10 -6 conversions/base/year, is not biased toward the retention of the ancestral state of sequences. We also found a significantly lower mutation rate of the arms (6.18 × 10-10 mutations/base/year) compared with the spacer (9.16 × 10-10 mutations/base/year), a finding that may explain the observed higher inter-species conservation of arms, without invoking any bias of conversion. Finally, by formally testing the mutation/conversion balance in P6, we found that the arms of this palindrome reached a steady-state equilibrium between mutation and gene conversion.
Collapse
Affiliation(s)
- Maria Bonito
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Eugenia D’Atanasio
- Institute of Molecular Biology and Pathology (IBPM), CNR, Rome 0185, Italy
| | - Francesco Ravasini
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Selene Cariati
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Andrea Finocchio
- Department of Biology, University of Rome Tor Vergata, Rome 0133, Italy
| | - Andrea Novelletto
- Department of Biology, University of Rome Tor Vergata, Rome 0133, Italy
| | - Beniamino Trombetta
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
| | - Fulvio Cruciani
- Department of Biology and Biotechnology ‘Charles Darwin’, Sapienza University of Rome, Laboratory affiliated to Istituto Pasteur Italia-Fondazione Cenci Bolognetti, Rome 0185, Italy
- Institute of Molecular Biology and Pathology (IBPM), CNR, Rome 0185, Italy
| |
Collapse
|
12
|
Moreira A, Croze M, Delehelle F, Cussat-Blanc S, Luga H, Mollereau C, Balaresque P. Hearing Sensitivity of Primates: Recurrent and Episodic Positive Selection in Hair Cells and Stereocilia Protein-Coding Genes. Genome Biol Evol 2021; 13:6302699. [PMID: 34137817 PMCID: PMC8358225 DOI: 10.1093/gbe/evab133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/06/2021] [Indexed: 12/29/2022] Open
Abstract
The large spectrum of hearing sensitivity observed in primates results from the impact of environmental and behavioral pressures to optimize sound perception and localization. Although evidence of positive selection in auditory genes has been detected in mammals including in Hominoids, selection has never been investigated in other primates. We analyzed 123 genes highly expressed in the inner ear of 27 primate species and tested to what extent positive selection may have shaped these genes in the order Primates tree. We combined both site and branch-site tests to obtain a comprehensive picture of the positively selected genes (PSGs) involved in hearing sensitivity, and drew a detailed description of the most affected branches in the tree. We chose a conservative approach, and thus focused on confounding factors potentially affecting PSG signals (alignment, GC-biased gene conversion, duplications, heterogeneous sequencing qualities). Using site tests, we showed that around 12% of these genes are PSGs, an α selection value consistent with average human genome estimates (10-15%). Using branch-site tests, we showed that the primate tree is heterogeneously affected by positive selection, with the black snub-nosed monkey, the bushbaby, and the orangutan, being the most impacted branches. A large proportion of these genes is inclined to shape hair cells and stereocilia, which are involved in the mechanotransduction process, known to influence frequency perception. Adaptive selection, and more specifically recurrent adaptive evolution, could have acted in parallel on a set of genes (ADGRV1, USH2A, PCDH15, PTPRQ, and ATP8A2) involved in stereocilia growth and the whole complex of bundle links connecting them, in species across different habitats, including high altitude and nocturnal environments.
Collapse
Affiliation(s)
- Andreia Moreira
- Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), Faculté de Médecine Purpan, CNRS UMR5288, Université de Toulouse, Université Toulouse III Paul Sabatier, France.,Institut de Recherche en Informatique de Toulouse (IRIT), CNRS UMR5505, Université Toulouse III Paul Sabatier, France
| | - Myriam Croze
- Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), Faculté de Médecine Purpan, CNRS UMR5288, Université de Toulouse, Université Toulouse III Paul Sabatier, France
| | - Franklin Delehelle
- Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), Faculté de Médecine Purpan, CNRS UMR5288, Université de Toulouse, Université Toulouse III Paul Sabatier, France.,Institut de Recherche en Informatique de Toulouse (IRIT), CNRS UMR5505, Université Toulouse III Paul Sabatier, France
| | - Sylvain Cussat-Blanc
- Institut de Recherche en Informatique de Toulouse (IRIT), CNRS UMR5505, Université Toulouse III Paul Sabatier, France
| | - Hervé Luga
- Institut de Recherche en Informatique de Toulouse (IRIT), CNRS UMR5505, Université Toulouse III Paul Sabatier, France
| | - Catherine Mollereau
- Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), Faculté de Médecine Purpan, CNRS UMR5288, Université de Toulouse, Université Toulouse III Paul Sabatier, France
| | - Patricia Balaresque
- Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), Faculté de Médecine Purpan, CNRS UMR5288, Université de Toulouse, Université Toulouse III Paul Sabatier, France
| |
Collapse
|
13
|
Lv X, Hu J, Hu Y, Li Y, Xu D, Ryder OA, Irwin DM, Yu L. Diverse phylogenomic datasets uncover a concordant scenario of laurasiatherian interordinal relationships. Mol Phylogenet Evol 2020; 157:107065. [PMID: 33387649 DOI: 10.1016/j.ympev.2020.107065] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 10/22/2022]
Abstract
Resolving the interordinal relationships in the mammalian superorder Laurasiatheria has been among the most intractable problems in higher-level mammalian systematics, with many conflicting hypotheses having been proposed. The present study collected three different sources of genome-scale data with comprehensive taxon sampling of laurasiatherian species, including two protein-coding datasets (4,186 protein-coding genes for an amino acid dataset comprising 2,761,247 amino acid residues and a nucleotide dataset comprising 5,516,340 nucleotides from 1st and 2nd codon positions), an intronic dataset (1,210 introns comprising 1,162,723 nucleotides) and an ultraconserved elements (UCEs) dataset (1,246 UCEs comprising 1,946,472 nucleotides) from 40 species representing all six laurasiatherian orders and 7 non-laurasiatherian outgroups. Remarkably, phylogenetic trees reconstructed with the four datasets using different tree-building methods (RAxML, FastTree, ASTRAL and MP-EST) all supported the relationship (Eulipotyphla, (Chiroptera, ((Carnivora, Pholidota), (Cetartiodactyla, Perissodactyla)))). We find a resolution of interordinal relationships of Laurasiatheria among all types of markers used in the present study, and the likelihood ratio tests for tree comparisons confirmed that the present tree topology is the optimal hypothesis compared to other examined hypotheses. Jackknifing subsampling analyses demonstrate that the results of laurasiatherian tree reconstruction varied with the number of loci and ordinal representatives used, which are likely the two main contributors to phylogenetic disagreements of Laurasiatheria seen in previous studies. Our study provides significant insight into laurasiatherian evolution, and moreover, an important methodological strategy and reference for resolving phylogenies of adaptive radiation, which have been a long-standing challenge in the field of phylogenetics.
Collapse
Affiliation(s)
- Xue Lv
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yiwen Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Yitian Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Dongming Xu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming, China
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo Global, Escondido, CA, USA
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.
| |
Collapse
|
14
|
Bossert S, Murray EA, Pauly A, Chernyshov K, Brady SG, Danforth BN. Gene Tree Estimation Error with Ultraconserved Elements: An Empirical Study on Pseudapis Bees. Syst Biol 2020; 70:803-821. [PMID: 33367855 DOI: 10.1093/sysbio/syaa097] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 11/18/2020] [Accepted: 12/02/2020] [Indexed: 11/12/2022] Open
Abstract
Summarizing individual gene trees to species phylogenies using two-step coalescent methods is now a standard strategy in the field of phylogenomics. However, practical implementations of summary methods suffer from gene tree estimation error, which is caused by various biological and analytical factors. Greatly understudied is the choice of gene tree inference method and downstream effects on species tree estimation for empirical data sets. To better understand the impact of this method choice on gene and species tree accuracy, we compare gene trees estimated through four widely used programs under different model-selection criteria: PhyloBayes, MrBayes, IQ-Tree, and RAxML. We study their performance in the phylogenomic framework of $>$800 ultraconserved elements from the bee subfamily Nomiinae (Halictidae). Our taxon sampling focuses on the genus Pseudapis, a distinct lineage with diverse morphological features, but contentious morphology-based taxonomic classifications and no molecular phylogenetic guidance. We approximate topological accuracy of gene trees by assessing their ability to recover two uncontroversial, monophyletic groups, and compare branch lengths of individual trees using the stemminess metric (the relative length of internal branches). We further examine different strategies of removing uninformative loci and the collapsing of weakly supported nodes into polytomies. We then summarize gene trees with ASTRAL and compare resulting species phylogenies, including comparisons to concatenation-based estimates. Gene trees obtained with the reversible jump model search in MrBayes were most concordant on average and all Bayesian methods yielded gene trees with better stemminess values. The only gene tree estimation approach whose ASTRAL summary trees consistently produced the most likely correct topology, however, was IQ-Tree with automated model designation (ModelFinder program). We discuss these findings and provide practical advice on gene tree estimation for summary methods. Lastly, we establish the first phylogeny-informed classification for Pseudapis s. l. and map the distribution of distinct morphological features of the group. [ASTRAL; Bees; concordance; gene tree estimation error; IQ-Tree; MrBayes, Nomiinae; PhyloBayes; RAxML; phylogenomics; stemminess].
Collapse
Affiliation(s)
- Silas Bossert
- Department of Entomology, Cornell University, Comstock Hall, Ithaca, NY 14853, USA.,Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA.,Department of Entomology, Washington State University, Pullman, Washington 99164, USA
| | - Elizabeth A Murray
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA.,Department of Entomology, Washington State University, Pullman, Washington 99164, USA
| | - Alain Pauly
- O.D. Taxonomy and Phylogeny, Royal Belgian Institute of Natural Sciences, Rue Vautier 29, 1000 Brussels, Belgium
| | - Kyrylo Chernyshov
- College of Arts and Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Seán G Brady
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA
| | - Bryan N Danforth
- Department of Entomology, Cornell University, Comstock Hall, Ithaca, NY 14853, USA
| |
Collapse
|
15
|
Upham NS, Esselstyn JA, Jetz W. Inferring the mammal tree: Species-level sets of phylogenies for questions in ecology, evolution, and conservation. PLoS Biol 2019; 17:e3000494. [PMID: 31800571 PMCID: PMC6892540 DOI: 10.1371/journal.pbio.3000494] [Citation(s) in RCA: 570] [Impact Index Per Article: 95.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Accepted: 10/24/2019] [Indexed: 12/18/2022] Open
Abstract
Big, time-scaled phylogenies are fundamental to connecting evolutionary processes to modern biodiversity patterns. Yet inferring reliable phylogenetic trees for thousands of species involves numerous trade-offs that have limited their utility to comparative biologists. To establish a robust evolutionary timescale for all approximately 6,000 living species of mammals, we developed credible sets of trees that capture root-to-tip uncertainty in topology and divergence times. Our "backbone-and-patch" approach to tree building applies a newly assembled 31-gene supermatrix to two levels of Bayesian inference: (1) backbone relationships and ages among major lineages, using fossil node or tip dating, and (2) species-level "patch" phylogenies with nonoverlapping in-groups that each correspond to one representative lineage in the backbone. Species unsampled for DNA are either excluded ("DNA-only" trees) or imputed within taxonomic constraints using branch lengths drawn from local birth-death models ("completed" trees). Joining time-scaled patches to backbones results in species-level trees of extant Mammalia with all branches estimated under the same modeling framework, thereby facilitating rate comparisons among lineages as disparate as marsupials and placentals. We compare our phylogenetic trees to previous estimates of mammal-wide phylogeny and divergence times, finding that (1) node ages are broadly concordant among studies, and (2) recent (tip-level) rates of speciation are estimated more accurately in our study than in previous "supertree" approaches, in which unresolved nodes led to branch-length artifacts. Credible sets of mammalian phylogenetic history are now available for download at http://vertlife.org/phylosubsets, enabling investigations of long-standing questions in comparative biology.
Collapse
Affiliation(s)
- Nathan S. Upham
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, Connecticut, United States of America
- Center for Biodiversity & Global Change, Yale University, New Haven, Connecticut, United States of America
| | - Jacob A. Esselstyn
- Department of Biological Sciences and Museum of Natural Science, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Walter Jetz
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, Connecticut, United States of America
- Center for Biodiversity & Global Change, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
16
|
Borges R, Szöllősi GJ, Kosiol C. Quantifying GC-Biased Gene Conversion in Great Ape Genomes Using Polymorphism-Aware Models. Genetics 2019; 212:1321-1336. [PMID: 31147380 PMCID: PMC6707462 DOI: 10.1534/genetics.119.302074] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Accepted: 05/20/2019] [Indexed: 11/18/2022] Open
Abstract
As multi-individual population-scale data become available, more complex modeling strategies are needed to quantify genome-wide patterns of nucleotide usage and associated mechanisms of evolution. Recently, the multivariate neutral Moran model was proposed. However, it was shown insufficient to explain the distribution of alleles in great apes. Here, we propose a new model that includes allelic selection. Our theoretical results constitute the basis of a new Bayesian framework to estimate mutation rates and selection coefficients from population data. We apply the new framework to a great ape dataset, where we found patterns of allelic selection that match those of genome-wide GC-biased gene conversion (gBGC). In particular, we show that great apes have patterns of allelic selection that vary in intensity-a feature that we correlated with great apes' distinct demographies. We also demonstrate that the AT/GC toggling effect decreases the probability of a substitution, promoting more polymorphisms in the base composition of great ape genomes. We further assess the impact of GC-bias in molecular analysis, and find that mutation rates and genetic distances are estimated under bias when gBGC is not properly accounted for. Our results contribute to the discussion on the tempo and mode of gBGC evolution, while stressing the need for gBGC-aware models in population genetics and phylogenetics.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
| | - Gergely J Szöllősi
- Department of Biological Physics, MTA-ELTE "Lendulet" Evolutionary Genomics Research Group, Eötvös University, Pázmány P. stny. 1A, Budapest 1117, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
17
|
Laurin-Lemay S, Rodrigue N, Lartillot N, Philippe H. Conditional Approximate Bayesian Computation: A New Approach for Across-Site Dependency in High-Dimensional Mutation-Selection Models. Mol Biol Evol 2019; 35:2819-2834. [PMID: 30203003 DOI: 10.1093/molbev/msy173] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
A key question in molecular evolutionary biology concerns the relative roles of mutation and selection in shaping genomic data. Moreover, features of mutation and selection are heterogeneous along the genome and over time. Mechanistic codon substitution models based on the mutation-selection framework are promising approaches to separating these effects. In practice, however, several complications arise, since accounting for such heterogeneities often implies handling models of high dimensionality (e.g., amino acid preferences), or leads to across-site dependence (e.g., CpG hypermutability), making the likelihood function intractable. Approximate Bayesian Computation (ABC) could address this latter issue. Here, we propose a new approach, named Conditional ABC (CABC), which combines the sampling efficiency of MCMC and the flexibility of ABC. To illustrate the potential of the CABC approach, we apply it to the study of mammalian CpG hypermutability based on a new mutation-level parameter implying dependence across adjacent sites, combined with site-specific purifying selection on amino-acids captured by a Dirichlet process. Our proof-of-concept of the CABC methodology opens new modeling perspectives. Our application of the method reveals a high level of heterogeneity of CpG hypermutability across loci and mild heterogeneity across taxonomic groups; and finally, we show that CpG hypermutability is an important evolutionary factor in rendering relative synonymous codon usage. All source code is available as a GitHub repository (https://github.com/Simonll/LikelihoodFreePhylogenetics.git).
Collapse
Affiliation(s)
- Simon Laurin-Lemay
- Robert-Cedergren Center for Bioinformatics and Genomics, Department of Biochemistry and Molecular Medicine, Faculty of Medicine, Université de Montréal, Montréal, QC, Canada
| | - Nicolas Rodrigue
- Department of Biology, Institute of Biochemistry, and School of Mathematics and Statistics, Carleton University, Ottawa, ON, Canada
| | - Nicolas Lartillot
- Laboratoire de Biométrie et Biologie Évolutive, UMR CNRS 5558, Université Lyon 1, Lyon, France
| | - Hervé Philippe
- Robert-Cedergren Center for Bioinformatics and Genomics, Department of Biochemistry and Molecular Medicine, Faculty of Medicine, Université de Montréal, Montréal, QC, Canada.,Centre de Théorisation et de Modélisation de la Biodiversité, Station d'Écologie Théorique et Expérimentale, UMR CNRS 5321, Moulis, France
| |
Collapse
|
18
|
Gossmann TI, Bockwoldt M, Diringer L, Schwarz F, Schumann VF. Evidence for Strong Fixation Bias at 4-fold Degenerate Sites Across Genes in the Great Tit Genome. Front Ecol Evol 2018. [DOI: 10.3389/fevo.2018.00203] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
19
|
Assaf ZJ, Tilk S, Park J, Siegal ML, Petrov DA. Deep sequencing of natural and experimental populations of Drosophila melanogaster reveals biases in the spectrum of new mutations. Genome Res 2017; 27:1988-2000. [PMID: 29079675 PMCID: PMC5741049 DOI: 10.1101/gr.219956.116] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Accepted: 10/20/2017] [Indexed: 11/25/2022]
Abstract
Mutations provide the raw material of evolution, and thus our ability to study evolution depends fundamentally on having precise measurements of mutational rates and patterns. We generate a data set for this purpose using (1) de novo mutations from mutation accumulation experiments and (2) extremely rare polymorphisms from natural populations. The first, mutation accumulation (MA) lines are the product of maintaining flies in tiny populations for many generations, therefore rendering natural selection ineffective and allowing new mutations to accrue in the genome. The second, rare genetic variation from natural populations allows the study of mutation because extremely rare polymorphisms are relatively unaffected by the filter of natural selection. We use both methods in Drosophila melanogaster, first generating our own novel data set of sequenced MA lines and performing a meta-analysis of all published MA mutations (∼2000 events) and then identifying a high quality set of ∼70,000 extremely rare (≤0.1%) polymorphisms that are fully validated with resequencing. We use these data sets to precisely measure mutational rates and patterns. Highlights of our results include: a high rate of multinucleotide mutation events at both short (∼5 bp) and long (∼1 kb) genomic distances, showing that mutation drives GC content lower in already GC-poor regions, and using our precise context-dependent mutation rates to predict long-term evolutionary patterns at synonymous sites. We also show that de novo mutations from independent MA experiments display similar patterns of single nucleotide mutation and well match the patterns of mutation found in natural populations.
Collapse
Affiliation(s)
- Zoe June Assaf
- Department of Genetics, Stanford University, Stanford, California 94305, USA.,Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Susanne Tilk
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Jane Park
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Mark L Siegal
- Department of Biology, New York University, New York, New York 10003, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
20
|
Lartillot N, Phillips MJ, Ronquist F. A mixed relaxed clock model. Philos Trans R Soc Lond B Biol Sci 2017; 371:rstb.2015.0132. [PMID: 27325829 PMCID: PMC4920333 DOI: 10.1098/rstb.2015.0132] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/29/2016] [Indexed: 12/13/2022] Open
Abstract
Over recent years, several alternative relaxed clock models have been proposed in the context of Bayesian dating. These models fall in two distinct categories: uncorrelated and autocorrelated across branches. The choice between these two classes of relaxed clocks is still an open question. More fundamentally, the true process of rate variation may have both long-term trends and short-term fluctuations, suggesting that more sophisticated clock models unfolding over multiple time scales should ultimately be developed. Here, a mixed relaxed clock model is introduced, which can be mechanistically interpreted as a rate variation process undergoing short-term fluctuations on the top of Brownian long-term trends. Statistically, this mixed clock represents an alternative solution to the problem of choosing between autocorrelated and uncorrelated relaxed clocks, by proposing instead to combine their respective merits. Fitting this model on a dataset of 105 placental mammals, using both node-dating and tip-dating approaches, suggests that the two pure clocks, Brownian and white noise, are rejected in favour of a mixed model with approximately equal contributions for its uncorrelated and autocorrelated components. The tip-dating analysis is particularly sensitive to the choice of the relaxed clock model. In this context, the classical pure Brownian relaxed clock appears to be overly rigid, leading to biases in divergence time estimation. By contrast, the use of a mixed clock leads to more recent and more reasonable estimates for the crown ages of placental orders and superorders. Altogether, the mixed clock introduced here represents a first step towards empirically more adequate models of the patterns of rate variation across phylogenetic trees.This article is part of the themed issue 'Dating species divergences using rocks and clocks'.
Collapse
Affiliation(s)
- Nicolas Lartillot
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Claude Bernard Lyon 1, F-69622 Villeurbanne Cedex, France
| | - Matthew J Phillips
- School of Earth, Environmental and Biological Sciences, Queensland University of Technology, Brisbane, Australia
| | - Fredrik Ronquist
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, PO Box 50007, 104 05 Stockholm, Sweden
| |
Collapse
|
21
|
Evolutionary forces affecting synonymous variations in plant genomes. PLoS Genet 2017; 13:e1006799. [PMID: 28531201 PMCID: PMC5460877 DOI: 10.1371/journal.pgen.1006799] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Revised: 06/06/2017] [Accepted: 05/04/2017] [Indexed: 01/04/2023] Open
Abstract
Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones. In protein coding genes, base composition strongly varies within and among plant genomes, especially at positions where changes do not alter the coded protein (synonymous variations). Some species, such as the model plant Arabidopsis thaliana, are relatively GC-poor and homogeneous while others, such as grasses, are highly heterogeneous and GC-rich. The causes of these variations are still debated: are they mainly due to selective or neutral processes? Answering to this question is important to correctly infer whether variations in base composition may have functional roles or not. We extended a population genetics method to jointly estimate the different forces that may affect synonymous variations and applied it to genomic datasets in 11 flowering plant species. We found that GC-biased gene conversion, a neutral process associated with recombination that mimics selection by favouring G and C bases, is a widespread and stronger process than selection and that it could explain the large variation in base composition observed in plant genomes. Our results bear implications for analysing plant genomes and for correctly interpreting what could be functional or not.
Collapse
|
22
|
Bossert S, Murray EA, Blaimer BB, Danforth BN. The impact of GC bias on phylogenetic accuracy using targeted enrichment phylogenomic data. Mol Phylogenet Evol 2017; 111:149-157. [PMID: 28390323 DOI: 10.1016/j.ympev.2017.03.022] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Revised: 03/06/2017] [Accepted: 03/24/2017] [Indexed: 01/08/2023]
Abstract
The field of sequence based phylogenetic analyses is currently being transformed by novel hybrid-based targeted enrichment methods, such as the use of ultraconserved elements (UCEs). Rather than analyzing relationships among organisms using a small number of genes, these methods now allow us to evaluate relationships with many hundreds to thousands of individual gene loci. However, the inclusion of thousands of loci does not necessarily overcome the long-standing challenge of incongruence among phylogenetic trees derived from different genes or gene regions. One factor that impacts the level of incongruence in phylogenomic data sets is the level of GC bias. GC rich gene regions are prone to higher recombination rates than AT rich regions, driven by a process referred to as "GC biased gene conversion". As a result, high GC content can be negatively associated with phylogenetic accuracy, but the extent to which this impacts incongruence among UCEs is currently unstudied. We investigated the impact of GC content on phylogeny reconstruction using in silico captured UCE data for the corbiculate bees (Hymenoptera: Apidae). The phylogeny of this group has been the subject of extensive study, and incongruence among gene trees is thought to be a source of phylogenetic error. We conducted coalescent- and concatenation-based analyses of 810 individual gene loci from all 13 currently available bee genomes, including 8 corbiculate taxa. Both coalescent- and concatenation-based methods converged on a single topology for the corbiculate tribes. In contrast to concatenation, the coalescent-based methods revealed significant topological conflict at nodes involving the orchid bees (Euglossini) and honeybees (Apini). Partitioning the loci by GC content reveals decreasing support for the inferred topology with increasing GC bias. Based on the results of this study, we report the first evidence that GC biased gene conversion may contribute to topological incongruence in studies based on ultraconserved elements.
Collapse
Affiliation(s)
- Silas Bossert
- Department of Entomology, Cornell University, Ithaca, New York, USA.
| | | | - Bonnie B Blaimer
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Bryan N Danforth
- Department of Entomology, Cornell University, Ithaca, New York, USA
| |
Collapse
|
23
|
Dornburg A, Townsend JP, Brooks W, Spriggs E, Eytan RI, Moore JA, Wainwright PC, Lemmon A, Lemmon EM, Near TJ. New insights on the sister lineage of percomorph fishes with an anchored hybrid enrichment dataset. Mol Phylogenet Evol 2017; 110:27-38. [PMID: 28254474 DOI: 10.1016/j.ympev.2017.02.017] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Revised: 02/22/2017] [Accepted: 02/25/2017] [Indexed: 11/17/2022]
Abstract
Percomorph fishes represent over 17,100 species, including several model organisms and species of economic importance. Despite continuous advances in the resolution of the percomorph Tree of Life, resolution of the sister lineage to Percomorpha remains inconsistent but restricted to a small number of candidate lineages. Here we use an anchored hybrid enrichment (AHE) dataset of 132 loci with over 99,000 base pairs to identify the sister lineage of percomorph fishes. Initial analyses of this dataset failed to recover a strongly supported sister clade to Percomorpha, however, scrutiny of the AHE dataset revealed a bias towards high GC content at fast-evolving codon partitions (GC bias). By combining several existing approaches aimed at mitigating the impacts of convergence in GC bias, including RY coding and analyses of amino acids, we consistently recovered a strongly supported clade comprised of Holocentridae (squirrelfishes), Berycidae (Alfonsinos), Melamphaidae (bigscale fishes), Cetomimidae (flabby whalefishes), and Rondeletiidae (redmouth whalefishes) as the sister lineage to Percomorpha. Additionally, implementing phylogenetic informativeness (PI) based metrics as a filtration method yielded this same topology, suggesting PI based approaches will preferentially filter these fast-evolving regions and act in a manner consistent with other phylogenetic approaches aimed at mitigating GC bias. Our results provide a new perspective on a key issue for studies investigating the evolutionary history of more than one quarter of all living species of vertebrates.
Collapse
Affiliation(s)
- Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC, USA.
| | - Jeffrey P Townsend
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Biostatistics, Yale University, New Haven, CT 06510, USA
| | - Willa Brooks
- North Carolina Museum of Natural Sciences, Raleigh, NC, USA
| | - Elizabeth Spriggs
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA
| | - Ron I Eytan
- Marine Biology Department, Texas A&M University at Galveston, Galveston, TX 77554, USA
| | - Jon A Moore
- Florida Atlantic University, Wilkes Honors College, Jupiter, FL 33458, USA; Florida Atlantic University, Harbor Branch Oceanographic Institution, Fort Pierce, FL 34946, USA
| | - Peter C Wainwright
- Department of Evolution & Ecology, University of California, Davis, CA 95616, USA
| | - Alan Lemmon
- Department of Scientific Computing, Florida State University, 400 Dirac Science Library, Tallahassee, FL 32306, USA
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA
| | - Thomas J Near
- Department of Ecology & Evolutionary Biology and Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA; Peabody Museum of Natural History, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
24
|
Symonová R, Majtánová Z, Arias-Rodriguez L, Mořkovský L, Kořínková T, Cavin L, Pokorná MJ, Doležálková M, Flajšhans M, Normandeau E, Ráb P, Meyer A, Bernatchez L. Genome Compositional Organization in Gars Shows More Similarities to Mammals than to Other Ray-Finned Fish. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2016; 328:607-619. [DOI: 10.1002/jez.b.22719] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Revised: 11/13/2016] [Accepted: 11/22/2016] [Indexed: 12/12/2022]
Affiliation(s)
- Radka Symonová
- Laboratory of Fish Genetics; Institute of Animal Physiology and Genetics; The Czech Academy of Sciences; Liběchov Czech Republic
- Department of Zoology; Faculty of Science; Charles University; Prague 2 Czech Republic
- Research Institute for Limnology; University of Innsbruck; Mondsee Austria
| | - Zuzana Majtánová
- Laboratory of Fish Genetics; Institute of Animal Physiology and Genetics; The Czech Academy of Sciences; Liběchov Czech Republic
- Department of Zoology; Faculty of Science; Charles University; Prague 2 Czech Republic
| | - Lenin Arias-Rodriguez
- División Académica de Ciencias Biológicas; Universidad Juárez Autónoma de Tabasco (UJAT); Villahermosa Tabasco México
| | - Libor Mořkovský
- Department of Zoology; Faculty of Science; Charles University; Prague 2 Czech Republic
| | - Tereza Kořínková
- Laboratory of Fish Genetics; Institute of Animal Physiology and Genetics; The Czech Academy of Sciences; Liběchov Czech Republic
| | - Lionel Cavin
- Muséum d'Histoire Naturelle; Geneva 6 Switzerland
| | - Martina Johnson Pokorná
- Laboratory of Fish Genetics; Institute of Animal Physiology and Genetics; The Czech Academy of Sciences; Liběchov Czech Republic
- Department of Ecology; Faculty of Science; Charles University; Prague 2 Czech Republic
| | - Marie Doležálková
- Laboratory of Fish Genetics; Institute of Animal Physiology and Genetics; The Czech Academy of Sciences; Liběchov Czech Republic
- Department of Zoology; Faculty of Science; Charles University; Prague 2 Czech Republic
| | - Martin Flajšhans
- Faculty of Fisheries and Protection of Waters; South Bohemian Research Centre of Aquaculture and Biodiversity of Hydrocenoses; University of South Bohemia in České Budějovice; Vodňany Czech Republic
| | - Eric Normandeau
- IBIS, Department of Biology, University Laval, Pavillon Charles-Eugène-Marchand; Avenue de la Médecine Quebec City; Canada
| | - Petr Ráb
- Laboratory of Fish Genetics; Institute of Animal Physiology and Genetics; The Czech Academy of Sciences; Liběchov Czech Republic
| | - Axel Meyer
- Chair in Zoology and Evolutionary Biology; Department of Biology; University of Konstanz; Konstanz Germany
| | - Louis Bernatchez
- IBIS, Department of Biology, University Laval, Pavillon Charles-Eugène-Marchand; Avenue de la Médecine Quebec City; Canada
| |
Collapse
|
25
|
Ghenu AH, Bolker BM, Melnick DJ, Evans BJ. Multicopy gene family evolution on primate Y chromosomes. BMC Genomics 2016; 17:157. [PMID: 26925773 PMCID: PMC4772468 DOI: 10.1186/s12864-015-2187-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2015] [Accepted: 11/02/2015] [Indexed: 12/12/2022] Open
Abstract
Background The primate Y chromosome is distinguished by a lack of inter-chromosomal recombination along most of its length, extensive gene loss, and a prevalence of repetitive elements. A group of genes on the male-specific portion of the Y chromosome known as the “ampliconic genes” are present in multiple copies that are sometimes part of palindromes, and that undergo a form of intra-chromosomal recombination called gene conversion, wherein the nucleotides of one copy are homogenized by those of another. With the aim of further understanding gene family evolution of these genes, we collected nucleotide sequence and gene copy number information for several species of papionin monkey. We then tested for evidence of gene conversion, and developed a novel statistical framework to evaluate alternative models of gene family evolution using our data combined with other information from a human, a chimpanzee, and a rhesus macaque. Results Our results (i) recovered evidence for several novel examples of gene conversion in papionin monkeys and indicate that (ii) ampliconic gene families evolve faster than autosomal gene families and than single-copy genes on the Y chromosome and that (iii) Y-linked singleton and autosomal gene families evolved faster in humans and chimps than they do in the other Old World Monkey lineages we studied. Conclusions Rapid evolution of ampliconic genes cannot be attributed solely to residence on the Y chromosome, nor to variation between primate lineages in the rate of gene family evolution. Instead other factors, such as natural selection and gene conversion, appear to play a role in driving temporal and genomic evolutionary heterogeneity in primate gene families. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2187-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana-Hermina Ghenu
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.
| | - Benjamin M Bolker
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.,Department of Mathematics & Statistics, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada
| | - Don J Melnick
- Department of Ecology, Evolution, and Environmental Biology, Columbia University, 10th Floor Schermerhorn Extension, New York, 10027, USA
| | - Ben J Evans
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.
| |
Collapse
|
26
|
Mugal CF, Weber CC, Ellegren H. GC-biased gene conversion links the recombination landscape and demography to genomic base composition. Bioessays 2015; 37:1317-26. [DOI: 10.1002/bies.201500058] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Carina F. Mugal
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| | - Claudia C. Weber
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
- Department of Biology; Center for Computational Genetics and Genomics; Temple University; Philadelphia PA USA
| | - Hans Ellegren
- Department of Evolutionary Biology; Evolutionary Biology Centre; Uppsala University; Uppsala Sweden
| |
Collapse
|
27
|
Bolívar P, Mugal CF, Nater A, Ellegren H. Recombination Rate Variation Modulates Gene Sequence Evolution Mainly via GC-Biased Gene Conversion, Not Hill-Robertson Interference, in an Avian System. Mol Biol Evol 2015; 33:216-27. [PMID: 26446902 PMCID: PMC4693978 DOI: 10.1093/molbev/msv214] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The ratio of nonsynonymous to synonymous substitution rates (ω) is often used to measure the strength of natural selection. However, ω may be influenced by linkage among different targets of selection, that is, Hill–Robertson interference (HRI), which reduces the efficacy of selection. Recombination modulates the extent of HRI but may also affect ω by means of GC-biased gene conversion (gBGC), a process leading to a preferential fixation of G:C (“strong,” S) over A:T (“weak,” W) alleles. As HRI and gBGC can have opposing effects on ω, it is essential to understand their relative impact to make proper inferences of ω. We used a model that separately estimated S-to-S, S-to-W, W-to-S, and W-to-W substitution rates in 8,423 avian genes in the Ficedula flycatcher lineage. We found that the W-to-S substitution rate was positively, and the S-to-W rate negatively, correlated with recombination rate, in accordance with gBGC but not predicted by HRI. The W-to-S rate further showed the strongest impact on both dN and dS. However, since the effects were stronger at 4-fold than at 0-fold degenerated sites, likely because the GC content of these sites is farther away from its equilibrium, ω slightly decreases with increasing recombination rate, which could falsely be interpreted as a consequence of HRI. We corroborated this hypothesis analytically and demonstrate that under particular conditions, ω can decrease with increasing recombination rate. Analyses of the site-frequency spectrum showed that W-to-S mutations were skewed toward high, and S-to-W mutations toward low, frequencies, consistent with a prevalent gBGC-driven fixation bias.
Collapse
Affiliation(s)
- Paulina Bolívar
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Carina F Mugal
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Alexander Nater
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
28
|
The relationship of recombination rate, genome structure, and patterns of molecular evolution across angiosperms. BMC Evol Biol 2015; 15:194. [PMID: 26377000 PMCID: PMC4574184 DOI: 10.1186/s12862-015-0473-3] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 09/01/2015] [Indexed: 12/31/2022] Open
Abstract
Background Although homologous recombination affects the efficacy of selection in populations, the pattern of recombination rate evolution and its effects on genome evolution across plants are largely unknown. Recombination can reduce genome size by enabling the removal of LTR retrotransposons, alter codon usage by GC biased gene conversion, contribute to complex histories of gene duplication and loss through tandem duplication, and enhance purifying selection on genes. Therefore, variation in recombination rate across species may explain some of the variation in genomic architecture as well as rates of molecular evolution. We used phylogenetic comparative methods to investigate the evolution of global meiotic recombination rate in angiosperms and its effects on genome architecture and selection at the molecular level using genetic maps and genome sequences from thirty angiosperm species. Results Recombination rate is negatively correlated with genome size, which is likely caused by the removal of LTR retrotransposons. After correcting recombination rates for euchromatin content, we also found an association between global recombination rate and average gene family size. This suggests a role for recombination in the preservation of duplicate genes or expansion of gene families. An analysis of the correlation between the ratio of nonsynonymous to synonymous substitution rates (dN/dS) and recombination rate in 3748 genes indicates that higher recombination rates are associated with an increased efficacy of purifying selection, suggesting that global recombination rates affect variation in rates of molecular evolution across distantly related angiosperm species, not just between populations. We also identified shifts in dN/dS for recombination proteins that are associated with shifts in global recombination rate across our sample of angiosperms. Conclusions Although our analyses only reveal correlations, not mechanisms, and do not include potential covariates of recombination rate, like effective population size, they suggest that global recombination rates may play an important role in shaping the macroevolutionary patterns of gene and genome evolution in plants. Interspecific recombination rate variation is tightly correlated with genome size as well as variation in overall LTR retrotransposon abundances. Recombination may shape gene-to-gene variation in dN/dS between species, which might impact the overall gene duplication and loss rates. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0473-3) contains supplementary material, which is available to authorized users.
Collapse
|
29
|
Simonti CN, Capra JA. The evolution of the human genome. Curr Opin Genet Dev 2015; 35:9-15. [PMID: 26338498 DOI: 10.1016/j.gde.2015.08.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 08/08/2015] [Accepted: 08/12/2015] [Indexed: 02/05/2023]
Abstract
Human genomes hold a record of the evolutionary forces that have shaped our species. Advances in DNA sequencing, functional genomics, and population genetic modeling have deepened our understanding of human demographic history, natural selection, and many other long-studied topics. These advances have also revealed many previously underappreciated factors that influence the evolution of the human genome, including functional modifications to DNA and histones, conserved 3D topological chromatin domains, structural variation, and heterogeneous mutation patterns along the genome. Using evolutionary theory as a lens to study these phenomena will lead to significant breakthroughs in understanding what makes us human and why we get sick.
Collapse
Affiliation(s)
- Corinne N Simonti
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA
| | - John A Capra
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA; Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA; Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
30
|
Kapheim KM, Pan H, Li C, Salzberg SL, Puiu D, Magoc T, Robertson HM, Hudson ME, Venkat A, Fischman BJ, Hernandez A, Yandell M, Ence D, Holt C, Yocum GD, Kemp WP, Bosch J, Waterhouse RM, Zdobnov EM, Stolle E, Kraus FB, Helbing S, Moritz RFA, Glastad KM, Hunt BG, Goodisman MAD, Hauser F, Grimmelikhuijzen CJP, Pinheiro DG, Nunes FMF, Soares MPM, Tanaka ÉD, Simões ZLP, Hartfelder K, Evans JD, Barribeau SM, Johnson RM, Massey JH, Southey BR, Hasselmann M, Hamacher D, Biewer M, Kent CF, Zayed A, Blatti C, Sinha S, Johnston JS, Hanrahan SJ, Kocher SD, Wang J, Robinson GE, Zhang G. Social evolution. Genomic signatures of evolutionary transitions from solitary to group living. Science 2015; 348:1139-43. [PMID: 25977371 PMCID: PMC5471836 DOI: 10.1126/science.aaa4788] [Citation(s) in RCA: 239] [Impact Index Per Article: 23.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Accepted: 05/06/2015] [Indexed: 12/14/2022]
Abstract
The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks.
Collapse
Affiliation(s)
- Karen M Kapheim
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Biology, Utah State University, Logan, UT 84322, USA.
| | - Hailin Pan
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518083, China
| | - Cai Li
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518083, China. Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, 1350, Denmark
| | - Steven L Salzberg
- Departments of Biomedical Engineering, Computer Science, and Biostatistics, Johns Hopkins University, Baltimore, MD 21218, USA. Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Daniela Puiu
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Tanja Magoc
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Hugh M Robertson
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Matthew E Hudson
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Aarti Venkat
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Brielle J Fischman
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Program in Ecology and Evolutionary Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Biology, Hobart and William Smith Colleges, Geneva, NY 14456, USA
| | - Alvaro Hernandez
- Roy J. Carver Biotechnology Center, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Mark Yandell
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA. USTAR Center for Genetic Discovery, University of Utah, Salt Lake City, UT 84112, USA
| | - Daniel Ence
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Carson Holt
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA. USTAR Center for Genetic Discovery, University of Utah, Salt Lake City, UT 84112, USA
| | - George D Yocum
- U.S. Department of Agriculture-Agricultural Research Service (USDA-ARS) Red River Valley Agricultural Research Center, Biosciences Research Laboratory, Fargo, ND 58102, USA
| | - William P Kemp
- U.S. Department of Agriculture-Agricultural Research Service (USDA-ARS) Red River Valley Agricultural Research Center, Biosciences Research Laboratory, Fargo, ND 58102, USA
| | - Jordi Bosch
- Center for Ecological Research and Forestry Applications (CREAF), Universitat Autonoma de Barcelona, 08193 Bellaterra, Spain
| | - Robert M Waterhouse
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland. Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA. The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Evgeny M Zdobnov
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland. Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Eckart Stolle
- Institute of Biology, Department Zoology, Martin-Luther-University Halle-Wittenberg, Hoher Weg 4, D-06099 Halle (Saale), Germany. Queen Mary University of London, School of Biological and Chemical Sciences Organismal Biology Research Group, London E1 4NS, UK
| | - F Bernhard Kraus
- Institute of Biology, Department Zoology, Martin-Luther-University Halle-Wittenberg, Hoher Weg 4, D-06099 Halle (Saale), Germany. Department of Laboratory Medicine, University Hospital Halle, Ernst Grube Strasse 40, D-06120 Halle (Saale), Germany
| | - Sophie Helbing
- Institute of Biology, Department Zoology, Martin-Luther-University Halle-Wittenberg, Hoher Weg 4, D-06099 Halle (Saale), Germany
| | - Robin F A Moritz
- Institute of Biology, Department Zoology, Martin-Luther-University Halle-Wittenberg, Hoher Weg 4, D-06099 Halle (Saale), Germany. German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, 04103 Leipzig, Germany
| | - Karl M Glastad
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Brendan G Hunt
- Department of Entomology, University of Georgia, Griffin, GA 30223, USA
| | | | - Frank Hauser
- Center for Functional and Comparative Insect Genomics, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Cornelis J P Grimmelikhuijzen
- Center for Functional and Comparative Insect Genomics, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Daniel Guariz Pinheiro
- Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, 14040-901 Ribeirão Preto, SP, Brazil. Departamento de Tecnologia, Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista (UNESP), 14884-900 Jaboticabal, SP, Brazil
| | - Francis Morais Franco Nunes
- Departamento de Genética e Evolução, Centro de Ciências Biológicas e da Saúde, Universidade Federal de São Carlos, 13565-905 São Carlos, SP, Brazil
| | - Michelle Prioli Miranda Soares
- Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, 14040-901 Ribeirão Preto, SP, Brazil
| | - Érica Donato Tanaka
- Departamento de Genética, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, 14049-900 Ribeirão Preto, SP, Brazil
| | - Zilá Luz Paulino Simões
- Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, 14040-901 Ribeirão Preto, SP, Brazil
| | - Klaus Hartfelder
- Departamento de Biologia Celular e Molecular e Bioagentes Patogênicos, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, 14049-900 Ribeirão Preto, SP, Brazil
| | - Jay D Evans
- USDA-ARS Bee Research Lab, Beltsville, MD 20705 USA
| | - Seth M Barribeau
- Department of Biology, East Carolina University, Greenville, NC 27858, USA
| | - Reed M Johnson
- Department of Entomology, Ohio Agricultural Research and Development Center, Ohio State University, Wooster, OH 44691, USA
| | - Jonathan H Massey
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Bruce R Southey
- Department of Animal Sciences, University of Illinois, Urbana, IL 61801, USA
| | - Martin Hasselmann
- Department of Population Genomics, Institute of Animal Husbandry and Animal Breeding, University of Hohenheim, Germany
| | - Daniel Hamacher
- Department of Population Genomics, Institute of Animal Husbandry and Animal Breeding, University of Hohenheim, Germany
| | - Matthias Biewer
- Department of Population Genomics, Institute of Animal Husbandry and Animal Breeding, University of Hohenheim, Germany
| | - Clement F Kent
- Department of Biology, York University, Toronto, ON M3J 1P3, Canada. Janelia Farm Research Campus, Howard Hughes Medical Institue, Ashburn, VA 20147, USA
| | - Amro Zayed
- Department of Biology, York University, Toronto, ON M3J 1P3, Canada
| | - Charles Blatti
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - J Spencer Johnston
- Department of Entomology, Texas A&M University, College Station, TX 77843, USA
| | - Shawn J Hanrahan
- Department of Entomology, Texas A&M University, College Station, TX 77843, USA
| | - Sarah D Kocher
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Jun Wang
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518083, China. Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark. Princess Al Jawhara Center of Excellence in the Research of Hereditary Disorders, King Abdulaziz University, Jeddah 21589, Saudi Arabia. Macau University of Science and Technology, Avenida Wai long, Taipa, Macau 999078, China. Department of Medicine, University of Hong Kong, Hong Kong.
| | - Gene E Robinson
- Carl R. WoeseInstitute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Center for Advanced Study Professor in Entomology and Neuroscience, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
| | - Guojie Zhang
- China National GeneBank, BGI-Shenzhen, Shenzhen, 518083, China. Centre for Social Evolution, Department of Biology, Universitetsparken 15, University of Copenhagen, DK-2100 Copenhagen, Denmark.
| |
Collapse
|
31
|
Glémin S, Arndt PF, Messer PW, Petrov D, Galtier N, Duret L. Quantification of GC-biased gene conversion in the human genome. Genome Res 2015; 25:1215-28. [PMID: 25995268 PMCID: PMC4510005 DOI: 10.1101/gr.185488.114] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 05/18/2015] [Indexed: 11/25/2022]
Abstract
Much evidence indicates that GC-biased gene conversion (gBGC) has a major impact on the evolution of mammalian genomes. However, a detailed quantification of the process is still lacking. The strength of gBGC can be measured from the analysis of derived allele frequency spectra (DAF), but this approach is sensitive to a number of confounding factors. In particular, we show by simulations that the inference is pervasively affected by polymorphism polarization errors and by spatial heterogeneity in gBGC strength. We propose a new general method to quantify gBGC from DAF spectra, incorporating polarization errors, taking spatial heterogeneity into account, and jointly estimating mutation bias. Applying it to human polymorphism data from the 1000 Genomes Project, we show that the strength of gBGC does not differ between hypermutable CpG sites and non-CpG sites, suggesting that in humans gBGC is not caused by the base-excision repair machinery. Genome-wide, the intensity of gBGC is in the nearly neutral area. However, given that recombination occurs primarily within recombination hotspots, 1%–2% of the human genome is subject to strong gBGC. On average, gBGC is stronger in African than in non-African populations, reflecting differences in effective population sizes. However, due to more heterogeneous recombination landscapes, the fraction of the genome affected by strong gBGC is larger in non-African than in African populations. Given that the location of recombination hotspots evolves very rapidly, our analysis predicts that, in the long term, a large fraction of the genome is affected by short episodes of strong gBGC.
Collapse
Affiliation(s)
- Sylvain Glémin
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France; Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Peter F Arndt
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Philipp W Messer
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | - Dmitri Petrov
- Department of Biology, Stanford University, Stanford, California 94305-5020, USA
| | - Nicolas Galtier
- Institut des Sciences de l'Evolution (ISEM - UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), 34095 Montpellier, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Lyon 1, 69622 Villeurbanne, France
| |
Collapse
|
32
|
Wallberg A, Glémin S, Webster MT. Extreme recombination frequencies shape genome variation and evolution in the honeybee, Apis mellifera. PLoS Genet 2015; 11:e1005189. [PMID: 25902173 PMCID: PMC4406589 DOI: 10.1371/journal.pgen.1005189] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Accepted: 04/01/2015] [Indexed: 01/10/2023] Open
Abstract
Meiotic recombination is a fundamental cellular process, with important consequences for evolution and genome integrity. However, we know little about how recombination rates vary across the genomes of most species and the molecular and evolutionary determinants of this variation. The honeybee, Apis mellifera, has extremely high rates of meiotic recombination, although the evolutionary causes and consequences of this are unclear. Here we use patterns of linkage disequilibrium in whole genome resequencing data from 30 diploid honeybees to construct a fine-scale map of rates of crossing over in the genome. We find that, in contrast to vertebrate genomes, the recombination landscape is not strongly punctate. Crossover rates strongly correlate with levels of genetic variation, but not divergence, which indicates a pervasive impact of selection on the genome. Germ-line methylated genes have reduced crossover rate, which could indicate a role of methylation in suppressing recombination. Controlling for the effects of methylation, we do not infer a strong association between gene expression patterns and recombination. The site frequency spectrum is strongly skewed from neutral expectations in honeybees: rare variants are dominated by AT-biased mutations, whereas GC-biased mutations are found at higher frequencies, indicative of a major influence of GC-biased gene conversion (gBGC), which we infer to generate an allele fixation bias 5 – 50 times the genomic average estimated in humans. We uncover further evidence that this repair bias specifically affects transitions and favours fixation of CpG sites. Recombination, via gBGC, therefore appears to have profound consequences on genome evolution in honeybees and interferes with the process of natural selection. These findings have important implications for our understanding of the forces driving molecular evolution. Evolution results from changes in allele frequencies in populations. The main forces that cause such changes are natural selection and random genetic drift. However, an additional process, GC-biased gene conversion (gBGC), associated with meiotic recombination, affects the probability that alleles are passed from one generation to the next. The honeybee, Apis mellifera, has extremely high recombination rates—more than 20 times to those observed in humans. However, the reason for this is unknown and the effects of such high recombination rates on evolution are not well understood. Here we use patterns of genetic variation in the genomes of 30 honeybees to infer variation in the rate of recombination across the genome. We find that recombination rates and levels of genetic variation are strongly correlated, which is indicative of a pervasive impact of natural selection on genetic variation. We also infer a major role of DNA methylation in determining recombination rates in genes. Patterns of genetic variation appear to be strongly skewed due to the effects of gBGC, suggesting that recombination generates a bias in transmission of alleles during meiosis. This process seems to be interfering with the efficacy of selection at removing deleterious alleles and favouring beneficial ones. Recombination therefore has a huge impact on genetic variation and evolution in honeybees and appears to play a dominant role in genome evolution.
Collapse
Affiliation(s)
- Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Sylvain Glémin
- Institut des Sciences de l’Evolution (ISEM—UMR 5554 Université de Montpellier-CNRS-IRD-EPHE), France
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Matthew T. Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- * E-mail:
| |
Collapse
|
33
|
Evolutionary consequences of DNA methylation on the GC content in vertebrate genomes. G3-GENES GENOMES GENETICS 2015; 5:441-7. [PMID: 25591920 PMCID: PMC4349097 DOI: 10.1534/g3.114.015545] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The genomes of many vertebrates show a characteristic variation in GC content. To explain its origin and evolution, mainly three mechanisms have been proposed: selection for GC content, mutation bias, and GC-biased gene conversion. At present, the mechanism of GC-biased gene conversion, i.e., short-scale, unidirectional exchanges between homologous chromosomes in the neighborhood of recombination-initiating double-strand breaks in favor for GC nucleotides, is the most widely accepted hypothesis. We here suggest that DNA methylation also plays an important role in the evolution of GC content in vertebrate genomes. To test this hypothesis, we investigated one mammalian (human) and one avian (chicken) genome. We used bisulfite sequencing to generate a whole-genome methylation map of chicken sperm and made use of a publicly available whole-genome methylation map of human sperm. Inclusion of these methylation maps into a model of GC content evolution provided significant support for the impact of DNA methylation on the local equilibrium GC content. Moreover, two different estimates of equilibrium GC content, one that neglects and one that incorporates the impact of DNA methylation and the concomitant CpG hypermutability, give estimates that differ by approximately 15% in both genomes, arguing for a strong impact of DNA methylation on the evolution of GC content. Thus, our results put forward that previous estimates of equilibrium GC content, which neglect the hypermutability of CpG dinucleotides, need to be reevaluated.
Collapse
|
34
|
Nikbakht H, Xia X, Hickey DA. The evolution of genomic GC content undergoes a rapid reversal within the genus Plasmodium. Genome 2015; 57:507-11. [PMID: 25633864 DOI: 10.1139/gen-2014-0158] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The genome of the malarial parasite Plasmodium falciparum is extremely AT rich. This bias toward a low GC content is a characteristic of several, but not all, species within the genus Plasmodium. We compared 4283 orthologous pairs of protein-coding sequences between Plasmodium falciparum and the less AT-biased Plasmodium vivax. Our results indicate that the common ancestor of these two species was also extremely AT rich. This means that, although there was a strong bias toward A+T during the early evolution of the ancestral Plasmodium lineage, there was a subsequent reversal of this trend during the more recent evolution of some species, such as P. vivax. Moreover, we show that not only is the P. vivax genome losing its AT richness, it is actually gaining a very significant degree of GC richness. This example illustrates the potential volatility of nucleotide content during the course of molecular evolution. Such reversible fluxes in nucleotide content within lineages could have important implications for phylogenetic reconstruction based on molecular sequence data.
Collapse
Affiliation(s)
- Hamid Nikbakht
- a Department of Biology, Concordia University, Montreal, QC H4B 1R6, Canada
| | | | | |
Collapse
|
35
|
Clément Y, Fustier MA, Nabholz B, Glémin S. The bimodal distribution of genic GC content is ancestral to monocot species. Genome Biol Evol 2014; 7:336-48. [PMID: 25527839 PMCID: PMC4316631 DOI: 10.1093/gbe/evu278] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
In grasses such as rice or maize, the distribution of genic GC content is well known to be bimodal. It is mainly driven by GC content at third codon positions (GC3 for short). This feature is thought to be specific to grasses as closely related species like banana have a unimodal GC3 distribution. GC3 is associated with numerous genomics features and uncovering the origin of this peculiar distribution will help understanding the potential roles and consequences of GC3 variations within and between genomes. Until recently, the origin of the peculiar GC3 distribution in grasses has remained unknown. Thanks to the recent publication of several complete genomes and transcriptomes of nongrass monocots, we studied more than 1,000 groups of one-to-one orthologous genes in seven grasses and three outgroup species (banana, palm tree, and yam). Using a maximum likelihood-based method, we reconstructed GC3 at several ancestral nodes. We found that the bimodal GC3 distribution observed in extant grasses is ancestral to both grasses and most monocot species, and that other species studied here have lost this peculiar structure. We also found that GC3 in grass lineages is globally evolving very slowly and that the decreasing GC3 gradient observed from 5′ to 3′ along coding sequences is also conserved and ancestral to monocots. This result strongly challenges the previous views on the specificity of grass genomes and we discuss its implications for the possible causes of the evolution of GC content in monocots.
Collapse
Affiliation(s)
- Yves Clément
- Montpellier SupAgro, Unité Mixte de Recherche 1334, Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, Montpellier, France Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| | | | - Benoit Nabholz
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| | - Sylvain Glémin
- Institut des Sciences de l'Evolution de Montpellier, Unité Mixte de Recherche 5554, Centre National de la Recherche Scientifique, Université Montpellier, France
| |
Collapse
|
36
|
Figuet E, Ballenghien M, Romiguier J, Galtier N. Biased gene conversion and GC-content evolution in the coding sequences of reptiles and vertebrates. Genome Biol Evol 2014; 7:240-50. [PMID: 25527834 PMCID: PMC4316630 DOI: 10.1093/gbe/evu277] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins.
Collapse
Affiliation(s)
- Emeric Figuet
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France
| | - Marion Ballenghien
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France
| | - Jonathan Romiguier
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France Department of Ecology and Evolution, Biophore, University of Lausanne, Switzerland
| | - Nicolas Galtier
- CNRS, Université Montpellier 2, UMR 5554, Institut des Sciences de l'Evolution de Montpellier, France
| |
Collapse
|
37
|
Lachance J, Tishkoff SA. Biased gene conversion skews allele frequencies in human populations, increasing the disease burden of recessive alleles. Am J Hum Genet 2014; 95:408-20. [PMID: 25279983 PMCID: PMC4185123 DOI: 10.1016/j.ajhg.2014.09.008] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2014] [Revised: 08/21/2014] [Accepted: 09/10/2014] [Indexed: 10/25/2022] Open
Abstract
Gene conversion results in the nonreciprocal transfer of genetic information between two recombining sequences, and there is evidence that this process is biased toward G and C alleles. However, the strength of GC-biased gene conversion (gBGC) in human populations and its effects on hereditary disease have yet to be assessed on a genomic scale. Using high-coverage whole-genome sequences of African hunter-gatherers, agricultural populations, and primate outgroups, we quantified the effects of GC-biased gene conversion on population genomic data sets. We find that genetic distances (FST and population branch statistics) are modified by gBGC. In addition, the site frequency spectrum is left-shifted when ancestral alleles are favored by gBGC and right-shifted when derived alleles are favored by gBGC. Allele frequency shifts due to gBGC mimic the effects of natural selection. As expected, these effects are strongest in high-recombination regions of the human genome. By comparing the relative rates of fixation of unbiased and biased sites, the strength of gene conversion was estimated to be on the order of Nb ≈ 0.05 to 0.09. We also find that derived alleles favored by gBGC are much more likely to be homozygous than derived alleles at unbiased SNPs (+42.2% to 62.8%). This results in a curse of the converted, whereby gBGC causes substantial increases in hereditary disease risks. Taken together, our findings reveal that GC-biased gene conversion has important population genetic and public health implications.
Collapse
MESH Headings
- Bias
- Evolution, Molecular
- Gene Conversion
- Gene Frequency
- Genes, Recessive/genetics
- Genetic Diseases, Inborn/genetics
- Genetics, Population
- Genome, Human/genetics
- Humans
- Models, Genetic
- Models, Theoretical
- Polymorphism, Single Nucleotide/genetics
- Recombination, Genetic
- Selection, Genetic/genetics
Collapse
Affiliation(s)
- Joseph Lachance
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Sarah A Tishkoff
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
38
|
Praz CJ, Packer L. Phylogenetic position of the bee genera Ancyla and Tarsalia (Hymenoptera: Apidae): a remarkable base compositional bias and an early Paleogene geodispersal from North America to the Old World. Mol Phylogenet Evol 2014; 81:258-70. [PMID: 25238948 DOI: 10.1016/j.ympev.2014.09.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Revised: 09/05/2014] [Accepted: 09/09/2014] [Indexed: 11/24/2022]
Abstract
We address the phylogenetic position of the bee genera Tarsalia and Ancyla (currently forming the tribe Ancylaini) on the basis of morphological, molecular and combined data. We assembled a matrix of 309 morphological characters and 5246 aligned nucleotide positions from six nuclear genes (28S, EF-1a, wingless, POL2, LW-Rhodopsin, NAK). In addition to both constituent genera of Ancylaini, we include all three subtribes of the Eucerini as well as a large number of other tribes from the "eucerine line". The morphological data suggest Ancyla to be sister to Tarsalia+Eucerini and analyses of the entire molecular dataset suggest Tarsalia to be sister to Ancyla+Eucerini. However, analyses of the combined dataset suggests the Ancylaini to be monophyletic. We address possible bias within the molecular data and show that the base composition of two markers (EF-1a and NAK) is significantly heterogeneous among taxa and that this heterogeneity is strong enough to overcome the phylogenetic signal from the other markers. Analyses of a molecular matrix where the heterogeneous partitions have been RY-recoded yield trees that are better resolved and have higher nodal support values than those recovered in analyses of the non-recoded matrix, and strongly suggest the Ancylaini to be a monophyletic sister group to the Eucerini. A dated phylogeny and ancestral range reconstructions suggest that the common ancestor of the Ancylaini reached the Old World from the New World most probably via the Thulean Land Bridge in a time window between 69 and 47 mya, a period that includes the Early Eocene Climatic Optimum. No further exchanges between the New World and the Old World are implied by our data until the period between 22 mya and 13.9 mya. These more recent faunal exchanges probably involved geodispersal over the Bering Land Bridge by less thermophilic lineages.
Collapse
Affiliation(s)
- Christophe J Praz
- Institute of Biology, University of Neuchatel, Emile-Argand 11, 2000 Neuchatel, Switzerland.
| | - Laurence Packer
- Department of Biology, York University, 4700 Keele St., Toronto, ON M3J 1P3, Canada
| |
Collapse
|
39
|
Abstract
MOTIVATION Brownian models have been introduced in phylogenetics for describing variation in substitution rates through time, with applications to molecular dating or to the comparative analysis of variation in substitution patterns among lineages. Thus far, however, the Monte Carlo implementations of these models have relied on crude approximations, in which the Brownian process is sampled only at the internal nodes of the phylogeny or at the midpoints along each branch, and the unknown trajectory between these sampled points is summarized by simple branchwise average substitution rates. RESULTS A more accurate Monte Carlo approach is introduced, explicitly sampling a fine-grained discretization of the trajectory of the (potentially multivariate) Brownian process along the phylogeny. Generic Monte Carlo resampling algorithms are proposed for updating the Brownian paths along and across branches. Specific computational strategies are developed for efficient integration of the finite-time substitution probabilities across branches induced by the Brownian trajectory. The mixing properties and the computational complexity of the resulting Markov chain Monte Carlo sampler scale reasonably with the discretization level, allowing practical applications with up to a few hundred discretization points along the entire depth of the tree. The method can be generalized to other Markovian stochastic processes, making it possible to implement a wide range of time-dependent substitution models with well-controlled computational precision. AVAILABILITY The program is freely available at www.phylobayes.org.
Collapse
Affiliation(s)
- Benjamin Horvilleur
- Université de Lyon, Université Lyon 1, CNRS; UMR 5558, Laboratoire de Biométrie, Biologie Évolutive, F-69622 Villeurbanne, France
| | - Nicolas Lartillot
- Université de Lyon, Université Lyon 1, CNRS; UMR 5558, Laboratoire de Biométrie, Biologie Évolutive, F-69622 Villeurbanne, France
| |
Collapse
|
40
|
Evans BJ, Zeng K, Esselstyn JA, Charlesworth B, Melnick DJ. Reduced representation genome sequencing suggests low diversity on the sex chromosomes of tonkean macaque monkeys. Mol Biol Evol 2014; 31:2425-40. [PMID: 24987106 DOI: 10.1093/molbev/msu197] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
In species with separate sexes, social systems can differ in the relative variances of male versus female reproductive success. Papionin monkeys (macaques, mangabeys, mandrills, drills, baboons, and geladas) exhibit hallmarks of a high variance in male reproductive success, including a female-biased adult sex ratio and prominent sexual dimorphism. To explore the potential genomic consequences of such sex differences, we used a reduced representation genome sequencing approach to quantifying polymorphism at sites on autosomes and sex chromosomes of the tonkean macaque (Macaca tonkeana), a species endemic to the Indonesian island of Sulawesi. The ratio of nucleotide diversity of the X chromosome to that of the autosomes was less than the value (0.75) expected with a 1:1 sex ratio and no sex differences in the variance in reproductive success. However, the significance of this difference was dependent on which outgroup was used to standardize diversity levels. Using a new model that includes the effects of varying population size, sex differences in mutation rate between the autosomes and X chromosome, and GC-biased gene conversion (gBGC) or selection on GC content, we found that the maximum-likelihood estimate of the ratio of effective population size of the X chromosome to that of the autosomes was 0.68, which did not differ significantly from 0.75. We also found evidence for 1) a higher level of purifying selection on genic than nongenic regions, 2) gBGC or natural selection favoring increased GC content, 3) a dynamic demography characterized by population growth and contraction, 4) a higher mutation rate in males than females, and 5) a very low polymorphism level on the Y chromosome. These findings shed light on the population genomic consequences of sex differences in the variance in reproductive success, which appear to be modest in the tonkean macaque; they also suggest the occurrence of hitchhiking on the Y chromosome.
Collapse
Affiliation(s)
- Ben J Evans
- Biology Department, McMaster University, Hamilton, ON, Canada
| | - Kai Zeng
- Department of Animal and Plant Sciences, Alfred Denny Building, University of Sheffield, Sheffield, United Kingdom
| | - Jacob A Esselstyn
- Department of Biological Sciences and Museum of Natural Science, Louisiana State University
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Don J Melnick
- Department of Ecology, Evolution, and Environmental Biology, Columbia University
| |
Collapse
|
41
|
Figuet E, Romiguier J, Dutheil JY, Galtier N. Mitochondrial DNA as a tool for reconstructing past life-history traits in mammals. J Evol Biol 2014; 27:899-910. [PMID: 24720883 DOI: 10.1111/jeb.12361] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 02/27/2014] [Accepted: 02/28/2014] [Indexed: 12/23/2022]
Abstract
Reconstructing the ancestral characteristics of species is a major goal in evolutionary and comparative biology. Unfortunately, fossils are not always available and sufficiently informative, and phylogenetic methods based on models of character evolution can be unsatisfactory. Genomic data offer a new opportunity to estimate ancestral character states, through (i) the correlation between DNA evolutionary processes and species life-history traits and (ii) available reliable methods for ancestral sequence inference. Here, we assess the relevance of mitochondrial DNA--the most popular molecular marker in animals--as a predictor of ancestral life-history traits in mammals, using the order of Cetartiodactyla as a benchmark. Using the complete set of 13 mitochondrial protein-coding genes, we show that the lineage-specific nonsynonymous over synonymous substitution rate ratio (dN/dS) is closely correlated with the species body mass, longevity and age of sexual maturity in Cetartiodactyla and can be used as a marker of ancestral traits provided that the noise introduced by short branches is appropriately dealt with. Based on ancestral dN/dS estimates, we predict that the first cetartiodactyls were relatively small animals (around 20 kg). This finding is in accordance with Cope's rule and the fossil record but could not be recovered via continuous character evolution methods.
Collapse
Affiliation(s)
- E Figuet
- UMR 5554, ISEM, CNRS, Université Montpellier 2, Montpellier, France
| | | | | | | |
Collapse
|
42
|
Douzery EJP, Scornavacca C, Romiguier J, Belkhir K, Galtier N, Delsuc F, Ranwez V. OrthoMaM v8: A Database of Orthologous Exons and Coding Sequences for Comparative Genomics in Mammals. Mol Biol Evol 2014; 31:1923-8. [DOI: 10.1093/molbev/msu132] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
|
43
|
Ellegren H. Genome sequencing and population genomics in non-model organisms. Trends Ecol Evol 2014; 29:51-63. [DOI: 10.1016/j.tree.2013.09.008] [Citation(s) in RCA: 392] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2013] [Revised: 09/02/2013] [Accepted: 09/16/2013] [Indexed: 12/20/2022]
|
44
|
Weber CC, Boussau B, Romiguier J, Jarvis ED, Ellegren H. Evidence for GC-biased gene conversion as a driver of between-lineage differences in avian base composition. Genome Biol 2014; 15:549. [PMID: 25496599 PMCID: PMC4290106 DOI: 10.1186/s13059-014-0549-1] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Accepted: 11/19/2014] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND While effective population size (Ne) and life history traits such as generation time are known to impact substitution rates, their potential effects on base composition evolution are less well understood. GC content increases with decreasing body mass in mammals, consistent with recombination-associated GC biased gene conversion (gBGC) more strongly impacting these lineages. However, shifts in chromosomal architecture and recombination landscapes between species may complicate the interpretation of these results. In birds, interchromosomal rearrangements are rare and the recombination landscape is conserved, suggesting that this group is well suited to assess the impact of life history on base composition. RESULTS Employing data from 45 newly and 3 previously sequenced avian genomes covering a broad range of taxa, we found that lineages with large populations and short generations exhibit higher GC content. The effect extends to both coding and non-coding sites, indicating that it is not due to selection on codon usage. Consistent with recombination driving base composition, GC content and heterogeneity were positively correlated with the rate of recombination. Moreover, we observed ongoing increases in GC in the majority of lineages. CONCLUSIONS Our results provide evidence that gBGC may drive patterns of nucleotide composition in avian genomes and are consistent with more effective gBGC in large populations and a greater number of meioses per unit time; that is, a shorter generation time. Thus, in accord with theoretical predictions, base composition evolution is substantially modulated by species life history.
Collapse
Affiliation(s)
- Claudia C Weber
- />Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| | - Bastien Boussau
- />Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558 Villeurbanne, France
| | | | - Erich D Jarvis
- />Department of Neurobiology, Howard Hughes Medical Institute, Duke University Medical Center, Durham, NC USA
| | - Hans Ellegren
- />Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden
| |
Collapse
|
45
|
Lartillot N. A phylogenetic Kalman filter for ancestral trait reconstruction using molecular data. ACTA ACUST UNITED AC 2013; 30:488-96. [PMID: 24318999 DOI: 10.1093/bioinformatics/btt707] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Correlation between life history or ecological traits and genomic features such as nucleotide or amino acid composition can be used for reconstructing the evolutionary history of the traits of interest along phylogenies. Thus far, however, such ancestral reconstructions have been done using simple linear regression approaches that do not account for phylogenetic inertia. These reconstructions could instead be seen as a genuine comparative regression problem, such as formalized by classical generalized least-square comparative methods, in which the trait of interest and the molecular predictor are represented as correlated Brownian characters coevolving along the phylogeny. RESULTS Here, a Bayesian sampler is introduced, representing an alternative and more efficient algorithmic solution to this comparative regression problem, compared with currently existing generalized least-square approaches. Technically, ancestral trait reconstruction based on a molecular predictor is shown to be formally equivalent to a phylogenetic Kalman filter problem, for which backward and forward recursions are developed and implemented in the context of a Markov chain Monte Carlo sampler. The comparative regression method results in more accurate reconstructions and a more faithful representation of uncertainty, compared with simple linear regression. Application to the reconstruction of the evolution of optimal growth temperature in Archaea, using GC composition in ribosomal RNA stems and amino acid composition of a sample of protein-coding genes, confirms previous findings, in particular, pointing to a hyperthermophilic ancestor for the kingdom. AVAILABILITY AND IMPLEMENTATION The program is freely available at www.phylobayes.org.
Collapse
Affiliation(s)
- Nicolas Lartillot
- Laboratoire de Biométrie et Biologie Évolutive, Centre National de la Recherche Scientifique, UMR 5558. Université Lyon 1, F-69622 Villeurbanne, France and Centre Robert-Cedergren pour la Bioinformatique, Département de Biochimie, Université de Montréal, Québec, Canada
| |
Collapse
|
46
|
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Center, Uppsala University, SE-752 36 Uppsala, Sweden;
| |
Collapse
|
47
|
Morgan CC, Mc Cartney AM, Donoghue MTA, Loughran NB, Spillane C, Teeling EC, O'Connell MJ. Molecular adaptation of telomere associated genes in mammals. BMC Evol Biol 2013; 13:251. [PMID: 24237966 PMCID: PMC3833184 DOI: 10.1186/1471-2148-13-251] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 11/06/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Placental mammals display a huge range of life history traits, including size, longevity, metabolic rate and germ line generation time. Although a number of general trends have been proposed between these traits, there are exceptions that warrant further investigation. Species such as naked mole rat, human and certain bat species all exhibit extreme longevity with respect to body size. It has long been established that telomeres and telomere maintenance have a clear role in ageing but it has not yet been established whether there is evidence for adaptation in telomere maintenance proteins that could account for increased longevity in these species. RESULTS Here we carry out a molecular investigation of selective pressure variation, specifically focusing on telomere associated genes across placental mammals. In general we observe a large number of instances of positive selection acting on telomere genes. Although these signatures of selection overall are not significantly correlated with either longevity or body size we do identify positive selection in the microbat species Myotis lucifugus in functionally important regions of the telomere maintenance genes DKC1 and TERT, and in naked mole rat in the DNA repair gene BRCA1. CONCLUSION These results demonstrate the multifarious selective pressures acting across the mammal phylogeny driving lineage-specific adaptations of telomere associated genes. Our results show that regardless of the longevity of a species, these proteins have evolved under positive selection thereby removing increased longevity as the single selective force driving this rapid rate of evolution. However, evidence of molecular adaptations specific to naked mole rat and Myotis lucifugus highlight functionally significant regions in genes that may alter the way in which telomeres are regulated and maintained in these longer-lived species.
Collapse
|
48
|
Capra JA, Hubisz MJ, Kostka D, Pollard KS, Siepel A. A model-based analysis of GC-biased gene conversion in the human and chimpanzee genomes. PLoS Genet 2013; 9:e1003684. [PMID: 23966869 PMCID: PMC3744432 DOI: 10.1371/journal.pgen.1003684] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2013] [Accepted: 06/14/2013] [Indexed: 01/03/2023] Open
Abstract
GC-biased gene conversion (gBGC) is a recombination-associated process that favors the fixation of G/C alleles over A/T alleles. In mammals, gBGC is hypothesized to contribute to variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations, but its prevalence and general functional consequences remain poorly understood. gBGC is difficult to incorporate into models of molecular evolution and so far has primarily been studied using summary statistics from genomic comparisons. Here, we introduce a new probabilistic model that captures the joint effects of natural selection and gBGC on nucleotide substitution patterns, while allowing for correlations along the genome in these effects. We implemented our model in a computer program, called phastBias, that can accurately detect gBGC tracts about 1 kilobase or longer in simulated sequence alignments. When applied to real primate genome sequences, phastBias predicts gBGC tracts that cover roughly 0.3% of the human and chimpanzee genomes and account for 1.2% of human-chimpanzee nucleotide differences. These tracts fall in clusters, particularly in subtelomeric regions; they are enriched for recombination hotspots and fast-evolving sequences; and they display an ongoing fixation preference for G and C alleles. They are also significantly enriched for disease-associated polymorphisms, suggesting that they contribute to the fixation of deleterious alleles. The gBGC tracts provide a unique window into historical recombination processes along the human and chimpanzee lineages. They supply additional evidence of long-term conservation of megabase-scale recombination rates accompanied by rapid turnover of hotspots. Together, these findings shed new light on the evolutionary, functional, and disease implications of gBGC. The phastBias program and our predicted tracts are freely available. Interpreting patterns of DNA sequence variation in the genomes of closely related species is critically important for understanding the causes and functional effects of nucleotide substitutions. Classical models describe patterns of substitution in terms of the fundamental forces of mutation, recombination, neutral drift, and natural selection. However, an entirely separate force, called GC-biased gene conversion (gBGC), also appears to have an important influence on substitution patterns in many species. gBGC is a recombination-associated evolutionary process that favors the fixation of strong (G/C) over weak (A/T) alleles. In mammals, gBGC is thought to promote variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations. However, its genome-wide influence remains poorly understood, in part because, it is difficult to incorporate gBGC into statistical models of evolution. In this paper, we describe a new evolutionary model that jointly describes the effects of selection and gBGC and apply it to the human and chimpanzee genomes. Our genome-wide predictions of gBGC tracts indicate that gBGC has been an important force in recent human evolution. Our publicly available computer program, called phastBias, and our genome-wide predictions will enable other researchers to consider gBGC in their analyses.
Collapse
Affiliation(s)
- John A. Capra
- Gladstone Institutes, University of California, San Francisco, California, United States of America
| | - Melissa J. Hubisz
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Dennis Kostka
- Department of Developmental Biology and Computational & Systems Biology, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Katherine S. Pollard
- Gladstone Institutes, University of California, San Francisco, California, United States of America
- Institute for Human Genetics and Division of Biostatistics, University of California, San Francisco, California, United States of America
- * E-mail: (KSP); (AS)
| | - Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
- * E-mail: (KSP); (AS)
| |
Collapse
|
49
|
Romiguier J, Ranwez V, Delsuc F, Galtier N, Douzery EJP. Less is more in mammalian phylogenomics: AT-rich genes minimize tree conflicts and unravel the root of placental mammals. Mol Biol Evol 2013; 30:2134-44. [PMID: 23813978 DOI: 10.1093/molbev/mst116] [Citation(s) in RCA: 122] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Despite the rapid increase of size in phylogenomic data sets, a number of important nodes on animal phylogeny are still unresolved. Among these, the rooting of the placental mammal tree is still a controversial issue. One difficulty lies in the pervasive phylogenetic conflicts among genes, with each one telling its own story, which may be reliable or not. Here, we identified a simple criterion, that is, the GC content, which substantially helps in determining which gene trees best reflect the species tree. We assessed the ability of 13,111 coding sequence alignments to correctly reconstruct the placental phylogeny. We found that GC-rich genes induced a higher amount of conflict among gene trees and performed worse than AT-rich genes in retrieving well-supported, consensual nodes on the placental tree. We interpret this GC effect mainly as a consequence of genome-wide variations in recombination rate. Indeed, recombination is known to drive GC-content evolution through GC-biased gene conversion and might be problematic for phylogenetic reconstruction, for instance, in an incomplete lineage sorting context. When we focused on the AT-richest fraction of the data set, the resolution level of the placental phylogeny was greatly increased, and a strong support was obtained in favor of an Afrotheria rooting, that is, Afrotheria as the sister group of all other placentals. We show that in mammals most conflicts among gene trees, which have so far hampered the resolution of the placental tree, are concentrated in the GC-rich regions of the genome. We argue that the GC content-because it is a reliable indicator of the long-term recombination rate-is an informative criterion that could help in identifying the most reliable molecular markers for species tree inference.
Collapse
Affiliation(s)
- Jonathan Romiguier
- CNRS, Université Montpellier, Institut des Sciences de l'Evolution, Montpellier, France.
| | | | | | | | | |
Collapse
|