101
|
Edelman NB, Frandsen PB, Miyagi M, Clavijo B, Davey J, Dikow RB, García-Accinelli G, Van Belleghem SM, Patterson N, Neafsey DE, Challis R, Kumar S, Moreira GRP, Salazar C, Chouteau M, Counterman BA, Papa R, Blaxter M, Reed RD, Dasmahapatra KK, Kronforst M, Joron M, Jiggins CD, McMillan WO, Di Palma F, Blumberg AJ, Wakeley J, Jaffe D, Mallet J. Genomic architecture and introgression shape a butterfly radiation. Science 2019; 366:594-599. [PMID: 31672890 PMCID: PMC7197882 DOI: 10.1126/science.aaw2090] [Citation(s) in RCA: 303] [Impact Index Per Article: 50.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 09/16/2019] [Indexed: 12/26/2022]
Abstract
We used 20 de novo genome assemblies to probe the speciation history and architecture of gene flow in rapidly radiating Heliconius butterflies. Our tests to distinguish incomplete lineage sorting from introgression indicate that gene flow has obscured several ancient phylogenetic relationships in this group over large swathes of the genome. Introgressed loci are underrepresented in low-recombination and gene-rich regions, consistent with the purging of foreign alleles more tightly linked to incompatibility loci. Here, we identify a hitherto unknown inversion that traps a color pattern switch locus. We infer that this inversion was transferred between lineages by introgression and is convergent with a similar rearrangement in another part of the genus. These multiple de novo genome sequences enable improved understanding of the importance of introgression and selective processes in adaptive radiation.
Collapse
Affiliation(s)
- Nathaniel B Edelman
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | - Paul B Frandsen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602, USA
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA
| | - Miriam Miyagi
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | | | - John Davey
- Bioscience Technology Facility, Department of Biology, University of York, York YO10 5DD, UK
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
| | - Rebecca B Dikow
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA
| | | | - Steven M Van Belleghem
- Department of Biology, University of Puerto Rico, Río Piedras Campus, San Juan, PR 00931-3360, Puerto Rico
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
| | - Daniel E Neafsey
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
- Harvard TH Chan School of Public Health, Boston, MA 02115, USA
| | - Richard Challis
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Sujai Kumar
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, UK
| | - Gilson R P Moreira
- Departamento de Zoologia, Universidade Federal do Rio Grande do Sul, Porto Alegre, 91501-970 Brasil
| | - Camilo Salazar
- Biology Program, Faculty of Natural Sciences and Mathematics, Universidad del Rosario, Carrera 24, No. 63C-69, Bogotá D.C. 111221, Colombia
| | - Mathieu Chouteau
- Laboratoire Ecologie, Evolution, Interactions des Systèmes Amazoniens (LEEISA), USR 3456, Université De Guyane, CNRS Guyane, 275 Route de Montabo, 97334 Cayenne, French Guiana
| | - Brian A Counterman
- Department of Biological Sciences, Mississippi State University, Starkville, MS 39762, USA
| | - Riccardo Papa
- Department of Biology, University of Puerto Rico, Río Piedras Campus, San Juan, PR 00931-3360, Puerto Rico
- Molecular Sciences and Research Center, University of Puerto Rico, San Juan, PR 00931-3360, Puerto Rico
| | - Mark Blaxter
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Robert D Reed
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY 14853, USA
| | - Kanchon K Dasmahapatra
- Bioscience Technology Facility, Department of Biology, University of York, York YO10 5DD, UK
| | - Marcus Kronforst
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
| | - Mathieu Joron
- CEFE, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, EPHE, IRD, 34090 Montpellier, France
| | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
| | - W Owen McMillan
- Smithsonian Tropical Research Institute, Apartado 0843-03092 Panamá, Panama
| | | | - Andrew J Blumberg
- Department of Mathematics, University of Texas, Austin, TX 78712, USA
| | - John Wakeley
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - David Jaffe
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142 USA
- 10x Genomics, Pleasanton, CA 94566, USA
| | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
102
|
V. Barroso G, Puzović N, Dutheil JY. Inference of recombination maps from a single pair of genomes and its application to ancient samples. PLoS Genet 2019; 15:e1008449. [PMID: 31725722 PMCID: PMC6879166 DOI: 10.1371/journal.pgen.1008449] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 11/26/2019] [Accepted: 09/30/2019] [Indexed: 12/11/2022] Open
Abstract
Understanding the causes and consequences of recombination landscape evolution is a fundamental goal in genetics that requires recombination maps from across the tree of life. Such maps can be obtained from population genomic datasets, but require large sample sizes. Alternative methods are therefore necessary to research organisms where such datasets cannot be generated easily, such as non-model or ancient species. Here we extend the sequentially Markovian coalescent model to jointly infer demography and the spatial variation in recombination rate. Using extensive simulations and sequence data from humans, fruit-flies and a fungal pathogen, we demonstrate that iSMC accurately infers recombination maps under a wide range of scenarios-remarkably, even from a single pair of unphased genomes. We exploit this possibility and reconstruct the recombination maps of ancient hominins. We report that the ancient and modern maps are correlated in a manner that reflects the established phylogeny of Neanderthals, Denisovans, and modern human populations.
Collapse
Affiliation(s)
- Gustavo V. Barroso
- Max Planck Institute for Evolutionary Biology, Department of Evolutionary Genetics, August-Thienemann-Straße , Plön–GERMANY
| | - Nataša Puzović
- Max Planck Institute for Evolutionary Biology, Department of Evolutionary Genetics, August-Thienemann-Straße , Plön–GERMANY
| | - Julien Y. Dutheil
- Max Planck Institute for Evolutionary Biology, Department of Evolutionary Genetics, August-Thienemann-Straße , Plön–GERMANY
| |
Collapse
|
103
|
Spence JP, Song YS. Inference and analysis of population-specific fine-scale recombination maps across 26 diverse human populations. SCIENCE ADVANCES 2019; 5:eaaw9206. [PMID: 31681842 PMCID: PMC6810367 DOI: 10.1126/sciadv.aaw9206] [Citation(s) in RCA: 95] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Accepted: 09/13/2019] [Indexed: 05/28/2023]
Abstract
Fine-scale rates of meiotic recombination vary by orders of magnitude across the genome and differ between species and even populations. Studying cross-population differences has been stymied by the confounding effects of demographic history. To address this problem, we developed a demography-aware method to infer fine-scale recombination rates and applied it to 26 diverse human populations, inferring population-specific recombination maps. These maps recapitulate many aspects of the history of these populations including signatures of the trans-Atlantic slave trade and the Iberian colonization of the Americas. We also investigated modulators of the local recombination rate, finding further evidence that Polycomb group proteins and the trimethylation of H3K27 elevate recombination rates. Further differences in the recombination landscape across the genome and between populations are driven by variation in the gene that encodes the DNA binding protein PRDM9, and we quantify the weak effect of meiotic drive acting to remove its binding sites.
Collapse
Affiliation(s)
- Jeffrey P. Spence
- Graduate Group in Computational Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Yun S. Song
- Computer Science Division and Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
104
|
Talla V, Soler L, Kawakami T, Dincă V, Vila R, Friberg M, Wiklund C, Backström N. Dissecting the Effects of Selection and Mutation on Genetic Diversity in Three Wood White (Leptidea) Butterfly Species. Genome Biol Evol 2019; 11:2875-2886. [PMID: 31580421 PMCID: PMC6795238 DOI: 10.1093/gbe/evz212] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2019] [Indexed: 12/12/2022] Open
Abstract
The relative role of natural selection and genetic drift in evolution is a major topic of debate in evolutionary biology. Most knowledge spring from a small group of organisms and originate from before it was possible to generate genome-wide data on genetic variation. Hence, it is necessary to extend to a larger number of taxonomic groups, descriptive and hypothesis-based research aiming at understanding the proximate and ultimate mechanisms underlying both levels of genetic polymorphism and the efficiency of natural selection. In this study, we used data from 60 whole-genome resequenced individuals of three cryptic butterfly species (Leptidea sp.), together with novel gene annotation information and population recombination data. We characterized the overall prevalence of natural selection and investigated the effects of mutation and linked selection on regional variation in nucleotide diversity. Our analyses showed that genome-wide diversity and rate of adaptive substitutions were comparatively low, whereas nonsynonymous to synonymous polymorphism and substitution levels were comparatively high in Leptidea, suggesting small long-term effective population sizes. Still, negative selection on linked sites (background selection) has resulted in reduced nucleotide diversity in regions with relatively high gene density and low recombination rate. We also found a significant effect of mutation rate variation on levels of polymorphism. Finally, there were considerable population differences in levels of genetic diversity and pervasiveness of selection against slightly deleterious alleles, in line with expectations from differences in estimated effective population sizes.
Collapse
Affiliation(s)
- Venkat Talla
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Lucile Soler
- Department of Medical Biochemistry and Microbiology, National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Uppsala, Sweden
| | - Takeshi Kawakami
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Vlad Dincă
- Department of Ecology and Genetics, University of Oulu, Finland
| | - Roger Vila
- Institut de Biologia Evolutiva (CSIC-UPF), Barcelona, Spain
| | - Magne Friberg
- Department of Biology, Biodiversity Unit, Lund University, Sweden
| | - Christer Wiklund
- Department of Zoology, Division of Ecology, Stockholm University, Sweden
| | - Niclas Backström
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| |
Collapse
|
105
|
Chapman JR, Hill T, Unckless RL. Balancing Selection Drives the Maintenance of Genetic Variation in Drosophila Antimicrobial Peptides. Genome Biol Evol 2019; 11:2691-2701. [PMID: 31504505 PMCID: PMC6764478 DOI: 10.1093/gbe/evz191] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/29/2019] [Indexed: 12/19/2022] Open
Abstract
Genes involved in immune defense against pathogens provide some of the most well-known examples of both directional and balancing selection. Antimicrobial peptides (AMPs) are innate immune effector genes, playing a key role in pathogen clearance in many species, including Drosophila. Conflicting lines of evidence have suggested that AMPs may be under directional, balancing, or purifying selection. Here, we use both a linear model and control-gene-based approach to show that balancing selection is an important force shaping AMP diversity in Drosophila. In Drosophila melanogaster, this is most clearly observed in ancestral African populations. Furthermore, the signature of balancing selection is even more striking once background selection has been accounted for. Balancing selection also acts on AMPs in Drosophila mauritiana, an isolated island endemic separated from D. melanogaster by about 4 Myr of evolution. This suggests that balancing selection may be broadly acting to maintain adaptive diversity in Drosophila AMPs, as has been found in other taxa.
Collapse
Affiliation(s)
| | - Tom Hill
- Department of Molecular Biosciences, University of Kansas
| | | |
Collapse
|
106
|
Howie JM, Mazzucco R, Taus T, Nolte V, Schlötterer C. DNA Motifs Are Not General Predictors of Recombination in Two Drosophila Sister Species. Genome Biol Evol 2019; 11:1345-1357. [PMID: 30980655 PMCID: PMC6490297 DOI: 10.1093/gbe/evz082] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/09/2019] [Indexed: 12/11/2022] Open
Abstract
Meiotic recombination is crucial for chromosomal segregation and facilitates the spread of beneficial and removal of deleterious mutations. Recombination rates frequently vary along chromosomes and Drosophila melanogaster exhibits a remarkable pattern. Recombination rates gradually decrease toward centromeres and telomeres, with a dramatic impact on levels of variation in natural populations. Two close sister species, Drosophila simulans and Drosophila mauritiana do not only have higher recombination rates but also exhibit a much more homogeneous recombination rate that only drops sharply very close to centromeres and telomeres. Because certain sequence motifs are associated with recombination rate variation in D. melanogaster, we tested whether the difference in recombination landscape between D. melanogaster and D. simulans can be explained by the genomic distribution of recombination rate–associated sequence motifs. We constructed the first high-resolution recombination map for D. simulans based on 189 haplotypes from a natural D. simulans population and searched for short sequence motifs linked with higher than average recombination in both sister species. We identified five consensus motifs significantly associated with higher than average chromosome-wide recombination rates in at least one species and present in both. Testing fine resolution associations between motif density and recombination, we found strong and positive associations genome-wide over a range of scales in D. melanogaster, while the results were equivocal in D. simulans. Despite the strong association in D. melanogaster, we did not find a decreasing density of these short-repeat motifs toward centromeres and telomeres. We conclude that the density of recombination-associated repeat motifs cannot explain the large-scale recombination landscape in D. melanogaster, nor the differences to D. simulans. The strong association seen for the sequence motifs in D. melanogaster likely reflects their impact influencing local differences in recombination rates along the genome.
Collapse
Affiliation(s)
- James M Howie
- Institut für Populationsgenetik, Vetmeduni Vienna, Austria
| | | | - Thomas Taus
- Institut für Populationsgenetik, Vetmeduni Vienna, Austria.,Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Austria
| | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Austria
| | | |
Collapse
|
107
|
Fraser BA, Whiting JR. What can be learned by scanning the genome for molecular convergence in wild populations? Ann N Y Acad Sci 2019; 1476:23-42. [PMID: 31241191 PMCID: PMC7586825 DOI: 10.1111/nyas.14177] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/24/2019] [Accepted: 06/04/2019] [Indexed: 12/11/2022]
Abstract
Convergent evolution, where independent lineages evolve similar phenotypes in response to similar challenges, can provide valuable insight into how selection operates and the limitations it encounters. However, it has only recently become possible to explore how convergent evolution is reflected at the genomic level. The overlapping outlier approach (OOA), where genome scans of multiple independent lineages are used to find outliers that overlap and therefore identify convergently evolving loci, is becoming popular. Here, we present a quantitative analysis of 34 studies that used this approach across many sampling designs, taxa, and sampling intensities. We found that OOA studies with increased biological sampling power within replicates have increased likelihood of finding overlapping, "convergent" signals of adaptation between them. When identifying convergent loci as overlapping outliers, it is tempting to assume that any false-positive outliers derived from individual scans will fail to overlap across replicates, but this cannot be guaranteed. We highlight how population demographics and genomic context can contribute toward both true convergence and false positives in OOA studies. We finish with an exploration of emerging methods that couple genome scans with phenotype and environmental measures, leveraging added information from genome data to more directly test hypotheses of the likelihood of convergent evolution.
Collapse
Affiliation(s)
- Bonnie A Fraser
- Department of Biosciences, University of Exeter, Exeter, United Kingdom
| | - James R Whiting
- Department of Biosciences, University of Exeter, Exeter, United Kingdom
| |
Collapse
|
108
|
Wang B, Mojica JP, Perera N, Lee CR, Lovell JT, Sharma A, Adam C, Lipzen A, Barry K, Rokhsar DS, Schmutz J, Mitchell-Olds T. Ancient polymorphisms contribute to genome-wide variation by long-term balancing selection and divergent sorting in Boechera stricta. Genome Biol 2019; 20:126. [PMID: 31227026 PMCID: PMC6587263 DOI: 10.1186/s13059-019-1729-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2018] [Accepted: 06/04/2019] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Genomic variation is widespread, and both neutral and selective processes can generate similar patterns in the genome. These processes are not mutually exclusive, so it is difficult to infer the evolutionary mechanisms that govern population and species divergence. Boechera stricta is a perennial relative of Arabidopsis thaliana native to largely undisturbed habitats with two geographic and ecologically divergent subspecies. Here, we delineate the evolutionary processes driving the genetic diversity and population differentiation in this species. RESULTS Using whole-genome re-sequencing data from 517 B. stricta accessions, we identify four genetic groups that diverged around 30-180 thousand years ago, with long-term small effective population sizes and recent population expansion after the Last Glacial Maximum. We find three genomic regions with elevated nucleotide diversity, totaling about 10% of the genome. These three regions of elevated nucleotide diversity show excess of intermediate-frequency alleles, higher absolute divergence (dXY), and lower relative divergence (FST) than genomic background, and significant enrichment in immune-related genes, reflecting long-term balancing selection. Scattered across the genome, we also find regions with both high FST and dXY among the groups, termed FST-islands. Population genetic signatures indicate that FST-islands with elevated divergence, which have experienced directional selection, are derived from divergent sorting of ancient polymorphisms. CONCLUSIONS Our results suggest that long-term balancing selection on disease resistance genes may have maintained ancestral haplotypes across different geographical lineages, and unequal sorting of balanced polymorphisms may have generated genomic regions with elevated divergence. This study highlights the importance of ancestral balanced polymorphisms as crucial components of genome-wide variation.
Collapse
Affiliation(s)
- Baosheng Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
- Department of Biology, Duke University, Box 90338, Durham, NC, 27708, USA.
| | - Julius P Mojica
- Department of Biology, Duke University, Box 90338, Durham, NC, 27708, USA
| | - Nadeesha Perera
- Department of Biology, Duke University, Box 90338, Durham, NC, 27708, USA
| | - Cheng-Ruei Lee
- Institute of Ecology and Evolutionary Biology and Institute of Plant Biology, National Taiwan University, Taipei, 10617, Taiwan, ROC
| | - John T Lovell
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Aditi Sharma
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Catherine Adam
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Anna Lipzen
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Kerrie Barry
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Daniel S Rokhsar
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | |
Collapse
|
109
|
Lim MCW, Witt CC, Graham CH, Dávalos LM. Divergent Fine-Scale Recombination Landscapes between a Freshwater and Marine Population of Threespine Stickleback Fish. Genome Biol Evol 2019; 11:1573-1585. [PMID: 31028697 PMCID: PMC6553502 DOI: 10.1093/gbe/evz090] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/17/2019] [Indexed: 12/27/2022] Open
Abstract
Meiotic recombination is a highly conserved process that has profound effects on genome evolution. At a fine-scale, recombination rates can vary drastically across genomes, often localized into small recombination "hotspots" with highly elevated rates, surrounded by regions with little recombination. In most species studied, the location of hotspots within genomes is highly conserved across broad evolutionary timescales. The main exception to this pattern is in mammals, where hotspot location can evolve rapidly among closely related species and even among populations within a species. Hotspot position in mammals is controlled by the gene, Prdm9, whereas in species with conserved hotspots, a functional Prdm9 is typically absent. Due to a limited number of species where recombination rates have been estimated at a fine-scale, it remains unclear whether hotspot conservation is always associated with the absence of a functional Prdm9. Threespine stickleback fish (Gasterosteus aculeatus) are an excellent model to examine the evolution of recombination over short evolutionary timescales. Using a linkage disequilibrium-based approach, we found recombination rates indeed varied at a fine-scale across the genome, with many regions organized into narrow hotspots. Hotspots had highly divergent landscapes between stickleback populations, where only ∼15% of these hotspots were shared. Our results indicate that fine-scale recombination rates may be diverging between closely related populations of threespine stickleback fish. Interestingly, we found only a weak association of a PRDM9 binding motif within hotspots, which suggests that threespine stickleback fish may possess a novel mechanism for targeting recombination hotspots at a fine-scale.
Collapse
Affiliation(s)
- Marisa C W Lim
- Department of Ecology and Evolution, Stony Brook University
| | - Christopher C Witt
- Museum of Southwestern Biology and Department of Biology, University of New Mexico
| | - Catherine H Graham
- Department of Ecology and Evolution, Stony Brook University
- Swiss Federal Research Institute (WSL), Birmensdorf, Switzerland
| | - Liliana M Dávalos
- Department of Ecology and Evolution, Stony Brook University
- Consortium for Inter-Disciplinary Environmental Research, Stony Brook University
| |
Collapse
|
110
|
Ragsdale AP, Gravel S. Models of archaic admixture and recent history from two-locus statistics. PLoS Genet 2019; 15:e1008204. [PMID: 31181058 PMCID: PMC6586359 DOI: 10.1371/journal.pgen.1008204] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 06/20/2019] [Accepted: 05/17/2019] [Indexed: 11/18/2022] Open
Abstract
We learn about population history and underlying evolutionary biology through patterns of genetic polymorphism. Many approaches to reconstruct evolutionary histories focus on a limited number of informative statistics describing distributions of allele frequencies or patterns of linkage disequilibrium. We show that many commonly used statistics are part of a broad family of two-locus moments whose expectation can be computed jointly and rapidly under a wide range of scenarios, including complex multi-population demographies with continuous migration and admixture events. A full inspection of these statistics reveals that widely used models of human history fail to predict simple patterns of linkage disequilibrium. To jointly capture the information contained in classical and novel statistics, we implemented a tractable likelihood-based inference framework for demographic history. Using this approach, we show that human evolutionary models that include archaic admixture in Africa, Asia, and Europe provide a much better description of patterns of genetic diversity across the human genome. We estimate that an unidentified, deeply diverged population admixed with modern humans within Africa both before and after the split of African and Eurasian populations, contributing 4 - 8% genetic ancestry to individuals in world-wide populations.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| | - Simon Gravel
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| |
Collapse
|
111
|
Hermann P, Heissl A, Tiemann‐Boege I, Futschik A. LDJump: Estimating variable recombination rates from population genetic data. Mol Ecol Resour 2019; 19:623-638. [PMID: 30666785 PMCID: PMC6519033 DOI: 10.1111/1755-0998.12994] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Revised: 12/13/2018] [Accepted: 01/11/2019] [Indexed: 11/27/2022]
Abstract
As recombination plays an important role in evolution, its estimation and the identification of hotspot positions is of considerable interest. We propose a novel approach for estimating population recombination rates based on genotyping or sequence data that involves a sequential multiscale change point estimator. Our method also permits demography to be taken into account. It uses several summary statistics within a regression model fitted on suitable scenarios. Our proposed method is accurate, computationally fast, and provides a parsimonious solution by ensuring a type I error control against too many changes in the recombination rate. An application to human genome data suggests a good congruence between our estimated and experimentally identified hotspots. Our method is implemented in the R-package LDJump, which is freely available at https://github.com/PhHermann/LDJump.
Collapse
Affiliation(s)
- Philipp Hermann
- Department of Applied StatisticsJohannes Kepler University LinzLinzAustria
| | - Angelika Heissl
- Institute of BiophysicsJohannes Kepler University LinzLinzAustria
| | | | - Andreas Futschik
- Department of Applied StatisticsJohannes Kepler University LinzLinzAustria
| |
Collapse
|
112
|
Kanduri C, Bock C, Gundersen S, Hovig E, Sandve GK. Colocalization analyses of genomic elements: approaches, recommendations and challenges. Bioinformatics 2019; 35:1615-1624. [PMID: 30307532 PMCID: PMC6499241 DOI: 10.1093/bioinformatics/bty835] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 09/03/2018] [Accepted: 10/10/2018] [Indexed: 12/23/2022] Open
Abstract
MOTIVATION Many high-throughput methods produce sets of genomic regions as one of their main outputs. Scientists often use genomic colocalization analysis to interpret such region sets, for example to identify interesting enrichments and to understand the interplay between the underlying biological processes. Although widely used, there is little standardization in how these analyses are performed. Different practices can substantially affect the conclusions of colocalization analyses. RESULTS Here, we describe the different approaches and provide recommendations for performing genomic colocalization analysis, while also discussing common methodological challenges that may influence the conclusions. As illustrated by concrete example cases, careful attention to analysis details is needed in order to meet these challenges and to obtain a robust and biologically meaningful interpretation of genomic region set data. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chakravarthi Kanduri
- Department of Informatics, University of Oslo, Oslo, Norway
- K. G. Jebsen Coeliac Disease Research Centre, Oslo, Norway
| | - Christoph Bock
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
- Department of Laboratory Medicine, Medical University of Vienna, Vienna, Austria
- Max Planck Institute for Informatics, Saarbrücken, Germany
| | - Sveinung Gundersen
- Department of Informatics, University of Oslo, Oslo, Norway
- Elixir Norway, Oslo Node, University of Oslo, Oslo, Norway
| | - Eivind Hovig
- Department of Informatics, University of Oslo, Oslo, Norway
- Elixir Norway, Oslo Node, University of Oslo, Oslo, Norway
- Department of Tumor Biology, Institute for Cancer Research, Oslo, Norway
- Institute for Cancer Genetics and Informatics, The Norwegian Radium Hospital, Oslo, Norway, UK
| | - Geir Kjetil Sandve
- Department of Informatics, University of Oslo, Oslo, Norway
- K. G. Jebsen Coeliac Disease Research Centre, Oslo, Norway
| |
Collapse
|
113
|
Robinson JA, Belsare S, Birnbaum S, Newman DE, Chan J, Glenn JP, Ferguson B, Cox LA, Wall JD. Analysis of 100 high-coverage genomes from a pedigreed captive baboon colony. Genome Res 2019; 29:848-856. [PMID: 30926611 PMCID: PMC6499309 DOI: 10.1101/gr.247122.118] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 03/21/2019] [Indexed: 12/21/2022]
Abstract
Baboons (genus Papio) are broadly studied in the wild and in captivity. They are widely used as a nonhuman primate model for biomedical studies, and the Southwest National Primate Research Center (SNPRC) at Texas Biomedical Research Institute has maintained a large captive baboon colony for more than 50 yr. Unlike other model organisms, however, the genomic resources for baboons are severely lacking. This has hindered the progress of studies using baboons as a model for basic biology or human disease. Here, we describe a data set of 100 high-coverage whole-genome sequences obtained from the mixed colony of olive (P. anubis) and yellow (P. cynocephalus) baboons housed at the SNPRC. These data provide a comprehensive catalog of common genetic variation in baboons, as well as a fine-scale genetic map. We show how the data can be used to learn about ancestry and admixture and to correct errors in the colony records. Finally, we investigated the consequences of inbreeding within the SNPRC colony and found clear evidence for increased rates of infant mortality and increased homozygosity of putatively deleterious alleles in inbred individuals.
Collapse
Affiliation(s)
- Jacqueline A Robinson
- Institute for Human Genetics, University of California, San Francisco, California 94143, USA
| | - Saurabh Belsare
- Institute for Human Genetics, University of California, San Francisco, California 94143, USA
| | - Shifra Birnbaum
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas 78245, USA
| | - Deborah E Newman
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas 78245, USA
| | - Jeannie Chan
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas 78245, USA
| | - Jeremy P Glenn
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas 78245, USA
| | - Betsy Ferguson
- Division of Genetics, Oregon National Primate Research Center, Beaverton, Oregon 97006, USA.,Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - Laura A Cox
- Center for Precision Medicine, Department of Internal Medicine, Section of Molecular Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina 27101, USA.,Southwest National Primate Research Center, Texas Biomedical Research Institute, San Antonio, Texas 78245, USA
| | - Jeffrey D Wall
- Institute for Human Genetics, University of California, San Francisco, California 94143, USA
| |
Collapse
|
114
|
Fraïsse C, Puixeu Sala G, Vicoso B. Pleiotropy Modulates the Efficacy of Selection in Drosophila melanogaster. Mol Biol Evol 2019; 36:500-515. [PMID: 30590559 PMCID: PMC6389323 DOI: 10.1093/molbev/msy246] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Pleiotropy is the well-established idea that a single mutation affects multiple phenotypes. If a mutation has opposite effects on fitness when expressed in different contexts, then genetic conflict arises. Pleiotropic conflict is expected to reduce the efficacy of selection by limiting the fixation of beneficial mutations through adaptation, and the removal of deleterious mutations through purifying selection. Although this has been widely discussed, in particular in the context of a putative "gender load," it has yet to be systematically quantified. In this work, we empirically estimate to which extent different pleiotropic regimes impede the efficacy of selection in Drosophila melanogaster. We use whole-genome polymorphism data from a single African population and divergence data from D. simulans to estimate the fraction of adaptive fixations (α), the rate of adaptation (ωA), and the direction of selection (DoS). After controlling for confounding covariates, we find that the different pleiotropic regimes have a relatively small, but significant, effect on selection efficacy. Specifically, our results suggest that pleiotropic sexual antagonism may restrict the efficacy of selection, but that this conflict can be resolved by limiting the expression of genes to the sex where they are beneficial. Intermediate levels of pleiotropy across tissues and life stages can also lead to maladaptation in D. melanogaster, due to inefficient purifying selection combined with low frequency of mutations that confer a selective advantage. Thus, our study highlights the need to consider the efficacy of selection in the context of antagonistic pleiotropy, and of genetic conflict in general.
Collapse
Affiliation(s)
- Christelle Fraïsse
- Institute of Science and Technology Austria, Am Campus 1, Klosterneuburg 3400, Austria
| | - Gemma Puixeu Sala
- Institute of Science and Technology Austria, Am Campus 1, Klosterneuburg 3400, Austria
| | - Beatriz Vicoso
- Institute of Science and Technology Austria, Am Campus 1, Klosterneuburg 3400, Austria
| |
Collapse
|
115
|
Fast Estimation of Recombination Rates Using Topological Data Analysis. Genetics 2019; 211:1191-1204. [PMID: 30787042 DOI: 10.1534/genetics.118.301565] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 02/13/2019] [Indexed: 01/26/2023] Open
Abstract
Accurate estimation of recombination rates is critical for studying the origins and maintenance of genetic diversity. Because the inference of recombination rates under a full evolutionary model is computationally expensive, we developed an alternative approach using topological data analysis (TDA) on genome sequences. We find that this method can analyze datasets larger than what can be handled by any existing recombination inference software, and has accuracy comparable to commonly used model-based methods with significantly less processing time. Previous TDA methods used information contained solely in the first Betti number ([Formula: see text]) of a set of genomes, which aims to capture the number of loops that can be detected within a genealogy. These explorations have proven difficult to connect to the theory of the underlying biological process of recombination, and, consequently, have unpredictable behavior under perturbations of the data. We introduce a new topological feature, which we call ψ, with a natural connection to coalescent models, and present novel arguments relating [Formula: see text] to population genetic models. Using simulations, we show that ψ and [Formula: see text] are differentially affected by missing data, and package our approach as TREE (Topological Recombination Estimator). TREE's efficiency and accuracy make it well suited as a first-pass estimator of recombination rate heterogeneity or hotspots throughout the genome. Our work empirically and theoretically justifies the use of topological statistics as summaries of genome sequences and describes a new, unintuitive relationship between topological features of the distribution of sequence data and the footprint of recombination on genomes.
Collapse
|
116
|
Martin SH, Davey JW, Salazar C, Jiggins CD. Recombination rate variation shapes barriers to introgression across butterfly genomes. PLoS Biol 2019; 17:e2006288. [PMID: 30730876 PMCID: PMC6366726 DOI: 10.1371/journal.pbio.2006288] [Citation(s) in RCA: 200] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Accepted: 01/07/2019] [Indexed: 12/30/2022] Open
Abstract
Hybridisation and introgression can dramatically alter the relationships among groups of species, leading to phylogenetic discordance across the genome and between populations. Introgression can also erode species differences over time, but selection against introgression at certain loci acts to maintain postmating species barriers. Theory predicts that species barriers made up of many loci throughout the genome should lead to a broad correlation between introgression and recombination rate, which determines the extent to which selection on deleterious foreign alleles will affect neutral alleles at physically linked loci. Here, we describe the variation in genealogical relationships across the genome among three species of Heliconius butterflies: H. melpomene (mel), H. cydno (cyd), and H. timareta (tim), using whole genomes of 92 individuals, and ask whether this variation can be explained by heterogeneous barriers to introgression. We find that species relationships vary predictably at the chromosomal scale. By quantifying recombination rate and admixture proportions, we then show that rates of introgression are predicted by variation in recombination rate. This implies that species barriers are highly polygenic, with selection acting against introgressed alleles across most of the genome. In addition, long chromosomes, which have lower recombination rates, produce stronger barriers on average than short chromosomes. Finally, we find a consistent difference between two species pairs on either side of the Andes, which suggests differences in the architecture of the species barriers. Our findings illustrate how the combined effects of hybridisation, recombination, and natural selection, acting at multitudes of loci over long periods, can dramatically sculpt the phylogenetic relationships among species.
Collapse
Affiliation(s)
- Simon H. Martin
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - John W. Davey
- Department of Biology, University of York, York, United Kingdom
| | - Camilo Salazar
- Biology Program, Faculty of Natural Sciences and Mathematics, Universidad del Rosario, Bogota, Colombia
| | - Chris D. Jiggins
- Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
117
|
Kryvokhyzha D, Salcedo A, Eriksson MC, Duan T, Tawari N, Chen J, Guerrina M, Kreiner JM, Kent TV, Lagercrantz U, Stinchcombe JR, Glémin S, Wright SI, Lascoux M. Parental legacy, demography, and admixture influenced the evolution of the two subgenomes of the tetraploid Capsella bursa-pastoris (Brassicaceae). PLoS Genet 2019; 15:e1007949. [PMID: 30768594 PMCID: PMC6395008 DOI: 10.1371/journal.pgen.1007949] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 02/28/2019] [Accepted: 01/09/2019] [Indexed: 11/18/2022] Open
Abstract
Allopolyploidy is generally perceived as a major source of evolutionary novelties and as an instantaneous way to create isolation barriers. However, we do not have a clear understanding of how two subgenomes evolve and interact once they have fused in an allopolyploid species nor how isolated they are from their relatives. Here, we address these questions by analyzing genomic and transcriptomic data of allotetraploid Capsella bursa-pastoris in three differentiated populations, Asia, Europe, and the Middle East. We phased the two subgenomes, one descended from the outcrossing and highly diverse Capsella grandiflora (CbpCg) and the other one from the selfing and genetically depauperate Capsella orientalis (CbpCo). For each subgenome, we assessed its relationship with the diploid relatives, temporal changes of effective population size (Ne), signatures of positive and negative selection, and gene expression patterns. In all three regions, Ne of the two subgenomes decreased gradually over time and the CbpCo subgenome accumulated more deleterious changes than CbpCg. There were signs of widespread admixture between C. bursa-pastoris and its diploid relatives. The two subgenomes were impacted differentially depending on geographic region suggesting either strong interploidy gene flow or multiple origins of C. bursa-pastoris. Selective sweeps were more common on the CbpCg subgenome in Europe and the Middle East, and on the CbpCo subgenome in Asia. In contrast, differences in expression were limited with the CbpCg subgenome slightly more expressed than CbpCo in Europe and the Middle-East. In summary, after more than 100,000 generations of co-existence, the two subgenomes of C. bursa-pastoris still retained a strong signature of parental legacy but their evolutionary trajectory strongly varied across geographic regions.
Collapse
Affiliation(s)
- Dmytro Kryvokhyzha
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Adriana Salcedo
- Department of Ecology and Evolution, University of Toronto, Toronto, Canada
| | - Mimmi C. Eriksson
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
| | - Tianlin Duan
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Nilesh Tawari
- Computational and Systems Biology Group, Genome Institute of Singapore, Agency for Science, Technology and Research (A*Star), Singapore
| | - Jun Chen
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Maria Guerrina
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Julia M. Kreiner
- Department of Ecology and Evolution, University of Toronto, Toronto, Canada
| | - Tyler V. Kent
- Department of Ecology and Evolution, University of Toronto, Toronto, Canada
| | - Ulf Lagercrantz
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | | | - Sylvain Glémin
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- CNRS, Université de Rennes 1, ECOBIO (Ecosystémes, biodiversité, évolution) - UMR 6553, F-35000 Rennes, France
| | - Stephen I. Wright
- Department of Ecology and Evolution, University of Toronto, Toronto, Canada
| | - Martin Lascoux
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| |
Collapse
|
118
|
Flagel L, Brandvain Y, Schrider DR. The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference. Mol Biol Evol 2019; 36:220-238. [PMID: 30517664 PMCID: PMC6367976 DOI: 10.1093/molbev/msy224] [Citation(s) in RCA: 117] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Population-scale genomic data sets have given researchers incredible amounts of information from which to infer evolutionary histories. Concomitant with this flood of data, theoretical and methodological advances have sought to extract information from genomic sequences to infer demographic events such as population size changes and gene flow among closely related populations/species, construct recombination maps, and uncover loci underlying recent adaptation. To date, most methods make use of only one or a few summaries of the input sequences and therefore ignore potentially useful information encoded in the data. The most sophisticated of these approaches involve likelihood calculations, which require theoretical advances for each new problem, and often focus on a single aspect of the data (e.g., only allele frequency information) in the interest of mathematical and computational tractability. Directly interrogating the entirety of the input sequence data in a likelihood-free manner would thus offer a fruitful alternative. Here, we accomplish this by representing DNA sequence alignments as images and using a class of deep learning methods called convolutional neural networks (CNNs) to make population genetic inferences from these images. We apply CNNs to a number of evolutionary questions and find that they frequently match or exceed the accuracy of current methods. Importantly, we show that CNNs perform accurate evolutionary model selection and parameter estimation, even on problems that have not received detailed theoretical treatments. Thus, when applied to population genetic alignments, CNNs are capable of outperforming expert-derived statistical methods and offer a new path forward in cases where no likelihood approach exists.
Collapse
Affiliation(s)
- Lex Flagel
- Monsanto Company, Chesterfield, MO
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN
| | - Yaniv Brandvain
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN
| | - Daniel R Schrider
- Department of Genetics, University of North Carolina, Chapel Hill, NC
| |
Collapse
|
119
|
Female Meiosis: Synapsis, Recombination, and Segregation in Drosophila melanogaster. Genetics 2018; 208:875-908. [PMID: 29487146 PMCID: PMC5844340 DOI: 10.1534/genetics.117.300081] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 10/18/2017] [Indexed: 12/11/2022] Open
Abstract
A century of genetic studies of the meiotic process in Drosophila melanogaster females has been greatly augmented by both modern molecular biology and major advances in cytology. These approaches, and the findings they have allowed, are the subject of this review. Specifically, these efforts have revealed that meiotic pairing in Drosophila females is not an extension of somatic pairing, but rather occurs by a poorly understood process during premeiotic mitoses. This process of meiotic pairing requires the function of several components of the synaptonemal complex (SC). When fully assembled, the SC also plays a critical role in maintaining homolog synapsis and in facilitating the maturation of double-strand breaks (DSBs) into mature crossover (CO) events. Considerable progress has been made in elucidating not only the structure, function, and assembly of the SC, but also the proteins that facilitate the formation and repair of DSBs into both COs and noncrossovers (NCOs). The events that control the decision to mature a DSB as either a CO or an NCO, as well as determining which of the two CO pathways (class I or class II) might be employed, are also being characterized by genetic and genomic approaches. These advances allow a reconsideration of meiotic phenomena such as interference and the centromere effect, which were previously described only by genetic studies. In delineating the mechanisms by which the oocyte controls the number and position of COs, it becomes possible to understand the role of CO position in ensuring the proper orientation of homologs on the first meiotic spindle. Studies of bivalent orientation have occurred in the context of numerous investigations into the assembly, structure, and function of the first meiotic spindle. Additionally, studies have examined the mechanisms ensuring the segregation of chromosomes that have failed to undergo crossing over.
Collapse
|
120
|
Dapper AL, Payseur BA. Connecting theory and data to understand recombination rate evolution. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0469. [PMID: 29109228 DOI: 10.1098/rstb.2016.0469] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/07/2017] [Indexed: 02/03/2023] Open
Abstract
Meiotic recombination is necessary for successful gametogenesis in most sexually reproducing organisms and is a fundamental genomic parameter, influencing the efficacy of selection and the fate of new mutations. The molecular and evolutionary functions of recombination should impose strong selective constraints on the range of recombination rates. Yet, variation in recombination rate is observed on a variety of genomic and evolutionary scales. In the past decade, empirical studies have described variation in recombination rate within genomes, between individuals, between sexes, between populations and between species. At the same time, theoretical work has provided an increasingly detailed picture of the evolutionary advantages to recombination. Perhaps surprisingly, the causes of natural variation in recombination rate remain poorly understood. We argue that empirical and theoretical approaches to understand the evolution of recombination have proceeded largely independently of each other. Most models that address the evolution of recombination rate were created to explain the evolutionary advantage of recombination rather than quantitative differences in rate among individuals. Conversely, most empirical studies aim to describe variation in recombination rate, rather than to test evolutionary hypotheses. In this Perspective, we argue that efforts to integrate the rich bodies of empirical and theoretical work on recombination rate are crucial to moving this field forward. We provide new directions for the development of theory and the production of data that will jointly close this gap.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Amy L Dapper
- Laboratory of Genetics, University of Wisconsin, Madison, WI 53706, USA
| | - Bret A Payseur
- Laboratory of Genetics, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
121
|
Stapley J, Feulner PGD, Johnston SE, Santure AW, Smadja CM. Variation in recombination frequency and distribution across eukaryotes: patterns and processes. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0455. [PMID: 29109219 PMCID: PMC5698618 DOI: 10.1098/rstb.2016.0455] [Citation(s) in RCA: 249] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/08/2017] [Indexed: 01/04/2023] Open
Abstract
Recombination, the exchange of DNA between maternal and paternal chromosomes during meiosis, is an essential feature of sexual reproduction in nearly all multicellular organisms. While the role of recombination in the evolution of sex has received theoretical and empirical attention, less is known about how recombination rate itself evolves and what influence this has on evolutionary processes within sexually reproducing organisms. Here, we explore the patterns of, and processes governing recombination in eukaryotes. We summarize patterns of variation, integrating current knowledge with an analysis of linkage map data in 353 organisms. We then discuss proximate and ultimate processes governing recombination rate variation and consider how these influence evolutionary processes. Genome-wide recombination rates (cM/Mb) can vary more than tenfold across eukaryotes, and there is large variation in the distribution of recombination events across closely related taxa, populations and individuals. We discuss how variation in rate and distribution relates to genome architecture, genetic and epigenetic mechanisms, sex, environmental perturbations and variable selective pressures. There has been great progress in determining the molecular mechanisms governing recombination, and with the continued development of new modelling and empirical approaches, there is now also great opportunity to further our understanding of how and why recombination rate varies.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Jessica Stapley
- Centre for Adaptation to a Changing Environment, IBZ, ETH Zürich, 8092 Zürich, Switzerland
| | - Philine G D Feulner
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, 6047 Kastanienbaum, Switzerland.,Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland
| | - Susan E Johnston
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JY, UK
| | - Anna W Santure
- School of Biological Sciences, University of Auckland, Auckland 1142, New Zealand
| | - Carole M Smadja
- Institut des Sciences de l'Evolution UMR 5554, CNRS, IRD, EPHE, Université de Montpellier, 3095 Montpellier cedex 05, France
| |
Collapse
|
122
|
Abstract
Recombination often differs markedly between males and females. Here we present the first analysis of sex-specific recombination in Gasterosteus sticklebacks. Using whole-genome sequencing of 15 crosses between G. aculeatus and G. nipponicus, we localized 698 crossovers with a median resolution of 2.3 kb. We also used a bioinformatic approach to infer historical sex-averaged recombination patterns for both species. Recombination is greater in females than males on all chromosomes, and overall map length is 1.64 times longer in females. The locations of crossovers differ strikingly between sexes. Crossovers cluster toward chromosome ends in males, but are distributed more evenly across chromosomes in females. Suppression of recombination near the centromeres in males causes crossovers to cluster at the ends of long arms in acrocentric chromosomes, and greatly reduces crossing over on short arms. The effect of centromeres on recombination is much weaker in females. Genomic differentiation between G. aculeatus and G. nipponicus is strongly correlated with recombination rate, and patterns of differentiation along chromosomes are strongly influenced by male-specific telomere and centromere effects. We found no evidence for fine-scale correlations between recombination and local gene content in either sex. We discuss hypotheses for the origin of sexual dimorphism in recombination and its consequences for sexually antagonistic selection and sex chromosome evolution.
Collapse
|
123
|
Comeron JM. Background selection as null hypothesis in population genomics: insights and challenges from Drosophila studies. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0471. [PMID: 29109230 PMCID: PMC5698629 DOI: 10.1098/rstb.2016.0471] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2017] [Indexed: 12/11/2022] Open
Abstract
The consequences of selection at linked sites are multiple and widespread across the genomes of most species. Here, I first review the main concepts behind models of selection and linkage in recombining genomes, present the difficulty in parametrizing these models simply as a reduction in effective population size (Ne) and discuss the predicted impact of recombination rates on levels of diversity across genomes. Arguments are then put forward in favour of using a model of selection and linkage with neutral and deleterious mutations (i.e. the background selection model, BGS) as a sensible null hypothesis for investigating the presence of other forms of selection, such as balancing or positive. I also describe and compare two studies that have generated high-resolution landscapes of the predicted consequences of selection at linked sites in Drosophila melanogaster. Both studies show that BGS can explain a very large fraction of the observed variation in diversity across the whole genome, thus supporting its use as null model. Finally, I identify and discuss a number of caveats and challenges in studies of genetic hitchhiking that have been often overlooked, with several of them sharing a potential bias towards overestimating the evidence supporting recent selective sweeps to the detriment of a BGS explanation. One potential source of bias is the analysis of non-equilibrium populations: it is precisely because models of selection and linkage predict variation in Ne across chromosomes that demographic dynamics are not expected to be equivalent chromosome- or genome-wide. Other challenges include the use of incomplete genome annotations, the assumption of temporally stable recombination landscapes, the presence of genes under balancing selection and the consequences of ignoring non-crossover (gene conversion) recombination events. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, IA 52242, USA .,Interdisciplinary Program in Genetics, University of Iowa, Iowa City, IA 52242, USA
| |
Collapse
|
124
|
Schumer M, Xu C, Powell DL, Durvasula A, Skov L, Holland C, Blazier JC, Sankararaman S, Andolfatto P, Rosenthal GG, Przeworski M. Natural selection interacts with recombination to shape the evolution of hybrid genomes. Science 2018; 360:656-660. [PMID: 29674434 PMCID: PMC6069607 DOI: 10.1126/science.aar3684] [Citation(s) in RCA: 260] [Impact Index Per Article: 37.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 03/23/2018] [Indexed: 12/29/2022]
Abstract
To investigate the consequences of hybridization between species, we studied three replicate hybrid populations that formed naturally between two swordtail fish species, estimating their fine-scale genetic map and inferring ancestry along the genomes of 690 individuals. In all three populations, ancestry from the "minor" parental species is more common in regions of high recombination and where there is linkage to fewer putative targets of selection. The same patterns are apparent in a reanalysis of human and archaic admixture. These results support models in which ancestry from the minor parental species is more likely to persist when rapidly uncoupled from alleles that are deleterious in hybrids. Our analyses further indicate that selection on swordtail hybrids stems predominantly from deleterious combinations of epistatically interacting alleles.
Collapse
Affiliation(s)
- Molly Schumer
- Howard Hughes Medical Institute (HHMI), Boston, MA, USA.
- Harvard Society of Fellows, Harvard University, Cambridge, MA, USA
- Department of Biological Sciences, Columbia University, New York, NY, USA
- Centro de Investigaciones Científicas de las Huastecas "Aguazarca," Calnali, Hidalgo, Mexico
| | - Chenling Xu
- Center for Computational Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Daniel L Powell
- Centro de Investigaciones Científicas de las Huastecas "Aguazarca," Calnali, Hidalgo, Mexico
- Department of Biology, Texas A&M University, College Station, TX, USA
| | - Arun Durvasula
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Laurits Skov
- Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| | - Chris Holland
- Centro de Investigaciones Científicas de las Huastecas "Aguazarca," Calnali, Hidalgo, Mexico
- Department of Biology, Texas A&M University, College Station, TX, USA
| | - John C Blazier
- Department of Biology, Texas A&M University, College Station, TX, USA
- Texas A&M Institute for Genome Sciences and Society, College Station, TX, USA
| | - Sriram Sankararaman
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Gil G Rosenthal
- Centro de Investigaciones Científicas de las Huastecas "Aguazarca," Calnali, Hidalgo, Mexico
- Department of Biology, Texas A&M University, College Station, TX, USA
| | - Molly Przeworski
- Department of Biological Sciences, Columbia University, New York, NY, USA.
- Department of Systems Biology, Columbia University, New York, NY, USA
| |
Collapse
|
125
|
Weigand H, Leese F. Detecting signatures of positive selection in non-model species using genomic data. Zool J Linn Soc 2018. [DOI: 10.1093/zoolinnean/zly007] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- Hannah Weigand
- Aquatic Ecosystem Research, University of Duisburg-Essen, Universitätsstraße, Essen, Germany
| | - Florian Leese
- Aquatic Ecosystem Research, University of Duisburg-Essen, Universitätsstraße, Essen, Germany
- Centre for Water and Environmental Research (ZWU), University of Duisburg-Essen, Universitätsstraße, Essen, Germany
| |
Collapse
|
126
|
Schrider DR, Ayroles J, Matute DR, Kern AD. Supervised machine learning reveals introgressed loci in the genomes of Drosophila simulans and D. sechellia. PLoS Genet 2018; 14:e1007341. [PMID: 29684059 PMCID: PMC5933812 DOI: 10.1371/journal.pgen.1007341] [Citation(s) in RCA: 76] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Revised: 05/03/2018] [Accepted: 03/28/2018] [Indexed: 12/30/2022] Open
Abstract
Hybridization and gene flow between species appears to be common. Even though it is clear that hybridization is widespread across all surveyed taxonomic groups, the magnitude and consequences of introgression are still largely unknown. Thus it is crucial to develop the statistical machinery required to uncover which genomic regions have recently acquired haplotypes via introgression from a sister population. We developed a novel machine learning framework, called FILET (Finding Introgressed Loci via Extra-Trees) capable of revealing genomic introgression with far greater power than competing methods. FILET works by combining information from a number of population genetic summary statistics, including several new statistics that we introduce, that capture patterns of variation across two populations. We show that FILET is able to identify loci that have experienced gene flow between related species with high accuracy, and in most situations can correctly infer which population was the donor and which was the recipient. Here we describe a data set of outbred diploid Drosophila sechellia genomes, and combine them with data from D. simulans to examine recent introgression between these species using FILET. Although we find that these populations may have split more recently than previously appreciated, FILET confirms that there has indeed been appreciable recent introgression (some of which might have been adaptive) between these species, and reveals that this gene flow is primarily in the direction of D. simulans to D. sechellia.
Collapse
Affiliation(s)
- Daniel R. Schrider
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
| | - Julien Ayroles
- Ecology and Evolutionary Biology Department, Princeton University, Princeton, New Jersey, United States of America
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Daniel R. Matute
- Biology Department, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Andrew D. Kern
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
| |
Collapse
|
127
|
Hervas S, Sanz E, Casillas S, Pool JE, Barbadilla A. PopFly: the Drosophila population genomics browser. Bioinformatics 2018; 33:2779-2780. [PMID: 28472360 DOI: 10.1093/bioinformatics/btx301] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Accepted: 05/02/2017] [Indexed: 11/15/2022] Open
Abstract
Summary The recent compilation of over 1100 worldwide wild-derived Drosophila melanogaster genome sequences reassembled using a standardized pipeline provides a unique resource for population genomic studies (Drosophila Genome Nexus, DGN). A visual display of the estimated metrics describing genome-wide variation and selection patterns would allow gaining a global view and understanding of the evolutionary forces shaping genome variation. Availability and implementation Here, we present PopFly, a population genomics-oriented genome browser, based on JBrowse software, that contains a complete inventory of population genomic parameters estimated from DGN data. This browser is designed for the automatic analysis and display of genetic variation data within and between populations along the D. melanogaster genome. PopFly allows the visualization and retrieval of functional annotations, estimates of nucleotide diversity metrics, linkage disequilibrium statistics, recombination rates, a battery of neutrality tests, and population differentiation parameters at different window sizes through the euchromatic chromosomes. PopFly is open and freely available at site http://popfly.uab.cat . Contact sergi.hervas@uab.cat or antonio.barbadilla@uab.cat.
Collapse
Affiliation(s)
- Sergi Hervas
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| | - Esteve Sanz
- Servei de Genòmica i Bioinformàtica, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| | - Sònia Casillas
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| | - John E Pool
- Laboratory of Genetics, University of Wisconsin, Madison, WI 53706, USA
| | - Antonio Barbadilla
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès 08193, Spain
| |
Collapse
|
128
|
Vy HMT, Won YJ, Kim Y. Multiple Modes of Positive Selection Shaping the Patterns of Incomplete Selective Sweeps over African Populations of Drosophila melanogaster. Mol Biol Evol 2018; 34:2792-2807. [PMID: 28981697 DOI: 10.1093/molbev/msx207] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
It remains a challenge in evolutionary genetics to elucidate how beneficial mutations arise and propagate in a population and how selective pressures on mutant alleles are structured over space and time. By identifying "sweeping haplotypes (SHs)" that putatively carry beneficial alleles and are increasing (or have increased) rapidly in frequency, and surveying the geographic distribution of SH frequencies, we can indirectly infer how selective sweeps unfold in time and thus which modes of positive selection underlie those sweeps. Using population genomic data from African Drosophila melanogaster, we identified SHs from 37 candidate loci under selection. At more than half of loci, we identify single SHs. However, many other loci harbor multiple independent SHs, namely soft selective sweeps, either due to parallel evolution across space or a high beneficial mutation rate. At about a quarter of the loci, intermediate SH frequencies are found across multiple populations, which cannot be explained unless a certain form of frequency-dependent positive selection, such as heterozygote advantage, is invoked given the reasonable range of migration rates between African populations. At one locus, many independent SHs are observed over multiple populations but always together with ancestral haplotypes. This complex pattern is compatible with a large number of mutational targets in a gene and frequency-dependent selection on new variants. We conclude that very diverse modes of positive selection are operating at different sets of loci in D. melanogaster populations.
Collapse
Affiliation(s)
- Ha My T Vy
- Division of EcoScience, Ewha Womans University, Seoul, Korea
| | - Yong-Jin Won
- Division of EcoScience, Ewha Womans University, Seoul, Korea.,Department of Life Science, Ewha Womans University, Seoul, Korea
| | - Yuseob Kim
- Division of EcoScience, Ewha Womans University, Seoul, Korea.,Department of Life Science, Ewha Womans University, Seoul, Korea
| |
Collapse
|
129
|
Stukenbrock EH, Dutheil JY. Fine-Scale Recombination Maps of Fungal Plant Pathogens Reveal Dynamic Recombination Landscapes and Intragenic Hotspots. Genetics 2018; 208:1209-1229. [PMID: 29263029 PMCID: PMC5844332 DOI: 10.1534/genetics.117.300502] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2017] [Accepted: 12/15/2017] [Indexed: 11/18/2022] Open
Abstract
Meiotic recombination is an important driver of evolution. Variability in the intensity of recombination across chromosomes can affect sequence composition, nucleotide variation, and rates of adaptation. In many organisms, recombination events are concentrated within short segments termed recombination hotspots. The variation in recombination rate and positions of recombination hotspot can be studied using population genomics data and statistical methods. In this study, we conducted population genomics analyses to address the evolution of recombination in two closely related fungal plant pathogens: the prominent wheat pathogen Zymoseptoria tritici and a sister species infecting wild grasses Z. ardabiliae We specifically addressed whether recombination landscapes, including hotspot positions, are conserved in the two recently diverged species and if recombination contributes to rapid evolution of pathogenicity traits. We conducted a detailed simulation analysis to assess the performance of methods of recombination rate estimation based on patterns of linkage disequilibrium, in particular in the context of high nucleotide diversity. Our analyses reveal overall high recombination rates, a lack of suppressed recombination in centromeres, and significantly lower recombination rates on chromosomes that are known to be accessory. The comparison of the recombination landscapes of the two species reveals a strong correlation of recombination rate at the megabase scale, but little correlation at smaller scales. The recombination landscapes in both pathogen species are dominated by frequent recombination hotspots across the genome including coding regions, suggesting a strong impact of recombination on gene evolution. A significant but small fraction of these hotspots colocalize between the two species, suggesting that hotspot dynamics contribute to the overall pattern of fast evolving recombination in these species.
Collapse
Affiliation(s)
- Eva H Stukenbrock
- Environmental Genomics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
- Environmental Genomics, Christian-Albrechts University of Kiel, 24118, Germany
| | - Julien Y Dutheil
- Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
- Institut des Sciences de L'Évolution de Montpellier, Centre National de la Recherche Scientifique, Université Montpellier 2, 34095, France
| |
Collapse
|
130
|
Doyle SR, Laing R, Bartley DJ, Britton C, Chaudhry U, Gilleard JS, Holroyd N, Mable BK, Maitland K, Morrison AA, Tait A, Tracey A, Berriman M, Devaney E, Cotton JA, Sargison ND. A Genome Resequencing-Based Genetic Map Reveals the Recombination Landscape of an Outbred Parasitic Nematode in the Presence of Polyploidy and Polyandry. Genome Biol Evol 2018; 10:396-409. [PMID: 29267942 PMCID: PMC5793844 DOI: 10.1093/gbe/evx269] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/15/2017] [Indexed: 12/27/2022] Open
Abstract
The parasitic nematode Haemonchus contortus is an economically and clinically important pathogen of small ruminants, and a model system for understanding the mechanisms and evolution of traits such as anthelmintic resistance. Anthelmintic resistance is widespread and is a major threat to the sustainability of livestock agriculture globally; however, little is known about the genome architecture and parameters such as recombination that will ultimately influence the rate at which resistance may evolve and spread. Here, we performed a genetic cross between two divergent strains of H. contortus, and subsequently used whole-genome resequencing of a female worm and her brood to identify the distribution of genome-wide variation that characterizes these strains. Using a novel bioinformatic approach to identify variants that segregate as expected in a pseudotestcross, we characterized linkage groups and estimated genetic distances between markers to generate a chromosome-scale F1 genetic map. We exploited this map to reveal the recombination landscape, the first for any helminth species, demonstrating extensive variation in recombination rate within and between chromosomes. Analyses of these data also revealed the extent of polyandry, whereby at least eight males were found to have contributed to the genetic variation of the progeny analyzed. Triploid offspring were also identified, which we hypothesize are the result of nondisjunction during female meiosis or polyspermy. These results expand our knowledge of the genetics of parasitic helminths and the unusual life-history of H. contortus, and enhance ongoing efforts to understand the genetic basis of resistance to the drugs used to control these worms and for related species that infect livestock and humans throughout the world. This study also demonstrates the feasibility of using whole-genome resequencing data to directly construct a genetic map in a single generation cross from a noninbred nonmodel organism with a complex lifecycle.
Collapse
Affiliation(s)
- Stephen R Doyle
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Roz Laing
- Institute of Biodiversity Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - David J Bartley
- Moredun Research Institute, Pentlands Science Park, Penicuik, United Kingdom
| | - Collette Britton
- Institute of Biodiversity Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - Umer Chaudhry
- Royal (Dick) School of Veterinary Studies, University of Edinburgh, United Kingdom
| | - John S Gilleard
- Department of Comparative Biology and Experimental Medicine, Faculty of Veterinary Medicine, University of Calgary, Alberta, Canada
| | - Nancy Holroyd
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Barbara K Mable
- Institute of Biodiversity Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - Kirsty Maitland
- Institute of Biodiversity Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - Alison A Morrison
- Moredun Research Institute, Pentlands Science Park, Penicuik, United Kingdom
| | - Andy Tait
- Institute of Biodiversity Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - Alan Tracey
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Eileen Devaney
- Institute of Biodiversity Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, United Kingdom
| | - James A Cotton
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Neil D Sargison
- Royal (Dick) School of Veterinary Studies, University of Edinburgh, United Kingdom
| |
Collapse
|
131
|
Dapper AL, Payseur BA. Effects of Demographic History on the Detection of Recombination Hotspots from Linkage Disequilibrium. Mol Biol Evol 2018; 35:335-353. [PMID: 29045724 PMCID: PMC5850621 DOI: 10.1093/molbev/msx272] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
In some species, meiotic recombination is concentrated in small genomic regions. These "recombination hotspots" leave signatures in fine-scale patterns of linkage disequilibrium, raising the prospect that the genomic landscape of hotspots can be characterized from sequence variation. This approach has led to the inference that hotspots evolve rapidly in some species, but are conserved in others. Historic demographic events, such as population bottlenecks, are known to affect patterns of linkage disequilibrium across the genome, violating population genetic assumptions of this approach. Although such events are prevalent, demographic history is generally ignored when making inferences about the evolution of recombination hotspots. To determine the effect of demography on the detection of recombination hotspots, we use the coalescent to simulate haplotypes with a known recombination landscape. We measure the ability of popular linkage disequilibrium-based programs to detect hotspots across a range of demographic histories, including population bottlenecks, hidden population structure, population expansions, and population contractions. We find that demographic events have the potential to greatly reduce the power and increase the false positive rate of hotspot discovery. Neither the power nor the false positive rate of hotspot detection can be predicted without also knowing the demographic history of the sample. Our results suggest that ignoring demographic history likely overestimates the power to detect hotspots and therefore underestimates the degree of hotspot sharing between species. We suggest strategies for incorporating demographic history into population genetic inferences about recombination hotspots.
Collapse
Affiliation(s)
- Amy L Dapper
- Laboratory of Genetics, University of Wisconsin, Madison, WI
| | - Bret A Payseur
- Laboratory of Genetics, University of Wisconsin, Madison, WI
| |
Collapse
|
132
|
Tiemann-Boege I, Schwarz T, Striedner Y, Heissl A. The consequences of sequence erosion in the evolution of recombination hotspots. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160462. [PMID: 29109225 PMCID: PMC5698624 DOI: 10.1098/rstb.2016.0462] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/05/2017] [Indexed: 12/18/2022] Open
Abstract
Meiosis is initiated by a double-strand break (DSB) introduced in the DNA by a highly controlled process that is repaired by recombination. In many organisms, recombination occurs at specific and narrow regions of the genome, known as recombination hotspots, which overlap with regions enriched for DSBs. In recent years, it has been demonstrated that conversions and mutations resulting from the repair of DSBs lead to a rapid sequence evolution at recombination hotspots eroding target sites for DSBs. We still do not fully understand the effect of this erosion in the recombination activity, but evidence has shown that the binding of trans-acting factors like PRDM9 is affected. PRDM9 is a meiosis-specific, multi-domain protein that recognizes DNA target motifs by its zinc finger domain and directs DSBs to these target sites. Here we discuss the changes in affinity of PRDM9 to eroded recognition sequences, and explain how these changes in affinity of PRDM9 can affect recombination, leading sometimes to sterility in the context of hybrid crosses. We also present experimental data showing that DNA methylation reduces PRDM9 binding in vitro Finally, we discuss PRDM9-independent hotspots, posing the question how these hotspots evolve and change with sequence erosion.This article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Irene Tiemann-Boege
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Theresa Schwarz
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Yasmin Striedner
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| | - Angelika Heissl
- Institute of Biophysics, Johannes Kepler University, Linz, Gruberstraße 40, 4020 Linz, Austria
| |
Collapse
|
133
|
Stevison LS, Sefick S, Rushton C, Graze RM. Recombination rate plasticity: revealing mechanisms by design. Philos Trans R Soc Lond B Biol Sci 2017; 372:20160459. [PMID: 29109222 PMCID: PMC5698621 DOI: 10.1098/rstb.2016.0459] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/01/2017] [Indexed: 12/13/2022] Open
Abstract
For over a century, scientists have known that meiotic recombination rates can vary considerably among individuals, and that environmental conditions can modify recombination rates relative to the background. A variety of external and intrinsic factors such as temperature, age, sex and starvation can elicit 'plastic' responses in recombination rate. The influence of recombination rate plasticity on genetic diversity of the next generation has interesting and important implications for how populations evolve. Further, many questions remain regarding the mechanisms and molecular processes that contribute to recombination rate plasticity. Here, we review 100 years of experimental work on recombination rate plasticity conducted in Drosophila melanogaster We categorize this work into four major classes of experimental designs, which we describe via classic studies in D. melanogaster Based on these studies, we highlight molecular mechanisms that are supported by experimental results and relate these findings to studies in other systems. We synthesize lessons learned from this model system into experimental guidelines for using recent advances in genotyping technologies, to study recombination rate plasticity in non-model organisms. Specifically, we recommend (1) using fine-scale genome-wide markers, (2) collecting time-course data, (3) including crossover distribution measurements, and (4) using mixed effects models to analyse results. To illustrate this approach, we present an application adhering to these guidelines from empirical work we conducted in Drosophila pseudoobscuraThis article is part of the themed issue 'Evolutionary causes and consequences of recombination rate variation in sexual organisms'.
Collapse
Affiliation(s)
- Laurie S Stevison
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Stephen Sefick
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Chase Rushton
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Rita M Graze
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| |
Collapse
|
134
|
Petit M, Astruc JM, Sarry J, Drouilhet L, Fabre S, Moreno CR, Servin B. Variation in Recombination Rate and Its Genetic Determinism in Sheep Populations. Genetics 2017; 207:767-784. [PMID: 28978774 PMCID: PMC5629338 DOI: 10.1534/genetics.117.300123] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 07/31/2017] [Indexed: 01/19/2023] Open
Abstract
Recombination is a complex biological process that results from a cascade of multiple events during meiosis. Understanding the genetic determinism of recombination can help to understand if and how these events are interacting. To tackle this question, we studied the patterns of recombination in sheep, using multiple approaches and data sets. We constructed male recombination maps in a dairy breed from the south of France (the Lacaune breed) at a fine scale by combining meiotic recombination rates from a large pedigree genotyped with a 50K SNP array and historical recombination rates from a sample of unrelated individuals genotyped with a 600K SNP array. This analysis revealed recombination patterns in sheep similar to other mammals but also genome regions that have likely been affected by directional and diversifying selection. We estimated the average recombination rate of Lacaune sheep at 1.5 cM/Mb, identified ∼50,000 crossover hotspots on the genome, and found a high correlation between historical and meiotic recombination rate estimates. A genome-wide association study revealed two major loci affecting interindividual variation in recombination rate in Lacaune, including the RNF212 and HEI10 genes and possibly two other loci of smaller effects including the KCNJ15 and FSHR genes. The comparison of these new results to those obtained previously in a distantly related population of domestic sheep (the Soay) revealed that Soay and Lacaune males have a very similar distribution of recombination along the genome. The two data sets were thus combined to create more precise male meiotic recombination maps in Sheep. However, despite their similar recombination maps, Soay and Lacaune males were found to exhibit different heritabilities and QTL effects for interindividual variation in genome-wide recombination rates. This highlights the robustness of recombination patterns to underlying variation in their genetic determinism.
Collapse
Affiliation(s)
- Morgane Petit
- INRA, Génétique, Physiologie et Systèmes d'Elevage, F-31326 Castanet-Tolosan, France
| | | | - Julien Sarry
- INRA, Génétique, Physiologie et Systèmes d'Elevage, F-31326 Castanet-Tolosan, France
| | - Laurence Drouilhet
- INRA, Génétique, Physiologie et Systèmes d'Elevage, F-31326 Castanet-Tolosan, France
| | - Stéphane Fabre
- INRA, Génétique, Physiologie et Systèmes d'Elevage, F-31326 Castanet-Tolosan, France
| | - Carole R Moreno
- INRA, Génétique, Physiologie et Systèmes d'Elevage, F-31326 Castanet-Tolosan, France
| | - Bertrand Servin
- INRA, Génétique, Physiologie et Systèmes d'Elevage, F-31326 Castanet-Tolosan, France
| |
Collapse
|
135
|
Arenas M, Araujo NM, Branco C, Castelhano N, Castro-Nallar E, Pérez-Losada M. Mutation and recombination in pathogen evolution: Relevance, methods and controversies. INFECTION GENETICS AND EVOLUTION 2017; 63:295-306. [PMID: 28951202 DOI: 10.1016/j.meegid.2017.09.029] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Revised: 09/20/2017] [Accepted: 09/21/2017] [Indexed: 02/06/2023]
Abstract
Mutation and recombination drive the evolution of most pathogens by generating the genetic variants upon which selection operates. Those variants can, for example, confer resistance to host immune systems and drug therapies or lead to epidemic outbreaks. Given their importance, diverse evolutionary studies have investigated the abundance and consequences of mutation and recombination in pathogen populations. However, some controversies persist regarding the contribution of each evolutionary force to the development of particular phenotypic observations (e.g., drug resistance). In this study, we revise the importance of mutation and recombination in the evolution of pathogens at both intra-host and inter-host levels. We also describe state-of-the-art analytical methodologies to detect and quantify these two evolutionary forces, including biases that are often ignored in evolutionary studies. Finally, we present some of our former studies involving pathogenic taxa where mutation and recombination played crucial roles in the recovery of pathogenic fitness, the generation of interspecific genetic diversity, or the design of centralized vaccines. This review also illustrates several common controversies and pitfalls in the analysis and in the evaluation and interpretation of mutation and recombination outcomes.
Collapse
Affiliation(s)
- Miguel Arenas
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, Spain; Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal; Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal.
| | - Natalia M Araujo
- Laboratory of Molecular Virology, Oswaldo Cruz Institute, FIOCRUZ, Rio de Janeiro, Brazil.
| | - Catarina Branco
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal; Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal.
| | - Nadine Castelhano
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal; Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal.
| | - Eduardo Castro-Nallar
- Universidad Andrés Bello, Center for Bioinformatics and Integrative Biology, Facultad de Ciencias Biológicas, Santiago, Chile.
| | - Marcos Pérez-Losada
- Computational Biology Institute, Milken Institute School of Public Health, George Washington University, Ashburn, VA 20147, Washington, DC, United States; CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Campus Agrário de Vairão, Vairão 4485-661, Portugal.
| |
Collapse
|
136
|
Booker TR, Ness RW, Keightley PD. The Recombination Landscape in Wild House Mice Inferred Using Population Genomic Data. Genetics 2017; 207:297-309. [PMID: 28751421 PMCID: PMC5586380 DOI: 10.1534/genetics.117.300063] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 07/19/2017] [Indexed: 11/29/2022] Open
Abstract
Characterizing variation in the rate of recombination across the genome is important for understanding several evolutionary processes. Previous analysis of the recombination landscape in laboratory mice has revealed that the different subspecies have different suites of recombination hotspots. It is unknown, however, whether hotspots identified in laboratory strains reflect the hotspot diversity of natural populations or whether broad-scale variation in the rate of recombination is conserved between subspecies. In this study, we constructed fine-scale recombination rate maps for a natural population of the Eastern house mouse, Mus musculus castaneus We performed simulations to assess the accuracy of recombination rate inference in the presence of phase errors, and we used a novel approach to quantify phase error. The spatial distribution of recombination events is strongly positively correlated between our castaneus map, and a map constructed using inbred lines derived predominantly from M. m. domesticus Recombination hotspots in wild castaneus show little overlap, however, with the locations of double-strand breaks in wild-derived house mouse strains. Finally, we also find that genetic diversity in M. m. castaneus is positively correlated with the rate of recombination, consistent with pervasive natural selection operating in the genome. Our study suggests that recombination rate variation is conserved at broad scales between house mouse subspecies, but it is not strongly conserved at fine scales.
Collapse
Affiliation(s)
- Tom R Booker
- Institute of Evolutionary Biology, University of Edinburgh, EH9 3FL, United Kingdom
| | - Rob W Ness
- Department of Biology, University of Toronto Mississauga, Ontario, L5L 1C6, Canada
| | - Peter D Keightley
- Institute of Evolutionary Biology, University of Edinburgh, EH9 3FL, United Kingdom
| |
Collapse
|
137
|
Johnston SE, Huisman J, Ellis PA, Pemberton JM. A High-Density Linkage Map Reveals Sexual Dimorphism in Recombination Landscapes in Red Deer ( Cervus elaphus). G3 (BETHESDA, MD.) 2017; 7:2859-2870. [PMID: 28667018 PMCID: PMC5555489 DOI: 10.1534/g3.117.044198] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Accepted: 06/27/2017] [Indexed: 11/29/2022]
Abstract
High-density linkage maps are an important tool to gain insight into the genetic architecture of traits of evolutionary and economic interest, and provide a resource to characterize variation in recombination landscapes. Here, we used information from the cattle genome and the 50 K Cervine Illumina BeadChip to inform and refine a high-density linkage map in a wild population of red deer (Cervus elaphus). We constructed a predicted linkage map of 38,038 SNPs and a skeleton map of 10,835 SNPs across 34 linkage groups. We identified several chromosomal rearrangements in the deer lineage relative to sheep and cattle, including six chromosome fissions, one fusion, and two large inversions. Otherwise, our findings showed strong concordance with map orders in the cattle genome. The sex-averaged linkage map length was 2739.7 cM and the genome-wide autosomal recombination rate was 1.04 cM/Mb. The female autosomal map length was 1.21 longer than that of males (2767.4 cM vs. 2280.8 cM, respectively). Sex differences in map length were driven by high female recombination rates in peri-centromeric regions, a pattern that is unusual relative to other mammal species. This effect was more pronounced in fission chromosomes that would have had to produce new centromeres. We propose two hypotheses to explain this effect: (1) that this mechanism may have evolved to counteract centromeric drive associated with meiotic asymmetry in oocyte production; and/or (2) that sequence and structural characteristics suppressing recombination in close proximity to the centromere may not have evolved at neo-centromeres. Our study provides insight into how recombination landscapes vary and evolve in mammals, and will provide a valuable resource for studies of evolution, genetic improvement, and population management in red deer and related species.
Collapse
Affiliation(s)
- Susan E Johnston
- Institute of Evolutionary Biology, University of Edinburgh, EH9 3FL, United Kingdom
| | - Jisca Huisman
- Institute of Evolutionary Biology, University of Edinburgh, EH9 3FL, United Kingdom
| | - Philip A Ellis
- Institute of Evolutionary Biology, University of Edinburgh, EH9 3FL, United Kingdom
| | | |
Collapse
|
138
|
Kawakami T, Mugal CF, Suh A, Nater A, Burri R, Smeds L, Ellegren H. Whole-genome patterns of linkage disequilibrium across flycatcher populations clarify the causes and consequences of fine-scale recombination rate variation in birds. Mol Ecol 2017; 26:4158-4172. [DOI: 10.1111/mec.14197] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2017] [Revised: 05/02/2017] [Accepted: 05/15/2017] [Indexed: 12/17/2022]
Affiliation(s)
- Takeshi Kawakami
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
- Department of Animal and Plant Sciences; University of Sheffield; Sheffield UK
| | - Carina F. Mugal
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
| | - Alexander Suh
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
| | - Alexander Nater
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
- Department of Evolutionary Biology and Environmental Studies; University of Zurich; Zürich Switzerland
| | - Reto Burri
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
- Department of Population Ecology; Friedrich Schiller University Jena; Jena Germany
| | - Linnéa Smeds
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology; Evolutionary Biology Centre (EBC); Uppsala University; Uppsala Sweden
| |
Collapse
|
139
|
Vijay N, Weissensteiner M, Burri R, Kawakami T, Ellegren H, Wolf JBW. Genomewide patterns of variation in genetic diversity are shared among populations, species and higher-order taxa. Mol Ecol 2017; 26:4284-4295. [PMID: 28570015 DOI: 10.1111/mec.14195] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2016] [Revised: 05/10/2017] [Accepted: 05/17/2017] [Indexed: 12/15/2022]
Abstract
Genomewide screens of genetic variation within and between populations can reveal signatures of selection implicated in adaptation and speciation. Genomic regions with low genetic diversity and elevated differentiation reflective of locally reduced effective population sizes (Ne ) are candidates for barrier loci contributing to population divergence. Yet, such candidate genomic regions need not arise as a result of selection promoting adaptation or advancing reproductive isolation. Linked selection unrelated to lineage-specific adaptation or population divergence can generate comparable signatures. It is challenging to distinguish between these processes, particularly when diverging populations share ancestral genetic variation. In this study, we took a comparative approach using population assemblages from distant clades assessing genomic parallelism of variation in Ne . Utilizing population-level polymorphism data from 444 resequenced genomes of three avian clades spanning 50 million years of evolution, we tested whether population genetic summary statistics reflecting genomewide variation in Ne would covary among populations within clades, and importantly, also among clades where lineage sorting has been completed. All statistics including population-scaled recombination rate (ρ), nucleotide diversity (π) and measures of genetic differentiation between populations (FST , PBS, dxy ) were significantly correlated across all phylogenetic distances. Moreover, genomic regions with elevated levels of genetic differentiation were associated with inferred pericentromeric and subtelomeric regions. The phylogenetic stability of diversity landscapes and stable association with genomic features support a role of linked selection not necessarily associated with adaptation and speciation in shaping patterns of genomewide heterogeneity in genetic diversity.
Collapse
Affiliation(s)
- Nagarjun Vijay
- Department of Evolutionary Biology and SciLifeLab, Uppsala University, Uppsala, Sweden.,Lab of Molecular and Genomic Evolution, Department of Ecology and Evolutionary Biology, College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI, USA
| | - Matthias Weissensteiner
- Department of Evolutionary Biology and SciLifeLab, Uppsala University, Uppsala, Sweden.,Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Reto Burri
- Department of Evolutionary Biology and SciLifeLab, Uppsala University, Uppsala, Sweden.,Department of Population Ecology, Friedrich Schiller University Jena, Jena, Germany
| | - Takeshi Kawakami
- Department of Evolutionary Biology and SciLifeLab, Uppsala University, Uppsala, Sweden.,Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| | - Hans Ellegren
- Department of Evolutionary Biology and SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Jochen B W Wolf
- Department of Evolutionary Biology and SciLifeLab, Uppsala University, Uppsala, Sweden.,Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| |
Collapse
|
140
|
Baker Z, Schumer M, Haba Y, Bashkirova L, Holland C, Rosenthal GG, Przeworski M. Repeated losses of PRDM9-directed recombination despite the conservation of PRDM9 across vertebrates. eLife 2017; 6:e24133. [PMID: 28590247 PMCID: PMC5519329 DOI: 10.7554/elife.24133] [Citation(s) in RCA: 89] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2016] [Accepted: 06/03/2017] [Indexed: 01/01/2023] Open
Abstract
Studies of highly diverged species have revealed two mechanisms by which meiotic recombination is directed to the genome-through PRDM9 binding or by targeting promoter-like features-that lead to dramatically different evolutionary dynamics of hotspots. Here, we identify PRDM9 orthologs from genome and transcriptome data in 225 species. We find the complete PRDM9 ortholog across distantly related vertebrates but, despite this broad conservation, infer a minimum of six partial and three complete losses. Strikingly, taxa carrying the complete ortholog of PRDM9 are precisely those with rapid evolution of its predicted binding affinity, suggesting that all domains are necessary for directing recombination. Indeed, as we show, swordtail fish carrying only a partial but conserved ortholog share recombination properties with PRDM9 knock-outs.
Collapse
Affiliation(s)
- Zachary Baker
- Department of Systems Biology, Columbia University, New York City, United States
| | - Molly Schumer
- Department of Biological Sciences, Columbia University, New York City, United States
- Harvard Society of Fellows, Harvard University, Cambridge, United States
- Centro de Investigaciones Científicas de las Huastecas 'Aguazarca', Hidalgo, Mexico
| | - Yuki Haba
- Department of Evolution, Ecology and Environmental Biology, Columbia University, New York City, United States
| | - Lisa Bashkirova
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York City, United States
| | - Chris Holland
- Centro de Investigaciones Científicas de las Huastecas 'Aguazarca', Hidalgo, Mexico
- Department of Biology, Texas A&M University, College Station, United States
| | - Gil G Rosenthal
- Centro de Investigaciones Científicas de las Huastecas 'Aguazarca', Hidalgo, Mexico
- Department of Biology, Texas A&M University, College Station, United States
| | - Molly Przeworski
- Department of Systems Biology, Columbia University, New York City, United States
- Department of Biological Sciences, Columbia University, New York City, United States
| |
Collapse
|
141
|
Ragsdale AP, Gutenkunst RN. Inferring Demographic History Using Two-Locus Statistics. Genetics 2017; 206:1037-1048. [PMID: 28413158 PMCID: PMC5499162 DOI: 10.1534/genetics.117.201251] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 04/07/2017] [Indexed: 11/18/2022] Open
Abstract
Population demographic history may be learned from contemporary genetic variation data. Methods based on aggregating the statistics of many single loci into an allele frequency spectrum (AFS) have proven powerful, but such methods ignore potentially informative patterns of linkage disequilibrium (LD) between neighboring loci. To leverage such patterns, we developed a composite-likelihood framework for inferring demographic history from aggregated statistics of pairs of loci. Using this framework, we show that two-locus statistics are more sensitive to demographic history than single-locus statistics such as the AFS. In particular, two-locus statistics escape the notorious confounding of depth and duration of a bottleneck, and they provide a means to estimate effective population size based on the recombination rather than mutation rate. We applied our approach to a Zambian population of Drosophila melanogaster Notably, using both single- and two-locus statistics, we inferred a substantially lower ancestral effective population size than previous works and did not infer a bottleneck history. Together, our results demonstrate the broad potential for two-locus statistics to enable powerful population genetic inference.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Program in Applied Mathematics, University of Arizona, Tucson, Arizona 85721
| | - Ryan N Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, Arizona 85721
| |
Collapse
|
142
|
Abstract
Molecular population genetics aims to explain genetic variation and molecular evolution from population genetics principles. The field was born 50 years ago with the first measures of genetic variation in allozyme loci, continued with the nucleotide sequencing era, and is currently in the era of population genomics. During this period, molecular population genetics has been revolutionized by progress in data acquisition and theoretical developments. The conceptual elegance of the neutral theory of molecular evolution or the footprint carved by natural selection on the patterns of genetic variation are two examples of the vast number of inspiring findings of population genetics research. Since the inception of the field, Drosophila has been the prominent model species: molecular variation in populations was first described in Drosophila and most of the population genetics hypotheses were tested in Drosophila species. In this review, we describe the main concepts, methods, and landmarks of molecular population genetics, using the Drosophila model as a reference. We describe the different genetic data sets made available by advances in molecular technologies, and the theoretical developments fostered by these data. Finally, we review the results and new insights provided by the population genomics approach, and conclude by enumerating challenges and new lines of inquiry posed by increasingly large population scale sequence data.
Collapse
|
143
|
Variation in Recombination Rate: Adaptive or Not? Trends Genet 2017; 33:364-374. [DOI: 10.1016/j.tig.2017.03.003] [Citation(s) in RCA: 100] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Revised: 03/06/2017] [Accepted: 03/07/2017] [Indexed: 01/30/2023]
|
144
|
Weissensteiner MH, Pang AWC, Bunikis I, Höijer I, Vinnere-Petterson O, Suh A, Wolf JBW. Combination of short-read, long-read, and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications. Genome Res 2017; 27:697-708. [PMID: 28360231 PMCID: PMC5411765 DOI: 10.1101/gr.215095.116] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 03/10/2017] [Indexed: 12/27/2022]
Abstract
Accurate and contiguous genome assembly is key to a comprehensive understanding of the processes shaping genomic diversity and evolution. Yet, it is frequently constrained by constitutive heterochromatin, usually characterized by highly repetitive DNA. As a key feature of genome architecture associated with centromeric and subtelomeric regions, it locally influences meiotic recombination. In this study, we assess the impact of large tandem repeat arrays on the recombination rate landscape in an avian speciation model, the Eurasian crow. We assembled two high-quality genome references using single-molecule real-time sequencing (long-read assembly [LR]) and single-molecule optical maps (optical map assembly [OM]). A three-way comparison including the published short-read assembly (SR) constructed for the same individual allowed assessing assembly properties and pinpointing misassemblies. By combining information from all three assemblies, we characterized 36 previously unidentified large repetitive regions in the proximity of sequence assembly breakpoints, the majority of which contained complex arrays of a 14-kb satellite repeat or its 1.2-kb subunit. Using whole-genome population resequencing data, we estimated the population-scaled recombination rate (ρ) and found it to be significantly reduced in these regions. These findings are consistent with an effect of low recombination in regions adjacent to centromeric or subtelomeric heterochromatin and add to our understanding of the processes generating widespread heterogeneity in genetic diversity and differentiation along the genome. By combining three different technologies, our results highlight the importance of adding a layer of information on genome structure that is inaccessible to each approach independently.
Collapse
Affiliation(s)
- Matthias H Weissensteiner
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilian University of Munich, 82152 Planegg-Martinsried, Germany
| | | | - Ignas Bunikis
- SciLife Lab Uppsala, Uppsala University SE-751 85 Uppsala, Sweden
| | - Ida Höijer
- SciLife Lab Uppsala, Uppsala University SE-751 85 Uppsala, Sweden
| | | | - Alexander Suh
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Jochen B W Wolf
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilian University of Munich, 82152 Planegg-Martinsried, Germany
| |
Collapse
|
145
|
Han F, Lamichhaney S, Grant BR, Grant PR, Andersson L, Webster MT. Gene flow, ancient polymorphism, and ecological adaptation shape the genomic landscape of divergence among Darwin's finches. Genome Res 2017; 27:1004-1015. [PMID: 28442558 PMCID: PMC5453315 DOI: 10.1101/gr.212522.116] [Citation(s) in RCA: 116] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 02/14/2017] [Indexed: 12/22/2022]
Abstract
Genomic comparisons of closely related species have identified “islands” of locally elevated sequence divergence. Genomic islands may contain functional variants involved in local adaptation or reproductive isolation and may therefore play an important role in the speciation process. However, genomic islands can also arise through evolutionary processes unrelated to speciation, and examination of their properties can illuminate how new species evolve. Here, we performed scans for regions of high relative divergence (FST) in 12 species pairs of Darwin's finches at different genetic distances. In each pair, we identify genomic islands that are, on average, elevated in both relative divergence (FST) and absolute divergence (dXY). This signal indicates that haplotypes within these genomic regions became isolated from each other earlier than the rest of the genome. Interestingly, similar numbers of genomic islands of elevated dXY are observed in sympatric and allopatric species pairs, suggesting that recent gene flow is not a major factor in their formation. We find that two of the most pronounced genomic islands contain the ALX1 and HMGA2 loci, which are associated with variation in beak shape and size, respectively, suggesting that they are involved in ecological adaptation. A subset of genomic island regions, including these loci, appears to represent anciently diverged haplotypes that evolved early during the radiation of Darwin's finches. Comparative genomics data indicate that these loci, and genomic islands in general, have exceptionally low recombination rates, which may play a role in their establishment.
Collapse
Affiliation(s)
- Fan Han
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75123 Uppsala, Sweden
| | - Sangeet Lamichhaney
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75123 Uppsala, Sweden
| | - B Rosemary Grant
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544-2016, USA
| | - Peter R Grant
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544-2016, USA
| | - Leif Andersson
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75123 Uppsala, Sweden.,Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, 75007 Uppsala, Sweden.,Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843-4461, USA
| | - Matthew T Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 75123 Uppsala, Sweden
| |
Collapse
|
146
|
Puzey JR, Willis JH, Kelly JK. Population structure and local selection yield high genomic variation in Mimulus guttatus. Mol Ecol 2017; 26:519-535. [PMID: 27859786 PMCID: PMC5274581 DOI: 10.1111/mec.13922] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Revised: 09/30/2016] [Accepted: 11/07/2016] [Indexed: 12/30/2022]
Abstract
Across western North America, Mimulus guttatus exists as many local populations adapted to site-specific environmental challenges. Gene flow between locally adapted populations will affect genetic diversity both within demes and across the larger metapopulation. Here, we analyse 34 whole-genome sequences from the intensively studied Iron Mountain population (IM) in conjunction with sequences from 22 Mimulus individuals sampled from across western North America. Three striking features of these data address hypotheses about migration and selection in a locally adapted population. First, we find very high levels of intrapopulation polymorphism (synonymous π = 0.033). Variation outside of genes is likely even higher but difficult to estimate because excessive divergence reduces the efficiency of read mapping. Second, IM exhibits a significantly positive genomewide average for Tajima's D. This indicates allele frequencies are typically more intermediate than expected from neutrality, opposite the pattern observed in many other species. Third, IM exhibits a distinctive haplotype structure with a genomewide excess of positive associations between rarer alleles at linked loci. This suggests an important effect of gene flow from other Mimulus populations, although a residual effect of population founding might also contribute. The combination of multiple analyses, including a novel tree-based analytic method, illustrates how the balance of local selection, limited dispersal and metapopulation dynamics manifests across the genome. The overall genomic pattern of sequence diversity suggests successful gene flow of divergent immigrant genotypes into IM. However, many loci show patterns indicative of local adaptation, particularly at SNPs associated with chromosomal inversions.
Collapse
Affiliation(s)
- Joshua R. Puzey
- Department of Biology, College of William and Mary, Williamsburg, Virginia, 23187
- Department of Biology, Duke University, Durham, North Carolina, 27708
| | - John H. Willis
- Department of Biology, Duke University, Durham, North Carolina, 27708
| | - John K. Kelly
- Department of Ecology and Evolution, University of Kansas, Lawrence, Kansas, 27708
| |
Collapse
|
147
|
Vijay N, Bossu CM, Poelstra JW, Weissensteiner MH, Suh A, Kryukov AP, Wolf JBW. Evolution of heterogeneous genome differentiation across multiple contact zones in a crow species complex. Nat Commun 2016; 7:13195. [PMID: 27796282 PMCID: PMC5095515 DOI: 10.1038/ncomms13195] [Citation(s) in RCA: 132] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 09/09/2016] [Indexed: 12/31/2022] Open
Abstract
Uncovering the genetic basis of species diversification is a central goal in evolutionary biology. Yet, the link between the accumulation of genomic changes during population divergence and the evolutionary forces promoting reproductive isolation is poorly understood. Here, we analysed 124 genomes of crow populations with various degrees of genome-wide differentiation, with parallelism of a sexually selected plumage phenotype, and ongoing hybridization. Overall, heterogeneity in genetic differentiation along the genome was best explained by linked selection exposed on a shared genome architecture. Superimposed on this common background, we identified genomic regions with signatures of selection specific to independent phenotypic contact zones. Candidate pigmentation genes with evidence for divergent selection were only partly shared, suggesting context-dependent selection on a multigenic trait architecture and parallelism by pathway rather than by repeated single-gene effects. This study provides insight into how various forms of selection shape genome-wide patterns of genomic differentiation as populations diverge.
Collapse
Affiliation(s)
- Nagarjun Vijay
- Department of Evolutionary Biology and Science for Life Laboratories, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden
| | - Christen M Bossu
- Department of Evolutionary Biology and Science for Life Laboratories, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden.,Department of Zoology, Population Genetics, Stockholm University, Stockholm SE-106 91, Sweden
| | - Jelmer W Poelstra
- Department of Evolutionary Biology and Science for Life Laboratories, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden
| | - Matthias H Weissensteiner
- Department of Evolutionary Biology and Science for Life Laboratories, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden
| | - Alexander Suh
- Department of Evolutionary Biology and Science for Life Laboratories, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden
| | - Alexey P Kryukov
- Laboratory of Evolutionary Zoology and Genetics, Institute of Biology and Soil Science, Far East Branch Russian Academy of Sciences, Vladivostok 690022, Russia
| | - Jochen B W Wolf
- Department of Evolutionary Biology and Science for Life Laboratories, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden.,Division of Evolutionary Biology, Ludwig Maximilian University of Munich, Grosshaderner Street 2, Planegg-Martinsried 82152, Germany
| |
Collapse
|
148
|
Dukić M, Berner D, Roesti M, Haag CR, Ebert D. A high-density genetic map reveals variation in recombination rate across the genome of Daphnia magna. BMC Genet 2016; 17:137. [PMID: 27737627 PMCID: PMC5064971 DOI: 10.1186/s12863-016-0445-7] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 10/04/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recombination rate is an essential parameter for many genetic analyses. Recombination rates are highly variable across species, populations, individuals and different genomic regions. Due to the profound influence that recombination can have on intraspecific diversity and interspecific divergence, characterization of recombination rate variation emerges as a key resource for population genomic studies and emphasises the importance of high-density genetic maps as tools for studying genome biology. Here we present such a high-density genetic map for Daphnia magna, and analyse patterns of recombination rate across the genome. RESULTS A F2 intercross panel was genotyped by Restriction-site Associated DNA sequencing to construct the third-generation linkage map of D. magna. The resulting high-density map included 4037 markers covering 813 scaffolds and contigs that sum up to 77 % of the currently available genome draft sequence (v2.4) and 55 % of the estimated genome size (238 Mb). Total genetic length of the map presented here is 1614.5 cM and the genome-wide recombination rate is estimated to 6.78 cM/Mb. Merging genetic and physical information we consistently found that recombination rate estimates are high towards the peripheral parts of the chromosomes, while chromosome centres, harbouring centromeres in D. magna, show very low recombination rate estimates. CONCLUSIONS Due to its high-density, the third-generation linkage map for D. magna can be coupled with the draft genome assembly, providing an essential tool for genome investigation in this model organism. Thus, our linkage map can be used for the on-going improvements of the genome assembly, but more importantly, it has enabled us to characterize variation in recombination rate across the genome of D. magna for the first time. These new insights can provide a valuable assistance in future studies of the genome evolution, mapping of quantitative traits and population genetic studies.
Collapse
Affiliation(s)
- Marinela Dukić
- University of Basel, Zoological Institute, Vesalgasse 1, Basel, CH-4051, Switzerland.
| | - Daniel Berner
- University of Basel, Zoological Institute, Vesalgasse 1, Basel, CH-4051, Switzerland
| | - Marius Roesti
- University of Basel, Zoological Institute, Vesalgasse 1, Basel, CH-4051, Switzerland.,Biodiversity Research Centre and Zoology Department, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Christoph R Haag
- Centre d'Ecologie Fonctionnelle et Evolutive - CEFE UMR 5175, CNRS - Université de Montpellier - Université Paul-Valéry Montpellier - EPHE, campus CNRS, 1919, route de Mende, 34293, Montpellier Cedex 5, France.,Department of Biology, Ecology and Evolution, University of Fribourg, Chemin du Muśee 10, 1700, Fribourg, Switzerland
| | - Dieter Ebert
- University of Basel, Zoological Institute, Vesalgasse 1, Basel, CH-4051, Switzerland
| |
Collapse
|
149
|
Adrian AB, Corchado JC, Comeron JM. Predictive Models of Recombination Rate Variation across the Drosophila melanogaster Genome. Genome Biol Evol 2016; 8:2597-612. [PMID: 27492232 PMCID: PMC5010912 DOI: 10.1093/gbe/evw181] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
In all eukaryotic species examined, meiotic recombination, and crossovers in particular, occur non‐randomly along chromosomes. The cause for this non-random distribution remains poorly understood but some specific DNA sequence motifs have been shown to be enriched near crossover hotspots in a number of species. We present analyses using machine learning algorithms to investigate whether DNA motif distribution across the genome can be used to predict crossover variation in Drosophila melanogaster, a species without hotspots. Our study exposes a combinatorial non-linear influence of motif presence able to account for a significant fraction of the genome-wide variation in crossover rates at all genomic scales investigated, from 20% at 5-kb to almost 70% at 2,500-kb scale. The models are particularly predictive for regions with the highest and lowest crossover rates and remain highly informative after removing sub-telomeric and -centromeric regions known to have strongly reduced crossover rates. Transcriptional activity during early meiosis and differences in motif use between autosomes and the X chromosome add to the predictive power of the models. Moreover, we show that population-specific differences in crossover rates can be partly explained by differences in motif presence. Our results suggest that crossover distribution in Drosophila is influenced by both meiosis-specific chromatin dynamics and very local constitutive open chromatin associated with DNA motifs that prevent nucleosome stabilization. These findings provide new information on the genetic factors influencing variation in recombination rates and a baseline to study epigenetic mechanisms responsible for plastic recombination as response to different biotic and abiotic conditions and stresses.
Collapse
Affiliation(s)
| | | | - Josep M Comeron
- Department of Biology, University of Iowa Interdisciplinary Graduate Program in Genetics, University of Iowa
| |
Collapse
|
150
|
Estimating the Effective Population Size from Temporal Allele Frequency Changes in Experimental Evolution. Genetics 2016; 204:723-735. [PMID: 27542959 PMCID: PMC5068858 DOI: 10.1534/genetics.116.191197] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 07/30/2016] [Indexed: 01/22/2023] Open
Abstract
The effective population size (Ne) is a major factor determining allele frequency changes in natural and experimental populations. Temporal methods provide a powerful and simple approach to estimate short-term Ne. They use allele frequency shifts between temporal samples to calculate the standardized variance, which is directly related to Ne. Here we focus on experimental evolution studies that often rely on repeated sequencing of samples in pools (Pool-seq). Pool-seq is cost-effective and often outperforms individual-based sequencing in estimating allele frequencies, but it is associated with atypical sampling properties: Additional to sampling individuals, sequencing DNA in pools leads to a second round of sampling, which increases the variance of allele frequency estimates. We propose a new estimator of Ne, which relies on allele frequency changes in temporal data and corrects for the variance in both sampling steps. In simulations, we obtain accurate Ne estimates, as long as the drift variance is not too small compared to the sampling and sequencing variance. In addition to genome-wide Ne estimates, we extend our method using a recursive partitioning approach to estimate Ne locally along the chromosome. Since the type I error is controlled, our method permits the identification of genomic regions that differ significantly in their Ne estimates. We present an application to Pool-seq data from experimental evolution with Drosophila and provide recommendations for whole-genome data. The estimator is computationally efficient and available as an R package at https://github.com/ThomasTaus/Nest.
Collapse
|