1
|
Soni V, Jensen JD. Inferring demographic and selective histories from population genomic data using a 2-step approach in species with coding-sparse genomes: an application to human data. G3 (BETHESDA, MD.) 2025; 15:jkaf019. [PMID: 39883523 PMCID: PMC12005166 DOI: 10.1093/g3journal/jkaf019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 01/14/2025] [Accepted: 01/27/2025] [Indexed: 01/31/2025]
Abstract
The demographic history of a population, and the distribution of fitness effects (DFE) of newly arising mutations in functional genomic regions, are fundamental factors dictating both genetic variation and evolutionary trajectories. Although both demographic and DFE inference has been performed extensively in humans, these approaches have generally either been limited to simple demographic models involving a single population, or, where a complex population history has been inferred, without accounting for the potentially confounding effects of selection at linked sites. Taking advantage of the coding-sparse nature of the genome, we propose a 2-step approach in which coalescent simulations are first used to infer a complex multi-population demographic model, utilizing large non-functional regions that are likely free from the effects of background selection. We then use forward-in-time simulations to perform DFE inference in functional regions, conditional on the complex demography inferred and utilizing expected background selection effects in the estimation procedure. Throughout, recombination and mutation rate maps were used to account for the underlying empirical rate heterogeneity across the human genome. Importantly, within this framework it is possible to utilize and fit multiple aspects of the data, and this inference scheme represents a generalized approach for such large-scale inference in species with coding-sparse genomes.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
2
|
Terbot JW, Soni V, Versoza CJ, Pfeifer SP, Jensen JD. Inferring the Demographic History of Aye-Ayes (Daubentonia madagascariensis) from High-Quality, Whole-Genome, Population-Level Data. Genome Biol Evol 2025; 17:evae281. [PMID: 39749927 PMCID: PMC11746965 DOI: 10.1093/gbe/evae281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2024] [Revised: 12/28/2024] [Accepted: 12/30/2024] [Indexed: 01/04/2025] Open
Abstract
The nocturnal aye-aye, Daubentonia madagascariensis, is one of the most elusive lemurs on the island of Madagascar. The timing of its activity and arboreal lifestyle has generally made it difficult to obtain accurate assessments of population size using traditional census methods. Therefore, alternative estimates provided by population genetic inference are essential for yielding much needed information for conservation measures and for enabling ecological and evolutionary studies of this species. Here, we utilize genomic data from 17 individuals-including 5 newly sequenced, high-coverage genomes-to estimate this history. Essential to this estimation are recently published annotations of the aye-aye genome which allow for variation at putatively neutral genomic regions to be included in the estimation procedures, and regions subject to selective constraints, or in linkage to such sites, to be excluded owing to the biasing effects of selection on demographic inference. By comparing a variety of demographic estimation tools to develop a well-supported model of population history, we find strong support for two demes, separating northern Madagascar from the rest of the island. Additionally, we find that the aye-aye has experienced two severe reductions in population size. The first occurred rapidly, ∼3,000 to 5,000 years ago, and likely corresponded with the arrival of humans to Madagascar. The second occurred over the past few decades and is likely related to substantial habitat loss, suggesting that the species is still undergoing population decline and remains at great risk for extinction.
Collapse
Affiliation(s)
- John W Terbot
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Vivak Soni
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Cyril J Versoza
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Susanne P Pfeifer
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
3
|
Soni V, Versoza CJ, Pfeifer SP, Jensen JD. Estimating the distribution of fitness effects in aye-ayes ( Daubentonia madagascariensis ), accounting for population history as well as mutation and recombination rate heterogeneity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.02.631144. [PMID: 39803457 PMCID: PMC11722344 DOI: 10.1101/2025.01.02.631144] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/20/2025]
Abstract
The distribution of fitness effects (DFE) characterizes the range of selection coefficients from which new mutations are sampled, and thus holds a fundamentally important role in evolutionary genomics. To date, DFE inference in primates has been largely restricted to haplorrhines, with limited data availability leaving the other suborder of primates, strepsirrhines, largely under-explored. To advance our understanding of the population genetics of this important taxonomic group, we here map exonic divergence in aye-ayes ( Daubentonia madagascariensis ) - the only extant member of the Daubentoniidae family of the Strepsirrhini suborder. We further infer the DFE in this highly-endangered species, utilizing a recently published high-quality annotated reference genome, a well-supported model of demographic history, as well as both direct and indirect estimates of underlying mutation and recombination rates. The inferred distribution is generally characterized by a greater proportion of deleterious mutations relative to humans, providing evidence of a larger long-term effective population size. In addition however, both immune-related and sensory-related genes were found to be amongst the most rapidly evolving in the aye-aye genome.
Collapse
|
4
|
Soni V, Versoza CJ, Terbot JW, Jensen JD, Pfeifer SP. Inferring fine-scale mutation and recombination rate maps in aye-ayes ( Daubentonia madagascariensis ). BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.28.630620. [PMID: 39763842 PMCID: PMC11703150 DOI: 10.1101/2024.12.28.630620] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/20/2025]
Abstract
The rate of input of new genetic mutations, and the rate at which that variation is reshuffled, are key evolutionary processes shaping genomic diversity. Importantly, these rates vary not just across populations and species, but also across individual genomes. Despite previous studies having demonstrated that failing to account for rate heterogeneity across the genome can bias the inference of both selective and neutral population genetic processes, mutation and recombination rate maps have to date only been generated for a relatively small number of organisms. Here, we infer such fine-scale maps for the aye-aye ( Daubentonia madagascariensis ) - a highly endangered strepsirrhine that represents one of the earliest splits in the primate clade, and thus stands as an important outgroup to the more commonly-studied haplorrhines - utilizing a recently released fully-annotated genome combined with high-quality population sequencing data. We compare our indirectly inferred rates to previous pedigree-based estimates, finding further evidence of relatively low mutation and recombination rates in aye-ayes compared to other primates.
Collapse
|
5
|
Carvajal-Rodríguez A. iHDSel software: The price equation and the population stability index to detect genomic patterns compatible with selective sweeps. An example with SARS-CoV-2. Biol Methods Protoc 2024; 9:bpae089. [PMID: 39679303 PMCID: PMC11646571 DOI: 10.1093/biomethods/bpae089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2024] [Revised: 11/19/2024] [Accepted: 11/25/2024] [Indexed: 12/17/2024] Open
Abstract
A large number of methods have been developed and continue to evolve for detecting the signatures of selective sweeps in genomes. Significant advances have been made, including the combination of different statistical strategies and the incorporation of artificial intelligence (machine learning) methods. Despite these advances, several common problems persist, such as the unknown null distribution of the statistics used, necessitating simulations and resampling to assign significance to the statistics. Additionally, it is not always clear how deviations from the specific assumptions of each method might affect the results. In this work, allelic classes of haplotypes are used along with the informational interpretation of the Price equation to design a statistic with a known distribution that can detect genomic patterns caused by selective sweeps. The statistic consists of Jeffreys divergence, also known as the population stability index, applied to the distribution of allelic classes of haplotypes in two samples. Results with simulated data show optimal performance of the statistic in detecting divergent selection. Analysis of real severe acute respiratory syndrome coronavirus 2 genome data also shows that some of the sites playing key roles in the virus's fitness and immune escape capability are detected by the method. The new statistic, called JHAC , is incorporated into the iHDSel (informed HacDivSel) software available at https://acraaj.webs.uvigo.es/iHDSel.html.
Collapse
Affiliation(s)
- Antonio Carvajal-Rodríguez
- Centro de Investigación Mariña (CIM), Departamento de Bioquímica, Genética e Inmunología, Universidade de Vigo, Vigo, 36310 Spain
| |
Collapse
|
6
|
Soni V, Jensen JD. Inferring demographic and selective histories from population genomic data using a two-step approach in species with coding-sparse genomes: an application to human data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.19.613979. [PMID: 39605418 PMCID: PMC11601476 DOI: 10.1101/2024.09.19.613979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
The demographic history of a population, and the distribution of fitness effects (DFE) of newly arising mutations in functional genomic regions, are fundamental factors dictating both genetic variation and evolutionary trajectories. Although both demographic and DFE inference has been performed extensively in humans, these approaches have generally either been limited to simple demographic models involving a single population, or, where a complex population history has been inferred, without accounting for the potentially confounding effects of selection at linked sites. Taking advantage of the coding-sparse nature of the genome, we propose a 2-step approach in which coalescent simulations are first used to infer a complex multi-population demographic model, utilizing large non-functional regions that are likely free from the effects of background selection. We then use forward-in-time simulations to perform DFE inference in functional regions, conditional on the complex demography inferred and utilizing expected background selection effects in the estimation procedure. Throughout, recombination and mutation rate maps were used to account for the underlying empirical rate heterogeneity across the human genome. Importantly, within this framework it is possible to utilize and fit multiple aspects of the data, and this inference scheme represents a generalized approach for such large-scale inference in species with coding-sparse genomes.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, US
| | - Jeffrey D. Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, US
| |
Collapse
|
7
|
Terbot JW, Soni V, Versoza CJ, Pfeifer SP, Jensen JD. Inferring the demographic history of aye-ayes ( Daubentonia madagascariensis) from high-quality, whole-genome, population-level data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.08.622659. [PMID: 39605532 PMCID: PMC11601231 DOI: 10.1101/2024.11.08.622659] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
The nocturnal aye-aye, Daubentonia madagascariensis, is one of the most elusive lemurs on the island of Madagascar. The timing of its activity and arboreal lifestyle has generally made it difficult to obtain accurate assessments of population size using traditional census methods. Therefore, alternative estimates provided by population genetic inference are essential for yielding much needed information for conservation measures and for enabling ecological and evolutionary studies of this species. Here, we utilize genomic data from 17 unrelated individuals - including 5 newly sequenced, high-coverage genomes - to estimate this history. Essential to this estimation are recently published annotations of the aye-aye genome which allow for variation at putatively neutral genomic regions to be included in the estimation procedures, and regions subject to selective constraints, or in linkage to such sites, to be excluded owing to the biasing effects of selection on demographic inference. By comparing a variety of demographic estimation tools to develop a well-supported model of population history, we find strong support for the species to consist of two demes, separating northern Madagascar from the rest of the island. Additionally, we find that the aye-aye has experienced two severe reductions in population size. The first occurred rapidly, approximately 3,000 to 5,000 years ago, and likely corresponded with the arrival of humans to Madagascar. The second occurred over the past few decades and is likely related to substantial habitat loss, suggesting that the species is still undergoing population decline and remains at great risk for extinction.
Collapse
Affiliation(s)
- John W. Terbot
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Vivak Soni
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Cyril J. Versoza
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Susanne P. Pfeifer
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D. Jensen
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
8
|
Soni V, Terbot JW, Versoza CJ, Pfeifer SP, Jensen JD. A whole-genome scan for evidence of recent positive and balancing selection in aye-ayes ( Daubentonia madagascariensis) utilizing a well-fit evolutionary baseline model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.08.622667. [PMID: 39605496 PMCID: PMC11601216 DOI: 10.1101/2024.11.08.622667] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
The aye-aye (Daubentonia madagascariensis) is one of the 25 most endangered primate species in the world, maintaining amongst the lowest genetic diversity of any primate measured to date. Characterizing patterns of genetic variation within aye-aye populations, and the relative influences of neutral and selective processes in shaping that variation, is thus important for future conservation efforts. In this study, we performed the first whole-genome scans for recent positive and balancing selection in the species, utilizing high-coverage population genomic data from newly sequenced individuals. We generated null thresholds for our genomic scans by creating an evolutionarily appropriate baseline model that incorporates the demographic history of this aye-aye population, and identified a small number of candidate genes. Most notably, a suite of genes involved in olfaction - a key trait in these nocturnal primates - were identified as experiencing long-term balancing selection. We also conducted analyses to quantify the expected statistical power to detect positive and balancing selection in this population using site frequency spectrum-based inference methods, once accounting for the potentially confounding contributions of population history, recombination and mutation rate variation, and purifying and background selection. This work, presenting the first high-quality, genome-wide polymorphism data across the functional regions of the aye-aye genome, thus provides important insights into the landscape of episodic selective forces in this highly endangered species.
Collapse
Affiliation(s)
- Vivak Soni
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - John W. Terbot
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Cyril J. Versoza
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Susanne P. Pfeifer
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D. Jensen
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
9
|
Gouy A, Wang X, Kapopoulou A, Neuenschwander S, Schmid E, Excoffier L, Heckel G. Genomes of Microtus Rodents Highlight the Importance of Olfactory and Immune Systems in Their Fast Radiation. Genome Biol Evol 2024; 16:evae233. [PMID: 39445808 PMCID: PMC11579656 DOI: 10.1093/gbe/evae233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Revised: 10/02/2024] [Accepted: 10/07/2024] [Indexed: 10/25/2024] Open
Abstract
The characterization of genes and biological functions underlying functional diversification and the formation of species is a major goal of evolutionary biology. In this study, we investigated the fast radiation of Microtus voles, one of the most speciose group of mammals, which shows strong genetic divergence despite few readily observable morphological differences. We produced an annotated reference genome for the common vole, Microtus arvalis, and resequenced the genomes of 10 different species and evolutionary lineages spanning the Microtus speciation continuum. Our full-genome sequences illustrate the recent and fast diversification of this group, and we identified genes in highly divergent genomic windows that have likely particular roles in their radiation. We found three biological functions enriched for highly divergent genes in most Microtus species and lineages: olfaction, immunity and metabolism. In particular, olfaction-related genes (mostly olfactory receptors and vomeronasal receptors) are fast evolving in all Microtus species indicating the exceptional importance of the olfactory system in the evolution of these rodents. Of note is e.g. the shared signature among vole species on Olfr1019 which has been associated with fear responses against predator odors in rodents. Our analyses provide a genome-wide basis for the further characterization of the ecological factors and processes of natural and sexual selection that have contributed to the fast radiation of Microtus voles.
Collapse
Affiliation(s)
- Alexandre Gouy
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Xuejing Wang
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Adamandia Kapopoulou
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Emanuel Schmid
- Vital-IT, Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Gerald Heckel
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
10
|
Marsh JI, Johri P. Biases in ARG-Based Inference of Historical Population Size in Populations Experiencing Selection. Mol Biol Evol 2024; 41:msae118. [PMID: 38874402 PMCID: PMC11245712 DOI: 10.1093/molbev/msae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/05/2024] [Accepted: 06/11/2024] [Indexed: 06/15/2024] Open
Abstract
Inferring the demographic history of populations provides fundamental insights into species dynamics and is essential for developing a null model to accurately study selective processes. However, background selection and selective sweeps can produce genomic signatures at linked sites that mimic or mask signals associated with historical population size change. While the theoretical biases introduced by the linked effects of selection have been well established, it is unclear whether ancestral recombination graph (ARG)-based approaches to demographic inference in typical empirical analyses are susceptible to misinference due to these effects. To address this, we developed highly realistic forward simulations of human and Drosophila melanogaster populations, including empirically estimated variability of gene density, mutation rates, recombination rates, purifying, and positive selection, across different historical demographic scenarios, to broadly assess the impact of selection on demographic inference using a genealogy-based approach. Our results indicate that the linked effects of selection minimally impact demographic inference for human populations, although it could cause misinference in populations with similar genome architecture and population parameters experiencing more frequent recurrent sweeps. We found that accurate demographic inference of D. melanogaster populations by ARG-based methods is compromised by the presence of pervasive background selection alone, leading to spurious inferences of recent population expansion, which may be further worsened by recurrent sweeps, depending on the proportion and strength of beneficial mutations. Caution and additional testing with species-specific simulations are needed when inferring population history with non-human populations using ARG-based approaches to avoid misinference due to the linked effects of selection.
Collapse
Affiliation(s)
- Jacob I Marsh
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Parul Johri
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
11
|
Soni V, Jensen JD. Temporal challenges in detecting balancing selection from population genomic data. G3 (BETHESDA, MD.) 2024; 14:jkae069. [PMID: 38551137 DOI: 10.1093/g3journal/jkae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 12/21/2023] [Accepted: 03/19/2024] [Indexed: 04/28/2024]
Abstract
The role of balancing selection in maintaining genetic variation remains an open question in population genetics. Recent years have seen numerous studies identifying candidate loci potentially experiencing balancing selection, most predominantly in human populations. There are however numerous alternative evolutionary processes that may leave similar patterns of variation, thereby potentially confounding inference, and the expected signatures of balancing selection additionally change in a temporal fashion. Here we use forward-in-time simulations to quantify expected statistical power to detect balancing selection using both site frequency spectrum- and linkage disequilibrium-based methods under a variety of evolutionarily realistic null models. We find that whilst site frequency spectrum-based methods have little power immediately after a balanced mutation begins segregating, power increases with time since the introduction of the balanced allele. Conversely, linkage disequilibrium-based methods have considerable power whilst the allele is young, and power dissipates rapidly as the time since introduction increases. Taken together, this suggests that site frequency spectrum-based methods are most effective at detecting long-term balancing selection (>25N generations since the introduction of the balanced allele) whilst linkage disequilibrium-based methods are effective over much shorter timescales (<1N generations), thereby leaving a large time frame over which current methods have little power to detect the action of balancing selection. Finally, we investigate the extent to which alternative evolutionary processes may mimic these patterns, and demonstrate the need for caution in attempting to distinguish the signatures of balancing selection from those of both neutral processes (e.g. population structure and admixture) as well as of alternative selective processes (e.g. partial selective sweeps).
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
12
|
Soni V, Terbot JW, Jensen JD. Population genetic considerations regarding the interpretation of within-patient SARS-CoV-2 polymorphism data. Nat Commun 2024; 15:3240. [PMID: 38627371 PMCID: PMC11021480 DOI: 10.1038/s41467-024-46261-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/29/2024] [Indexed: 04/19/2024] Open
Affiliation(s)
- Vivak Soni
- Center for Evolution & Medicine, Arizona State University, School of Life Sciences, Tempe, AZ, USA
| | - John W Terbot
- Center for Evolution & Medicine, Arizona State University, School of Life Sciences, Tempe, AZ, USA
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Jeffrey D Jensen
- Center for Evolution & Medicine, Arizona State University, School of Life Sciences, Tempe, AZ, USA.
| |
Collapse
|
13
|
Soni V, Pfeifer SP, Jensen JD. The Effects of Mutation and Recombination Rate Heterogeneity on the Inference of Demography and the Distribution of Fitness Effects. Genome Biol Evol 2024; 16:evae004. [PMID: 38207127 PMCID: PMC10834165 DOI: 10.1093/gbe/evae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/12/2023] [Accepted: 01/07/2024] [Indexed: 01/13/2024] Open
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavor; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modeled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination before utilizing population genomic data to quantify the effects of genetic drift (i.e. as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modeled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Susanne P Pfeifer
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
14
|
Zheng L, Wang H, Lin J, Zhou Y, Xiao J, Li K. Population genomics provides insights into the genetic diversity and adaptation of the Pieris rapae in China. PLoS One 2023; 18:e0294521. [PMID: 37972203 PMCID: PMC10653512 DOI: 10.1371/journal.pone.0294521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 11/02/2023] [Indexed: 11/19/2023] Open
Abstract
The cabbage white butterfly (Pieris rapae), a major agricultural pest, has become one of the most abundant and destructive butterflies in the world. It is widely distributed in a large variety of climates and terrains of China due to its strong adaptability. To gain insight into the population genetic characteristics of P. rapae in China, we resequenced the genome of 51 individuals from 19 areas throughout China. Using population genomics approaches, a dense variant map of P. rapae was observed, indicating a high level of polymorphism that could result in adaptation to a changing environment. The feature of the genetic structure suggested considerable genetic admixture in different geographical groups. Additionally, our analyses suggest that physical barriers may have played a more important role than geographic distance in driving genetic differentiation. Population history showed the effective population size of P. rapae was greatly affected by global temperature changes, with mild periods (i.e., temperatures warmer than those during glaciation but not excessively hot) leading to an increase in population size. Furthermore, by comparing populations from south and north China, we have identified selected genes related to sensing temperature, growth, neuromodulation and immune response, which may reveal the genetic basis of adaptation to different environments. Our study is the first to illustrate the genetic signatures of P. rapae in China at the population genomic level, providing fundamental knowledge of the genetic diversity and adaptation of P. rapae.
Collapse
Affiliation(s)
- Linlin Zheng
- College of Biological Science and Medical Engineering, Donghua University, Songjiang District, Shanghai, China
| | - Huan Wang
- Department of Plant Science and Technology, Shanghai Vocational College of Agriculture and Forestry, Shanghai, China
| | - Junjie Lin
- College of Biological Science and Medical Engineering, Donghua University, Songjiang District, Shanghai, China
| | - Yuxun Zhou
- College of Biological Science and Medical Engineering, Donghua University, Songjiang District, Shanghai, China
| | - Junhua Xiao
- College of Biological Science and Medical Engineering, Donghua University, Songjiang District, Shanghai, China
| | - Kai Li
- College of Biological Science and Medical Engineering, Donghua University, Songjiang District, Shanghai, China
| |
Collapse
|
15
|
Soni V, Pfeifer SP, Jensen JD. The effects of mutation and recombination rate heterogeneity on the inference of demography and the distribution of fitness effects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.11.566703. [PMID: 38014252 PMCID: PMC10680612 DOI: 10.1101/2023.11.11.566703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavour; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modelled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination prior to utilizing population genomic data to quantify the effects of genetic drift (i.e., as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modelled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Susanne P. Pfeifer
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| |
Collapse
|