1
|
The inference of sex-biased human demography from whole-genome data. PLoS Genet 2019; 15:e1008293. [PMID: 31539367 PMCID: PMC6774570 DOI: 10.1371/journal.pgen.1008293] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 10/02/2019] [Accepted: 07/07/2019] [Indexed: 01/06/2023] Open
Abstract
Sex-biased demographic events ("sex-bias") involve unequal numbers of females and males. These events are typically inferred from the relative amount of X-chromosomal to autosomal genetic variation and have led to conflicting conclusions about human demographic history. Though population size changes alter the relative amount of X-chromosomal to autosomal genetic diversity even in the absence of sex-bias, this has generally not been accounted for in sex-bias estimators to date. Here, we present a novel method to identify sex-bias from genetic sequence data that models population size changes and estimates the female fraction of the effective population size during each time epoch. Compared to recent sex-bias inference methods, our approach can detect sex-bias that changes on a single population branch without requiring data from an outgroup or knowledge of divergence events. When applied to simulated data, conventional sex-bias estimators are biased by population size changes, especially recent growth or bottlenecks, while our estimator is unbiased. We next apply our method to high-coverage exome data from the 1000 Genomes Project and estimate a male bias in Yorubans (47% female) and Europeans (44%), possibly due to stronger background selection on the X chromosome than on the autosomes. Finally, we apply our method to the 1000 Genomes Project Phase 3 high-coverage Complete Genomics whole-genome data and estimate a female bias in Yorubans (63% female), Europeans (84%), Punjabis (82%), as well as Peruvians (56%), and a male bias in the Southern Han Chinese (45%). Our method additionally identifies a male-biased migration out of Africa based on data from Europeans (20% female). Our results demonstrate that modeling population size change is necessary to estimate sex-bias parameters accurately. Our approach gives insight into signatures of sex-bias in sexual species, and the demographic models it produces can serve as more accurate null models for tests of selection.
Collapse
|
2
|
Abstract
Levels and patterns of genetic diversity can provide insights into a population’s history. In species with sex chromosomes, differences between genomic regions with unique inheritance patterns can be used to distinguish between different sets of possible demographic and selective events. This review introduces the differences in population history for sex chromosomes and autosomes, provides the expectations for genetic diversity across the genome under different evolutionary scenarios, and gives an introductory description for how deviations in these expectations are calculated and can be interpreted. Predominantly, diversity on the sex chromosomes has been used to explore and address three research areas: 1) Mating patterns and sex-biased variance in reproductive success, 2) signatures of selection, and 3) evidence for modes of speciation and introgression. After introducing the theory, this review catalogs recent studies of genetic diversity on the sex chromosomes across species within the major research areas that sex chromosomes are typically applied to, arguing that there are broad similarities not only between male-heterogametic (XX/XY) and female-heterogametic (ZZ/ZW) sex determination systems but also any mating system with reduced recombination in a sex-determining region. Further, general patterns of reduced diversity in nonrecombining regions are shared across plants and animals. There are unique patterns across populations with vastly different patterns of mating and speciation, but these do not tend to cluster by taxa or sex determination system.
Collapse
Affiliation(s)
- Melissa A Wilson Sayres
- School of Life Sciences, Center for Evolution and Medicine, The Biodesign Institute, Arizona State University
| |
Collapse
|
3
|
Evolutionary processes in populations of Cryptosporidium inferred from gp60 sequence data. Parasitol Res 2017; 116:1855-1861. [PMID: 28502018 DOI: 10.1007/s00436-017-5459-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Accepted: 04/24/2017] [Indexed: 10/19/2022]
Abstract
Cryptosporidiosis is one of the most common human infectious diseases globally. The gp60 gene has been adopted as a key marker for molecular epidemiological investigations into this protozoan disease because of the capability to characterize genotypes and detect variants within Cryptosporidium species infecting humans. However, we know relatively little about the potential spatial and temporal variation in population demography that can be inferred from this gene beyond that it is recognized to be under selective pressure. Here, we analyzed the genetic variation in time and space within two putative populations of Cryptosporidium in New Zealand to infer the processes behind the patterns of sequence polymorphism. Analyses using Tajima's D, Fu, and Li's D* and F* tests show significant departures from neutrality in some populations and indicate the selective maintenance of alleles within some populations. Demographic analyses showed distortions in the pattern of the genetic variability caused by high recombination rates and population expansion, which was observed in case notification data. Our results showed that processes acting on populations that have similar effects can be distinguished from one another and multiple processes can be detected acting at the same time. These results are significant for prediction of the parasite dynamics and potential mechanisms of long-term changes in the risk of cryptosporidiosis in humans.
Collapse
|
4
|
Vadivalagan C, Karthika P, Murugan K, Panneerselvam C, Del Serrone P, Benelli G. Exploring genetic variation in haplotypes of the filariasis vector Culex quinquefasciatus (Diptera: Culicidae) through DNA barcoding. Acta Trop 2017; 169:43-50. [PMID: 28126462 DOI: 10.1016/j.actatropica.2017.01.020] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Revised: 01/20/2017] [Accepted: 01/21/2017] [Indexed: 12/14/2022]
Abstract
Culex quinquefasciatus (Diptera: Culicidae) is a vector of many pathogens and parasites of humans, as well as domestic and wild animals. In urban and semi-urban Asian countries, Cx. quinquefasciatus is a main vector of nematodes causing lymphatic filariasis. In the African region, it vectors the Rift Valley fever virus, while in the USA it transmits West Nile, St. Louis encephalitis and Western equine encephalitis virus. In this study, DNA barcoding was used to explore the genetic variation of Cx. quinquefasciatus populations from 88 geographical regions. We presented a comprehensive approach analyzing the effectiveness of two gene markers, i.e. CO1 and 16S rRNA. The high threshold genetic divergence of CO1 (0.47%) gene was reported as an ideal marker for molecular identification of this mosquito vector. Furthermore, null substitutions were lower in CO1 if compared to 16S rRNA, which influenced its differentiating potential among Indian haplotypes. NJ tree was well supported with high branch values for CO1 gene than 16S rRNA, indicating ideal genetic differentiation among haplotypes. TCS haplotype network revealed 14 distinct clusters. The intra- and inter-population polymorphism were calculated among the global and Indian Cx. quinquefasciatus lineages. The genetic diversity index Tajima' D showed negative values for all the 4 intra-population clusters (G2-4, G10). Fu's FS showed negative value for G10 cluster, which was significant and indicated recent population expansion. However, the G2-G4 (i.e. Indian lineages) had positive values, suggesting a bottleneck effect. Overall, our research firstly shed light on the genetic differences among the haplotypes of Cx. quinquefasciatus species complex, adding basic knowledge to the molecular ecology of this important mosquito vector.
Collapse
Affiliation(s)
- Chithravel Vadivalagan
- Division of Entomology, Department of Zoology, School of Life Sciences, Bharathiar University, Coimbatore 641 046, Tamil Nadu, India
| | - Pushparaj Karthika
- Department of Zoology, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore 641 043, Tamil Nadu, India
| | - Kadarkarai Murugan
- Division of Entomology, Department of Zoology, School of Life Sciences, Bharathiar University, Coimbatore 641 046, Tamil Nadu, India; Department of Biotechnology, Thiruvalluvar University, Serkkadu, Vellore 632 115, Tamil Nadu, India
| | | | - Paola Del Serrone
- Council for Agricultural Research and Economics (CREA), Animal Production Research Centre, Via Salaria 31, 00016 Monterotondo Scalo, Roma, Italy
| | - Giovanni Benelli
- Department of Agriculture, Food and Environment, University of Pisa, Via del Borghetto 80, 56124 Pisa, Italy.
| |
Collapse
|
5
|
Narang P, Wilson Sayres MA. Variable Autosomal and X Divergence Near and Far from Genes Affects Estimates of Male Mutation Bias in Great Apes. Genome Biol Evol 2016; 8:3393-3405. [PMID: 27702816 PMCID: PMC5203777 DOI: 10.1093/gbe/evw232] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Male mutation bias, when more mutations are passed on via the male germline than via the female germline, is observed across mammals. One common way to infer the magnitude of male mutation bias, α, is to compare levels of neutral sequence divergence between genomic regions that spend different amounts of time in the male and female germline. For great apes, including human, we show that estimates of divergence are reduced in putatively unconstrained regions near genes relative to unconstrained regions far from genes. Divergence increases with increasing distance from genes on both the X chromosome and autosomes, but increases faster on the X chromosome than autosomes. As a result, ratios of X/A divergence increase with increasing distance from genes and corresponding estimates of male mutation bias are significantly higher in intergenic regions near genes versus far from genes. Future studies in other species will need to carefully consider the effect that genomic location will have on estimates of male mutation bias.
Collapse
Affiliation(s)
- Pooja Narang
- School of Life Sciences, Arizona State University, Tempe
| | - Melissa A Wilson Sayres
- School of Life Sciences, Arizona State University, Tempe .,Center for Evolution and Medicine, The Biodesign Institute, Arizona State University, Tempe
| |
Collapse
|
6
|
Mattila TM, Aalto EA, Toivainen T, Niittyvuopio A, Piltonen S, Kuittinen H, Savolainen O. Selection for population-specific adaptation shaped patterns of variation in the photoperiod pathway genes in Arabidopsis lyrata during post-glacial colonization. Mol Ecol 2016; 25:581-97. [PMID: 26600237 DOI: 10.1111/mec.13489] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Revised: 11/16/2015] [Accepted: 11/18/2015] [Indexed: 12/20/2022]
Abstract
Spatially varying selection can lead to population-specific adaptation, which is often recognized at the phenotypic level; however, the genetic evidence is weaker in many groups of organisms. In plants, environmental shifts that occur due to colonization of a novel environment may require adaptive changes in the timing of growth and flowering, which are often governed by location-specific environmental cues such as day length. We studied locally varying selection in 19 flowering time loci in nine populations of the perennial herb Arabidopsis lyrata, which has a wide but patchy distribution in temperate and boreal regions of the northern hemisphere. The populations differ in their recent population demographic and colonization histories and current environmental conditions, especially in the growing season length. We searched for population-specific molecular signatures of directional selection by comparing a set of candidate flowering time loci with a genomic reference set within each population using multiple approaches and contrasted the patterns of different populations. The candidate loci possessed approximately 20% of the diversity of the reference loci. On average the flowering time loci had more rare alleles (a smaller Tajima's D) and an excess of highly differentiated sites relative to the reference, suggesting positive selection. The strongest signal of selection was detected in photoperiodic pathway loci in the colonizing populations of Northwestern Europe, whereas no evidence of positive selection was detected in the Central European populations. These findings emphasized the population-specific nature of selection and suggested that photoperiodic adaptation was important during postglacial colonization of the species.
Collapse
Affiliation(s)
- Tiina M Mattila
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland
| | - Esa A Aalto
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland
| | - Tuomas Toivainen
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland.,Biocenter Oulu, University of Oulu, 90014, Oulu, Finland
| | - Anne Niittyvuopio
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland
| | - Susanna Piltonen
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland
| | - Helmi Kuittinen
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland
| | - Outi Savolainen
- Department of Genetics and Physiology, University of Oulu, 90014, Oulu, Finland.,Biocenter Oulu, University of Oulu, 90014, Oulu, Finland
| |
Collapse
|
7
|
Tyagi S, Das A. Mitochondrial population genomic analyses reveal population structure and demography of Indian Plasmodium falciparum. Mitochondrion 2015; 24:9-21. [PMID: 26149324 DOI: 10.1016/j.mito.2015.06.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Revised: 06/29/2015] [Accepted: 06/29/2015] [Indexed: 11/30/2022]
Abstract
Inference on the genetic diversity of Plasmodium falciparum populations could help in better management of malaria. A very recent study with mitochondrial (mt) genomes in global P. falciparum had revealed interesting evolutionary genetic patterns of Indian isolates in comparison to global ones. However, no population genetic study using the whole mt genome sequences of P. falciparum isolates collected in the entire distribution range in India has yet been performed. We herewith have analyzed 85 whole mt genomes (48 already published and 37 entirely new) sampled from eight differentially endemic Indian locations to estimate genetic diversity and infer population structure and historical demography of Indian P. falciparum. We found 19 novel Indian-specific Single Nucleotide Polymorphisms (SNPs) and 22 novel haplotypes segregating in Indian P. falciparum. Accordingly, high haplotype and nucleotide diversities were detected in Indian P. falciparum in comparison to many other global isolates. Indian P. falciparum populations were found to be moderately sub-structured with four different genetic clusters. Interestingly, group of local populations aggregate to form each cluster; while samples from Jharkhand and Odisha formed a single cluster, P. falciparum isolates from Asom formed an independent one. Similarly, Surat, Bilaspur and Betul formed a single cluster and Goa and Mangalore formed another. Interestingly, P. falciparum isolates from the two later populations were significantly genetically differentiated from isolates collected in other six Indian locations. Signature of historical population expansion was evident in five population samples, and the onset of expansion event was found to be very similar to African P. falciparum. In agreement with the previous finding, the estimated Time to Most Recent Common Ancestor (TMRCA) and the effective population size were high in Indian P. falciparum. All these genetic features of Indian P. falciparum with high mt genome diversity are somehow similar to Africa, but quite different from other Asian population samples.
Collapse
Affiliation(s)
- Suchi Tyagi
- Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, New Delhi, India
| | - Aparup Das
- Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, New Delhi, India.
| |
Collapse
|
8
|
Dixit J, Arunyawat U, Huong NT, Das A. Multilocus nuclear DNA markers reveal population structure and demography of Anopheles minimus. Mol Ecol 2014; 23:5599-618. [PMID: 25266341 DOI: 10.1111/mec.12943] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2013] [Revised: 09/21/2014] [Accepted: 09/22/2014] [Indexed: 01/11/2023]
Abstract
Utilization of multiple putatively neutral DNA markers for inferring evolutionary history of species population is considered to be the most robust approach. Molecular population genetic studies have been conducted in many species of Anopheles genus, but studies based on single nucleotide polymorphism (SNP) data are still very scarce. Anopheles minimus is one of the principal malaria vectors of Southeast (SE) Asia including the Northeastern (NE) India. Although population genetic studies with mitochondrial genetic variation data have been utilized to infer phylogeography of the SE Asian populations of this species, limited information on the population structure and demography of Indian An. minimus is available. We herewith have developed multilocus nuclear genetic approach with SNP markers located in X chromosome of An. minimus in eight Indian and two SE Asian population samples (121 individual mosquitoes in total) to infer population history and test several hypotheses on the phylogeography of this species. While the Thai population sample of An. minimus presented the highest nucleotide diversity, majority of the Indian samples were also fairly diverse. In general, An. minimus populations were moderately substructured in the distribution range covering SE Asia and NE India, largely falling under three distinct genetic clusters. Moreover, demographic expansion events could be detected in the majority of the presently studied populations of An. minimus. Additional DNA sequencing of the mitochondrial COII region in a subset of the samples (40 individual mosquitoes) corroborated the existing hypothesis of Indian An. minimus falling under the earlier reported mitochondrial lineage B.
Collapse
Affiliation(s)
- Jyotsana Dixit
- Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Sector-8, Dwarka, New Delhi, 110077, India
| | | | | | | |
Collapse
|
9
|
Arbiza L, Gottipati S, Siepel A, Keinan A. Contrasting X-linked and autosomal diversity across 14 human populations. Am J Hum Genet 2014; 94:827-44. [PMID: 24836452 DOI: 10.1016/j.ajhg.2014.04.011] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Accepted: 04/15/2014] [Indexed: 12/29/2022] Open
Abstract
Contrasting the genetic diversity of the human X chromosome (X) and autosomes has facilitated understanding historical differences between males and females and the influence of natural selection. Previous studies based on smaller data sets have left questions regarding how empirical patterns extend to additional populations and which forces can explain them. Here, we address these questions by analyzing the ratio of X-to-autosomal (X/A) nucleotide diversity with the complete genomes of 569 females from 14 populations. Results show that X/A diversity is similar within each continental group but notably lower in European (EUR) and East Asian (ASN) populations than in African (AFR) populations. X/A diversity increases in all populations with increasing distance from genes, highlighting the stronger impact of diversity-reducing selection on X than on the autosomes. However, relative X/A diversity (between two populations) is invariant with distance from genes, suggesting that selection does not drive the relative reduction in X/A diversity in non-Africans (0.842 ± 0.012 for EUR-to-AFR and 0.820 ± 0.032 for ASN-to-AFR comparisons). Finally, an array of models with varying population bottlenecks, expansions, and migration from the latest studies of human demographic history account for about half of the observed reduction in relative X/A diversity from the expected value of 1. They predict values between 0.91 and 0.94 for EUR-to-AFR comparisons and between 0.91 and 0.92 for ASN-to-AFR comparisons. Further reductions can be predicted by more extreme demographic events in excess of those captured by the latest studies but, in the absence of these, also by historical sex-biased demographic events or other processes.
Collapse
|
10
|
Khan N, Chittoria A, Pande V, Jaiswal YK, Das A. Development of multilocus putatively neutral DNA markers in the X-chromosome for population genetic studies in humans. Ann Hum Biol 2012; 39:281-9. [PMID: 22656191 DOI: 10.3109/03014460.2012.689326] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
BACKGROUND It has now been well documented that the type (coding, non-coding) and location (nuclear, mitochondrial etc.) of genetic markers heavily influence evolutionary inferences; realistic assumptions can be drawn if multiple putatively neutral DNA fragments spread across the genome are used. AIM To infer human population history, Single Nucleotide Polymorphisms (SNPs), located in the non-coding regions of different genes in the X-chromosome have been developed as 'putatively neutral markers'. SUBJECTS AND METHODS A population sample consisting of 16 male individuals from the western part of India was utilized for sequencing eight DNA fragments located in introns of three genes (Duchenne muscular dystrophy, Factor IX and Pyruvate dehydrogenase E1 sub-unit) on the human X-chromosome. PCR amplification and DNA sequencing confirmed the polymorphic status of all the fragments. RESULTS Twenty nine SNPs were found to be segregating in the Western Indian population samples. Using these SNPs the nucleotide diversity and demographic parameters of the Western Indian population were estimated. Several tests of neutrality ascertained that all eight fragments evolve putatively neutrally. Further, linkage disequilibrium analyses confirmed this fact. CONCLUSION All eight DNA fragments seem to bear the characteristics to be considered as 'putatively neutral genetic markers' and thus, could be utilized for inference of human population and demographic histories.
Collapse
Affiliation(s)
- Naazneen Khan
- Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Sector 8, Dwarka, New Delhi - 110077, India
| | | | | | | | | |
Collapse
|
11
|
Harpur BA, Minaei S, Kent CF, Zayed A. Management increases genetic diversity of honey bees via admixture. Mol Ecol 2012; 21:4414-21. [PMID: 22564213 DOI: 10.1111/j.1365-294x.2012.05614.x] [Citation(s) in RCA: 112] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The process of domestication often brings about profound changes in levels of genetic variation in animals and plants. The honey bee, Apis mellifera, has been managed by humans for centuries for both honey and wax production and crop pollination. Human management and selective breeding are believed to have caused reductions in genetic diversity in honey bee populations, thereby contributing to the global declines threatening this ecologically and economically important insect. However, previous studies supporting this claim mostly relied on population genetic comparisons of European and African (or Africanized) honey bee races; such conclusions require reassessment given recent evidence demonstrating that the honey bee originated in Africa and colonized Europe via two independent expansions. We sampled honey bee workers from two managed populations in North America and Europe as well as several old-world progenitor populations in Africa, East and West Europe. Managed bees had highly introgressed genomes representing admixture between East and West European progenitor populations. We found that managed honey bees actually have higher levels of genetic diversity compared with their progenitors in East and West Europe, providing an unusual example whereby human management increases genetic diversity by promoting admixture. The relationship between genetic diversity and honey bee declines is tenuous given that managed bees have more genetic diversity than their progenitors and many viable domesticated animals.
Collapse
Affiliation(s)
- Brock A Harpur
- Department of Biology York University, Toronto, Ontario, Canada M3J 1P3
| | | | | | | |
Collapse
|
12
|
Peters JL, Roberts TE, Winker K, McCracken KG. Heterogeneity in genetic diversity among non-coding loci fails to fit neutral coalescent models of population history. PLoS One 2012; 7:e31972. [PMID: 22384117 PMCID: PMC3285185 DOI: 10.1371/journal.pone.0031972] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2011] [Accepted: 01/17/2012] [Indexed: 12/26/2022] Open
Abstract
Inferring aspects of the population histories of species using coalescent analyses of non-coding nuclear DNA has grown in popularity. These inferences, such as divergence, gene flow, and changes in population size, assume that genetic data reflect simple population histories and neutral evolutionary processes. However, violating model assumptions can result in a poor fit between empirical data and the models. We sampled 22 nuclear intron sequences from at least 19 different chromosomes (a genomic transect) to test for deviations from selective neutrality in the gadwall (Anas strepera), a Holarctic duck. Nucleotide diversity among these loci varied by nearly two orders of magnitude (from 0.0004 to 0.029), and this heterogeneity could not be explained by differences in substitution rates alone. Using two different coalescent methods to infer models of population history and then simulating neutral genetic diversity under these models, we found that the observed among-locus heterogeneity in nucleotide diversity was significantly higher than expected for these simple models. Defining more complex models of population history demonstrated that a pre-divergence bottleneck was also unlikely to explain this heterogeneity. However, both selection and interspecific hybridization could account for the heterogeneity observed among loci. Regardless of the cause of the deviation, our results illustrate that violating key assumptions of coalescent models can mislead inferences of population history.
Collapse
Affiliation(s)
- Jeffrey L Peters
- Department of Biological Sciences, Wright State University, Dayton, Ohio, United States of America.
| | | | | | | |
Collapse
|
13
|
GUPTA BHAVNA, SRIVASTAVA NALINI, DAS APARUP. Inferring the evolutionary history of IndianPlasmodium vivaxfrom population genetic analyses of multilocus nuclear DNA fragments. Mol Ecol 2012; 21:1597-616. [DOI: 10.1111/j.1365-294x.2012.05480.x] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
14
|
Emery LS, Felsenstein J, Akey JM. Estimators of the human effective sex ratio detect sex biases on different timescales. Am J Hum Genet 2010; 87:848-56. [PMID: 21109223 DOI: 10.1016/j.ajhg.2010.10.021] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2010] [Revised: 10/15/2010] [Accepted: 10/22/2010] [Indexed: 12/25/2022] Open
Abstract
Determining historical sex ratios throughout human evolution can provide insight into patterns of genomic variation, the structure and composition of ancient populations, and the cultural factors that influence the sex ratio (e.g., sex-specific migration rates). Although numerous studies have suggested that unequal sex ratios have existed in human evolutionary history, a coherent picture of sex-biased processes has yet to emerge. For example, two recent studies compared human X chromosome to autosomal variation to make inferences about historical sex ratios but reached seemingly contradictory conclusions, with one study finding evidence for a male bias and the other study identifying a female bias. Here, we show that a large part of this discrepancy can be explained by methodological differences. Specifically, through reanalysis of empirical data, derivation of explicit analytical formulae, and extensive simulations we demonstrate that two estimators of the effective sex ratio based on population structure and nucleotide diversity preferentially detect biases that have occurred on different timescales. Our results clarify apparently contradictory evidence on the role of sex-biased processes in human evolutionary history and show that extant patterns of human genomic variation are consistent with both a recent male bias and an earlier, persistent female bias.
Collapse
Affiliation(s)
- Leslie S Emery
- Department of Genome Sciences, University of Washington, Seattle, 98195, USA
| | | | | |
Collapse
|
15
|
Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants. Nat Genet 2010; 42:969-72. [PMID: 20890277 DOI: 10.1038/ng.680] [Citation(s) in RCA: 272] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2010] [Accepted: 09/08/2010] [Indexed: 12/19/2022]
Abstract
Targeted capture combined with massively parallel exome sequencing is a promising approach to identify genetic variants implicated in human traits. We report exome sequencing of 200 individuals from Denmark with targeted capture of 18,654 coding genes and sequence coverage of each individual exome at an average depth of 12-fold. On average, about 95% of the target regions were covered by at least one read. We identified 121,870 SNPs in the sample population, including 53,081 coding SNPs (cSNPs). Using a statistical method for SNP calling and an estimation of allelic frequencies based on our population data, we derived the allele frequency spectrum of cSNPs with a minor allele frequency greater than 0.02. We identified a 1.8-fold excess of deleterious, non-syonomyous cSNPs over synonymous cSNPs in the low-frequency range (minor allele frequencies between 2% and 5%). This excess was more pronounced for X-linked SNPs, suggesting that deleterious substitutions are primarily recessive.
Collapse
|
16
|
Keinan A, Reich D. Human population differentiation is strongly correlated with local recombination rate. PLoS Genet 2010; 6:e1000886. [PMID: 20361044 PMCID: PMC2845648 DOI: 10.1371/journal.pgen.1000886] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2009] [Accepted: 02/24/2010] [Indexed: 11/19/2022] Open
Abstract
Allele frequency differences across populations can provide valuable information both for studying population structure and for identifying loci that have been targets of natural selection. Here, we examine the relationship between recombination rate and population differentiation in humans by analyzing two uniformly-ascertained, whole-genome data sets. We find that population differentiation as assessed by inter-continental F(ST) shows negative correlation with recombination rate, with F(ST) reduced by 10% in the tenth of the genome with the highest recombination rate compared with the tenth of the genome with the lowest recombination rate (P<<10(-12)). This pattern cannot be explained by the mutagenic properties of recombination and instead must reflect the impact of selection in the last 100,000 years since human continental populations split. The correlation between recombination rate and F(ST) has a qualitatively different relationship for F(ST) between African and non-African populations and for F(ST) between European and East Asian populations, suggesting varying levels or types of selection in different epochs of human history.
Collapse
Affiliation(s)
- Alon Keinan
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America.
| | | |
Collapse
|
17
|
Lambert CA, Connelly CF, Madeoy J, Qiu R, Olson MV, Akey JM. Highly punctuated patterns of population structure on the X chromosome and implications for African evolutionary history. Am J Hum Genet 2010; 86:34-44. [PMID: 20085712 DOI: 10.1016/j.ajhg.2009.12.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2009] [Revised: 11/22/2009] [Accepted: 12/01/2009] [Indexed: 01/24/2023] Open
Abstract
It is well known that average levels of population structure are higher on the X chromosome compared to autosomes in humans. However, there have been surprisingly few analyses on the spatial distribution of population structure along the X chromosome. With publicly available data from the HapMap Project and Perlegen Sciences, we show a strikingly punctuated pattern of X chromosome population structure. Specifically, 87% of X-linked HapMap SNPs within the top 1% of F(ST) values cluster into five distinct loci. The largest of these regions spans 5.4 Mb and contains 66% of the most highly differentiated HapMap SNPs on the X chromosome. We demonstrate that the extreme clustering of highly differentiated SNPs on the X chromosome is not an artifact of ascertainment bias, nor is it specific to the populations genotyped in the HapMap Project. Rather, additional analyses and resequencing data suggest that these five regions have been substrates of recent and strong adaptive evolution. Finally, we discuss the implications that patterns of X-linked population structure have on the evolutionary history of African populations.
Collapse
Affiliation(s)
- Charla A Lambert
- Department of Genetics, University of Pennsylvania, Philadelphia, 19104, USA
| | | | | | | | | | | |
Collapse
|
18
|
Miyake T, Takebayashi N, Wolf DE. Possible diversifying selection in the imprinted gene, MEDEA, in Arabidopsis. Mol Biol Evol 2009; 26:843-57. [PMID: 19126870 PMCID: PMC2727397 DOI: 10.1093/molbev/msp001] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2008] [Indexed: 11/12/2022] Open
Abstract
Coevolutionary conflict among imprinted genes that influence traits such as offspring growth may arise when maternal and paternal genomes have different evolutionary optima. This conflict is expected in outcrossing taxa with multiple paternity, but not self-fertilizing taxa. MEDEA (MEA) is an imprinted plant gene that influences seed growth. Disagreement exists regarding the type of selection acting on this gene. We present new data and analyses of sequence diversity of MEA in self-fertilizing and outcrossing Arabidopsis and its relatives, to help clarify the form of selection acting on this gene. Codon-based branch analysis among taxa (PAML) suggests that selection on the coding region is changing over time, and nonsynonymous substitution is elevated in at least one outcrossing branch. Codon-based analysis of diversity within outcrossing Arabidopsis lyrata ssp. petraea (OmegaMap) suggests that diversifying selection is acting on a portion of the gene, to cause elevated nonsynonymous polymorphism. Providing further support for balancing selection in A. lyrata, Hudson, Kreitman and Aguadé analysis indicates that diversity/divergence at silent sites in the MEA promoter and genic region is elevated relative to reference genes, and there are deviations from the neutral frequency spectrum. This combination of positive selection as well as balancing and diversifying selection in outcrossing lineages is consistent with other genes influence by evolutionary conflict, such as disease resistance genes. Consistent with predictions that conflict would be eliminated in self-fertilizing taxa, we found no evidence of positive, balancing, or diversifying selection in A. thaliana promoter or genic region.
Collapse
Affiliation(s)
- Takashi Miyake
- University of Alaska Fairbanks, Institute of Arctic Biology
| | | | | |
Collapse
|
19
|
Accelerated genetic drift on chromosome X during the human dispersal out of Africa. Nat Genet 2008; 41:66-70. [PMID: 19098910 PMCID: PMC2612098 DOI: 10.1038/ng.303] [Citation(s) in RCA: 119] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2008] [Accepted: 11/03/2008] [Indexed: 11/30/2022]
Abstract
Comparisons of chromosome X and the autosomes can illuminate differences in the histories of males and females as well as the forces of natural selection. We compared the patterns of variation in these parts of the genome using two data sets that we assembled for this study that are both genomic in scale. Three independent analyses show that around the time of the dispersal of modern humans out of Africa, chromosome X experienced much more genetic drift than is expected from the pattern on the autosomes. This is not predicted by known episodes of demographic history, and we found no similar patterns associated with the dispersals into East Asia and Europe. We conclude that a gender-biased process that reduced the female effective population size, or an episode of natural selection unusually affecting chromosome X, was associated with the founding of non-African populations.
Collapse
|
20
|
Wall JD, Cox MP, Mendez FL, Woerner A, Severson T, Hammer MF. A novel DNA sequence database for analyzing human demographic history. Genome Res 2008; 18:1354-61. [PMID: 18493019 DOI: 10.1101/gr.075630.107] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
While there are now extensive databases of human genomic sequences from both private and public efforts to catalog human nucleotide variation, there are very few large-scale surveys designed for the purpose of analyzing human population history. Demographic inference from patterns of SNP variation in current large public databases is complicated by ascertainment biases associated with SNP discovery and the ways that populations and regions of the genome are sampled. Here, we present results from a resequencing survey of 40 independent intergenic regions on the autosomes and X chromosome comprising ~210 kb from each of 90 humans from six geographically diverse populations (i.e., a total of ~18.9 Mb). Unlike other public DNA sequence databases, we include multiple indigenous populations that serve as important reservoirs of human genetic diversity, such as the San of Namibia, the Biaka of the Central African Republic, and Melanesians from Papua New Guinea. In fact, only 20% of the SNPs that we find are contained in the HapMap database. We identify several key differences in patterns of variability in our database compared with other large public databases, including higher levels of nucleotide diversity within populations, greater levels of differentiation between populations, and significant differences in the frequency spectrum. Because variants at loci included in this database are less likely to be subject to ascertainment biases or linked to sites under selection, these data will be more useful for accurately reconstructing past changes in size and structure of human populations.
Collapse
Affiliation(s)
- Jeffrey D Wall
- Institute for Human Genetics and Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA 94143, USA
| | | | | | | | | | | |
Collapse
|
21
|
Abstract
The house mouse is a well-established model organism, particularly for studying the genetics of complex traits. However, most studies of mice use classical inbred strains, whose genomes derive from multiple species. Relatively little is known about the distribution of genetic variation among these species or how variation among strains relates to variation in the wild. We sequenced intronic regions of five X-linked loci in large samples of wild Mus domesticus and M. musculus, and we found low levels of nucleotide diversity in both species. We compared these data to published data from short portions of six X-linked and 18 autosomal loci in wild mice. We estimate that M. domesticus and M. musculus diverged <500,000 years ago. Consistent with this recent divergence, some gene genealogies were reciprocally monophyletic between these species, while others were paraphyletic or polyphyletic. In general, the X chromosome was more differentiated than the autosomes. We resequenced classical inbred strains for all 29 loci and found that inbred strains contain only a small amount of the genetic variation seen in wild mice. Notably, the X chromosome contains proportionately less variation among inbred strains than do the autosomes. Moreover, variation among inbred strains derives from differences between species as well as from differences within species, and these proportions differ in different genomic regions. Wild mice thus provide a reservoir of additional genetic variation that may be useful for mapping studies. Together these results suggest that wild mice will be a valuable complement to laboratory strains for studying the genetics of complex traits.
Collapse
|
22
|
Pilkington MM, Wilder JA, Mendez FL, Cox MP, Woerner A, Angui T, Kingan S, Mobasher Z, Batini C, Destro-Bisol G, Soodyall H, Strassmann BI, Hammer MF. Contrasting signatures of population growth for mitochondrial DNA and Y chromosomes among human populations in Africa. Mol Biol Evol 2008; 25:517-25. [PMID: 18093995 DOI: 10.1093/molbev/msm279] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A history of Pleistocene population expansion has been inferred from the frequency spectrum of polymorphism in the mitochondrial DNA (mtDNA) of many human populations. Similar patterns are not typically observed for autosomal and X-linked loci. One explanation for this discrepancy is a recent population bottleneck, with different rates of recovery for haploid and autosomal loci as a result of their different effective population sizes. This hypothesis predicts that mitochondrial and Y chromosomal DNA will show a similar skew in the frequency spectrum in populations that have experienced a recent increase in effective population size. We test this hypothesis by resequencing 6.6 kb of noncoding Y chromosomal DNA and 780 basepairs of the mtDNA cytochrome c oxidase subunit III (COIII) gene in 172 males from 5 African populations. Four tests of population expansion are employed for each locus in each population: Fu's Fs statistic, the R(2) statistic, coalescent simulations, and the mismatch distribution. Consistent with previous results, patterns of mtDNA polymorphism better fit a model of constant population size for food-gathering populations and a model of population expansion for food-producing populations. In contrast, none of the tests reveal evidence of Y chromosome growth for either food-gatherers or food-producers. The distinct mtDNA and Y chromosome polymorphism patterns most likely reflect sex-biased demographic processes in the recent history of African populations. We hypothesize that males experienced smaller effective population sizes and/or lower rates of migration during the Bantu expansion, which occurred over the last 5,000 years.
Collapse
|
23
|
Tarazona-Santos E, Bernig T, Burdett L, Magalhaes WC, Fabbri C, Liao J, Redondo RA, Welch R, Yeager M, Chanock SJ. CYBB, an NADPH-oxidase gene: restricted diversity in humans and evidence for differential long-term purifying selection on transmembrane and cytosolic domains. Hum Mutat 2008; 29:623-32. [DOI: 10.1002/humu.20667] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
24
|
|
25
|
Underhill PA, Kivisild T. Use of Y Chromosome and Mitochondrial DNA Population Structure in Tracing Human Migrations. Annu Rev Genet 2007; 41:539-64. [PMID: 18076332 DOI: 10.1146/annurev.genet.41.110306.130407] [Citation(s) in RCA: 308] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Peter A. Underhill
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120;
| | - Toomas Kivisild
- Leverhulme Center of Human Evolutionary Studies, University of Cambridge, Cambridge CB2 1QH, United Kingdom;
| |
Collapse
|
26
|
Battilana J, Cardoso-Silva L, Barrantes R, Hill K, Hurtado AM, Salzano FM, Bonatto SL. Molecular variability of the 16p13.3 region in Amerindians and its anthropological significance. Ann Hum Genet 2007; 71:64-76. [PMID: 17227477 DOI: 10.1111/j.1469-1809.2006.00296.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A total of 1558 base pairs in the 16p13.3 region were investigated in 98 individuals of Mongolian, Northern Arctic and Amerindian affiliation, and the results compared with those obtained in a previous worldwide study of the same genomic region. Fifty-five polymorphic sites could be classified into thirty-five haplotypes from the total data. A median joining network based on the haplotypes revealed two distinct clusters: one with low diversity, with haplotypes found in all five geographic-ethnic categories; while the other, with the most divergent haplotypes, was composed mainly of Africans and a few Amerindians. Almost all neutrality parameters yielded significantly negative values. Demographic simulations with the exclusively Amerindian dataset rejected all scenarios, including a bottleneck beginning more than 12,000 years ago. The demographic scenarios tested considering population growth were similar among the Amerindian and worldwide or Eurasian data sets. The results suggest that Amerindians are a representative sample of Eurasian populations, preserving the signal of demographic growth from the out of Africa exodus and, together with data from uniparental markers, support a scenario of a bottleneck of moderate intensity during the peopling of the New World.
Collapse
Affiliation(s)
- J Battilana
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Caixa Postal 15053, 91501-970 Porto Alegre, RS, Brazil
| | | | | | | | | | | | | |
Collapse
|
27
|
Schmegner C, Hoegel J, Vogel W, Assum G. A comparison of the variability spectra of two genomic loci in a European group of individuals reveals fundamental differences pointing to selection or a population bottleneck. Ann Hum Genet 2007; 71:370-8. [PMID: 17222291 DOI: 10.1111/j.1469-1809.2006.00342.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Knowledge about the variability spectra of neutrally evolving sequences in a population is a prerequisite for the identification of genes, which may have been under positive selection during recent human evolution. Here, we report the results of a re-sequencing project of a presumably neutrally evolving chromosome 22 locus with a severely reduced recombination frequency in a group of 24 individuals of German origin. The comparison of these data with the results of a similar analysis of a chromosome 17 locus revealed striking differences, although the same group of individuals was used. For the chromosome 17 locus two well-separated groups of sequences, a positive value of Tajima's D and a TMRCA of 700,000 years were observed. In contrast, the sequences from the chromosome 22 locus were found to be relatively homogeneous, with no deep splits between subgroups; the obtained value for Tajima's D was negative and the TMRCA was only 260,000 years. These discrepancies may be explained by selection or demographic processes. Regarding demography, the most plausible explanation is the assumption of a severe bottleneck in the history of the European population: in the case of the chromosome 17 locus two ancient lineages passed this bottleneck; for the chromosome 22 locus it was only one ancient lineage.
Collapse
Affiliation(s)
- C Schmegner
- Institut für Humangenetik, Universität Ulm, Albert-Einstein-Allee 11, D-89081 Ulm, Germany
| | | | | | | |
Collapse
|
28
|
Gonder MK, Mortensen HM, Reed FA, de Sousa A, Tishkoff SA. Whole-mtDNA Genome Sequence Analysis of Ancient African Lineages. Mol Biol Evol 2006; 24:757-68. [PMID: 17194802 DOI: 10.1093/molbev/msl209] [Citation(s) in RCA: 197] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Studies of human mitochondrial (mt) DNA genomes demonstrate that the root of the human phylogenetic tree occurs in Africa. Although 2 mtDNA lineages with an African origin (haplogroups M and N) were the progenitors of all non-African haplogroups, macrohaplogroup L (including haplogroups L0-L6) is limited to sub-Saharan Africa. Several L haplogroup lineages occur most frequently in eastern Africa (e.g., L0a, L0f, L5, and L3g), but some are specific to certain ethnic groups, such as haplogroup lineages L0d and L0k that previously have been found nearly exclusively among southern African "click" speakers. Few studies have included multiple mtDNA genome samples belonging to haplogroups that occur in eastern and southern Africa but are rare or absent elsewhere. This lack of sampling in eastern Africa makes it difficult to infer relationships among mtDNA haplogroups or to examine events that occurred early in human history. We sequenced 62 complete mtDNA genomes of ethnically diverse Tanzanians, southern African Khoisan speakers, and Bakola Pygmies and compared them with a global pool of 226 mtDNA genomes. From these, we infer phylogenetic relationships amongst mtDNA haplogroups and estimate the time to most recent common ancestor (TMRCA) for haplogroup lineages. These data suggest that Tanzanians have high genetic diversity and possess ancient mtDNA haplogroups, some of which are either rare (L0d and L5) or absent (L0f) in other regions of Africa. We propose that a large and diverse human population has persisted in eastern Africa and that eastern Africa may have been an ancient source of dispersion of modern humans both within and outside of Africa.
Collapse
|
29
|
Reed FA, Tishkoff SA. African human diversity, origins and migrations. Curr Opin Genet Dev 2006; 16:597-605. [PMID: 17056248 DOI: 10.1016/j.gde.2006.10.008] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2006] [Accepted: 10/06/2006] [Indexed: 01/09/2023]
Abstract
The continent of Africa is thought to be the site of origin of all modern humans and is the more recent origin of millions of African Americans. Although Africa has the highest levels of human genetic diversity both within and between populations, it is under-represented in studies of human genetics. Recent advances have been made in understanding the origins of modern humans within Africa, the rate of adaptations due to positive selection, the routes taken in the first migrations of modern humans out of Africa, and the degree of admixture with archaic populations. Africa is also in dire need of effective medical interventions, and studies of genetic variation in Africans will shed light on the genetic basis of diseases and resistance to infectious diseases. Thus, we have tremendous potential to learn about human variation and evolutionary history and to positively impact human health care from studies of genetic diversity in Africa.
Collapse
Affiliation(s)
- Floyd A Reed
- Department of Biology, University of Maryland, College Park, MD 20742, USA
| | | |
Collapse
|
30
|
Abstract
Analyses of recently acquired genomic sequence data are leading to important insights into the early evolution of anatomically modern humans, as well as into the more recent demographic processes that accompanied the global radiation of Homo sapiens. Some of the new results contradict early, but still influential, conclusions that were based on analyses of gene trees from mitochondrial DNA and Y-chromosome sequences. In this review, we discuss the different genetic and statistical methods that are available for studying human population history, and identify the most plausible models of human evolution that can accommodate the contrasting patterns observed at different loci throughout the genome.
Collapse
Affiliation(s)
- Daniel Garrigan
- Division of Biotechnology, University of Arizona, Tucson, AZ 85721, USA
| | | |
Collapse
|
31
|
Fredman D, Sawyer SL, Strömqvist L, Mottagui-Tabar S, Kidd KK, Wahlestedt C, Chanock SJ, Brookes AJ. Nonsynonymous SNPs: validation characteristics, derived allele frequency patterns, and suggestive evidence for natural selection. Hum Mutat 2006; 27:173-86. [PMID: 16429399 DOI: 10.1002/humu.20289] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
We experimentally investigated more than 1,200 entries in dbSNP that would change amino-acids (nsSNPs), using various subsets of DNA samples drawn from 18 global populations (approximately 1,000 subjects in total). First, we mined the data for any SNP features that correlated with a high validation rate. Useful predictors of valid SNPs included multiple submissions to dbSNP, having a dbSNP validation statement, and being present in a low number of ESTs. Together, these features improved validation rates by almost 10-fold. Higher-abundance SNPs (e.g., T/C variants) also validated more frequently. Second, we considered derived alleles and noted a considerably (approximately 10%) increased average derived allele frequency (DAF) in Europeans vs. Africans, plus a further increase in some other populations. This was not primarily due to an SNP ascertainment bias, nor to the effects of natural selection. Instead, it can be explained as a drift-based, progressive increase in DAF that occurs over many generations and becomes exaggerated during population bottlenecks. This observation could be used as the basis for novel DAF-based tests for comparing demographic histories. Finally, we considered individual marker patterns and identified 37 SNPs with allele frequency variance or FST values consistent with the effects of population-specific natural selection. Four particularly striking clusters of these markers were apparent, and three of these coincide with genes/regions from among only several dozen such domains previously suggested by others to carry signatures of selection.
Collapse
Affiliation(s)
- David Fredman
- Center for Genomics and Bioinformatics, Karolinska Institute, Stockholm, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Panhuis TM, Swanson WJ. Molecular evolution and population genetic analysis of candidate female reproductive genes in Drosophila. Genetics 2006; 173:2039-47. [PMID: 16783023 PMCID: PMC1569687 DOI: 10.1534/genetics.105.053611] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Molecular analyses in several taxa have consistently shown that genes involved in reproduction are rapidly evolving and subjected to positive selection. The mechanism behind this evolution is not clear, but several proposed hypotheses involve the coevolution between males and females. In Drosophila, several male reproductive proteins (Acps) involved in male-male and male-female interactions show evidence of rapid adaptive evolution. What has been missing from the Drosophila literature is the identification and analysis of female reproductive genes. Recently, an evolutionary expressed sequence tag analysis of Drosophila female reproductive tract genes identified 169 candidate female reproductive genes. Many of these candidate genes still await further molecular analysis and independent verification of positive selection. Our goal was to expand our understanding of the molecular evolution of Drosophila female reproductive genes with a detailed polymorphism and divergence study on seven additional candidate female reproductive genes and a reanalysis of two genes from the above study. We demonstrate that 6 candidate female genes of the 9 genes surveyed show evidence of positive selection using both polymorphism and divergence data. One of these proteins (CG17012) is modeled to reveal that the sites under selection fall around and within the active site of this protease, suggesting potential differences between species. We discuss our results in light of potential function as well as interaction with male reproductive proteins.
Collapse
Affiliation(s)
- Tami M Panhuis
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | |
Collapse
|
33
|
Zhao Z, Yu N, Fu YX, Li WH. Nucleotide variation and haplotype diversity in a 10-kb noncoding region in three continental human populations. Genetics 2006; 174:399-409. [PMID: 16783003 PMCID: PMC1569808 DOI: 10.1534/genetics.106.060301] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Noncoding regions are usually less subject to natural selection than coding regions and so may be more useful for studying human evolution. The recent surveys of worldwide DNA variation in four 10-kb noncoding regions revealed many interesting but also some incongruent patterns. Here we studied another 10-kb noncoding region, which is in 6p22. Sixty-six single-nucleotide polymorphisms were found among the 122 worldwide human sequences, resulting in 46 genotypes, from which 48 haplotypes were inferred. The distribution patterns of DNA variation, genotypes, and haplotypes suggest rapid population expansion in relatively recent times. The levels of polymorphism within human populations and divergence between humans and chimpanzees at this locus were generally similar to those for the other four noncoding regions. Fu and Li's tests rejected the neutrality assumption in the total sample and in the African sample but Tajima's test did not reject neutrality. A detailed examination of the contributions of various types of mutations to the parameters used in the neutrality tests clarified the discrepancy between these test results. The age estimates suggest a relatively young history in this region. Combining three autosomal noncoding regions, we estimated the long-term effective population size of humans to be 11,000 +/- 2800 using Tajima's estimator and 17,600 +/- 4700 using Watterson's estimator and the age of the most recent common ancestor to be 860,000 +/- 258,000 years ago.
Collapse
Affiliation(s)
- Zhongming Zhao
- Department of Psychiatry, Virginia Commonwealth University, Richmond, Virginia 23298, USA
| | | | | | | |
Collapse
|
34
|
Hamblin MT, Casa AM, Sun H, Murray SC, Paterson AH, Aquadro CF, Kresovich S. Challenges of detecting directional selection after a bottleneck: lessons from Sorghum bicolor. Genetics 2006; 173:953-64. [PMID: 16547110 PMCID: PMC1526520 DOI: 10.1534/genetics.105.054312] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2005] [Accepted: 03/13/2006] [Indexed: 11/18/2022] Open
Abstract
Multilocus surveys of sequence variation can be used to identify targets of directional selection, which are expected to have reduced levels of variation. Following a population bottleneck, the signal of directional selection may be hard to detect because many loci may have low variation by chance and the frequency spectrum of variation may be perturbed in ways that resemble the effects of selection. Cultivated Sorghum bicolor contains a subset of the genetic diversity found in its wild ancestor(s) due to the combined effects of a domestication bottleneck and human selection on traits associated with agriculture. As a framework for distinguishing between the effects of demography and selection, we sequenced 204 loci in a diverse panel of 17 cultivated S. bicolor accessions. Genomewide patterns of diversity depart strongly from equilibrium expectations with regard to the variance of the number of segregating sites, the site frequency spectrum, and haplotype configuration. Furthermore, gene genealogies of most loci with an excess of low frequency variants and/or an excess of segregating sites do not show the characteristic signatures of directional and diversifying selection, respectively. A simple bottleneck model provides an improved but inadequate fit to the data, suggesting the action of other population-level factors, such as population structure and migration. Despite a known history of recent selection, we find little evidence for directional selection, likely due to low statistical power and lack of an appropriate null model.
Collapse
Affiliation(s)
- Martha T Hamblin
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
| | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
The beneficial substitution of an allele shapes patterns of genetic variation at linked sites. Thus, in principle, adaptations can be mapped by looking for the signature of directional selection in polymorphism data. In practice, such efforts are hampered by the need for an accurate characterization of the demographic history of the species and of the effects of positive selection. In an attempt to circumvent these difficulties, researchers are increasingly taking a purely empirical approach, in which a large number of genomic regions are ordered by summaries of the polymorphism data, and loci with extreme values are considered to be likely targets of positive selection. We evaluated the reliability of the "empirical" approach, focusing on applications to human data and to maize. To do so, we considered a coalescent model of directional selection in a sensible demographic setting, allowing for selection on standing variation as well as on a new mutation. Our simulations suggest that while empirical approaches will identify several interesting candidates, they will also miss many--in some cases, most--loci of interest. The extent of the trade-off depends on the mode of positive selection and the demographic history of the population. Specifically, the false-discovery rate is higher when directional selection involves a recessive rather than a co-dominant allele, when it acts on a previously neutral rather than a new allele, and when the population has experienced a population bottleneck rather than maintained a constant size. One implication of these results is that, insofar as attributes of the beneficial mutation (e.g., the dominance coefficient) affect the power to detect targets of selection, genomic scans will yield an unrepresentative subset of loci that contribute to adaptations.
Collapse
Affiliation(s)
- Kosuke M. Teshima
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| | - Graham Coop
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | - Molly Przeworski
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| |
Collapse
|
36
|
Geraldes A, Ferrand N, Nachman MW. Contrasting patterns of introgression at X-linked loci across the hybrid zone between subspecies of the European rabbit (Oryctolagus cuniculus). Genetics 2006; 173:919-33. [PMID: 16582441 PMCID: PMC1526523 DOI: 10.1534/genetics.105.054106] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Hybrid zones provide an excellent opportunity for studying the consequences of genetic changes between closely related taxa. Here we investigate patterns of genetic variability and gene flow at four X-linked loci within and between the two subspecies of European rabbit (Oryctolagus cuniculus cuniculus and O. c. algirus). Two of these genes are located near the centromere and two are located near the telomeres. We observed a deep split in the genealogy of each gene with the root located along the deepest branch in each case, consistent with the evolution of these subspecies in allopatry. The two centromeric loci showed low levels of variability, high levels of linkage disequilibrium, and little introgression between subspecies. In contrast, the two telomeric loci showed high levels of variability, low levels of linkage disequilibrium, and considerable introgression between subspecies. These data are consistent with suppression of recombination near the centromere of the rabbit X chromosome. These observations support a view of speciation where genomic incompatibilities at different loci in the genome create localized differences in levels of gene flow between nascent species.
Collapse
Affiliation(s)
- Armando Geraldes
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Vairão, Spain.
| | | | | |
Collapse
|
37
|
Ray N, Currat M, Berthier P, Excoffier L. Recovering the geographic origin of early modern humans by realistic and spatially explicit simulations. Genome Res 2005; 15:1161-7. [PMID: 16077015 PMCID: PMC1182229 DOI: 10.1101/gr.3708505] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2005] [Accepted: 05/18/2005] [Indexed: 11/24/2022]
Abstract
Most genetic and archeological evidence argue in favor of a recent and unique origin of modern humans in sub-Saharan Africa, but no attempt has ever been made at quantifying the likelihood of this model, relative to alternative hypotheses of human evolution. In this paper, we investigate the possibility of using multilocus genetic data to correctly infer the geographic origin of humans, and to distinguish between a unique origin (UO) and a multiregional evolution (ME) model. We introduce here an approach based on realistic simulations of the genetic diversity expected after an expansion process of modern humans into the Old World from different possible areas and their comparison to observed data. We find that the geographic origin of the expansion can be correctly recovered provided that a large number of independent markers are used, and that precise information on past demography and potential places of origins is available. In that case, it is also possible to unambiguously distinguish between a unique origin and a multiregional model of human evolution. Application to a real human data set of 377 STR markers tested in 22 populations points toward a unique but surprising North African origin of modern humans. We show that this result could be due to ascertainment bias in favor of markers selected to be polymorphic in Europeans. A new estimation modeling this bias explicitly reveals that East Africa is the most likely place of origin for modern humans.
Collapse
Affiliation(s)
- Nicolas Ray
- Computational and Molecular Population Genetics Lab, Zoological Institute, University of Bern, 3012 Bern, Switzerland.
| | | | | | | |
Collapse
|
38
|
Saunders MA, Slatkin M, Garner C, Hammer MF, Nachman MW. The extent of linkage disequilibrium caused by selection on G6PD in humans. Genetics 2005; 171:1219-29. [PMID: 16020776 PMCID: PMC1456824 DOI: 10.1534/genetics.105.048140] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The gene coding for glucose-6-phosphate dehydrogenase (G6PD) is subject to positive selection by malaria in some human populations. The G6PD A- allele, which is common in sub-Saharan Africa, is associated with deficient enzyme activity and protection from severe malaria. To delimit the impact of selection on patterns of linkage disequilibrium (LD) and nucleotide diversity, we resequenced 5.1 kb at G6PD and approximately 2-3 kb at each of eight loci in a 2.5-Mb region roughly centered on G6PD in a diverse sub-Saharan African panel of 51 unrelated men (including 20 G6PD A-, 11 G6PD A+, and 20 G6PD B chromosomes). The signature of selection is evident in the absence of genetic variation at G6PD and at three neighboring loci within 0.9 Mb from G6PD among all individuals bearing G6PD A- alleles. A genomic region of approximately 1.6 Mb around G6PD was characterized by long-range LD associated with the A- alleles. These patterns of nucleotide variability and LD suggest that G6PD A- is younger than previous age estimates and has increased in frequency in sub-Saharan Africa due to strong selection (0.1 < s < 0.2). These results also show that selection can lead to nonrandom associations among SNPs over great physical and genetic distances, even in African populations.
Collapse
Affiliation(s)
- Matthew A Saunders
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | | | | | | | |
Collapse
|
39
|
Borge T, Webster MT, Andersson G, Saetre GP. Contrasting patterns of polymorphism and divergence on the Z chromosome and autosomes in two Ficedula flycatcher species. Genetics 2005; 171:1861-73. [PMID: 15956661 PMCID: PMC1456110 DOI: 10.1534/genetics.105.045120] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In geographic areas where pied and collared flycatchers (Ficedula hypoleuca and F. albicollis) breed in sympatry, hybridization occurs, leading to gene flow (introgression) between the two recently diverged species. Notably, while such introgression is observable at autosomal loci it is apparently absent at the Z chromosome, suggesting an important role for genes on the Z chromosome in creating reproductive isolation during speciation. To further understand the role of Z-linked loci in the formation of new species, we studied genetic variation of the two species from regions where they live in allopatry. We analyzed patterns of polymorphism and divergence in introns from 9 Z-linked and 23 autosomal genes in pied and collared flycatcher males. Average variation on the Z chromosome is greatly reduced compared to neutral expectations based on autosomal diversity in both species. We also observe significant heterogeneity between patterns of polymorphism and divergence at Z-linked loci and a relative absence of polymorphisms that are shared by the two species on the Z chromosome compared to the autosomes. We suggest that these observations may indicate the action of recurrent selective sweeps on the Z chromosome during the evolution of the two species, which may be caused by sexual selection acting on Z-linked genes. Alternatively, reduced variation on the Z chromosome could result from substantially higher levels of introgression at autosomal than at Z-linked loci or from a complex demographic history, such as a population bottleneck.
Collapse
Affiliation(s)
- Thomas Borge
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, N-0316 Oslo. Norway
| | | | | | | |
Collapse
|
40
|
Garrigan D, Mobasher Z, Kingan SB, Wilder JA, Hammer MF. Deep haplotype divergence and long-range linkage disequilibrium at xp21.1 provide evidence that humans descend from a structured ancestral population. Genetics 2005; 170:1849-56. [PMID: 15937130 PMCID: PMC1449746 DOI: 10.1534/genetics.105.041095] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Fossil evidence links human ancestry with populations that evolved from modern gracile morphology in Africa 130,000-160,000 years ago. Yet fossils alone do not provide clear answers to the question of whether the ancestors of all modern Homo sapiens comprised a single African population or an amalgamation of distinct archaic populations. DNA sequence data have consistently supported a single-origin model in which anatomically modern Africans expanded and completely replaced all other archaic hominin populations. Aided by a novel experimental design, we present the first genetic evidence that statistically rejects the null hypothesis that our species descends from a single, historically panmictic population. In a global sample of 42 X chromosomes, two African individuals carry a lineage of noncoding 17.5-kb sequence that has survived for >1 million years without any clear traces of ongoing recombination with other lineages at this locus. These patterns of deep haplotype divergence and long-range linkage disequilibrium are best explained by a prolonged period of ancestral population subdivision followed by relatively recent interbreeding. This inference supports human evolution models that incorporate admixture between divergent African branches of the genus Homo.
Collapse
Affiliation(s)
- Daniel Garrigan
- Genomic Analysis and Technology Core, University of Arizona, Tucson, Arizona 85721, USA
| | | | | | | | | |
Collapse
|
41
|
Hey J. On the number of New World founders: a population genetic portrait of the peopling of the Americas. PLoS Biol 2005; 3:e193. [PMID: 15898833 PMCID: PMC1131883 DOI: 10.1371/journal.pbio.0030193] [Citation(s) in RCA: 270] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2004] [Accepted: 03/25/2005] [Indexed: 12/03/2022] Open
Abstract
The founding of New World populations by Asian peoples is the focus of considerable archaeological and genetic research, and there persist important questions on when and how these events occurred. Genetic data offer great potential for the study of human population history, but there are significant challenges in discerning distinct demographic processes. A new method for the study of diverging populations was applied to questions on the founding and history of Amerind-speaking Native American populations. The model permits estimation of founding population sizes, changes in population size, time of population formation, and gene flow. Analyses of data from nine loci are consistent with the general portrait that has emerged from archaeological and other kinds of evidence. The estimated effective size of the founding population for the New World is fewer than 80 individuals, approximately 1% of the effective size of the estimated ancestral Asian population. By adding a splitting parameter to population divergence models it becomes possible to develop detailed portraits of human demographic history. Analyses of Asian and New World data support a model of a recent founding of the New World by a population of quite small effective size.
Collapse
Affiliation(s)
- Jody Hey
- Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, New Jersey, USA.
| |
Collapse
|
42
|
Vander Molen J, Frisse LM, Fullerton SM, Qian Y, Del Bosque-Plata L, Hudson RR, Di Rienzo A. Population genetics of CAPN10 and GPR35: implications for the evolution of type 2 diabetes variants. Am J Hum Genet 2005; 76:548-60. [PMID: 15696418 PMCID: PMC1199293 DOI: 10.1086/428784] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2004] [Accepted: 01/12/2005] [Indexed: 12/30/2022] Open
Abstract
A positional cloning study of type 2 diabetes in Mexican Americans identified a region, termed "NIDDM1," on chromosome 2q37 with significant linkage evidence. Haplotype combinations at the calpain-10 gene (CAPN10) within this region were shown to increase diabetes risk in several populations. On the basis of the thrifty genotype hypothesis, variants that increase susceptibility to type 2 diabetes under modern lifestyle conditions provided a survival advantage in past environments by increasing the efficiency of energy use and storage. Here, our goal is to make inferences about the evolutionary forces shaping variation in genes in the NIDDM1 region and to investigate the population genetics models that may underlie the thrifty genotype hypothesis. To this end, we surveyed sequence variation in CAPN10 and in an adjacent gene, G-protein-coupled receptor 35 (GPR35), in four population samples from different ethnic groups. These data revealed two distinct deviations from the standard neutral model in CAPN10, whereas GPR35 variation was largely consistent with neutrality. CAPN10 showed a significant deficit of variation in the haplotype class defined by the derived allele at SNP44, a polymorphism that is significantly associated with diabetes in meta-analysis studies. This suggests that this haplotype class was quickly driven to high frequency by positive natural selection. Interestingly, the derived allele at SNP44 is protective against diabetes. CAPN10 also showed a local excess of polymorphism and linkage disequilibrium decay in intron 13. Simulations show that this pattern may be explained by long-standing balancing selection that maintains multiple selected alleles. Alternatively, it is possible that the local mutation and recombination rates changed since the divergence of human and chimpanzee; this scenario does not require the action of natural selection on intron 13 variation.
Collapse
Affiliation(s)
- J Vander Molen
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | | | | | | | | | | | | |
Collapse
|
43
|
Garrigan D, Mobasher Z, Severson T, Wilder JA, Hammer MF. Evidence for archaic Asian ancestry on the human X chromosome. Mol Biol Evol 2004; 22:189-92. [PMID: 15483323 DOI: 10.1093/molbev/msi013] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The human RRM2P4 pseudogene has a pattern of nucleotide polymorphism that is unlike any locus published to date. A gene tree constructed from a 2.4-kb fragment of the RRM2P4 locus sequenced in a sample of 41 worldwide humans clearly roots in East Asia and has a most-recent common ancestor approximately 2 Myr before present. The presence of this basal lineage exclusively in Asia results in higher nucleotide diversity among non-Africans than among Africans. A global survey of a single-nucleotide polymorphism that is diagnostic for the basal, Asian lineage in 570 individuals shows that it occurs at frequencies up to 53% in south China, whereas only one of 177 surveyed Africans carries this archaic lineage. We suggest that this ancient lineage is a remnant of introgressive hybridization between expanding anatomically modern humans emerging from Africa and archaic populations in Eurasia.
Collapse
|