151
|
Wang J, Street NR, Scofield DG, Ingvarsson PK. Variation in Linked Selection and Recombination Drive Genomic Divergence during Allopatric Speciation of European and American Aspens. Mol Biol Evol 2016; 33:1754-67. [PMID: 26983554 PMCID: PMC4915356 DOI: 10.1093/molbev/msw051] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Despite the global economic and ecological importance of forest trees, the genomic basis of differential adaptation and speciation in tree species is still poorly understood. Populus tremula and Populus tremuloides are two of the most widespread tree species in the Northern Hemisphere. Using whole-genome re-sequencing data of 24 P. tremula and 22 P. tremuloides individuals, we find that the two species diverged ∼2.2–3.1 million years ago, coinciding with the severing of the Bering land bridge and the onset of dramatic climatic oscillations during the Pleistocene. Both species have experienced substantial population expansions following long-term declines after species divergence. We detect widespread and heterogeneous genomic differentiation between species, and in accordance with the expectation of allopatric speciation, coalescent simulations suggest that neutral evolutionary processes can account for most of the observed patterns of genetic differentiation. However, there is an excess of regions exhibiting extreme differentiation relative to those expected under demographic simulations, which is indicative of the action of natural selection. Overall genetic differentiation is negatively associated with recombination rate in both species, providing strong support for a role of linked selection in generating the heterogeneous genomic landscape of differentiation between species. Finally, we identify a number of candidate regions and genes that may have been subject to positive and/or balancing selection during the speciation process.
Collapse
Affiliation(s)
- Jing Wang
- Department of Ecology and Environmental Science, Umeå University, Umeå, SE, Sweden
| | - Nathaniel R Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, SE, Sweden
| | - Douglas G Scofield
- Department of Ecology and Environmental Science, Umeå University, Umeå, SE, Sweden Department of Ecology and Genetics: Evolutionary Biology, Uppsala University, Uppsala, Sweden Uppsala Multidisciplinary Center for Advanced Computational Science, Uppsala University, Uppsala, Sweden
| | - Pär K Ingvarsson
- Department of Ecology and Environmental Science, Umeå University, Umeå, SE, Sweden
| |
Collapse
|
152
|
Voorter CEM, Gerritsen KEH, Groeneweg M, Wieten L, Tilanus MGJ. The role of gene polymorphism in HLA class I splicing. Int J Immunogenet 2016; 43:65-78. [PMID: 26920492 DOI: 10.1111/iji.12256] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Revised: 01/28/2016] [Accepted: 02/04/2016] [Indexed: 01/15/2023]
Abstract
Among the large number of human leucocyte antigen (HLA) alleles, only a few have been identified with a nucleotide polymorphism impairing correct splicing. Those alleles show aberrant expression levels, due to either a direct effect of the polymorphism on the normal splice site or to the creation of an alternative splice site. Furthermore, in several studies, the presence of alternatively spliced HLA transcripts co-expressed with the mature spliced transcripts was reported. We evaluated the splice site sequences of all known HLA class I alleles and found that, beside the consensus GT and AG sequences at the intron borders, there were some other highly conserved nucleotides for the different class I genes. In this review, we summarize the splicing mechanism and evaluate what is known today about alternative splicing of HLA class I genes.
Collapse
Affiliation(s)
- C E M Voorter
- Department of Transplantation Immunology, Tissue Typing Laboratory, Maastricht University Medical Centre, Maastricht, the Netherlands
| | - K E H Gerritsen
- Department of Transplantation Immunology, Tissue Typing Laboratory, Maastricht University Medical Centre, Maastricht, the Netherlands
| | - M Groeneweg
- Department of Transplantation Immunology, Tissue Typing Laboratory, Maastricht University Medical Centre, Maastricht, the Netherlands
| | - L Wieten
- Department of Transplantation Immunology, Tissue Typing Laboratory, Maastricht University Medical Centre, Maastricht, the Netherlands
| | - M G J Tilanus
- Department of Transplantation Immunology, Tissue Typing Laboratory, Maastricht University Medical Centre, Maastricht, the Netherlands
| |
Collapse
|
153
|
Zhong X, Peng J, Shen QS, Chen JY, Gao H, Luan X, Yan S, Huang X, Zhang SJ, Xu L, Zhang X, Tan BCM, Li CY. RhesusBase PopGateway: Genome-Wide Population Genetics Atlas in Rhesus Macaque. Mol Biol Evol 2016; 33:1370-5. [PMID: 26882984 PMCID: PMC4839223 DOI: 10.1093/molbev/msw025] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Although population genetics studies have significantly accelerated the evolutionary and functional interrogations of genes and regulations, limited polymorphism data are available for rhesus macaque, the model animal closely related to human. Here, we report the first genome-wide effort to identify and visualize the population genetics profile in rhesus macaque. On the basis of the whole-genome sequencing of 31 independent macaque animals, we profiled a comprehensive polymorphism map with 46,146,548 sites. The allele frequency for each polymorphism site, the haplotype structure, as well as multiple population genetics parameters were then calculated on a genome-wide scale. We further developed a specific interface, the RhesusBase PopGateway, to facilitate the visualization of these annotations, and highlighted the applications of this highly integrative platform in clarifying the selection signatures of genes and regulations in the context of the primate evolution. Overall, the updated RhesusBase provides a comprehensive monkey population genetics framework for in-depth evolutionary studies of human biology.
Collapse
Affiliation(s)
- Xiaoming Zhong
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Jiguang Peng
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Qing Sunny Shen
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Jia-Yu Chen
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Han Gao
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Xuke Luan
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China Peking-Tsinghua Center for Life Sciences, Beijing, China Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Shouyu Yan
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Xin Huang
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Shi-Jian Zhang
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Luying Xu
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Xiuqin Zhang
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| | - Bertrand Chin-Ming Tan
- Department of Biomedical Sciences and Graduate Institute of Biomedical Sciences, College of Medicine, Chang Gung University, Tao-Yuan, Taiwan Molecular Medicine Research Center, Chang Gung University, Tao-Yuan, Taiwan
| | - Chuan-Yun Li
- Beijing Key Laboratory of Cardiometabolic Molecular Medicine, Institute of Molecular Medicine, Peking University, Beijing, China
| |
Collapse
|
154
|
Voorter CE, Groeneweg M, Groeneveld L, Tilanus MG. Uncommon HLA alleles identified by hemizygous ultra-high Sanger sequencing: haplotype associations and reconsideration of their assignment in the Common and Well-Documented catalogue. Hum Immunol 2016; 77:184-90. [DOI: 10.1016/j.humimm.2015.11.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Revised: 10/23/2015] [Accepted: 11/19/2015] [Indexed: 01/24/2023]
|
155
|
de Filippo C, Key FM, Ghirotto S, Benazzo A, Meneu JR, Weihmann A, Parra G, Green ED, Andrés AM. Recent Selection Changes in Human Genes under Long-Term Balancing Selection. Mol Biol Evol 2016; 33:1435-47. [PMID: 26831942 DOI: 10.1093/molbev/msw023] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Balancing selection is an important evolutionary force that maintains genetic and phenotypic diversity in populations. Most studies in humans have focused on long-standing balancing selection, which persists over long periods of time and is generally shared across populations. But balanced polymorphisms can also promote fast adaptation, especially when the environment changes. To better understand the role of previously balanced alleles in novel adaptations, we analyzed in detail four loci as case examples of this mechanism. These loci show hallmark signatures of long-term balancing selection in African populations, but not in Eurasian populations. The disparity between populations is due to changes in allele frequencies, with intermediate frequency alleles in Africans (likely due to balancing selection) segregating instead at low- or high-derived allele frequency in Eurasia. We explicitly tested the support for different evolutionary models with an approximate Bayesian computation approach and show that the patterns in PKDREJ, SDR39U1, and ZNF473 are best explained by recent changes in selective pressure in certain populations. Specifically, we infer that alleles previously under long-term balancing selection, or alleles linked to them, were recently targeted by positive selection in Eurasian populations. Balancing selection thus likely served as a source of functional alleles that mediated subsequent adaptations to novel environments.
Collapse
Affiliation(s)
- Cesare de Filippo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Felix M Key
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Silvia Ghirotto
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| | - Andrea Benazzo
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| | - Juan R Meneu
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Antje Weihmann
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | | - Genís Parra
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Eric D Green
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| |
Collapse
|
156
|
The influence of genetic drift on the formation and stability of polymorphisms arising from negative frequency-dependent selection. J Theor Biol 2016; 391:51-64. [DOI: 10.1016/j.jtbi.2015.11.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Revised: 11/13/2015] [Accepted: 11/17/2015] [Indexed: 11/20/2022]
|
157
|
Genes with monoallelic expression contribute disproportionately to genetic diversity in humans. Nat Genet 2016; 48:231-237. [PMID: 26808112 PMCID: PMC4942303 DOI: 10.1038/ng.3493] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2014] [Accepted: 12/23/2015] [Indexed: 12/20/2022]
Abstract
An unexpectedly large number of human autosomal genes are subject to monoallelic expression (MAE). Our analysis of 4,227 such genes uncovers surprisingly high genetic variation across human populations. This increased diversity is unlikely to reflect relaxed purifying selection. Remarkably, MAE genes exhibit an elevated recombination rate and an increased density of hypermutable sequence contexts. However, these factors do not fully account for the increased diversity. We find that the elevated nucleotide diversity of MAE genes is also associated with greater allelic age: variants in these genes tend to be older and are enriched in polymorphisms shared by Neanderthals and chimpanzees. Both synonymous and nonsynonymous alleles of MAE genes have elevated average population frequencies. We also observed strong enrichment of the MAE signature among genes reported to evolve under balancing selection. We propose that an important biological function of widespread MAE might be the generation of cell-to-cell heterogeneity; the increased genetic variation contributes to this heterogeneity.
Collapse
|
158
|
Deschamps M, Laval G, Fagny M, Itan Y, Abel L, Casanova JL, Patin E, Quintana-Murci L. Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes. Am J Hum Genet 2016; 98:5-21. [PMID: 26748513 DOI: 10.1016/j.ajhg.2015.11.014] [Citation(s) in RCA: 174] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 11/06/2015] [Indexed: 01/25/2023] Open
Abstract
Human genes governing innate immunity provide a valuable tool for the study of the selective pressure imposed by microorganisms on host genomes. A comprehensive, genome-wide study of how selective constraints and adaptations have driven the evolution of innate immunity genes is missing. Using full-genome sequence variation from the 1000 Genomes Project, we first show that innate immunity genes have globally evolved under stronger purifying selection than the remainder of protein-coding genes. We identify a gene set under the strongest selective constraints, mutations in which are likely to predispose individuals to life-threatening disease, as illustrated by STAT1 and TRAF3. We then evaluate the occurrence of local adaptation and detect 57 high-scoring signals of positive selection at innate immunity genes, variation in which has been associated with susceptibility to common infectious or autoimmune diseases. Furthermore, we show that most adaptations targeting coding variation have occurred in the last 6,000-13,000 years, the period at which populations shifted from hunting and gathering to farming. Finally, we show that innate immunity genes present higher Neandertal introgression than the remainder of the coding genome. Notably, among the genes presenting the highest Neandertal ancestry, we find the TLR6-TLR1-TLR10 cluster, which also contains functional adaptive variation in Europeans. This study identifies highly constrained genes that fulfill essential, non-redundant functions in host survival and reveals others that are more permissive to change-containing variation acquired from archaic hominins or adaptive variants in specific populations-improving our understanding of the relative biological importance of innate immunity pathways in natural conditions.
Collapse
Affiliation(s)
- Matthieu Deschamps
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France; Université Pierre et Marie Curie, Cellule Pasteur UPMC, 75015 Paris, France
| | - Guillaume Laval
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France
| | - Maud Fagny
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France; Université Pierre et Marie Curie, Cellule Pasteur UPMC, 75015 Paris, France
| | - Yuval Itan
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA
| | - Laurent Abel
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA; Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM U.1163, 75015 Paris, France; Imagine Institute, Paris Descartes University, 75015 Paris, France
| | - Jean-Laurent Casanova
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA; Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM U.1163, 75015 Paris, France; Imagine Institute, Paris Descartes University, 75015 Paris, France; Howard Hughes Medical Institute, New York, NY 10065, USA; Pediatric Hematology-Immunology Unit, Necker Hospital for Sick Children, 75015 Paris, France
| | - Etienne Patin
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France
| | - Lluis Quintana-Murci
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France.
| |
Collapse
|
159
|
Chakraborty M, Fry JD. Evidence that Environmental Heterogeneity Maintains a Detoxifying Enzyme Polymorphism in Drosophila melanogaster. Curr Biol 2015; 26:219-223. [PMID: 26748852 DOI: 10.1016/j.cub.2015.11.049] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2015] [Revised: 11/09/2015] [Accepted: 11/11/2015] [Indexed: 11/28/2022]
Abstract
Environmental heterogeneity is thought to be an important process maintaining genetic variation in populations [1-4]: if alternative alleles are favored in different environments, a stable polymorphism can be maintained [1, 5, 6]. This situation has been hypothesized to occur in genes encoding multi-substrate enzymes [7], in which changes that increase activity with one substrate typically decrease activity with others [8-10], but examples of polymorphisms maintained by this mechanism are rare. Here, we present evidence that a polymorphism in an enzyme gene in Drosophila melanogaster is maintained by such a trade-off. The mitochondrially localized aldehyde dehydrogenase in D. melanogaster has two important functions: detoxifying acetaldehyde derived from dietary ethanol [11] and detoxifying larger aldehydes produced as byproducts of oxidative phosphorylation [12]. A derived variant of the enzyme, Leu479Phe, is present in moderate frequencies in most temperate populations but is rare in more ethanol-averse tropical populations. Using purified recombinant protein, we show that the Leu-Phe substitution increases turnover rate of acetaldehyde but decreases turnover rate of larger aldehydes. Furthermore, using transgenic fly lines, we show that the substitution increases lifetime fitness on medium supplemented with an ecologically relevant ethanol concentration but decreases fitness on medium lacking ethanol. The strong, opposing selection pressures, coupled with documented highly variable ethanol concentrations in breeding sites of temperate populations, implicate an essential role for environmental heterogeneity in maintaining the polymorphism.
Collapse
Affiliation(s)
- Mahul Chakraborty
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| | - James D Fry
- Department of Biology, University of Rochester, Rochester, NY 14627, USA.
| |
Collapse
|
160
|
He Y, Wang M, Huang X, Li R, Xu H, Xu S, Jin L. A probabilistic method for testing and estimating selection differences between populations. Genome Res 2015; 25:1903-1909. [PMID: 26463656 PMCID: PMC4665011 DOI: 10.1101/gr.192336.115] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 10/13/2015] [Indexed: 01/18/2023]
Abstract
Human populations around the world encounter various environmental challenges and, consequently, develop genetic adaptations to different selection forces. Identifying the differences in natural selection between populations is critical for understanding the roles of specific genetic variants in evolutionary adaptation. Although numerous methods have been developed to detect genetic loci under recent directional selection, a probabilistic solution for testing and quantifying selection differences between populations is lacking. Here we report the development of a probabilistic method for testing and estimating selection differences between populations. By use of a probabilistic model of genetic drift and selection, we showed that logarithm odds ratios of allele frequencies provide estimates of the differences in selection coefficients between populations. The estimates approximate a normal distribution, and variance can be estimated using genome-wide variants. This allows us to quantify differences in selection coefficients and to determine the confidence intervals of the estimate. Our work also revealed the link between genetic association testing and hypothesis testing of selection differences. It therefore supplies a solution for hypothesis testing of selection differences. This method was applied to a genome-wide data analysis of Han and Tibetan populations. The results confirmed that both the EPAS1 and EGLN1 genes are under statistically different selection in Han and Tibetan populations. We further estimated differences in the selection coefficients for genetic variants involved in melanin formation and determined their confidence intervals between continental population groups. Application of the method to empirical data demonstrated the outstanding capability of this novel approach for testing and quantifying differences in natural selection.
Collapse
Affiliation(s)
- Yungang He
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Minxian Wang
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xin Huang
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Ran Li
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Hongyang Xu
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Shuhua Xu
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Li Jin
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China; State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200433, China
| |
Collapse
|
161
|
Schrider DR, Kern AD. Inferring Selective Constraint from Population Genomic Data Suggests Recent Regulatory Turnover in the Human Brain. Genome Biol Evol 2015; 7:3511-28. [PMID: 26590212 PMCID: PMC4700959 DOI: 10.1093/gbe/evv228] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The comparative genomics revolution of the past decade has enabled the discovery of functional elements in the human genome via sequence comparison. While that is so, an important class of elements, those specific to humans, is entirely missed by searching for sequence conservation across species. Here we present an analysis based on variation data among human genomes that utilizes a supervised machine learning approach for the identification of human-specific purifying selection in the genome. Using only allele frequency information from the complete low-coverage 1000 Genomes Project data set in conjunction with a support vector machine trained from known functional and nonfunctional portions of the genome, we are able to accurately identify portions of the genome constrained by purifying selection. Our method identifies previously known human-specific gains or losses of function and uncovers many novel candidates. Candidate targets for gain and loss of function along the human lineage include numerous putative regulatory regions of genes essential for normal development of the central nervous system, including a significant enrichment of gain of function events near neurotransmitter receptor genes. These results are consistent with regulatory turnover being a key mechanism in the evolution of human-specific characteristics of brain development. Finally, we show that the majority of the genome is unconstrained by natural selection currently, in agreement with what has been estimated from phylogenetic methods but in sharp contrast to estimates based on transcriptomics or other high-throughput functional methods.
Collapse
Affiliation(s)
| | - Andrew D Kern
- Department of Genetics, Rutgers University, Piscataway Human Genetics Institute of New Jersey, Piscataway, New Jersey
| |
Collapse
|
162
|
Review: can diet influence the selective advantage of mitochondrial DNA haplotypes? Biosci Rep 2015; 35:BSR20150232. [PMID: 26543031 PMCID: PMC4708006 DOI: 10.1042/bsr20150232] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Accepted: 11/05/2015] [Indexed: 01/12/2023] Open
Abstract
This review explores the potential for changes in dietary macronutrients to differentially influence mitochondrial bioenergetics and thereby the frequency of mtDNA haplotypes in natural populations. Such dietary modification may be seasonal or result from biogeographic or demographic shifts. Mechanistically, mtDNA haplotypes may influence the activity of the electron transport system (ETS), retrograde signalling to the nuclear genome and affect epigenetic modifications. Thus, differential provisioning by macronutrients may lead to selection through changes in the levels of ATP production, modulation of metabolites (including AMP, reactive oxygen species (ROS) and the NAD+/NADH ratio) and potentially complex epigenetic effects. The exquisite complexity of dietary influence on haplotype frequency is further illustrated by the fact that macronutrients may differentially influence the selective advantage of specific mutations in different life-history stages. In Drosophila, complex I mutations may affect larval growth because dietary nutrients are fed through this complex in immaturity. In contrast, the majority of electrons are provided to complex III in adult flies. We conclude the review with a case study that considers specific interactions between diet and complex I of the ETS. Complex I is the first enzyme of the mitochondrial ETS and co-ordinates in the oxidation of NADH and transfer of electrons to ubiquinone. Although the supposition that mtDNA variants may be selected upon by dietary macronutrients could be intuitively consistent to some and counter intuitive to others, it must face a multitude of scientific hurdles before it can be recognized.
Collapse
|
163
|
Band G, Rockett KA, Spencer CCA, Kwiatkowski DP. A novel locus of resistance to severe malaria in a region of ancient balancing selection. Nature 2015; 526:253-7. [PMID: 26416757 PMCID: PMC4629224 DOI: 10.1038/nature15390] [Citation(s) in RCA: 131] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Accepted: 08/10/2015] [Indexed: 12/13/2022]
Abstract
The high prevalence of sickle haemoglobin in Africa shows that malaria has been a major force for human evolutionary selection, but surprisingly few other polymorphisms have been proven to confer resistance to malaria in large epidemiological studies. To address this problem, we conducted a multi-centre genome-wide association study (GWAS) of life-threatening Plasmodium falciparum infection (severe malaria) in over 11,000 African children, with replication data in a further 14,000 individuals. Here we report a novel malaria resistance locus close to a cluster of genes encoding glycophorins that are receptors for erythrocyte invasion by P. falciparum. We identify a haplotype at this locus that provides 33% protection against severe malaria (odds ratio = 0.67, 95% confidence interval = 0.60-0.76, P value = 9.5 × 10(-11)) and is linked to polymorphisms that have previously been shown to have features of ancient balancing selection, on the basis of haplotype sharing between humans and chimpanzees. Taken together with previous observations on the malaria-protective role of blood group O, these data reveal that two of the strongest GWAS signals for severe malaria lie in or close to genes encoding the glycosylated surface coat of the erythrocyte cell membrane, both within regions of the genome where it appears that evolution has maintained diversity for millions of years. These findings provide new insights into the host-parasite interactions that are critical in determining the outcome of malaria infection.
Collapse
|
164
|
Manjurano A, Sepúlveda N, Nadjm B, Mtove G, Wangai H, Maxwell C, Olomi R, Reyburn H, Drakeley CJ, Riley EM, Clark TG. USP38, FREM3, SDC1, DDC, and LOC727982 Gene Polymorphisms and Differential Susceptibility to Severe Malaria in Tanzania. J Infect Dis 2015; 212:1129-39. [PMID: 25805752 PMCID: PMC4559194 DOI: 10.1093/infdis/jiv192] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Accepted: 03/16/2015] [Indexed: 12/20/2022] Open
Abstract
Populations exposed to Plasmodium falciparum infection develop genetic mechanisms of protection against severe malarial disease. Despite decades of genetic epidemiological research, the sickle cell trait (HbAS) sickle cell polymorphism, ABO blood group, and other hemoglobinopathies remain the few major determinants in severe malaria to be replicated across different African populations and study designs. Within a case-control study in a region of high transmission in Tanzania (n = 983), we investigated the role of 40 new loci identified in recent genome-wide studies. In 32 loci passing quality control procedures, we found polymorphisms in USP38, FREM3, SDC1, DDC, and LOC727982 genes to be putatively associated with differential susceptibility to severe malaria. Established candidates explained 7.4% of variation in severe malaria risk (HbAS polymorphism, 6.3%; α-thalassemia, 0.3%; ABO group, 0.3%; and glucose-6-phosphate dehydrogenase deficiency, 0.5%) and the new polymorphisms, another 4.3%. The regions encompassing the loci identified are promising targets for the design of future treatment and control interventions.
Collapse
Affiliation(s)
- Alphaxard Manjurano
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
- National Institute for Medical Research, Dar es Salaam, Tanzania
| | - Nuno Sepúlveda
- Departments ofImmunology and Infection
- Centre of Statistics and Applications, University of Lisbon, Portugal
| | | | - George Mtove
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
- National Institute for Medical Research, Dar es Salaam, Tanzania
| | - Hannah Wangai
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
| | - Caroline Maxwell
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
| | - Raimos Olomi
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
| | - Hugh Reyburn
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
- Departments ofImmunology and Infection
| | - Christopher J. Drakeley
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
- Departments ofImmunology and Infection
| | - Eleanor M. Riley
- Joint Malaria Programme,Kilimanjaro Christian Medical College, Moshi
- Departments ofImmunology and Infection
| | - Taane G. Clark
- Pathogen Molecular Biology
- Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, United Kingdom
| |
Collapse
|
165
|
Haasl RJ, Payseur BA. Fifteen years of genomewide scans for selection: trends, lessons and unaddressed genetic sources of complication. Mol Ecol 2015. [PMID: 26224644 DOI: 10.1111/mec.13339] [Citation(s) in RCA: 109] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Genomewide scans for natural selection (GWSS) have become increasingly common over the last 15 years due to increased availability of genome-scale genetic data. Here, we report a representative survey of GWSS from 1999 to present and find that (i) between 1999 and 2009, 35 of 49 (71%) GWSS focused on human, while from 2010 to present, only 38 of 83 (46%) of GWSS focused on human, indicating increased focus on nonmodel organisms; (ii) the large majority of GWSS incorporate interpopulation or interspecific comparisons using, for example F(ST), cross-population extended haplotype homozygosity or the ratio of nonsynonymous to synonymous substitutions; (iii) most GWSS focus on detection of directional selection rather than other modes such as balancing selection; and (iv) in human GWSS, there is a clear shift after 2004 from microsatellite markers to dense SNP data. A survey of GWSS meant to identify loci positively selected in response to severe hypoxic conditions support an approach to GWSS in which a list of a priori candidate genes based on potential selective pressures are used to filter the list of significant hits a posteriori. We also discuss four frequently ignored determinants of genomic heterogeneity that complicate GWSS: mutation, recombination, selection and the genetic architecture of adaptive traits. We recommend that GWSS methodology should better incorporate aspects of genomewide heterogeneity using empirical estimates of relevant parameters and/or realistic, whole-chromosome simulations to improve interpretation of GWSS results. Finally, we argue that knowledge of potential selective agents improves interpretation of GWSS results and that new methods focused on correlations between environmental variables and genetic variation can help automate this approach.
Collapse
Affiliation(s)
- Ryan J Haasl
- Department of Biology, University of Wisconsin-Platteville, 1 University Plaza, Platteville, WI, 53818, USA
| | - Bret A Payseur
- Laboratory of Genetics, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA
| |
Collapse
|
166
|
Azevedo L, Serrano C, Amorim A, Cooper DN. Trans-species polymorphism in humans and the great apes is generally maintained by balancing selection that modulates the host immune response. Hum Genomics 2015; 9:21. [PMID: 26337052 PMCID: PMC4559023 DOI: 10.1186/s40246-015-0043-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2015] [Accepted: 08/20/2015] [Indexed: 12/20/2022] Open
Abstract
Known examples of ancient identical-by-descent genetic variants being shared between evolutionarily related species, known as trans-species polymorphisms (TSPs), result from counterbalancing selective forces acting on target genes to confer resistance against infectious agents. To date, putative TSPs between humans and other primate species have been identified for the highly polymorphic major histocompatibility complex (MHC), the histo-blood ABO group, two antiviral genes (ZC3HAV1 and TRIM5), an autoimmunity-related gene LAD1 and several non-coding genomic segments with a putative regulatory role. Although the number of well-characterized TSPs under long-term balancing selection is still very small, these examples are connected by a common thread, namely that they involve genes with key roles in the immune system and, in heterozygosity, appear to confer genetic resistance to pathogens. Here, we review known cases of shared polymorphism that appear to be under long-term balancing selection in humans and the great apes. Although the specific selective agent(s) responsible are still unknown, these TSPs may nevertheless be seen as constituting important adaptive events that have occurred during the evolution of the primate immune system.
Collapse
Affiliation(s)
- Luisa Azevedo
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.
- IPATIMUP-Institute of Molecular Pathology and Immunology, University of Porto, Rua Dr. Roberto Frias s/n, 4200-465, Porto, Portugal.
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, s/n, 4169-007, Porto, Portugal.
| | - Catarina Serrano
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.
- IPATIMUP-Institute of Molecular Pathology and Immunology, University of Porto, Rua Dr. Roberto Frias s/n, 4200-465, Porto, Portugal.
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, s/n, 4169-007, Porto, Portugal.
| | - Antonio Amorim
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Porto, Portugal.
- IPATIMUP-Institute of Molecular Pathology and Immunology, University of Porto, Rua Dr. Roberto Frias s/n, 4200-465, Porto, Portugal.
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, s/n, 4169-007, Porto, Portugal.
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK.
| |
Collapse
|
167
|
Fish I, Boissinot S. Contrasted patterns of variation and evolutionary convergence at the antiviral OAS1 gene in old world primates. Immunogenetics 2015; 67:487-99. [PMID: 26156123 PMCID: PMC4809017 DOI: 10.1007/s00251-015-0855-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 06/24/2015] [Indexed: 11/13/2022]
Abstract
The oligoadenylate synthetase 1 (OAS1) enzyme acts as an innate sensor of viral infection and plays a major role in the defense against a wide diversity of viruses. Polymorphisms at OAS1 have been shown to correlate with differential susceptibility to several infections of great public health significance, including hepatitis C virus, SARS coronavirus, and West Nile virus. Population genetics analyses in hominoids have revealed interesting evolutionary patterns. In Central African chimpanzee, OAS1 has evolved under long-term balancing selection, resulting in the persistence of polymorphisms since the origin of hominoids, whereas human populations have acquired and retained OAS1 alleles from Neanderthal and Denisovan origin. We decided to further investigate the evolution of OAS1 in primates by characterizing intra-specific variation in four species commonly used as models in infectious disease research: the rhesus macaque, the cynomolgus macaque, the olive baboon, and the Guinea baboon. In baboons, OAS1 harbors a very low level of variation. In contrast, OAS1 in macaques exhibits a level of polymorphism far greater than the genomic average, which is consistent with the action of balancing selection. The region of the enzyme that directly interacts with viral RNA, the RNA-binding domain, contains a number of polymorphisms likely to affect the RNA-binding affinity of OAS1. This strongly suggests that pathogen-driven balancing selection acting on the RNA-binding domain of OAS1 is maintaining variation at this locus. Interestingly, we found that a number of polymorphisms involved in RNA-binding were shared between macaques and chimpanzees. This represents an unusual case of convergent polymorphism.
Collapse
Affiliation(s)
- Ian Fish
- Biology Department, Queens College, the City University of New York, Flushing, NY USA
- Graduate Center, the City University of New York, New York, NY USA
| | | |
Collapse
|
168
|
Jackson JA. Immunology in wild nonmodel rodents: an ecological context for studies of health and disease. Parasite Immunol 2015; 37:220-32. [PMID: 25689683 PMCID: PMC7167918 DOI: 10.1111/pim.12180] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2014] [Accepted: 02/04/2015] [Indexed: 12/16/2022]
Abstract
Transcriptomic methods are set to revolutionize the study of the immune system in naturally occurring nonmodel organisms. With this in mind, the present article focuses on ways in which the use of 'nonmodel' rodents (not the familiar laboratory species) can advance studies into the classical, but ever relevant, epidemiologic triad of immune defence, infectious disease and environment. For example, naturally occurring rodents are an interesting system in which to study the environmental stimuli that drive the development and homeostasis of the immune system and, by extension, to identify where these stimuli are altered in anthropogenic environments leading to the formation of immunopathological phenotypes. Measurement of immune expression may help define individual heterogeneity in infectious disease susceptibility and transmission and facilitate our understanding of infection dynamics and risk in the natural environment; furthermore, it may provide a means of surveillance that can filter individuals carrying previously unknown acute infections of potential ecological or zoonotic importance. Finally, the study of immunology in wild animals may reveal interactions within the immune system and between immunity and other organismal traits that are not observable under restricted laboratory conditions. Potentiating much of this is the possibility of combining gene expression profiles with analytical tools derived from ecology and systems biology to reverse engineer interaction networks between immune responses, other organismal traits and the environment (including symbiont exposures), revealing regulatory architecture. Such holistic studies promise to link ecology, epidemiology and immunology in natural systems in a unified approach that can illuminate important problems relevant to human health and animal welfare and production.
Collapse
Affiliation(s)
- J A Jackson
- IBERS, Aberystwyth University, Aberystwyth, Ceredigion, UK
| |
Collapse
|
169
|
Hunter-Zinck H, Clark AG. Aberrant Time to Most Recent Common Ancestor as a Signature of Natural Selection. Mol Biol Evol 2015; 32:2784-97. [PMID: 26093129 DOI: 10.1093/molbev/msv142] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Natural selection inference methods often target one mode of selection of a particular age and strength. However, detecting multiple modes simultaneously, or with atypical representations, would be advantageous for understanding a population's evolutionary history. We have developed an anomaly detection algorithm using distributions of pairwise time to most recent common ancestor (TMRCA) to simultaneously detect multiple modes of natural selection in whole-genome sequences. As natural selection distorts local genealogies in distinct ways, the method uses pairwise TMRCA distributions, which approximate genealogies at a nonrecombining locus, to detect distortions without targeting a specific mode of selection. We evaluate the performance of our method, TSel, for both positive and balancing selection over different time-scales and selection strengths and compare TSel's performance with that of other methods. We then apply TSel to the Complete Genomics diversity panel, a set of human whole-genome sequences, and recover loci previously inferred to be under positive or balancing selection.
Collapse
Affiliation(s)
- Haley Hunter-Zinck
- Department of Biological Statistics and Computational Biology, Cornell University
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University
| |
Collapse
|
170
|
Fijarczyk A, Babik W. Detecting balancing selection in genomes: limits and prospects. Mol Ecol 2015; 24:3529-45. [DOI: 10.1111/mec.13226] [Citation(s) in RCA: 144] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Revised: 04/27/2015] [Accepted: 04/30/2015] [Indexed: 12/17/2022]
Affiliation(s)
- Anna Fijarczyk
- Institute of Environmental Sciences; Jagiellonian University; Gronostajowa 7 30-387 Kraków Poland
| | - Wiesław Babik
- Institute of Environmental Sciences; Jagiellonian University; Gronostajowa 7 30-387 Kraków Poland
| |
Collapse
|
171
|
Trans-Species Polymorphism in Immune Genes: General Pattern or MHC-Restricted Phenomenon? J Immunol Res 2015; 2015:838035. [PMID: 26090501 PMCID: PMC4458282 DOI: 10.1155/2015/838035] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Accepted: 05/04/2015] [Indexed: 11/24/2022] Open
Abstract
Immunity exhibits extraordinarily high levels of variation. Evolution of the immune system in response to host-pathogen interactions in particular ecological contexts appears to be frequently associated with diversifying selection increasing the genetic variability. Many studies have documented that immunologically relevant polymorphism observed today may be tens of millions years old and may predate the emergence of present species. This pattern can be explained by the concept of trans-species polymorphism (TSP) predicting the maintenance and sharing of favourable functionally important alleles of immune-related genes between species due to ongoing balancing selection. Despite the generality of this concept explaining the long-lasting adaptive variation inherited from ancestors, current research in TSP has vastly focused only on major histocompatibility complex (MHC). In this review we summarise the evidence available on TSP in human and animal immune genes to reveal that TSP is not a MHC-specific evolutionary pattern. Further research should clearly pay more attention to the investigation of TSP in innate immune genes and especially pattern recognition receptors which are promising candidates for this type of evolution. More effort should also be made to distinguish TSP from convergent evolution and adaptive introgression. Identification of balanced TSP variants may represent an accurate approach in evolutionary medicine to recognise disease-resistance alleles.
Collapse
|
172
|
Halldórsdóttir K, Árnason E. Trans-species polymorphism at antimicrobial innate immunity cathelicidin genes of Atlantic cod and related species. PeerJ 2015; 3:e976. [PMID: 26038731 PMCID: PMC4451034 DOI: 10.7717/peerj.976] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Accepted: 05/05/2015] [Indexed: 12/27/2022] Open
Abstract
Natural selection, the most important force in evolution, comes in three forms. Negative purifying selection removes deleterious variation and maintains adaptations. Positive directional selection fixes beneficial variants, producing new adaptations. Balancing selection maintains variation in a population. Important mechanisms of balancing selection include heterozygote advantage, frequency-dependent advantage of rarity, and local and fluctuating episodic selection. A rare pathogen gains an advantage because host defenses are predominantly effective against prevalent types. Similarly, a rare immune variant gives its host an advantage because the prevalent pathogens cannot escape the host's apostatic defense. Due to the stochastic nature of evolution, neutral variation may accumulate on genealogical branches, but trans-species polymorphisms are rare under neutrality and are strong evidence for balancing selection. Balanced polymorphism maintains diversity at the major histocompatibility complex (MHC) in vertebrates. The Atlantic cod is missing genes for both MHC-II and CD4, vital parts of the adaptive immune system. Nevertheless, cod are healthy in their ecological niche, maintaining large populations that support major commercial fisheries. Innate immunity is of interest from an evolutionary perspective, particularly in taxa lacking adaptive immunity. Here, we analyze extensive amino acid and nucleotide polymorphisms of the cathelicidin gene family in Atlantic cod and closely related taxa. There are three major clusters, Cath1, Cath2, and Cath3, that we consider to be paralogous genes. There is extensive nucleotide and amino acid allelic variation between and within clusters. The major feature of the results is that the variation clusters by alleles and not by species in phylogenetic trees and discriminant analysis of principal components. Variation within the three groups shows trans-species polymorphism that is older than speciation and that is suggestive of balancing selection maintaining the variation. Using Bayesian and likelihood methods positive and negative selection is evident at sites in the conserved part of the genes and, to a larger extent, in the active part which also shows episodic diversifying selection, further supporting the argument for balancing selection.
Collapse
Affiliation(s)
- Katrín Halldórsdóttir
- Institute of Life and Environmental Sciences, University of Iceland, Reykjavík, Iceland
| | - Einar Árnason
- Institute of Life and Environmental Sciences, University of Iceland, Reykjavík, Iceland
| |
Collapse
|
173
|
Teixeira JC, de Filippo C, Weihmann A, Meneu JR, Racimo F, Dannemann M, Nickel B, Fischer A, Halbwax M, Andre C, Atencia R, Meyer M, Parra G, Pääbo S, Andrés AM. Long-Term Balancing Selection in LAD1 Maintains a Missense Trans-Species Polymorphism in Humans, Chimpanzees, and Bonobos. Mol Biol Evol 2015; 32:1186-96. [PMID: 25605789 DOI: 10.1093/molbev/msv007] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Balancing selection maintains advantageous genetic and phenotypic diversity in populations. When selection acts for long evolutionary periods selected polymorphisms may survive species splits and segregate in present-day populations of different species. Here, we investigate the role of long-term balancing selection in the evolution of protein-coding sequences in the Homo-Pan clade. We sequenced the exome of 20 humans, 20 chimpanzees, and 20 bonobos and detected eight coding trans-species polymorphisms (trSNPs) that are shared among the three species and have segregated for approximately 14 My of independent evolution. Although the majority of these trSNPs were found in three genes of the major histocompatibility locus cluster, we also uncovered one coding trSNP (rs12088790) in the gene LAD1. All these trSNPs show clustering of sequences by allele rather than by species and also exhibit other signatures of long-term balancing selection, such as segregating at intermediate frequency and lying in a locus with high genetic diversity. Here, we focus on the trSNP in LAD1, a gene that encodes for Ladinin-1, a collagenous anchoring filament protein of basement membrane that is responsible for maintaining cohesion at the dermal-epidermal junction; the gene is also an autoantigen responsible for linear IgA disease. This trSNP results in a missense change (Leucine257Proline) and, besides altering the protein sequence, is associated with changes in gene expression of LAD1.
Collapse
Affiliation(s)
- João C Teixeira
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Cesare de Filippo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Antje Weihmann
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Juan R Meneu
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Fernando Racimo
- Department of Integrative Biology, University of California, Berkeley
| | - Michael Dannemann
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Birgit Nickel
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Anne Fischer
- International Center for Insect Physiology and Ecology, Nairobi, Kenya
| | - Michel Halbwax
- Clinique vétérinaire du Dr. Jacquemin, Maisons-Alfort, France
| | - Claudine Andre
- Lola Ya Bonobo sanctuary, Kinshasa, Democratic Republic Congo
| | - Rebeca Atencia
- Réserve Naturelle Sanctuaire à Chimpanzés de Tchimpounga, Jane Goodall Institute, Pointe-Noire, Republic of Congo
| | - Matthias Meyer
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Genís Parra
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Svante Pääbo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| |
Collapse
|
174
|
De Wit P, Pespeni MH, Palumbi SR. SNP genotyping and population genomics from expressed sequences - current advances and future possibilities. Mol Ecol 2015; 24:2310-23. [DOI: 10.1111/mec.13165] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2014] [Revised: 03/13/2015] [Accepted: 03/18/2015] [Indexed: 02/01/2023]
Affiliation(s)
- Pierre De Wit
- Department of Biology and Environmental Sciences; University of Gothenburg; Sven Lovén Centre for Marine Science - Tjärnö; Hättebäcksvägen 7 Strömstad SE-452 96 Sweden
| | - Melissa H. Pespeni
- Department of Biology; University of Vermont; Marsh Life Science; Rm 326A 109 Carrigan Drive Burlington VT 05405 USA
| | - Stephen R. Palumbi
- Department of Biology; Stanford University; Hopkins Marine Station 120 Ocean view Blvd. Pacific Grove CA 93950 USA
| |
Collapse
|
175
|
Gulisija D, Kim Y. Emergence of long-term balanced polymorphism under cyclic selection of spatially variable magnitude. Evolution 2015; 69:979-92. [PMID: 25707330 DOI: 10.1111/evo.12630] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2014] [Accepted: 02/15/2015] [Indexed: 01/09/2023]
Abstract
A fundamental question in evolutionary biology is what promotes genetic variation at nonneutral loci, a major precursor to adaptation in changing environments. In particular, balanced polymorphism under realistic evolutionary models of temporally varying environments in finite natural populations remains to be demonstrated. Here, we propose a novel mechanism of balancing selection under temporally varying fitnesses. Using forward-in-time computer simulations and mathematical analysis, we show that cyclic selection that spatially varies in magnitude, such as along an environmental gradient, can lead to elevated levels of nonneutral genetic polymorphism in finite populations. Balanced polymorphism is more likely with an increase in gene flow, magnitude and period of fitness oscillations, and spatial heterogeneity. This polymorphism-promoting effect is robust to small systematic fitness differences between competing alleles or to random environmental perturbation. Furthermore, we demonstrate analytically that protected polymorphism arises as spatially heterogeneous cyclic fitness oscillations generate a type of storage effect that leads to negative frequency dependent selection. Our findings imply that spatially variable cyclic environments can promote elevated levels of nonneutral genetic variation in natural populations.
Collapse
Affiliation(s)
- Davorka Gulisija
- Department of Zoology, University of Wisconsin, Madison, Wisconsin, 53706; Current Address: Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, 19104
| | | |
Collapse
|
176
|
Mapping Bias Overestimates Reference Allele Frequencies at the HLA Genes in the 1000 Genomes Project Phase I Data. G3-GENES GENOMES GENETICS 2015; 5:931-41. [PMID: 25787242 PMCID: PMC4426377 DOI: 10.1534/g3.114.015784] [Citation(s) in RCA: 121] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.
Collapse
|
177
|
Dellicour S, Michez D, Rasplus JY, Mardulyn P. Impact of past climatic changes and resource availability on the population demography of three food-specialist bees. Mol Ecol 2015; 24:1074-90. [PMID: 25612734 DOI: 10.1111/mec.13085] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Revised: 12/24/2014] [Accepted: 01/15/2015] [Indexed: 12/01/2022]
Abstract
Past climate change is known to have strongly impacted current patterns of genetic variation of animals and plants in Europe. However, ecological factors also have the potential to influence demographic history and thus patterns of genetic variation. In this study, we investigated the impact of past climate, and also the potential impact of host plant species abundance, on intraspecific genetic variation in three codistributed and related specialized solitary bees of the genus Melitta with very similar life history traits and dispersal capacities. We sequenced five independent loci in samples collected from the three species. Our analyses revealed that the species associated with the most abundant host plant species (Melitta leporina) displays unusually high genetic variation, to an extent that is seldom reported in phylogeographic studies of animals and plants. This suggests a potential role of food resource abundance in determining current patterns of genetic variation in specialized herbivorous insects. Patterns of genetic variation in the two other species indicated lower overall levels of diversity, and that M. nigricans could have experienced a recent range expansion. Ecological niche modelling of the three Melitta species and their main host plant species suggested a strong reduction in range size during the last glacial maximum. Comparing observed sequence data with data simulated using spatially explicit models of coalescence suggests that M. leporina recovered a range and population size close to their current levels at the end of the last glaciation, and confirms recent range expansion as the most likely scenario for M. nigricans. Overall, this study illustrates that both demographic history and ecological factors may have contributed to shape current phylogeographic patterns.
Collapse
Affiliation(s)
- Simon Dellicour
- Evolutionary Biology and Ecology, Université Libre de Bruxelles, av. FD Roosevelt 50, 1050, Brussels, Belgium
| | | | | | | |
Collapse
|
178
|
Cagliani R, Forni D, Biasin M, Comabella M, Guerini FR, Riva S, Pozzoli U, Agliardi C, Caputo D, Malhotra S, Montalban X, Bresolin N, Clerici M, Sironi M. Ancient and recent selective pressures shaped genetic diversity at AIM2-like nucleic acid sensors. Genome Biol Evol 2015; 6:830-45. [PMID: 24682156 PMCID: PMC4007548 DOI: 10.1093/gbe/evu066] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
AIM2-like receptors (ALRs) are a family of nucleic acid sensors essential for innate immune responses against viruses and bacteria. We performed an evolutionary analysis of ALR genes (MNDA, PYHIN1, IFI16, and AIM2) by analyzing inter- and intraspecies diversity. Maximum-likelihood analyses indicated that IFI16 and AIM2 evolved adaptively in primates, with branch-specific selection at the catarrhini lineage for IFI16. Application of a population genetics–phylogenetics approach also allowed identification of positive selection events in the human lineage. Positive selection in primates targeted sites located at the DNA-binding interface in both IFI16 and AIM2. In IFI16, several sites positively selected in primates and in the human lineage were located in the PYD domain, which is involved in protein–protein interaction and is bound by a human cytomegalovirus immune evasion protein. Finally, positive selection was found to target nuclear localization signals in IFI16 and the spacer region separating the two HIN domains. Population genetic analysis in humans revealed that an IFI16 genic region has been a target of long-standing balancing selection, possibly acting on two nonsynonymous polymorphisms located in the spacer region. Data herein indicate that ALRs have been repeatedly targeted by natural selection. The balancing selection region in IFI16 carries a variant with opposite risk effect for distinct autoimmune diseases, suggesting antagonistic pleiotropy. We propose that the underlying scenario is the result of an ancestral and still ongoing host–pathogen arms race and that the maintenance of susceptibility alleles for autoimmune diseases at IFI16 represents an evolutionary trade-off.
Collapse
Affiliation(s)
- Rachele Cagliani
- Bioinformatics Laboratory, Scientific Institute IRCCS E. Medea, Bosisio Parini (LC), Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
179
|
Manjurano A, Sepulveda N, Nadjm B, Mtove G, Wangai H, Maxwell C, Olomi R, Reyburn H, Riley EM, Drakeley CJ, Clark TG. African glucose-6-phosphate dehydrogenase alleles associated with protection from severe malaria in heterozygous females in Tanzania. PLoS Genet 2015; 11:e1004960. [PMID: 25671784 PMCID: PMC4335500 DOI: 10.1371/journal.pgen.1004960] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Accepted: 12/17/2014] [Indexed: 11/24/2022] Open
Abstract
X-linked Glucose-6-phosphate dehydrogenase (G6PD) A- deficiency is prevalent in sub-Saharan Africa populations, and has been associated with protection from severe malaria. Whether females and/or males are protected by G6PD deficiency is uncertain, due in part to G6PD and malaria phenotypic complexity and misclassification. Almost all large association studies have genotyped a limited number of G6PD SNPs (e.g. G6PD202 / G6PD376), and this approach has been too blunt to capture the complete epidemiological picture. Here we have identified 68 G6PD polymorphisms and analysed 29 of these (i.e. those with a minor allele frequency greater than 1%) in 983 severe malaria cases and controls in Tanzania. We establish, across a number of SNPs including G6PD376, that only female heterozygotes are protected from severe malaria. Haplotype analysis reveals the G6PD locus to be under balancing selection, suggesting a mechanism of protection relying on alleles at modest frequency and avoiding fixation, where protection provided by G6PD deficiency against severe malaria is offset by increased risk of life-threatening complications. Our study also demonstrates that the much-needed large-scale studies of severe malaria and G6PD enzymatic function across African populations require the identification and analysis of the full repertoire of G6PD genetic markers. Glucose-6-phosphate dehydrogenase (G6PD) is an essential enzyme that protects red blood cells from oxidative damage. Numerous genetic variants of G6PD, residing in the X chromosome, are found among African populations: mutations causing A- deficiency can lead to serious clinical outcomes (including hemolytic anemia) but also confer protection against severe malaria. Epidemiological studies have used some of the genetic markers that cause A- deficiency to establish who is protected from severe malaria, with differing results. Whether females, with one or two copies of mutant genes, males with one copy, or both genders are protected is uncertain. This uncertainty is due to G6PD and malaria phenotypic complexity and misclassification, and to genetic differences between populations and the limited numbers of genetic markers (usually 2) considered. In this study we analysed more than 30 G6PD genetic markers in 506 Tanzanian children with severe malaria and 477 without malaria. We found that only females with one normal and one mutant copy of the gene (heterozygotes) were protected from severe malaria. Further, we established that the G6PD gene is under evolutionary pressure with the likely mechanism being selection by malaria. Our work demonstrates that studies of severe malaria and G6PD enzymatic function across African populations require, in addition to complete and accurate G6PD phenotypic classification, the identification and analysis of the full repertoire of G6PD genetic markers.
Collapse
Affiliation(s)
- Alphaxard Manjurano
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
| | - Nuno Sepulveda
- Department of Infection and Immunology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Behzad Nadjm
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
| | - George Mtove
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
| | - Hannah Wangai
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
| | - Caroline Maxwell
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
| | - Raimos Olomi
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
| | - Hugh Reyburn
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
- Department of Infection and Immunology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Eleanor M. Riley
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
- Department of Infection and Immunology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Christopher J. Drakeley
- Joint Malaria Programme, Kilimanjaro Christian Medical College, Moshi, Tanzania
- Department of Infection and Immunology, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Taane G. Clark
- Pathogen Molecular Biology Department, Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
- Department of Infectious Disease Epidemiology, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, United Kingdom
- * E-mail:
| | - MalariaGEN Consortium
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
180
|
Gao Z, Przeworski M, Sella G. Footprints of ancient-balanced polymorphisms in genetic variation data from closely related species. Evolution 2015; 69:431-46. [PMID: 25403856 PMCID: PMC4335603 DOI: 10.1111/evo.12567] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 10/28/2014] [Indexed: 01/17/2023]
Abstract
When long-lasting, balancing selection can lead to “trans-species” polymorphisms
that are shared by two or more species identical by descent. In such cases, the gene genealogy at
the selected site clusters by allele instead of by species, and nearby neutral sites also have
unusual genealogies because of linkage. While this scenario is expected to leave discernible
footprints in genetic variation data, the specific patterns remain poorly characterized. Motivated
by recent findings in primates, we focus on the case of a biallelic polymorphism under ancient
balancing selection and derive approximations for summaries of the polymorphism data from two
species. Specifically, we characterize the length of the segment that carries most of the
footprints, the expected number of shared neutral single nucleotide polymorphisms (SNPs), and the
patterns of allelic associations among them. We confirm the accuracy of our approximations by
coalescent simulations. We further show that for humans and chimpanzees—more generally, for
pairs of species with low genetic diversity levels—these patterns are highly unlikely to be
generated by neutral recurrent mutations. We discuss the implications for the design and
interpretation of genome scans for ancient balanced polymorphisms in primates and other taxa.
Collapse
Affiliation(s)
- Ziyue Gao
- Committee on Genetics, Genomics and Systems Biology, University of Chicago, Chicago, Illinois, 60637.
| | | | | |
Collapse
|
181
|
Causes of natural variation in fitness: evidence from studies of Drosophila populations. Proc Natl Acad Sci U S A 2015; 112:1662-9. [PMID: 25572964 DOI: 10.1073/pnas.1423275112] [Citation(s) in RCA: 130] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
DNA sequencing has revealed high levels of variability within most species. Statistical methods based on population genetics theory have been applied to the resulting data and suggest that most mutations affecting functionally important sequences are deleterious but subject to very weak selection. Quantitative genetic studies have provided information on the extent of genetic variation within populations in traits related to fitness and the rate at which variability in these traits arises by mutation. This paper attempts to combine the available information from applications of the two approaches to populations of the fruitfly Drosophila in order to estimate some important parameters of genetic variation, using a simple population genetics model of mutational effects on fitness components. Analyses based on this model suggest the existence of a class of mutations with much larger fitness effects than those inferred from sequence variability and that contribute most of the standing variation in fitness within a population caused by the input of mildly deleterious mutations. However, deleterious mutations explain only part of this standing variation, and other processes such as balancing selection appear to make a large contribution to genetic variation in fitness components in Drosophila.
Collapse
|
182
|
Abstract
Natural selection is expected to drive adaptive evolution in genes involved in host–pathogen interactions. In this study, we use molecular population genetic analyses to understand how natural selection operates on the immune system of Anopheles coluzzii (formerly A. gambiae “M form”). We analyzed patterns of intraspecific and interspecific genetic variation in 20 immune-related genes and 17 nonimmune genes from a wild population of A. coluzzii and asked if patterns of genetic variation in the immune genes are consistent with pathogen-driven selection shaping the evolution of defense. We found evidence of a balanced polymorphism in CTLMA2, which encodes a C-type lectin involved in regulation of the melanization response. The two CTLMA2 haplotypes, which are distinguished by fixed amino acid differences near the predicted peptide cleavage site, are also segregating in the sister species A. gambiae (“S form”) and A. arabiensis. Comparison of the two haplotypes between species indicates that they were not shared among the species through introgression, but rather that they arose before the species divergence and have been adaptively maintained as a balanced polymorphism in all three species. We additionally found that STAT-B, a retroduplicate of STAT-A, shows strong evidence of adaptive evolution that is consistent with neofunctionalization after duplication. In contrast to the striking patterns of adaptive evolution observed in these Anopheles-specific immune genes, we found no evidence of adaptive evolution in the Toll and Imd innate immune pathways that are orthologously conserved throughout insects. Genes encoding the Imd pathway exhibit high rates of amino acid divergence between Anopheles species but also display elevated amino acid diversity that is consistent with relaxed purifying selection. These results indicate that adaptive coevolution between A. coluzzii and its pathogens is more likely to involve novel or lineage-specific molecular mechanisms than the canonical humoral immune pathways.
Collapse
|
183
|
Wang J, Fan C. A neutrality test for detecting selection on DNA methylation using single methylation polymorphism frequency spectrum. Genome Biol Evol 2014; 7:154-71. [PMID: 25539727 PMCID: PMC4316624 DOI: 10.1093/gbe/evu271] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Inheritable epigenetic mutations (epimutations) can contribute to transmittable phenotypic variation. Thus, epimutations can be subject to natural selection and impact the fitness and evolution of organisms. Based on the framework of the modified Tajima’s D test for DNA mutations, we developed a neutrality test with the statistic “Dm” to detect selection forces on DNA methylation mutations using single methylation polymorphisms. With computer simulation and empirical data analysis, we compared the Dm test with the original and modified Tajima’s D tests and demonstrated that the Dm test is suitable for detecting selection on epimutations and outperforms original/modified Tajima’s D tests. Due to the higher resetting rate of epimutations, the interpretation of Dm on epimutations and Tajima’s D test on DNA mutations could be different in inferring natural selection. Analyses using simulated and empirical genome-wide polymorphism data suggested that genes under genetic and epigenetic selections behaved differently. We applied the Dm test to recently originated Arabidopsis and human genes, and showed that newly evolved genes contain higher level of rare epialleles, suggesting that epimutation may play a role in origination and evolution of genes and genomes. Overall, we demonstrate the utility of the Dm test to detect whether the loci are under selection regarding DNA methylation. Our analytical metrics and methodology could contribute to our understanding of evolutionary processes of genes and genomes in the field of epigenetics. The Perl script for the “Dm” test is available at http://fanlab.wayne.edu/ (last accessed December 18, 2014).
Collapse
Affiliation(s)
- Jun Wang
- Department of Biological Sciences, Wayne State University
| | - Chuanzhu Fan
- Department of Biological Sciences, Wayne State University
| |
Collapse
|
184
|
McManus KF, Kelley JL, Song S, Veeramah KR, Woerner AE, Stevison LS, Ryder OA, Ape Genome Project G, Kidd JM, Wall JD, Bustamante CD, Hammer MF. Inference of gorilla demographic and selective history from whole-genome sequence data. Mol Biol Evol 2014; 32:600-12. [PMID: 25534031 PMCID: PMC4327160 DOI: 10.1093/molbev/msu394] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Although population-level genomic sequence data have been gathered extensively for humans, similar data from our closest living relatives are just beginning to emerge. Examination of genomic variation within great apes offers many opportunities to increase our understanding of the forces that have differentially shaped the evolutionary history of hominid taxa. Here, we expand upon the work of the Great Ape Genome Project by analyzing medium to high coverage whole-genome sequences from 14 western lowland gorillas (Gorilla gorilla gorilla), 2 eastern lowland gorillas (G. beringei graueri), and a single Cross River individual (G. gorilla diehli). We infer that the ancestors of western and eastern lowland gorillas diverged from a common ancestor approximately 261 ka, and that the ancestors of the Cross River population diverged from the western lowland gorilla lineage approximately 68 ka. Using a diffusion approximation approach to model the genome-wide site frequency spectrum, we infer a history of western lowland gorillas that includes an ancestral population expansion of 1.4-fold around 970 ka and a recent 5.6-fold contraction in population size 23 ka. The latter may correspond to a major reduction in African equatorial forests around the Last Glacial Maximum. We also analyze patterns of variation among western lowland gorillas to identify several genomic regions with strong signatures of recent selective sweeps. We find that processes related to taste, pancreatic and saliva secretion, sodium ion transmembrane transport, and cardiac muscle function are overrepresented in genomic regions predicted to have experienced recent positive selection.
Collapse
Affiliation(s)
- Kimberly F McManus
- Department of Biology, Stanford University Department of Biomedical Informatics, Stanford University
| | - Joanna L Kelley
- Department of Genetics, Stanford University School of Biological Sciences, Washington State University
| | - Shiya Song
- Department of Computational Medicine & Bioinformatics, University of Michigan
| | | | | | - Laurie S Stevison
- Institute for Human Genetics, University of California San Francisco
| | - Oliver A Ryder
- San Diego Zoo Institute for Conservation Research, San Diego Zoo Global, Escondido, CA
| | | | - Jeffrey M Kidd
- Department of Computational Medicine & Bioinformatics, University of Michigan Department of Human Genetics, University of Michigan
| | - Jeffrey D Wall
- Institute for Human Genetics, University of California San Francisco
| | | | | |
Collapse
|
185
|
Hedrick PW. Heterozygote Advantage: The Effect of Artificial Selection in Livestock and Pets. J Hered 2014; 106:141-54. [DOI: 10.1093/jhered/esu070] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
186
|
Interspecific introgressive origin of genomic diversity in the house mouse. Proc Natl Acad Sci U S A 2014; 112:196-201. [PMID: 25512534 DOI: 10.1073/pnas.1406298111] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
We report on a genome-wide scan for introgression between the house mouse (Mus musculus domesticus) and the Algerian mouse (Mus spretus), using samples from the ranges of sympatry and allopatry in Africa and Europe. Our analysis reveals wide variability in introgression signatures along the genomes, as well as across the samples. We find that fewer than half of the autosomes in each genome harbor all detectable introgression, whereas the X chromosome has none. Further, European mice carry more M. spretus alleles than the sympatric African ones. Using the length distribution and sharing patterns of introgressed genomic tracts across the samples, we infer, first, that at least three distinct hybridization events involving M. spretus have occurred, one of which is ancient, and the other two are recent (one presumably due to warfarin rodenticide selection). Second, several of the inferred introgressed tracts contain genes that are likely to confer adaptive advantage. Third, introgressed tracts might contain driver genes that determine the evolutionary fate of those tracts. Further, functional analysis revealed introgressed genes that are essential to fitness, including the Vkorc1 gene, which is implicated in rodenticide resistance, and olfactory receptor genes. Our findings highlight the extent and role of introgression in nature and call for careful analysis and interpretation of house mouse data in evolutionary and genetic studies.
Collapse
|
187
|
Jordan CY, Connallon T. Sexually antagonistic polymorphism in simultaneous hermaphrodites. Evolution 2014; 68:3555-69. [PMID: 25311368 DOI: 10.1111/evo.12536] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 09/09/2014] [Indexed: 12/22/2022]
Abstract
In hermaphrodites, pleiotropic genetic trade-offs between female and male reproductive functions can lead to sexually antagonistic (SA) selection, where individual alleles have conflicting fitness effects on each sex function. Although an extensive theory of SA selection exists for dioecious species, these results have not been generalized to hermaphrodites. We develop population genetic models of SA selection in simultaneous hermaphrodites, and evaluate effects of dominance, selection on each sex function, self-fertilization, and population size on the maintenance of polymorphism. Under obligate outcrossing, hermaphrodite model predictions converge exactly with those of dioecious populations. Self-fertilization in hermaphrodites generates three points of divergence with dioecious theory. First, opportunities for stable polymorphism decline sharply and become less sensitive to dominance with increased selfing. Second, selfing introduces an asymmetry in the relative importance of selection through male versus female reproductive functions, expands the parameter space favorable for the evolutionary invasion of female-beneficial alleles, and restricts invasion criteria for male-beneficial alleles. Finally, contrary to models of unconditionally beneficial alleles, selfing decreases genetic hitchhiking effects of invading SA alleles, and should therefore decrease these population genetic signals of SA polymorphisms. We discuss implications of SA selection in hermaphrodites, including its potential role in the evolution of "selfing syndromes."
Collapse
Affiliation(s)
- Crispin Y Jordan
- Ashworth Laboratories, Institute of Evolutionary Biology, The University of Edinburgh, Kings Buildings, West Mains Road, Edinburgh, EH9 3JT, United Kingdom.
| | | |
Collapse
|
188
|
Bergland AO, Behrman EL, O'Brien KR, Schmidt PS, Petrov DA. Genomic evidence of rapid and stable adaptive oscillations over seasonal time scales in Drosophila. PLoS Genet 2014; 10:e1004775. [PMID: 25375361 PMCID: PMC4222749 DOI: 10.1371/journal.pgen.1004775] [Citation(s) in RCA: 355] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Accepted: 09/24/2014] [Indexed: 01/06/2023] Open
Abstract
In many species, genomic data have revealed pervasive adaptive evolution indicated by the fixation of beneficial alleles. However, when selection pressures are highly variable along a species' range or through time adaptive alleles may persist at intermediate frequencies for long periods. So called “balanced polymorphisms” have long been understood to be an important component of standing genetic variation, yet direct evidence of the strength of balancing selection and the stability and prevalence of balanced polymorphisms has remained elusive. We hypothesized that environmental fluctuations among seasons in a North American orchard would impose temporally variable selection on Drosophila melanogaster that would drive repeatable adaptive oscillations at balanced polymorphisms. We identified hundreds of polymorphisms whose frequency oscillates among seasons and argue that these loci are subject to strong, temporally variable selection. We show that these polymorphisms respond to acute and persistent changes in climate and are associated in predictable ways with seasonally variable phenotypes. In addition, our results suggest that adaptively oscillating polymorphisms are likely millions of years old, with some possibly predating the divergence between D. melanogaster and D. simulans. Taken together, our results are consistent with a model of balancing selection wherein rapid temporal fluctuations in climate over generational time promotes adaptive genetic diversity at loci underlying polygenic variation in fitness related phenotypes. Herein, we investigate the genomic basis of rapid adaptive evolution in response to seasonal fluctuations in the environment. We identify hundreds of polymorphisms (seasonal SNPs) that undergo dramatic shifts in allele frequency – on average between 40 and 60% – and oscillate between seasons repeatedly over multiple years, likely inducing high levels of genome-wide genetic differentiation. We provide evidence that seasonal SNPs are functional, being both sensitive to an acute frost event and associated with two stress tolerance traits. Finally, we show that some seasonal SNPs are possibly ancient balanced polymorphisms. Taken together, our results suggest that environmental heterogeneity can promote the long-term persistence of functional polymorphisms within populations that fuels fast directional adaptive response at any one time.
Collapse
Affiliation(s)
- Alan O. Bergland
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| | - Emily L. Behrman
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Katherine R. O'Brien
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Paul S. Schmidt
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| |
Collapse
|
189
|
Gatesy J, Springer MS. Phylogenetic analysis at deep timescales: Unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum. Mol Phylogenet Evol 2014; 80:231-66. [DOI: 10.1016/j.ympev.2014.08.013] [Citation(s) in RCA: 239] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2014] [Revised: 07/26/2014] [Accepted: 08/10/2014] [Indexed: 11/16/2022]
|
190
|
Terekhanova NV, Logacheva MD, Penin AA, Neretina TV, Barmintseva AE, Bazykin GA, Kondrashov AS, Mugue NS. Fast evolution from precast bricks: genomics of young freshwater populations of threespine stickleback Gasterosteus aculeatus. PLoS Genet 2014; 10:e1004696. [PMID: 25299485 PMCID: PMC4191950 DOI: 10.1371/journal.pgen.1004696] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 08/22/2014] [Indexed: 12/03/2022] Open
Abstract
Adaptation is driven by natural selection; however, many adaptations are caused by weak selection acting over large timescales, complicating its study. Therefore, it is rarely possible to study selection comprehensively in natural environments. The threespine stickleback (Gasterosteus aculeatus) is a well-studied model organism with a short generation time, small genome size, and many genetic and genomic tools available. Within this originally marine species, populations have recurrently adapted to freshwater all over its range. This evolution involved extensive parallelism: pre-existing alleles that adapt sticklebacks to freshwater habitats, but are also present at low frequencies in marine populations, have been recruited repeatedly. While a number of genomic regions responsible for this adaptation have been identified, the details of selection remain poorly understood. Using whole-genome resequencing, we compare pooled genomic samples from marine and freshwater populations of the White Sea basin, and identify 19 short genomic regions that are highly divergent between them, including three known inversions. 17 of these regions overlap protein-coding genes, including a number of genes with predicted functions that are relevant for adaptation to the freshwater environment. We then analyze four additional independently derived young freshwater populations of known ages, two natural and two artificially established, and use the observed shifts of allelic frequencies to estimate the strength of positive selection. Adaptation turns out to be quite rapid, indicating strong selection acting simultaneously at multiple regions of the genome, with selection coefficients of up to 0.27. High divergence between marine and freshwater genotypes, lack of reduction in polymorphism in regions responsible for adaptation, and high frequencies of freshwater alleles observed even in young freshwater populations are all consistent with rapid assembly of G. aculeatus freshwater genotypes from pre-existing genomic regions of adaptive variation, with strong selection that favors this assembly acting simultaneously at multiple loci. Adaptation to novel environments is a keystone of evolution. There is only a handful of natural and experimental systems in which the process of adaptation has been studied in detail, and each studied system brings its own surprises with regard to the number of loci involved, dynamics of adaptation, extent of interactions between loci and of parallelism between different adapting populations. The threespine stickleback is an excellent model organism for evolutionary studies. Marine-derived freshwater populations of this species have consistently acquired a specific set of morphological, physiological and behavioral traits allowing them to reside in freshwater for their whole lifespan. Previous studies identified several genomic regions responsible for this adaptation. Here, using whole-genome sequencing, we compare the allele frequencies at such regions in four derived freshwater populations of known ages: two natural, and two artificially established in 1978. Knowledge of population ages allows us to infer the strength of selection that acted at these loci. Adaptation of threespine stickleback to freshwater is typically fast, and is driven by strong selection favoring pre-existing alleles that are likely present in the ancestral marine population at low frequencies; however, some of the adaptation may also be due to young population-specific alleles.
Collapse
Affiliation(s)
- Nadezhda V. Terekhanova
- Department of Bioinformatics and Bioengineering, M. V. Lomonosov Moscow State University, Moscow, Russia
- * E-mail: (NVT); (NSM)
| | - Maria D. Logacheva
- Department of Bioinformatics and Bioengineering, M. V. Lomonosov Moscow State University, Moscow, Russia
- A. N. Belozersky Institute of Physico-Chemical Biology, M. V. Lomonosov Moscow State University, Moscow, Russia
| | - Aleksey A. Penin
- Department of Bioinformatics and Bioengineering, M. V. Lomonosov Moscow State University, Moscow, Russia
- Department of Genetics, Biological faculty, M. V. Lomonosov Moscow State University, Moscow, Russia
| | - Tatiana V. Neretina
- Department of Bioinformatics and Bioengineering, M. V. Lomonosov Moscow State University, Moscow, Russia
- White Sea Biological Station, Biological faculty, M. V. Lomonosov Moscow State University, Moscow, Russia
| | - Anna E. Barmintseva
- Laboratory of Molecular genetics, Russian Institute of Fisheries and Oceanology, Russian Federal Research Institute of Fisheries and Oceanography, Moscow, Russia
| | - Georgii A. Bazykin
- Department of Bioinformatics and Bioengineering, M. V. Lomonosov Moscow State University, Moscow, Russia
- Sector for Molecular Evolution, Institute for Information Transmission Problems of the RAS (Kharkevich Institute), Moscow, Russia
| | - Alexey S. Kondrashov
- Department of Bioinformatics and Bioengineering, M. V. Lomonosov Moscow State University, Moscow, Russia
- Department of Ecology and Evolutionary Biology and Life Sciences Institute, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Nikolai S. Mugue
- Laboratory of Molecular genetics, Russian Institute of Fisheries and Oceanology, Russian Federal Research Institute of Fisheries and Oceanography, Moscow, Russia
- N. K. Koltsov Institute of Developmental Biology RAS, Moscow, Russia
- * E-mail: (NVT); (NSM)
| |
Collapse
|
191
|
Ségurel L, Quintana-Murci L. Preserving immune diversity through ancient inheritance and admixture. Curr Opin Immunol 2014; 30:79-84. [DOI: 10.1016/j.coi.2014.08.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2014] [Revised: 08/11/2014] [Accepted: 08/12/2014] [Indexed: 10/24/2022]
|
192
|
Sadee W, Hartmann K, Seweryn M, Pietrzak M, Handelman SK, Rempala GA. Missing heritability of common diseases and treatments outside the protein-coding exome. Hum Genet 2014; 133:1199-1215. [PMID: 25107510 PMCID: PMC4169001 DOI: 10.1007/s00439-014-1476-7] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Accepted: 07/23/2014] [Indexed: 02/07/2023]
Abstract
Genetic factors strongly influence risk of common human diseases and treatment outcomes but the causative variants remain largely unknown; this gap has been called the 'missing heritability'. We propose several hypotheses that in combination have the potential to narrow the gap. First, given a multi-stage path from wellness to disease, we propose that common variants under positive evolutionary selection represent normal variation and gate the transition between wellness and an 'off-well' state, revealing adaptations to changing environmental conditions. In contrast, genome-wide association studies (GWAS) focus on deleterious variants conveying disease risk, accelerating the path from off-well to illness and finally specific diseases, while common 'normal' variants remain hidden in the noise. Second, epistasis (dynamic gene-gene interactions) likely assumes a central role in adaptations and evolution; yet, GWAS analyses currently are poorly designed to reveal epistasis. As gene regulation is germane to adaptation, we propose that epistasis among common normal regulatory variants, or between common variants and less frequent deleterious variants, can have strong protective or deleterious phenotypic effects. These gene-gene interactions can be highly sensitive to environmental stimuli and could account for large differences in drug response between individuals. Residing largely outside the protein-coding exome, common regulatory variants affect either transcription of coding and non-coding RNAs (regulatory SNPs, or rSNPs) or RNA functions and processing (structural RNA SNPs, or srSNPs). Third, with the vast majority of causative variants yet to be discovered, GWAS rely on surrogate markers, a confounding factor aggravated by the presence of more than one causative variant per gene and by epistasis. We propose that the confluence of these factors may be responsible to large extent for the observed heritability gap.
Collapse
Affiliation(s)
- Wolfgang Sadee
- Department of Pharmacology, Center for Pharmacogenomics, College of Medicine, The Ohio State University Wexner Medical Center, 5184A Graves Hall, 333 West 10th Avenue, Columbus, OH, 43210, USA,
| | | | | | | | | | | |
Collapse
|
193
|
Abstract
Recombination allows different parts of the genome to have different genealogical histories. When a species splits in two, allelic lineages sort into the two descendant species, and this lineage sorting varies along the genome. If speciation events are close in time, the lineage sorting process may be incomplete at the second speciation event and lead to gene genealogies that do not match the species phylogeny. We review different recent approaches to model lineage sorting along the genome and show how it is possible to learn about population sizes, natural selection, and recombination rates in ancestral species from application of these models to genome alignments of great ape species.
Collapse
Affiliation(s)
- Thomas Mailund
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark; , ,
| | | | | |
Collapse
|
194
|
New frontiers in the study of human cultural and genetic evolution. Curr Opin Genet Dev 2014; 29:103-9. [PMID: 25218864 DOI: 10.1016/j.gde.2014.08.014] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2014] [Revised: 08/25/2014] [Accepted: 08/27/2014] [Indexed: 02/01/2023]
Abstract
In this review, we discuss the dynamic linkages between culture and the genetic evolution of the human species. We begin by briefly describing the framework of gene-culture coevolutionary (or dual-inheritance) models for human evolutionary change. Until recently, the literature on gene-culture coevolution was composed primarily of mathematical models and formalized theory describing the complex dynamics underlying human behavior, adaptation, and technological evolution, but had little empirical support concerning genetics. The rapid progress in the fields of molecular genetics and genomics, however, is now providing the kinds of data needed to produce rich empirical support for gene-culture coevolutionary models. We briefly outline how theoretical and methodological progress in genome sciences has provided ways for the strength of selection on genes to be evaluated, and then outline how evidence of selection on several key genes can be directly linked to human cultural practices. We then describe some exciting new directions in the empirical study of gene-culture coevolution, and conclude with a discussion of the role of gene-culture evolutionary models in the future integration of medical, biological, and social sciences.
Collapse
|
195
|
Ségurel L, Wyman MJ, Przeworski M. Determinants of Mutation Rate Variation in the Human Germline. Annu Rev Genomics Hum Genet 2014; 15:47-70. [DOI: 10.1146/annurev-genom-031714-125740] [Citation(s) in RCA: 232] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Laure Ségurel
- Laboratoire Éco-Anthropologie et Ethnobiologie, UMR 7206, Muséum National d'Histoire Naturelle–Centre National de la Recherche Scientifique–Université Paris 7 Diderot, Paris 75231, France;
| | - Minyoung J. Wyman
- Department of Biological Sciences, Columbia University, New York, NY 10027;
| | - Molly Przeworski
- Department of Human Genetics and Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois 60637;
| |
Collapse
|
196
|
Key FM, Teixeira JC, de Filippo C, Andrés AM. Advantageous diversity maintained by balancing selection in humans. Curr Opin Genet Dev 2014; 29:45-51. [PMID: 25173959 DOI: 10.1016/j.gde.2014.08.001] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2014] [Revised: 07/30/2014] [Accepted: 08/02/2014] [Indexed: 11/16/2022]
Abstract
Most human polymorphisms are neutral or slightly deleterious, but some genetic variation is advantageous and maintained in populations by balancing selection. Considered a rarity and overlooked for years, balanced polymorphisms have recently received renewed attention with several lines of evidence showing their relevance in human evolution. From theoretical work on its role in adaptation to empirical studies that identify its targets, recent developments have showed that balancing selection is more prevalent than previously thought. Here we review these developments and discuss their implications in our understanding of the influence of balancing selection in human evolution. We also review existing evidence on the biological functions that benefit most from advantageous diversity, and the functional consequences of these variants. Overall, we argue that balancing selection must be considered an important selective force in human evolution.
Collapse
Affiliation(s)
- Felix M Key
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - João C Teixeira
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Cesare de Filippo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
| |
Collapse
|
197
|
DeGiorgio M, Lohmueller KE, Nielsen R. A model-based approach for identifying signatures of ancient balancing selection in genetic data. PLoS Genet 2014; 10:e1004561. [PMID: 25144706 PMCID: PMC4140648 DOI: 10.1371/journal.pgen.1004561] [Citation(s) in RCA: 112] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 06/26/2014] [Indexed: 01/19/2023] Open
Abstract
While much effort has focused on detecting positive and negative directional selection in the human genome, relatively little work has been devoted to balancing selection. This lack of attention is likely due to the paucity of sophisticated methods for identifying sites under balancing selection. Here we develop two composite likelihood ratio tests for detecting balancing selection. Using simulations, we show that these methods outperform competing methods under a variety of assumptions and demographic models. We apply the new methods to whole-genome human data, and find a number of previously-identified loci with strong evidence of balancing selection, including several HLA genes. Additionally, we find evidence for many novel candidates, the strongest of which is FANK1, an imprinted gene that suppresses apoptosis, is expressed during meiosis in males, and displays marginal signs of segregation distortion. We hypothesize that balancing selection acts on this locus to stabilize the segregation distortion and negative fitness effects of the distorter allele. Thus, our methods are able to reproduce many previously-hypothesized signals of balancing selection, as well as discover novel interesting candidates. In the past, balancing selection was a topic of great theoretical interest that received much attention. However, there has been little focus toward developing methods to identify regions of the genome that are under balancing selection. In this article, we present the first set of likelihood-based methods that explicitly model the spatial distribution of polymorphism expected near a site under long-term balancing selection. Simulation results show that our methods outperform commonly-used summary statistics for identifying regions under balancing selection. Finally, we performed a scan for balancing selection in Africans and Europeans using our new methods and identified a gene called FANK1 as our top candidate outside the HLA region. We hypothesize that the maintenance of polymorphism at FANK1 is the result of segregation distortion.
Collapse
Affiliation(s)
- Michael DeGiorgio
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
- * E-mail:
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Department of Statistics, University of California, Berkeley, Berkeley, California, United States of America
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
198
|
Wang M, Huang X, Li R, Xu H, Jin L, He Y. Detecting recent positive selection with high accuracy and reliability by conditional coalescent tree. Mol Biol Evol 2014; 31:3068-80. [PMID: 25135945 DOI: 10.1093/molbev/msu244] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Studies of natural selection, followed by functional validation, are shedding light on understanding of genetic mechanisms underlying human evolution and adaptation. Classic methods for detecting selection, such as the integrated haplotype score (iHS) and Fay and Wu's H statistic, are useful for candidate gene searching underlying positive selection. These methods, however, have limited capability to localize causal variants in selection target regions. In this study, we developed a novel method based on conditional coalescent tree to detect recent positive selection by counting unbalanced mutations on coalescent gene genealogies. Extensive simulation studies revealed that our method is more robust than many other approaches against biases due to various demographic effects, including population bottleneck, expansion, or stratification, while not sacrificing its power. Furthermore, our method demonstrated its superiority in localizing causal variants from massive linked genetic variants. The rate of successful localization was about 20-40% higher than that of other state-of-the-art methods on simulated data sets. On empirical data, validated functional causal variants of four well-known positive selected genes were all successfully localized by our method, such as ADH1B, MCM6, APOL1, and HBB. Finally, the computational efficiency of this new method was much higher than that of iHS implementations, that is, 24-66 times faster than the REHH package, and more than 10,000 times faster than the original iHS implementation. These magnitudes make our method suitable for applying on large sequencing data sets. Software can be downloaded from https://github.com/wavefancy/scct.
Collapse
Affiliation(s)
- Minxian Wang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Xin Huang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Ran Li
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Hongyang Xu
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Li Jin
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yungang He
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
199
|
Abstract
About 2% of human genetic polymorphisms have been hypothesized to arise via multinucleotide mutations (MNMs), complex events that generate SNPs at multiple sites in a single generation. MNMs have the potential to accelerate the pace at which single genes evolve and to confound studies of demography and selection that assume all SNPs arise independently. In this paper, we examine clustered mutations that are segregating in a set of 1092 human genomes, demonstrating that the signature of MNM becomes enriched as large numbers of individuals are sampled. We estimate the percentage of linked SNP pairs that were generated by simultaneous mutation as a function of the distance between affected sites and show that MNMs exhibit a high percentage of transversions relative to transitions, findings that are reproducible in data from multiple sequencing platforms and cannot be attributed to sequencing error. Among tandem mutations that occur simultaneously at adjacent sites, we find an especially skewed distribution of ancestral and derived alleles, with GC → AA, GA → TT, and their reverse complements making up 27% of the total. These mutations have been previously shown to dominate the spectrum of the error-prone polymerase Pol ζ, suggesting that low-fidelity DNA replication by Pol ζ is at least partly responsible for the MNMs that are segregating in the human population. We develop statistical estimates of MNM prevalence that can be used to correct phylogenetic and population genetic inferences for the presence of complex mutations.
Collapse
Affiliation(s)
- Kelley Harris
- Department of Mathematics, University of California Berkeley, Berkeley, California 94703, USA;
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California Berkeley, Berkeley, California 94703, USA; Department of Statistics, University of California Berkeley, Berkeley, California 94703, USA; Center for Bioinformatics, University of Copenhagen, 2200 Copenhagen, Denmark
| |
Collapse
|
200
|
Abstract
The great ape families are the species most closely related to our own, comprising chimpanzees, bonobos, gorillas, and orangutans. They live exclusively in tropical rainforests in Central Africa and the islands of Southeast Asia. Due to their close evolutionary relationship with humans, great apes share many cognitive, physiological, and morphological similarities with humans. The members of the great ape family make obvious models to facilitate the further understanding about humans' biology and history. This review will discuss how the recent addition of genome-wide data from great apes has furthered humans' understanding of these species and humanity, especially in the realm of evolutionary genetics.
Collapse
|