151
|
Onuki R, Yamaguchi R, Shibuya T, Kanehisa M, Goto S. Revealing phenotype-associated functional differences by genome-wide scan of ancient haplotype blocks. PLoS One 2017; 12:e0176530. [PMID: 28445522 PMCID: PMC5406033 DOI: 10.1371/journal.pone.0176530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 04/12/2017] [Indexed: 11/18/2022] Open
Abstract
Genome-wide scans for positive selection have become important for genomic medicine, and many studies aim to find genomic regions affected by positive selection that are associated with risk allele variations among populations. Most such studies are designed to detect recent positive selection. However, we hypothesize that ancient positive selection is also important for adaptation to pathogens, and has affected current immune-mediated common diseases. Based on this hypothesis, we developed a novel linkage disequilibrium-based pipeline, which aims to detect regions associated with ancient positive selection across populations from single nucleotide polymorphism (SNP) data. By applying this pipeline to the genotypes in the International HapMap project database, we show that genes in the detected regions are enriched in pathways related to the immune system and infectious diseases. The detected regions also contain SNPs reported to be associated with cancers and metabolic diseases, obesity-related traits, type 2 diabetes, and allergic sensitization. These SNPs were further mapped to biological pathways to determine the associations between phenotypes and molecular functions. Assessments of candidate regions to identify functions associated with variations in incidence rates of these diseases are needed in the future.
Collapse
Affiliation(s)
- Ritsuko Onuki
- Bioinformatics Team, Advanced Analysis Center, National Agriculture and Food Research Organization (NARO), 2-1-2 Kannondai, Tsukuba, Ibaraki, Japan
| | - Rui Yamaguchi
- Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, Japan
| | - Tetsuo Shibuya
- Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, Japan
| | - Minoru Kanehisa
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| | - Susumu Goto
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
- * E-mail:
| |
Collapse
|
152
|
Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 2017; 100:635-649. [PMID: 28366442 DOI: 10.1016/j.ajhg.2017.03.004] [Citation(s) in RCA: 889] [Impact Index Per Article: 111.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2016] [Accepted: 03/10/2017] [Indexed: 01/10/2023] Open
Abstract
The vast majority of genome-wide association studies (GWASs) are performed in Europeans, and their transferability to other populations is dependent on many factors (e.g., linkage disequilibrium, allele frequencies, genetic architecture). As medical genomics studies become increasingly large and diverse, gaining insights into population history and consequently the transferability of disease risk measurement is critical. Here, we disentangle recent population history in the widely used 1000 Genomes Project reference panel, with an emphasis on populations underrepresented in medical studies. To examine the transferability of single-ancestry GWASs, we used published summary statistics to calculate polygenic risk scores for eight well-studied phenotypes. We identify directional inconsistencies in all scores; for example, height is predicted to decrease with genetic distance from Europeans, despite robust anthropological evidence that West Africans are as tall as Europeans on average. To gain deeper quantitative insights into GWAS transferability, we developed a complex trait coalescent-based simulation framework considering effects of polygenicity, causal allele frequency divergence, and heritability. As expected, correlations between true and inferred risk are typically highest in the population from which summary statistics were derived. We demonstrate that scores inferred from European GWASs are biased by genetic drift in other populations even when choosing the same causal variants and that biases in any direction are possible and unpredictable. This work cautions that summarizing findings from large-scale GWASs may have limited portability to other populations using standard approaches and highlights the need for generalized risk prediction methods and the inclusion of more diverse individuals in medical genomics.
Collapse
|
153
|
Quach H, Quintana-Murci L. Living in an adaptive world: Genomic dissection of the genus Homo and its immune response. J Exp Med 2017; 214:877-894. [PMID: 28351985 PMCID: PMC5379985 DOI: 10.1084/jem.20161942] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2016] [Revised: 02/14/2017] [Accepted: 03/06/2017] [Indexed: 12/14/2022] Open
Abstract
More than a decade after the sequencing of the human genome, a deluge of genome-wide population data are generating a portrait of human genetic diversity at an unprecedented level of resolution. Genomic studies have provided new insight into the demographic and adaptive history of our species, Homo sapiens, including its interbreeding with other hominins, such as Neanderthals, and the ways in which natural selection, in its various guises, has shaped genome diversity. These studies, combined with functional genomic approaches, such as the mapping of expression quantitative trait loci, have helped to identify genes, functions, and mechanisms of prime importance for host survival and involved in phenotypic variation and differences in disease risk. This review summarizes new findings in this rapidly developing field, focusing on the human immune response. We discuss the importance of defining the genetic and evolutionary determinants driving immune response variation, and highlight the added value of population genomic approaches in settings relevant to immunity and infection.
Collapse
Affiliation(s)
- Hélène Quach
- Human Evolutionary Genetics Unit, Department of Genomes and Genetics, Institut Pasteur, 75015 Paris, France.,Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, 75015 Paris, France.,Centre National de la Recherche Scientifique, URA3012, 75015 Paris, France
| | - Lluis Quintana-Murci
- Human Evolutionary Genetics Unit, Department of Genomes and Genetics, Institut Pasteur, 75015 Paris, France .,Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, 75015 Paris, France.,Centre National de la Recherche Scientifique, URA3012, 75015 Paris, France
| |
Collapse
|
154
|
Fan S, Hansen MEB, Lo Y, Tishkoff SA. Going global by adapting local: A review of recent human adaptation. Science 2017; 354:54-59. [PMID: 27846491 DOI: 10.1126/science.aaf5098] [Citation(s) in RCA: 210] [Impact Index Per Article: 26.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The spread of modern humans across the globe has led to genetic adaptations to diverse local environments. Recent developments in genomic technologies, statistical analyses, and expanded sampled populations have led to improved identification and fine-mapping of genetic variants associated with adaptations to regional living conditions and dietary practices. Ongoing efforts in sequencing genomes of indigenous populations, accompanied by the growing availability of "-omics" and ancient DNA data, promises a new era in our understanding of recent human evolution and the origins of variable traits and disease risks.
Collapse
Affiliation(s)
- Shaohua Fan
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Matthew E B Hansen
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Yancy Lo
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Tishkoff
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA. .,Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
155
|
Powers RE, Gaudet R, Sotomayor M. A Partial Calcium-Free Linker Confers Flexibility to Inner-Ear Protocadherin-15. Structure 2017; 25:482-495. [PMID: 28238533 DOI: 10.1016/j.str.2017.01.014] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 01/16/2017] [Accepted: 01/30/2017] [Indexed: 12/30/2022]
Abstract
Tip links of the inner ear are protein filaments essential for hearing and balance. Two atypical cadherins, cadherin-23 and protocadherin-15, interact in a Ca2+-dependent manner to form tip links. The largely unknown structure and mechanics of these proteins are integral to understanding how tip links pull on ion channels to initiate sensory perception. Protocadherin-15 has 11 extracellular cadherin (EC) repeats. Its EC3-4 linker lacks several of the canonical Ca2+-binding residues, and contains an aspartate-to-alanine polymorphism (D414A) under positive selection in East Asian populations. We present structures of protocadherin-15 EC3-5 featuring two Ca2+-binding linker regions: canonical EC4-5 linker binding three Ca2+ ions, and non-canonical EC3-4 linker binding only two Ca2+ ions. Our structures and biochemical assays reveal little difference between the D414 and D414A variants. Simulations predict that the partial Ca2+-free EC3-4 linker exhibits increased flexural flexibility without compromised mechanical strength, providing insight into the dynamics of tip links and other atypical cadherins.
Collapse
Affiliation(s)
- Robert E Powers
- Department of Molecular and Cellular Biology, Harvard University, 52 Oxford Street, Cambridge, MA 02138, USA; Biophysics Graduate Program, Harvard University, Boston, MA 02115, USA
| | - Rachelle Gaudet
- Department of Molecular and Cellular Biology, Harvard University, 52 Oxford Street, Cambridge, MA 02138, USA.
| | - Marcos Sotomayor
- Department of Chemistry and Biochemistry, The Ohio State University, 484 West 12(th) Avenue, Columbus, OH 43210, USA.
| |
Collapse
|
156
|
Park L. Evidence of Recent Intricate Adaptation in Human Populations. PLoS One 2016; 11:e0165870. [PMID: 27992444 PMCID: PMC5167553 DOI: 10.1371/journal.pone.0165870] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 10/19/2016] [Indexed: 11/18/2022] Open
Abstract
Recent human adaptations have shaped population differentiation in genomic regions containing putative functional variants, mostly located in predicted regulatory elements. However, their actual functionalities and the underlying mechanism of recent adaptation remain poorly understood. In the current study, regions of genes and repeats were investigated for functionality depending on the degree of population differentiation, FST or ΔDAF (a difference in derived allele frequency). The high FST in the 5´ or 3´ untranslated regions (UTRs), in particular, confirmed that population differences arose mainly from differences in regulation. Expression quantitative trait loci (eQTL) analyses using lymphoblastoid cell lines indicated that the majority of the highly population-specific regions represented cis- and/or trans-eQTL. However, groups having the highest ΔDAFs did not necessarily have higher proportions of eQTL variants; in these groups, the patterns were complex, indicating recent intricate adaptations. The results indicated that East Asian (EAS) and European populations (EUR) experienced mutual selection pressures. The mean derived allele frequency of the high ΔDAF groups suggested that EAS and EUR underwent strong adaptation; however, the African population in Africa (AFR) experienced slight, yet broad, adaptation. The DAF distributions of variants in the gene regions showed clear selective pressure in each population, which implies the existence of more recent regulatory adaptations in cells other than lymphoblastoid cell lines. In-depth analysis of population-differentiated regions indicated that the coding gene, RNF135, represented a trans-regulation hotspot via cis-regulation by the population-specific variants in the region of selective sweep. Together, the results provide strong evidence of actual intricate adaptation of human populations via regulatory manipulation.
Collapse
Affiliation(s)
- Leeyoung Park
- Natural Science Research Institute, Yonsei University, Seoul, Korea
- * E-mail:
| |
Collapse
|
157
|
Genome-Wide Detection of Selective Signatures in Chicken through High Density SNPs. PLoS One 2016; 11:e0166146. [PMID: 27820849 PMCID: PMC5098818 DOI: 10.1371/journal.pone.0166146] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2016] [Accepted: 10/24/2016] [Indexed: 11/25/2022] Open
Abstract
Chicken is recognized as an excellent model for studies of genetic mechanism of phenotypic and genomic evolution, with large effective population size and strong human-driven selection. In the present study, we performed Extended Haplotype Homozygosity (EHH) tests to identify significant core regions employing 600K SNP Chicken chip in an F2 population of 1,534 hens, which was derived from reciprocal crosses between White Leghorn and Dongxiang chicken. Results indicated that a total of 49,151 core regions with an average length of 9.79 Kb were identified, which occupied approximately 52.15% of genome across all autosomes, and 806 significant core regions attracted us mostly. Genes in candidate regions may experience positive selection and were considered to have possible influence on beneficial economic traits. A panel of genes including AASDHPPT, GDPD5, PAR3, SOX6, GPC1 and a signal pathway of AKT1 were detected with the most extreme P-values. Further enrichment analyses indicated that these genes were associated with immune function, sensory organ development and neurogenesis, and may have experienced positive selection in chicken. Moreover, some of core regions exactly overlapped with genes excavated in our previous GWAS, suggesting that these genes have undergone positive selection may affect egg production. Findings in our study could draw a comparatively integrate genome-wide map of selection signature in the chicken genome, and would be worthy for explicating the genetic mechanisms of phenotypic diversity in poultry breeding.
Collapse
|
158
|
Yuan Z, Liu E, Liu Z, Kijas JW, Zhu C, Hu S, Ma X, Zhang L, Du L, Wang H, Wei C. Selection signature analysis reveals genes associated with tail type in Chinese indigenous sheep. Anim Genet 2016; 48:55-66. [PMID: 27807880 DOI: 10.1111/age.12477] [Citation(s) in RCA: 81] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/09/2016] [Indexed: 01/19/2023]
Abstract
Fat-tailed sheep have commercial value because consumers prefer high-protein and low-fat food and producers care about feed conversion rate. However, fat-tailed sheep still have some scientific significance, as the fat tail is commonly regarded as a characteristic of environmental adaptability. Finding the candidate genes associated with fat tail formation is essential for breeding and conservation. To identify these candidate genes, we applied FST and hapFLK approaches in fat- and thin-tailed sheep with available 50K SNP genotype data. These two methods found 6.24 Mb of overlapped regions and 43 genes that may associated with fat tail development. Gene annotation showed that HOXA11, BMP2, PPP1CC, SP3, SP9, WDR92, PROKR1 and ETAA1 may play important roles in fat tail formation. These findings provide insight into tail fat development and a guide for molecular breeding and conservation.
Collapse
Affiliation(s)
- Z Yuan
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - E Liu
- School of Life Sciences, Capital Normal University, Beijing, China
| | - Z Liu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - J W Kijas
- CSIRO Agriculture Flagship, Brisbane, Australia
| | - C Zhu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.,College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - S Hu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - X Ma
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - L Zhang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - L Du
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - H Wang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.,Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - C Wei
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| |
Collapse
|
159
|
González-Rodríguez A, Munilla S, Mouresan EF, Cañas-Álvarez JJ, Díaz C, Piedrafita J, Altarriba J, Baro JÁ, Molina A, Varona L. On the performance of tests for the detection of signatures of selection: a case study with the Spanish autochthonous beef cattle populations. Genet Sel Evol 2016; 48:81. [PMID: 27793093 PMCID: PMC5084421 DOI: 10.1186/s12711-016-0258-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 10/18/2016] [Indexed: 01/05/2023] Open
Abstract
Background Procedures for the detection of signatures of selection can be classified according to the source of information they use to reject the null hypothesis of absence of selection. Three main groups of tests can be identified that are based on: (1) the analysis of the site frequency spectrum, (2) the study of the extension of the linkage disequilibrium across the length of the haplotypes that surround the polymorphism, and (3) the differentiation among populations. The aim of this study was to compare the performance of a subset of these procedures by using a dataset on seven Spanish autochthonous beef cattle populations. Results Analysis of the correlations between the logarithms of the statistics that were obtained by 11 tests for detecting signatures of selection at each single nucleotide polymorphism confirmed that they can be clustered into the three main groups mentioned above. A factor analysis summarized the results of the 11 tests into three canonical axes that were each associated with one of the three groups. Moreover, the signatures of selection identified with the first and second groups of tests were shared across populations, whereas those with the third group were more breed-specific. Nevertheless, an enrichment analysis identified the metabolic pathways that were associated with each group; they coincided with canonical axes and were related to immune response, muscle development, protein biosynthesis, skin and pigmentation, glucose metabolism, fat metabolism, embryogenesis and morphology, heart and uterine metabolism, regulation of the hypothalamic–pituitary–thyroid axis, hormonal, cellular cycle, cell signaling and extracellular receptors. Conclusions We show that the results of the procedures used to identify signals of selection differed substantially between the three groups of tests. However, they can be classified using a factor analysis. Moreover, each canonical factor that coincided with a group of tests identified different signals of selection, which could be attributed to processes of selection that occurred at different evolutionary times. Nevertheless, the metabolic pathways that were associated with each group of tests were similar, which suggests that the selection events that occurred during the evolutionary history of the populations probably affected the same group of traits. Electronic supplementary material The online version of this article (doi:10.1186/s12711-016-0258-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Sebastián Munilla
- Departamento de Anatomía, Embriología y Genética, Universidad de Zaragoza, 50013, Saragossa, Spain.,Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, 1417, Buenos Aires, Argentina
| | - Elena F Mouresan
- Departamento de Anatomía, Embriología y Genética, Universidad de Zaragoza, 50013, Saragossa, Spain
| | - Jhon J Cañas-Álvarez
- Grup de Recerca en Remugants, Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, 08193, Bellaterra, Barcelona, Spain
| | - Clara Díaz
- Departamento de Mejora Genética Animal, INIA, 28040, Madrid, Spain
| | - Jesús Piedrafita
- Grup de Recerca en Remugants, Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, 08193, Bellaterra, Barcelona, Spain
| | - Juan Altarriba
- Departamento de Anatomía, Embriología y Genética, Universidad de Zaragoza, 50013, Saragossa, Spain.,Instituto Agroalimentario de Aragón (IA2), 50013, Saragossa, Spain
| | - Jesús Á Baro
- Departamento de Ciencias Agroforestales, Universidad de Valladolid, 34004, Palencia, Spain
| | | | - Luis Varona
- Departamento de Anatomía, Embriología y Genética, Universidad de Zaragoza, 50013, Saragossa, Spain. .,Instituto Agroalimentario de Aragón (IA2), 50013, Saragossa, Spain.
| |
Collapse
|
160
|
Kita R, Fraser HB. Local Adaptation of Sun-Exposure-Dependent Gene Expression Regulation in Human Skin. PLoS Genet 2016; 12:e1006382. [PMID: 27760139 PMCID: PMC5070784 DOI: 10.1371/journal.pgen.1006382] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Accepted: 09/23/2016] [Indexed: 12/29/2022] Open
Abstract
Sun-exposure is a key environmental variable in the study of human evolution. Several skin-pigmentation genes serve as classical examples of positive selection, suggesting that sun-exposure has significantly shaped worldwide genomic variation. Here we investigate the interaction between genetic variation and sun-exposure, and how this impacts gene expression regulation. Using RNA-Seq data from 607 human skin samples, we identified thousands of transcripts that are differentially expressed between sun-exposed skin and non-sun-exposed skin. We then tested whether genetic variants may influence each individual’s gene expression response to sun-exposure. Our analysis revealed 10 sun-exposure-dependent gene expression quantitative trait loci (se-eQTLs), including genes involved in skin pigmentation (SLC45A2) and epidermal differentiation (RASSF9). The allele frequencies of the RASSF9 se-eQTL across diverse populations correlate with the magnitude of solar radiation experienced by these populations, suggesting local adaptation to varying levels of sunlight. These results provide the first examples of sun-exposure-dependent regulatory variation and suggest that this variation has contributed to recent human adaptation. Varying levels of sun-exposure across the world have significantly shaped human evolution. Previous analyses have found several skin pigmentation genes with evidence of strong evolutionary pressures throughout human history, manifesting as large differences in the frequency of genomic variants across populations. But even within populations, individuals respond differently to sun-exposure, suggesting variation in addition to the major differences in skin pigmentation across populations. Here we investigated whether genetic variants associate with response to sun-exposure within Europeans. To measure the response we analyzed gene expression in sun-exposed and non-sun-exposed skin, and identified ten genetic variants that associated with the sun-exposure response of nearby genes. One of these genetic variants, which associated with the sun-exposure response of the gene RASSF9, showed evidence of adaptation in humans in response to solar radiation. Together this evidence suggests that the regulation of gene expression is influenced by sun-exposure and that the sun-exposure dependent effect on RASSF9 expression may have had an effect on human fitness. To our knowledge, this is the first example of an environment-dependent regulatory variant with evidence of adaptation in humans.
Collapse
Affiliation(s)
- Ryosuke Kita
- Department of Biology, Stanford University, Stanford California
| | - Hunter B. Fraser
- Department of Biology, Stanford University, Stanford California
- * E-mail:
| |
Collapse
|
161
|
Identification of selective sweeps reveals divergent selection between Chinese Holstein and Simmental cattle populations. Genet Sel Evol 2016; 48:76. [PMID: 27716022 PMCID: PMC5054554 DOI: 10.1186/s12711-016-0254-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 09/26/2016] [Indexed: 12/23/2022] Open
Abstract
Background The identification of signals left by recent positive selection provides a feasible approach for targeting genomic variants that underlie complex traits and fitness. A better understanding of the selection mechanisms that occurred during the evolution of species can also be gained. In this study, we simultaneously detected the genome-wide footprints of recent positive selection that occurred within and between Chinese Holstein and Simmental populations, which have been subjected to artificial selection for distinct purposes. We conducted analyses using various complementary approaches, including LRH, XP-EHH and FST, based on the Illumina 770K high-density single nucleotide polymorphism (SNP) array, to enable more comprehensive detection. Results We successfully constructed profiles of selective signals in both cattle populations. To further annotate these regions, we identified a set of novel functional genes related to growth, reproduction, immune response and milk production. There were no overlapping candidate windows between the two breeds. Finally, we investigated the distribution of SNPs that had low FST values across five distinct functional regions in the genome. In the low-minor allele frequency bin, we found a higher proportion of low-FST SNPs in the exons of the bovine genome, which indicates strong purifying selection of the exons. Conclusions The selection signatures identified in these two populations demonstrated positive selection pressure on a set of important genes with potential functions that are involved in many biological processes. We also demonstrated that in the bovine genome, exons were under strong purifying selection. Our findings provide insight into the mechanisms of artificial selection and will facilitate follow-up functional studies of potential candidate genes that are related to various economically important traits in cattle. Electronic supplementary material The online version of this article (doi:10.1186/s12711-016-0254-5) contains supplementary material, which is available to authorized users.
Collapse
|
162
|
Sulovari A, Chen YH, Hudziak JJ, Li D. Atlas of human diseases influenced by genetic variants with extreme allele frequency differences. Hum Genet 2016; 136:39-54. [PMID: 27699474 DOI: 10.1007/s00439-016-1734-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Accepted: 09/27/2016] [Indexed: 12/22/2022]
Abstract
Genetic variants with extreme allele frequency differences (EAFD) may underlie some human health disparities across populations. To identify EAFD loci, we systematically analyzed and characterized 81 million genomic variants from 2504 unrelated individuals of 26 world populations (phase III of the 1000 Genomes Project). Our analyses revealed a total of 434 genes, 15 pathways, and 18 diseases and traits influenced by EAFD variants from five continental populations. They included known EAFD genes, such as LCT (lactose tolerance), SLC24A5 (skin pigmentation), and EDAR (hair morphology). We found many novel EAFD genes, including TBC1D2B (autophagy mediator), TRIM40 (gastrointestinal inflammatory regulator), KRT71, KRT75, KRT83, and KRTAP10-1 (hair and epithelial keratin synthesis), PIK3R3 (insulin receptor interaction), DARS (neurological disorders), and NACA2 (skin inflammatory response). Our results also showed four complex diseases significantly associated with EAFD loci, including asthma (adjusted enrichment P = 4 × 10-8), type I diabetes (P = 6 × 10-9), alcohol consumption (P = 0.0002), and attention deficit/hyperactivity disorder (P = 0.003). This study provides a comprehensive atlas of genes, pathways, and human diseases significantly influenced by EAFD variants.
Collapse
Affiliation(s)
- Arvis Sulovari
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT, 05405, USA
| | - Yolanda H Chen
- Department of Plant and Soil Science, University of Vermont, Burlington, VT, 05405, USA
| | - James J Hudziak
- Vermont Center for Children, Youth, and Families, Department of Psychiatry, University of Vermont, Burlington, VT, 05405, USA
| | - Dawei Li
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT, 05405, USA. .,Department of Computer Science, University of Vermont, Burlington, VT, 05405, USA. .,Neuroscience, Behavior, and Health Initiative, University of Vermont, Burlington, VT, 05405, USA.
| |
Collapse
|
163
|
Xiang-Yu J, Yang Z, Tang K, Li H. Revisiting the false positive rate in detecting recent positive selection. QUANTITATIVE BIOLOGY 2016. [DOI: 10.1007/s40484-016-0077-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
164
|
Epstein B, Jones M, Hamede R, Hendricks S, McCallum H, Murchison EP, Schönfeld B, Wiench C, Hohenlohe P, Storfer A. Rapid evolutionary response to a transmissible cancer in Tasmanian devils. Nat Commun 2016; 7:12684. [PMID: 27575253 PMCID: PMC5013612 DOI: 10.1038/ncomms12684] [Citation(s) in RCA: 140] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 07/20/2016] [Indexed: 01/16/2023] Open
Abstract
Although cancer rarely acts as an infectious disease, a recently emerged transmissible cancer in Tasmanian devils (Sarcophilus harrisii) is virtually 100% fatal. Devil facial tumour disease (DFTD) has swept across nearly the entire species' range, resulting in localized declines exceeding 90% and an overall species decline of more than 80% in less than 20 years. Despite epidemiological models that predict extinction, populations in long-diseased sites persist. Here we report rare genomic evidence of a rapid, parallel evolutionary response to strong selection imposed by a wildlife disease. We identify two genomic regions that contain genes related to immune function or cancer risk in humans that exhibit concordant signatures of selection across three populations. DFTD spreads between hosts by suppressing and evading the immune system, and our results suggest that hosts are evolving immune-modulated resistance that could aid in species persistence in the face of this devastating disease.
Collapse
Affiliation(s)
- Brendan Epstein
- School of Biological Sciences, Washington State University, Pullman, Washington 99164-4236, USA
| | - Menna Jones
- School of Zoology, University of Tasmania, Private Bag 5, Hobart, Tasmania 7001, Australia
| | - Rodrigo Hamede
- School of Zoology, University of Tasmania, Private Bag 5, Hobart, Tasmania 7001, Australia
| | - Sarah Hendricks
- Department of Biological Sciences, Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive, Moscow, Idaho 83844-3051, USA
| | - Hamish McCallum
- School of Environment, Griffith University, Nathan Campus, 170 Kessels Road, Nathan, Queensland 4111, Australia
| | - Elizabeth P. Murchison
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Barbara Schönfeld
- School of Zoology, University of Tasmania, Private Bag 5, Hobart, Tasmania 7001, Australia
| | - Cody Wiench
- Department of Biological Sciences, Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive, Moscow, Idaho 83844-3051, USA
| | - Paul Hohenlohe
- Department of Biological Sciences, Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive, Moscow, Idaho 83844-3051, USA
| | - Andrew Storfer
- School of Biological Sciences, Washington State University, Pullman, Washington 99164-4236, USA
| |
Collapse
|
165
|
Torruella-Loran I, Laayouni H, Dobon B, Gallego A, Balcells I, Garcia-Ramallo E, Espinosa-Parrilla Y. MicroRNA Genetic Variation: From Population Analysis to Functional Implications of Three Allele Variants Associated with Cancer. Hum Mutat 2016; 37:1060-73. [PMID: 27397105 DOI: 10.1002/humu.23045] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Revised: 05/25/2016] [Indexed: 12/31/2022]
Abstract
Nucleotide variants in microRNA regions have been associated with disease; nevertheless, few studies still have addressed the allele-dependent effect of these changes. We studied microRNA genetic variation in human populations and found that while low-frequency variants accumulate indistinctly in microRNA regions, the mature and seed regions tend to be depleted of high-frequency variants, probably as a result of purifying selection. Comparison of pairwise population fixation indexes among regions showed that the seed had higher population fixation indexes than the other regions, suggesting the existence of local adaptation in the seed region. We further performed functional studies of three microRNA variants associated with cancer (rs2910164:C > G in MIR146A, rs11614913:C > T in MIR196A2, and rs3746444:A > G in both MIR499A and MIR499B). We found differences in the expression between alleles and in the regulation of several genes involved in cancer, such as TP53, KIT, CDH1, CLH, and TERT, which may result in changes in regulatory networks related to tumorigenesis. Furthermore, luciferase-based assays showed that MIR499A could be regulating the cadherin CDH1 and the cell adhesion molecule CLH1 in an allele-dependent fashion. A better understanding of the effect of microRNA variants associated with disease could be key in our way to a more personalized medicine.
Collapse
Affiliation(s)
- Ignasi Torruella-Loran
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain.,Departament de Genètica i de Microbiologia, Grup de Biologia Evolutiva (GBE), Universitat Autonòma de Barcelona, Bellaterra, Barcelona, Spain
| | - Begoña Dobon
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain
| | - Alicia Gallego
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain
| | - Ingrid Balcells
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain
| | - Eva Garcia-Ramallo
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain
| | - Yolanda Espinosa-Parrilla
- Department of Experimental and Health Sciences, IBE, Institute of Evolutionary Biology, (Universitat Pompeu Fabra-CSIC), Barcelona, Catalonia, Spain. .,School of Medicine, University of Magallanes, Punta Arenas, Chile.
| |
Collapse
|
166
|
Genome-wide scans reveal variants at EDAR predominantly affecting hair straightness in Han Chinese and Uyghur populations. Hum Genet 2016; 135:1279-1286. [PMID: 27487801 DOI: 10.1007/s00439-016-1718-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Accepted: 07/23/2016] [Indexed: 10/21/2022]
Abstract
Hair straightness/curliness is one of the most conspicuous features of human variation and is particularly diverse among populations. A recent genome-wide scan found common variants in the Trichohyalin (TCHH) gene that are associated with hair straightness in Europeans, but different genes might affect this phenotype in other populations. By sampling 2899 Han Chinese, we performed the first genome-wide scan of hair straightness in East Asians, and found EDAR (rs3827760) as the predominant gene (P = 4.67 × 10-16), accounting for 3.66 % of the total variance. The candidate gene approach did not find further significant associations, suggesting that hair straightness may be affected by a large number of genes with subtle effects. Notably, genetic variants associated with hair straightness in Europeans are generally low in frequency in Han Chinese, and vice versa. To evaluate the relative contribution of these variants, we performed a second genome-wide scan in 709 samples from the Uyghur, an admixed population with both eastern and western Eurasian ancestries. In Uyghurs, both EDAR (rs3827760: P = 1.92 × 10-12) and TCHH (rs11803731: P = 1.46 × 10-3) are associated with hair straightness, but EDAR (OR 0.415) has a greater effect than TCHH (OR 0.575). We found no significant interaction between EDAR and TCHH (P = 0.645), suggesting that these two genes affect hair straightness through different mechanisms. Furthermore, haplotype analysis indicates that TCHH is not subject to selection. While EDAR is under strong selection in East Asia, it does not appear to be subject to selection after the admixture in Uyghurs. These suggest that hair straightness is unlikely a trait under selection.
Collapse
|
167
|
Choi J, Shooshtari P, Samocha KE, Daly MJ, Cotsapas C. Network Analysis of Genome-Wide Selective Constraint Reveals a Gene Network Active in Early Fetal Brain Intolerant of Mutation. PLoS Genet 2016; 12:e1006121. [PMID: 27305007 PMCID: PMC4909280 DOI: 10.1371/journal.pgen.1006121] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Accepted: 05/20/2016] [Indexed: 11/22/2022] Open
Abstract
Using robust, integrated analysis of multiple genomic datasets, we show that genes depleted for non-synonymous de novo mutations form a subnetwork of 72 members under strong selective constraint. We further show this subnetwork is preferentially expressed in the early development of the human hippocampus and is enriched for genes mutated in neurological Mendelian disorders. We thus conclude that carefully orchestrated developmental processes are under strong constraint in early brain development, and perturbations caused by mutation have adverse outcomes subject to strong purifying selection. Our findings demonstrate that selective forces can act on groups of genes involved in the same process, supporting the notion that purifying selection can act coordinately on multiple genes. Our approach provides a statistically robust, interpretable way to identify the tissues and developmental times where groups of disease genes are active. Some genes are extremely intolerant of mutations that alter their amino acid sequence. Such mutations are highly likely to drive disease, and previous reports have implicated these genes in multiple diseases. To better understand the function of these constrained genes and their place in cellular organization, we developed a framework to ask if these genes form biochemical networks expressed in specific tissues and developmental timepoints. Using clustering analysis over protein-protein interaction maps, we show that 72/107 such genes form a densely connected network. Using another new method, we found that these 72 genes are coordinately expressed in fetal brain and early blood cell precursors, but not other tissues, in the Roadmap Epigenomic Project, and then show that this gene module is active in very early developmental time points of the hippocampus included in the Brainspan Atlas. We also show that these genes, when mutated, tend to cause genetic diseases. Thus we demonstrate that evolution constrains mutation of key mechanisms that must therefore require careful control in both time and space for development to occur normally.
Collapse
Affiliation(s)
- Jinmyung Choi
- Department of Neurology, Yale School of Medicine, New Haven Connecticut, United States of America
| | - Parisa Shooshtari
- Department of Neurology, Yale School of Medicine, New Haven Connecticut, United States of America
| | - Kaitlin E. Samocha
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Program in Genetics and Genomics, Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Mark J. Daly
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| | - Chris Cotsapas
- Department of Neurology, Yale School of Medicine, New Haven Connecticut, United States of America
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Department of Genetics, Yale School of Medicine, New Haven CT, United States of America
- * E-mail:
| |
Collapse
|
168
|
Wallberg A, Pirk CW, Allsopp MH, Webster MT. Identification of Multiple Loci Associated with Social Parasitism in Honeybees. PLoS Genet 2016; 12:e1006097. [PMID: 27280405 PMCID: PMC4900560 DOI: 10.1371/journal.pgen.1006097] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Accepted: 05/10/2016] [Indexed: 12/20/2022] Open
Abstract
In colonies of the honeybee Apis mellifera, the queen is usually the only reproductive female, which produces new females (queens and workers) by laying fertilized eggs. However, in one subspecies of A. mellifera, known as the Cape bee (A. m. capensis), worker bees reproduce asexually by thelytoky, an abnormal form of meiosis where two daughter nucleii fuse to form single diploid eggs, which develop into females without being fertilized. The Cape bee also exhibits a suite of phenotypes that facilitate social parasitism whereby workers lay such eggs in foreign colonies so their offspring can exploit their resources. The genetic basis of this switch to social parasitism in the Cape bee is unknown. To address this, we compared genome variation in a sample of Cape bees with other African populations. We find genetic divergence between these populations to be very low on average but identify several regions of the genome with extreme differentiation. The regions are strongly enriched for signals of selection in Cape bees, indicating that increased levels of positive selection have produced the unique set of derived phenotypic traits in this subspecies. Genetic variation within these regions allows unambiguous genetic identification of Cape bees and likely underlies the genetic basis of social parasitism. The candidate loci include genes involved in ecdysteroid signaling and juvenile hormone and dopamine biosynthesis, which may regulate worker ovary activation and others whose products localize at the centrosome and are implicated in chromosomal segregation during meiosis. Functional analysis of these loci will yield insights into the processes of reproduction and chemical signaling in both parasitic and non-parasitic populations and advance understanding of the process of normal and atypical meiosis.
Collapse
Affiliation(s)
- Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- * E-mail: (AW); (MTW)
| | - Christian W. Pirk
- Department of Zoology and Entomology, University of Pretoria, Pretoria, South Africa
| | - Mike H. Allsopp
- Plant Protection Research Institute, Agricultural Research Council, Stellenbosch, South Africa
| | - Matthew T. Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- * E-mail: (AW); (MTW)
| |
Collapse
|
169
|
Li MJ, Pan Z, Liu Z, Wu J, Wang P, Zhu Y, Xu F, Xia Z, Sham PC, Kocher JPA, Li M, Liu JS, Wang J. Predicting regulatory variants with composite statistic. Bioinformatics 2016; 32:2729-36. [PMID: 27273672 DOI: 10.1093/bioinformatics/btw288] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 04/29/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Prediction and prioritization of human non-coding regulatory variants is critical for understanding the regulatory mechanisms of disease pathogenesis and promoting personalized medicine. Existing tools utilize functional genomics data and evolutionary information to evaluate the pathogenicity or regulatory functions of non-coding variants. However, different algorithms lead to inconsistent and even conflicting predictions. Combining multiple methods may increase accuracy in regulatory variant prediction. RESULTS Here, we compiled an integrative resource for predictions from eight different tools on functional annotation of non-coding variants. We further developed a composite strategy to integrate multiple predictions and computed the composite likelihood of a given variant being regulatory variant. Benchmarked by multiple independent causal variants datasets, we demonstrated that our composite model significantly improves the prediction performance. AVAILABILITY AND IMPLEMENTATION We implemented our model and scoring procedure as a tool, named PRVCS, which is freely available to academic and non-profit usage at http://jjwanglab.org/PRVCS CONTACT: wang.junwen@mayo.edu, jliu@stat.harvard.edu, or limx54@gmail.com SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mulin Jun Li
- Department of Statistics, Harvard University, Cambridge, Boston, 02138-2901 MA, USA, Centre for Genomic Sciences
| | - Zhicheng Pan
- Centre for Genomic Sciences, Department of Psychiatry
| | - Zipeng Liu
- Centre for Genomic Sciences, Department of Anaesthesiology
| | - Jiexing Wu
- Department of Statistics, Harvard University, Cambridge, Boston, 02138-2901 MA, USA
| | | | - Yun Zhu
- Centre for Genomic Sciences, School of Biomedical Sciences
| | | | | | | | - Jean-Pierre A Kocher
- Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Scottsdale, AZ 85259, USA and
| | - Miaoxin Li
- Centre for Genomic Sciences, Department of Psychiatry, Centre for Reproduction, Development and Growth, LKS Faculty of Medicine, the University of Hong Kong, Hong Kong SAR, China
| | - Jun S Liu
- Department of Statistics, Harvard University, Cambridge, Boston, 02138-2901 MA, USA, Center for Statistical Science, Tsinghua University, Beijing 100084, China and
| | - Junwen Wang
- Centre for Genomic Sciences, Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Scottsdale, AZ 85259, USA and Department of Biomedical Informatics, Arizona State University, Scottsdale, AZ 85259, USA
| |
Collapse
|
170
|
Randhawa IAS, Khatkar MS, Thomson PC, Raadsma HW. A Meta-Assembly of Selection Signatures in Cattle. PLoS One 2016; 11:e0153013. [PMID: 27045296 PMCID: PMC4821596 DOI: 10.1371/journal.pone.0153013] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 03/22/2016] [Indexed: 12/31/2022] Open
Abstract
Since domestication, significant genetic improvement has been achieved for many traits of commercial importance in cattle, including adaptation, appearance and production. In response to such intense selection pressures, the bovine genome has undergone changes at the underlying regions of functional genetic variants, which are termed “selection signatures”. This article reviews 64 recent (2009–2015) investigations testing genomic diversity for departure from neutrality in worldwide cattle populations. In particular, we constructed a meta-assembly of 16,158 selection signatures for individual breeds and their archetype groups (European, African, Zebu and composite) from 56 genome-wide scans representing 70,743 animals of 90 pure and crossbred cattle breeds. Meta-selection-scores (MSS) were computed by combining published results at every given locus, within a sliding window span. MSS were adjusted for common samples across studies and were weighted for significance thresholds across and within studies. Published selection signatures show extensive coverage across the bovine genome, however, the meta-assembly provides a consensus profile of 263 genomic regions of which 141 were unique (113 were breed-specific) and 122 were shared across cattle archetypes. The most prominent peaks of MSS represent regions under selection across multiple populations and harboured genes of known major effects (coat color, polledness and muscle hypertrophy) and genes known to influence polygenic traits (stature, adaptation, feed efficiency, immunity, behaviour, reproduction, beef and dairy production). As the first meta-assembly of selection signatures, it offers novel insights about the hotspots of selective sweeps in the bovine genome, and this method could equally be applied to other species.
Collapse
Affiliation(s)
- Imtiaz A. S. Randhawa
- Reprogen - Animal Bioscience Group, Faculty of Veterinary Science, The University of Sydney, 425 Werombi Road, Camden, 2570, NSW, Australia
- * E-mail:
| | - Mehar S. Khatkar
- Reprogen - Animal Bioscience Group, Faculty of Veterinary Science, The University of Sydney, 425 Werombi Road, Camden, 2570, NSW, Australia
| | - Peter C. Thomson
- Reprogen - Animal Bioscience Group, Faculty of Veterinary Science, The University of Sydney, 425 Werombi Road, Camden, 2570, NSW, Australia
| | - Herman W. Raadsma
- Reprogen - Animal Bioscience Group, Faculty of Veterinary Science, The University of Sydney, 425 Werombi Road, Camden, 2570, NSW, Australia
| |
Collapse
|
171
|
Key FM, Fu Q, Romagné F, Lachmann M, Andrés AM. Human adaptation and population differentiation in the light of ancient genomes. Nat Commun 2016; 7:10775. [PMID: 26988143 PMCID: PMC4802047 DOI: 10.1038/ncomms10775] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 01/18/2016] [Indexed: 01/24/2023] Open
Abstract
The influence of positive selection sweeps in human evolution is increasingly debated, although our ability to detect them is hampered by inherent uncertainties in the timing of past events. Ancient genomes provide snapshots of allele frequencies in the past and can help address this question. We combine modern and ancient genomic data in a simple statistic (DAnc) to time allele frequency changes, and investigate the role of drift and adaptation in population differentiation. Only 30% of the most strongly differentiated alleles between Africans and Eurasians changed in frequency during the colonization of Eurasia, but in Europe these alleles are enriched in genic and putatively functional alleles to an extent only compatible with local adaptation. Adaptive alleles—especially those associated with pigmentation—are mostly of hunter-gatherer origin, although lactose persistence arose in a haplotype present in farmers. These results provide evidence for a role of local adaptation in human population differentiation. Detecting the targets of positive selection in the human genome is challenging. Here, the authors combine modern and ancient genomes to show that alleles strongly differentiated between Africans and Europeans mediated local adaptation in European populations, and were mostly contributed by ancient hunter-gatherers.
Collapse
Affiliation(s)
- Felix M Key
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Qiaomei Fu
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany.,Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, IVPP, CAS, Beijing 100044, China.,Department of Genetics, Harvard Medical School, Boston 02115, Massachusetts, USA
| | - Frédéric Romagné
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Michael Lachmann
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany.,Santa Fe Institute, Santa Fe, New Mexico 87501, USA
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| |
Collapse
|
172
|
Engelken J, Espadas G, Mancuso FM, Bonet N, Scherr AL, Jímenez-Álvarez V, Codina-Solà M, Medina-Stacey D, Spataro N, Stoneking M, Calafell F, Sabidó E, Bosch E. Signatures of Evolutionary Adaptation in Quantitative Trait Loci Influencing Trace Element Homeostasis in Liver. Mol Biol Evol 2016; 33:738-54. [PMID: 26582562 PMCID: PMC4760079 DOI: 10.1093/molbev/msv267] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Essential trace elements possess vital functions at molecular, cellular, and physiological levels in health and disease, and they are tightly regulated in the human body. In order to assess variability and potential adaptive evolution of trace element homeostasis, we quantified 18 trace elements in 150 liver samples, together with the expression levels of 90 genes and abundances of 40 proteins involved in their homeostasis. Additionally, we genotyped 169 single nucleotide polymorphism (SNPs) in the same sample set. We detected significant associations for 8 protein quantitative trait loci (pQTL), 10 expression quantitative trait loci (eQTLs), and 15 micronutrient quantitative trait loci (nutriQTL). Six of these exceeded the false discovery rate cutoff and were related to essential trace elements: 1) one pQTL for GPX2 (rs10133290); 2) two previously described eQTLs for HFE (rs12346) and SELO (rs4838862) expression; and 3) three nutriQTLs: The pathogenic C282Y mutation at HFE affecting iron (rs1800562), and two SNPs within several clustered metallothionein genes determining selenium concentration (rs1811322 and rs904773). Within the complete set of significant QTLs (which involved 30 SNPs and 20 gene regions), we identified 12 SNPs with extreme patterns of population differentiation (FST values in the top 5% percentile in at least one HapMap population pair) and significant evidence for selective sweeps involving QTLs at GPX1, SELENBP1, GPX3, SLC30A9, and SLC39A8. Overall, this detailed study of various molecular phenotypes illustrates the role of regulatory variants in explaining differences in trace element homeostasis among populations and in the human adaptive response to environmental pressures related to micronutrients.
Collapse
Affiliation(s)
- Johannes Engelken
- †These authors contributed equally to this work. ‡Deceased October 23, 2015. Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Guadalupe Espadas
- †These authors contributed equally to this work. Proteomics Unit, Center of Genomics Regulation, Barcelona, Spain Proteomics Unit, Universitat Pompeu Fabra, Barcelona, Spain
| | - Francesco M Mancuso
- Proteomics Unit, Center of Genomics Regulation, Barcelona, Spain Proteomics Unit, Universitat Pompeu Fabra, Barcelona, Spain
| | - Nuria Bonet
- Genomics Core Facility, Universitat Pompeu Fabra, Barcelona Biomedical Research Park, Barcelona, Spain
| | - Anna-Lena Scherr
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Victoria Jímenez-Álvarez
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Marta Codina-Solà
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Daniel Medina-Stacey
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Nino Spataro
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Francesc Calafell
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Eduard Sabidó
- Proteomics Unit, Center of Genomics Regulation, Barcelona, Spain Proteomics Unit, Universitat Pompeu Fabra, Barcelona, Spain
| | - Elena Bosch
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
173
|
Abstract
Simonson, Tatum S. Altitude adaptation: A glimpse through various lenses. High Alt Med Biol 16:125-137, 2015.--Recent availability of genome-wide data from highland populations has enabled the identification of adaptive genomic signals. Some of the genomic signals reported thus far among Tibetan, Andean, and Ethiopian are the same, while others appear unique to each population. These genomic findings parallel observations conveyed by decades of physiological research: different continental populations, resident at high altitude for hundreds of generations, exhibit a distinct composite of traits at altitude. The most commonly reported signatures of selection emanate from genomic segments containing hypoxia-inducible factor (HIF) pathway genes. Corroborative evidence for adaptive significance stems from associations between putatively adaptive gene copies and sea-level ranges of hemoglobin concentration in Tibetan and Amhara Ethiopians, birth weights and metabolic factors in Andeans and Tibetans, maternal uterine artery diameter in Andeans, and protection from chronic mountain sickness in Andean males at altitude. While limited reports provide mechanistic insights thus far, efforts to identify and link precise genetic variants to molecular, physiological, and developmental functions are underway, and progress on the genomics front continues to provide unprecedented movement towards these goals. This combination of multiple perspectives is necessary to maximize our understanding of orchestrated biological and evolutionary processes in native highland populations, which will advance our understanding of both adaptive and non-adaptive responses to hypoxia.
Collapse
Affiliation(s)
- Tatum S Simonson
- Department of Medicine, Division of Physiology, University of California , San Diego, La Jolla, California
| |
Collapse
|
174
|
Adhikari K, Fontanil T, Cal S, Mendoza-Revilla J, Fuentes-Guajardo M, Chacón-Duque JC, Al-Saadi F, Johansson JA, Quinto-Sanchez M, Acuña-Alonzo V, Jaramillo C, Arias W, Barquera Lozano R, Macín Pérez G, Gómez-Valdés J, Villamil-Ramírez H, Hunemeier T, Ramallo V, Silva de Cerqueira CC, Hurtado M, Villegas V, Granja V, Gallo C, Poletti G, Schuler-Faccini L, Salzano FM, Bortolini MC, Canizales-Quinteros S, Rothhammer F, Bedoya G, Gonzalez-José R, Headon D, López-Otín C, Tobin DJ, Balding D, Ruiz-Linares A. A genome-wide association scan in admixed Latin Americans identifies loci influencing facial and scalp hair features. Nat Commun 2016; 7:10815. [PMID: 26926045 PMCID: PMC4773514 DOI: 10.1038/ncomms10815] [Citation(s) in RCA: 127] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Accepted: 01/25/2016] [Indexed: 12/20/2022] Open
Abstract
We report a genome-wide association scan in over 6,000 Latin Americans for features of scalp hair (shape, colour, greying, balding) and facial hair (beard thickness, monobrow, eyebrow thickness). We found 18 signals of association reaching genome-wide significance (P values 5 × 10(-8) to 3 × 10(-119)), including 10 novel associations. These include novel loci for scalp hair shape and balding, and the first reported loci for hair greying, monobrow, eyebrow and beard thickness. A newly identified locus influencing hair shape includes a Q30R substitution in the Protease Serine S1 family member 53 (PRSS53). We demonstrate that this enzyme is highly expressed in the hair follicle, especially the inner root sheath, and that the Q30R substitution affects enzyme processing and secretion. The genome regions associated with hair features are enriched for signals of selection, consistent with proposals regarding the evolution of human hair.
Collapse
Affiliation(s)
- Kaustubh Adhikari
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
| | - Tania Fontanil
- Departamento de Bioquímica y Biología Molecular, IUOPA, Universidad de Oviedo, Oviedo 33006, Spain
| | - Santiago Cal
- Departamento de Bioquímica y Biología Molecular, IUOPA, Universidad de Oviedo, Oviedo 33006, Spain
| | - Javier Mendoza-Revilla
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
| | - Macarena Fuentes-Guajardo
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
- Departamento de Tecnología Médica, Facultad de Ciencias de la Salud, Universidad de Tarapacá, Arica 1000009, Chile
| | - Juan-Camilo Chacón-Duque
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
| | - Farah Al-Saadi
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
| | - Jeanette A. Johansson
- Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian EH25 9RG, UK
| | | | - Victor Acuña-Alonzo
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
- National Institute of Anthropology and History, México 4510, México
| | - Claudia Jaramillo
- GENMOL (Genética Molecular), Universidad de Antioquia, Medellín 5001000, Colombia
| | - William Arias
- GENMOL (Genética Molecular), Universidad de Antioquia, Medellín 5001000, Colombia
| | - Rodrigo Barquera Lozano
- National Institute of Anthropology and History, México 4510, México
- Unidad de Genómica de Poblaciones Aplicada a la Salud, Facultad de Química, UNAM-Instituto Nacional de Medicina Genómica, México 4510, México
| | - Gastón Macín Pérez
- National Institute of Anthropology and History, México 4510, México
- Unidad de Genómica de Poblaciones Aplicada a la Salud, Facultad de Química, UNAM-Instituto Nacional de Medicina Genómica, México 4510, México
| | | | - Hugo Villamil-Ramírez
- Unidad de Genómica de Poblaciones Aplicada a la Salud, Facultad de Química, UNAM-Instituto Nacional de Medicina Genómica, México 4510, México
| | - Tábita Hunemeier
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brasil
| | - Virginia Ramallo
- Centro Nacional Patagónico, CONICET, Puerto Madryn U9129ACD, Argentina
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brasil
| | - Caio C. Silva de Cerqueira
- Centro Nacional Patagónico, CONICET, Puerto Madryn U9129ACD, Argentina
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brasil
| | - Malena Hurtado
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
| | - Valeria Villegas
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
| | - Vanessa Granja
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
| | - Carla Gallo
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
| | - Giovanni Poletti
- Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
| | - Lavinia Schuler-Faccini
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brasil
| | - Francisco M. Salzano
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brasil
| | - Maria-Cátira Bortolini
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brasil
| | - Samuel Canizales-Quinteros
- Unidad de Genómica de Poblaciones Aplicada a la Salud, Facultad de Química, UNAM-Instituto Nacional de Medicina Genómica, México 4510, México
| | | | - Gabriel Bedoya
- GENMOL (Genética Molecular), Universidad de Antioquia, Medellín 5001000, Colombia
| | | | - Denis Headon
- Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian EH25 9RG, UK
| | - Carlos López-Otín
- Departamento de Bioquímica y Biología Molecular, IUOPA, Universidad de Oviedo, Oviedo 33006, Spain
| | - Desmond J. Tobin
- Centre for Skin Sciences, Faculty of Life Sciences, University of Bradford, Bradford BD7 1DP, Victoria, UK
| | - David Balding
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
- Schools of BioSciences and Mathematics and Statistics, University of Melbourne, Melbourne 3010, Australia
| | - Andrés Ruiz-Linares
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
| |
Collapse
|
175
|
Freedman AH, Schweizer RM, Ortega-Del Vecchyo D, Han E, Davis BW, Gronau I, Silva PM, Galaverni M, Fan Z, Marx P, Lorente-Galdos B, Ramirez O, Hormozdiari F, Alkan C, Vilà C, Squire K, Geffen E, Kusak J, Boyko AR, Parker HG, Lee C, Tadigotla V, Siepel A, Bustamante CD, Harkins TT, Nelson SF, Marques-Bonet T, Ostrander EA, Wayne RK, Novembre J. Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs. PLoS Genet 2016; 12:e1005851. [PMID: 26943675 PMCID: PMC4778760 DOI: 10.1371/journal.pgen.1005851] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 01/18/2016] [Indexed: 12/31/2022] Open
Abstract
Controlling for background demographic effects is important for accurately identifying loci that have recently undergone positive selection. To date, the effects of demography have not yet been explicitly considered when identifying loci under selection during dog domestication. To investigate positive selection on the dog lineage early in the domestication, we examined patterns of polymorphism in six canid genomes that were previously used to infer a demographic model of dog domestication. Using an inferred demographic model, we computed false discovery rates (FDR) and identified 349 outlier regions consistent with positive selection at a low FDR. The signals in the top 100 regions were frequently centered on candidate genes related to brain function and behavior, including LHFPL3, CADM2, GRIK3, SH3GL2, MBP, PDE7B, NTAN1, and GLRA1. These regions contained significant enrichments in behavioral ontology categories. The 3rd top hit, CCRN4L, plays a major role in lipid metabolism, that is supported by additional metabolism related candidates revealed in our scan, including SCP2D1 and PDXC1. Comparing our method to an empirical outlier approach that does not directly account for demography, we found only modest overlaps between the two methods, with 60% of empirical outliers having no overlap with our demography-based outlier detection approach. Demography-aware approaches have lower-rates of false discovery. Our top candidates for selection, in addition to expanding the set of neurobehavioral candidate genes, include genes related to lipid metabolism, suggesting a dietary target of selection that was important during the period when proto-dogs hunted and fed alongside hunter-gatherers.
Collapse
Affiliation(s)
- Adam H. Freedman
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Rena M. Schweizer
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Diego Ortega-Del Vecchyo
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Eunjung Han
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Brian W. Davis
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Ilan Gronau
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | | | | | - Zhenxin Fan
- Key Laboratory of Bioresources and Ecoenvironment, Sichuan University, Chengdu, China
| | - Peter Marx
- Department of Measurement and Information Systems, Budapest University of Technology and Economics, Budapest, Hungary
| | - Belen Lorente-Galdos
- ICREA at Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Oscar Ramirez
- ICREA at Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Farhad Hormozdiari
- Department of Computer Science, University of California, Los Angeles, Los Angeles, California, United States of America
| | | | - Carles Vilà
- Estación Biológia de Doñana EBD-CSIC, Sevilla, Spain
| | - Kevin Squire
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Eli Geffen
- Department of Zoology, Tel Aviv University, Tel Aviv, Israel
| | - Josip Kusak
- Department of Biology, University of Zagreb, Zagreb, Croatia
| | - Adam R. Boyko
- Department of Biomedical Sciences, Cornell University, Ithaca, New York, United States of America
| | - Heidi G. Parker
- ICREA at Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Clarence Lee
- Life Technologies, Foster City, California, United States of America
| | - Vasisht Tadigotla
- Life Technologies, Foster City, California, United States of America
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | | | | | - Stanley F. Nelson
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Tomas Marques-Bonet
- ICREA at Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
- Centro Nacional de Analisis Genomico (CNAG/PCB), Baldiri Reixach 4–8, Barcelona, Spain
| | - Elaine A. Ostrander
- National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Robert K. Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - John Novembre
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
176
|
Inferring the Dynamics of Effective Population Size Using Autosomal Genomes. Sci Rep 2016; 6:20079. [PMID: 26832887 PMCID: PMC4735516 DOI: 10.1038/srep20079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 08/20/2015] [Indexed: 01/25/2023] Open
Abstract
Next-generation sequencing technology has provided a great opportunity for inferring human demographic history by investigating changes in the effective population size (Ne). In this report, we introduce a strategy for estimating Ne dynamics, allowing the exploration of large multi-locus SNP datasets. We applied this strategy to the Phase 1 Han Chinese samples from the 1000 Genomes Project. The Han Chinese population has undergone a continuous expansion since 25,000 years ago, at first slowly from about 7,300 to 9,800 (at the end of the last glacial maximum about 15,000 YBP), then more quickly to about 46,000 (at the beginning of the Neolithic about 8,000 YBP), and then even more quickly to reach a population size of about 140,000 (recently).
Collapse
|
177
|
Deschamps M, Laval G, Fagny M, Itan Y, Abel L, Casanova JL, Patin E, Quintana-Murci L. Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes. Am J Hum Genet 2016; 98:5-21. [PMID: 26748513 DOI: 10.1016/j.ajhg.2015.11.014] [Citation(s) in RCA: 174] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 11/06/2015] [Indexed: 01/25/2023] Open
Abstract
Human genes governing innate immunity provide a valuable tool for the study of the selective pressure imposed by microorganisms on host genomes. A comprehensive, genome-wide study of how selective constraints and adaptations have driven the evolution of innate immunity genes is missing. Using full-genome sequence variation from the 1000 Genomes Project, we first show that innate immunity genes have globally evolved under stronger purifying selection than the remainder of protein-coding genes. We identify a gene set under the strongest selective constraints, mutations in which are likely to predispose individuals to life-threatening disease, as illustrated by STAT1 and TRAF3. We then evaluate the occurrence of local adaptation and detect 57 high-scoring signals of positive selection at innate immunity genes, variation in which has been associated with susceptibility to common infectious or autoimmune diseases. Furthermore, we show that most adaptations targeting coding variation have occurred in the last 6,000-13,000 years, the period at which populations shifted from hunting and gathering to farming. Finally, we show that innate immunity genes present higher Neandertal introgression than the remainder of the coding genome. Notably, among the genes presenting the highest Neandertal ancestry, we find the TLR6-TLR1-TLR10 cluster, which also contains functional adaptive variation in Europeans. This study identifies highly constrained genes that fulfill essential, non-redundant functions in host survival and reveals others that are more permissive to change-containing variation acquired from archaic hominins or adaptive variants in specific populations-improving our understanding of the relative biological importance of innate immunity pathways in natural conditions.
Collapse
Affiliation(s)
- Matthieu Deschamps
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France; Université Pierre et Marie Curie, Cellule Pasteur UPMC, 75015 Paris, France
| | - Guillaume Laval
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France
| | - Maud Fagny
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France; Université Pierre et Marie Curie, Cellule Pasteur UPMC, 75015 Paris, France
| | - Yuval Itan
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA
| | - Laurent Abel
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA; Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM U.1163, 75015 Paris, France; Imagine Institute, Paris Descartes University, 75015 Paris, France
| | - Jean-Laurent Casanova
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA; Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM U.1163, 75015 Paris, France; Imagine Institute, Paris Descartes University, 75015 Paris, France; Howard Hughes Medical Institute, New York, NY 10065, USA; Pediatric Hematology-Immunology Unit, Necker Hospital for Sick Children, 75015 Paris, France
| | - Etienne Patin
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France
| | - Lluis Quintana-Murci
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France.
| |
Collapse
|
178
|
Khan A, Tian L, Zhang C, Yuan K, Xu S. Genetic diversity and natural selection footprints of the glycine amidinotransferase gene in various human populations. Sci Rep 2016; 6:18755. [PMID: 26729229 PMCID: PMC4700420 DOI: 10.1038/srep18755] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 11/23/2015] [Indexed: 12/02/2022] Open
Abstract
The glycine amidinotransferase gene (GATM) plays a vital role in energy metabolism in muscle tissues and is associated with multiple clinically important phenotypes. However, the genetic diversity of the GATM gene remains poorly understood within and between human populations. Here we analyzed the 1,000 Genomes Project data through population genetics approaches and observed significant genetic diversity across the GATM gene among various continental human populations. We observed considerable variations in GATM allele frequencies and haplotype composition among different populations. Substantial genetic differences were observed between East Asian and European populations (FST = 0.56). In addition, the frequency of a distinct major GATM haplotype in these groups was congruent with population-wide diversity at this locus. Furthermore, we identified GATM as the top differentiated gene compared to the other statin drug response-associated genes. Composite multiple analyses identified signatures of positive selection at the GATM locus, which was estimated to have occurred around 850 generations ago in European populations. As GATM catalyzes the key step of creatine biosynthesis involved in energy metabolism, we speculate that the European prehistorical demographic transition from hunter-gatherer to farming cultures was the driving force of selection that fulfilled creatine-based metabolic requirement of the populations.
Collapse
Affiliation(s)
- Asifullah Khan
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese academy of Sciences, Shanghai 200031, China.,Department of Biochemistry, Abdul Wali Khan University Mardan (AWKUM), Mardan, Khyber Pakhthunkhwa, Pakistan
| | - Lei Tian
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese academy of Sciences, Shanghai 200031, China
| | - Chao Zhang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese academy of Sciences, Shanghai 200031, China
| | - Kai Yuan
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese academy of Sciences, Shanghai 200031, China
| | - Shuhua Xu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese academy of Sciences, Shanghai 200031, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai 200031, China.,Collaborative Innovation Center of Genetics and Development, Shanghai 200438, China
| |
Collapse
|
179
|
Peng Q, Li J, Tan J, Yang Y, Zhang M, Wu S, Liu Y, Zhang J, Qin P, Guan Y, Jiao Y, Zhang Z, Sabeti PC, Tang K, Xu S, Jin L, Wang S. EDARV370A associated facial characteristics in Uyghur population revealing further pleiotropic effects. Hum Genet 2016; 135:99-108. [DOI: 10.1007/s00439-015-1618-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2015] [Accepted: 11/14/2015] [Indexed: 12/16/2022]
|
180
|
Schweizer RM, Robinson J, Harrigan R, Silva P, Galverni M, Musiani M, Green RE, Novembre J, Wayne RK. Targeted capture and resequencing of 1040 genes reveal environmentally driven functional variation in grey wolves. Mol Ecol 2015; 25:357-79. [DOI: 10.1111/mec.13467] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Revised: 11/04/2015] [Accepted: 11/06/2015] [Indexed: 12/29/2022]
Affiliation(s)
- Rena M. Schweizer
- Department of Ecology and Evolutionary Biology University of California, Los Angeles 610 Charles E Young Dr East Los Angeles CA 90095 USA
| | - Jacqueline Robinson
- Department of Ecology and Evolutionary Biology University of California, Los Angeles 610 Charles E Young Dr East Los Angeles CA 90095 USA
| | - Ryan Harrigan
- Center for Tropical Research Institute of the Environment and Sustainability University of California 619 Charles E. Young Drive East Los Angeles CA 90095 USA
| | - Pedro Silva
- CIBIO/InBio – Centro de Investigação em Biodiversidade e Recursos Genéticos Universidade do Porto Campus Agrário de Vairão 4485‐661 Vairão Portugal
- Departamento de Biologia Faculdade de Ciências Universidade do Porto Rua do Campo Alegre s/n. 4169‐007 Porto Portugal
| | - Marco Galverni
- Laboratory of Genetics ISPRA (Istituto Superiore per la Protezione e Ricerca Ambientale) Via Cà Fornacetta 9 40064 Ozzano dell'Emilia BO Italy
| | - Marco Musiani
- Faculties of Environmental Design and Veterinary Medicine (Joint Appointment) EVDS University of Calgary 2500 University Dr NW Calgary Alberta Canada T2N 1N4
| | - Richard E. Green
- Department of Biomolecular Engineering University of California Santa Cruz CA 95060 USA
| | - John Novembre
- Department of Human Genetics University of Chicago 920 E. 58th Street Chicago IL 60637 USA
| | - Robert K. Wayne
- Department of Ecology and Evolutionary Biology University of California, Los Angeles 610 Charles E Young Dr East Los Angeles CA 90095 USA
| |
Collapse
|
181
|
Hirbo J, Eidem H, Rokas A, Abbot P. Integrating Diverse Types of Genomic Data to Identify Genes that Underlie Adverse Pregnancy Phenotypes. PLoS One 2015; 10:e0144155. [PMID: 26641094 PMCID: PMC4671692 DOI: 10.1371/journal.pone.0144155] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Accepted: 11/14/2015] [Indexed: 11/18/2022] Open
Abstract
Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB), a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23–34%) are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.
Collapse
Affiliation(s)
- Jibril Hirbo
- Department of Biological Sciences, Vanderbilt University, Box 35164 Station B, Nashville, TN, 37235–1634, United States of America
| | - Haley Eidem
- Department of Biological Sciences, Vanderbilt University, Box 35164 Station B, Nashville, TN, 37235–1634, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Box 35164 Station B, Nashville, TN, 37235–1634, United States of America
| | - Patrick Abbot
- Department of Biological Sciences, Vanderbilt University, Box 35164 Station B, Nashville, TN, 37235–1634, United States of America
- * E-mail:
| |
Collapse
|
182
|
High level of inbreeding in final phase of 1000 Genomes Project. Sci Rep 2015; 5:17453. [PMID: 26625947 PMCID: PMC4667178 DOI: 10.1038/srep17453] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Accepted: 10/19/2015] [Indexed: 12/18/2022] Open
Abstract
The 1000 Genomes Project provides a unique source of whole genome sequencing data for studies of human population genetics and human diseases. The last release of this project includes more than 2,500 sequenced individuals from 26 populations. Although relationships among individuals have been investigated in some of the populations, inbreeding has never been studied. In this article, we estimated the genomic inbreeding coefficient of each individual and found an unexpected high level of inbreeding in 1000 Genomes data: nearly a quarter of the individuals were inbred and around 4% of them had inbreeding coefficients similar or greater than the ones expected for first-cousin offspring. Inbred individuals were found in each of the 26 populations, with some populations showing proportions of inbred individuals above 50%. We also detected 227 previously unreported pairs of close relatives (up to and including first-cousins). Thus, we propose subsets of unrelated and outbred individuals, for use by the scientific community. In addition, because admixed populations are present in the 1000 Genomes Project, we performed simulations to study the robustness of inbreeding coefficient estimates in the presence of admixture. We found that our multi-point approach (FSuite) was quite robust to admixture, unlike single-point methods (PLINK).
Collapse
|
183
|
He Y, Wang M, Huang X, Li R, Xu H, Xu S, Jin L. A probabilistic method for testing and estimating selection differences between populations. Genome Res 2015; 25:1903-1909. [PMID: 26463656 PMCID: PMC4665011 DOI: 10.1101/gr.192336.115] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 10/13/2015] [Indexed: 01/18/2023]
Abstract
Human populations around the world encounter various environmental challenges and, consequently, develop genetic adaptations to different selection forces. Identifying the differences in natural selection between populations is critical for understanding the roles of specific genetic variants in evolutionary adaptation. Although numerous methods have been developed to detect genetic loci under recent directional selection, a probabilistic solution for testing and quantifying selection differences between populations is lacking. Here we report the development of a probabilistic method for testing and estimating selection differences between populations. By use of a probabilistic model of genetic drift and selection, we showed that logarithm odds ratios of allele frequencies provide estimates of the differences in selection coefficients between populations. The estimates approximate a normal distribution, and variance can be estimated using genome-wide variants. This allows us to quantify differences in selection coefficients and to determine the confidence intervals of the estimate. Our work also revealed the link between genetic association testing and hypothesis testing of selection differences. It therefore supplies a solution for hypothesis testing of selection differences. This method was applied to a genome-wide data analysis of Han and Tibetan populations. The results confirmed that both the EPAS1 and EGLN1 genes are under statistically different selection in Han and Tibetan populations. We further estimated differences in the selection coefficients for genetic variants involved in melanin formation and determined their confidence intervals between continental population groups. Application of the method to empirical data demonstrated the outstanding capability of this novel approach for testing and quantifying differences in natural selection.
Collapse
Affiliation(s)
- Yungang He
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Minxian Wang
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xin Huang
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Ran Li
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Hongyang Xu
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Shuhua Xu
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Li Jin
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China; State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200433, China
| |
Collapse
|
184
|
Hu Y, Ding Q, He Y, Xu S, Jin L. Reintroduction of a Homocysteine Level-Associated Allele into East Asians by Neanderthal Introgression. Mol Biol Evol 2015; 32:3108-3113. [PMID: 26392408 DOI: 10.1093/molbev/msv176] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
In this study, we present an analysis of Neanderthal introgression at the dipeptidase 1 gene, DPEP1. A Neanderthal origin for the putative introgressive haplotypes was demonstrated using an established three-step approach. This introgression was under positive natural selection, reached a frequency of >50%, and introduced a homocysteine level- and pigmentation-associated allele (rs460879-T) into East Asians. However, the same allele was also found in non-East Asians, but not from Neanderthal introgression. It is likely that rs460879-T was lost in East Asians and was reintroduced subsequently through Neanderthal introgression. Our findings suggest that Neanderthal introgression could reintroduce an important previously existing allele into populations where the allele had been lost. This study sheds new light on understanding the contribution of Neanderthal introgression to the adaptation of non-Africans.
Collapse
Affiliation(s)
- Ya Hu
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, China
| | - Qiliang Ding
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, China Department of Molecular Biology and Genetics, Cornell University
| | - Yungang He
- CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, China
| | - Shuhua Xu
- CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, China
| |
Collapse
|
185
|
Testing for Ancient Selection Using Cross-population Allele Frequency Differentiation. Genetics 2015; 202:733-50. [PMID: 26596347 DOI: 10.1534/genetics.115.178095] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 11/18/2015] [Indexed: 12/18/2022] Open
Abstract
A powerful way to detect selection in a population is by modeling local allele frequency changes in a particular region of the genome under scenarios of selection and neutrality and finding which model is most compatible with the data. A previous method based on a cross-population composite likelihood ratio (XP-CLR) uses an outgroup population to detect departures from neutrality that could be compatible with hard or soft sweeps, at linked sites near a beneficial allele. However, this method is most sensitive to recent selection and may miss selective events that happened a long time ago. To overcome this, we developed an extension of XP-CLR that jointly models the behavior of a selected allele in a three-population tree. Our method - called "3-population composite likelihood ratio" (3P-CLR) - outperforms XP-CLR when testing for selection that occurred before two populations split from each other and can distinguish between those events and events that occurred specifically in each of the populations after the split. We applied our new test to population genomic data from the 1000 Genomes Project, to search for selective sweeps that occurred before the split of Yoruba and Eurasians, but after their split from Neanderthals, and that could have led to the spread of modern-human-specific phenotypes. We also searched for sweep events that occurred in East Asians, Europeans, and the ancestors of both populations, after their split from Yoruba. In both cases, we are able to confirm a number of regions identified by previous methods and find several new candidates for selection in recent and ancient times. For some of these, we also find suggestive functional mutations that may have driven the selective events.
Collapse
|
186
|
Nakagome S, Alkorta-Aranburu G, Amato R, Howie B, Peter BM, Hudson RR, Di Rienzo A. Estimating the Ages of Selection Signals from Different Epochs in Human History. Mol Biol Evol 2015; 33:657-69. [PMID: 26545921 DOI: 10.1093/molbev/msv256] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Genetic variation harbors signatures of natural selection driven by selective pressures that are often unknown. Estimating the ages of selection signals may allow reconstructing the history of environmental changes that shaped human phenotypes and diseases. We have developed an approximate Bayesian computation (ABC) approach to estimate allele ages under a model of selection on new mutations and under demographic models appropriate for human populations. We have applied it to two resequencing data sets: An ultra-high depth data set from a relatively small sample of unrelated individuals and a lower depth data set in a larger sample with transmission information. In addition to evaluating the accuracy of our method based on simulations, for each SNP, we assessed the consistency between the posterior probabilities estimated by the ABC approach and the ancient DNA record, finding good agreement between the two types of data and methods. Applying this ABC approach to data for eight single nucleotide polymorphisms (SNPs), we were able to rule out an onset of selection prior to the dispersal out-of-Africa for three of them and more recent than the spread of agriculture for an additional three SNPs.
Collapse
Affiliation(s)
| | | | - Roberto Amato
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, United Kingdom
| | - Bryan Howie
- Department of Human Genetics, University of Chicago
| | | | - Richard R Hudson
- Department of Human Genetics, University of Chicago Department of Ecology and Evolution, University of Chicago
| | | |
Collapse
|
187
|
Luo XJ, Mattheisen M, Li M, Huang L, Rietschel M, Børglum AD, Als TD, van den Oord EJ, Aberg KA, Mors O, Mortensen PB, Luo Z, Degenhardt F, Cichon S, Schulze TG, Nöthen MM, iPSYCH-GEMS SCZ working group, MooDS SCZ Consortium, Su B, Zhao Z, Gan L, Yao YG. Systematic Integration of Brain eQTL and GWAS Identifies ZNF323 as a Novel Schizophrenia Risk Gene and Suggests Recent Positive Selection Based on Compensatory Advantage on Pulmonary Function. Schizophr Bull 2015; 41:1294-308. [PMID: 25759474 PMCID: PMC4601704 DOI: 10.1093/schbul/sbv017] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Genome-wide association studies have identified multiple risk variants and loci that show robust association with schizophrenia. Nevertheless, it remains unclear how these variants confer risk to schizophrenia. In addition, the driving force that maintains the schizophrenia risk variants in human gene pool is poorly understood. To investigate whether expression-associated genetic variants contribute to schizophrenia susceptibility, we systematically integrated brain expression quantitative trait loci and genome-wide association data of schizophrenia using Sherlock, a Bayesian statistical framework. Our analyses identified ZNF323 as a schizophrenia risk gene (P = 2.22×10(-6)). Subsequent analyses confirmed the association of the ZNF323 and its expression-associated single nucleotide polymorphism rs1150711 in independent samples (gene-expression: P = 1.40×10(-6); single-marker meta-analysis in the combined discovery and replication sample comprising 44123 individuals: P = 6.85×10(-10)). We found that the ZNF323 was significantly downregulated in hippocampus and frontal cortex of schizophrenia patients (P = .0038 and P = .0233, respectively). Evidence for pleiotropic effects was detected (association of rs1150711 with lung function and gene expression of ZNF323 in lung: P = 6.62×10(-5) and P = 9.00×10(-5), respectively) with the risk allele (T allele) for schizophrenia acting as protective allele for lung function. Subsequent population genetics analyses suggest that the risk allele (T) of rs1150711 might have undergone recent positive selection in human population. Our findings suggest that the ZNF323 is a schizophrenia susceptibility gene whose expression may influence schizophrenia risk. Our study also illustrates a possible mechanism for maintaining schizophrenia risk variants in the human gene pool.
Collapse
Affiliation(s)
- Xiong-Jian Luo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province, Kunming Institute of Zoology, Kunming, Yunnan, China; These authors contributed equally to this work.
| | - Manuel Mattheisen
- Department of Biomedicine and Centre for Integrative Sequencing (iSEQ), Aarhus University, 8000 Aarhus C, Denmark;,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus and Copenhagen, Denmark;,Department of Genomics, Life & Brain Center, and Institute of Human Genetics, University of Bonn, Bonn, Germany;,These authors contributed equally to this work
| | - Ming Li
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD
| | - Liang Huang
- First Affiliated Hospital of Gannan Medical University, Ganzhou, Jiangxi, China
| | - Marcella Rietschel
- Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty of Mannheim, University of Heidelberg, Mannheim, Germany
| | - Anders D. Børglum
- Department of Biomedicine and Centre for Integrative Sequencing (iSEQ), Aarhus University, 8000 Aarhus C, Denmark;,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus and Copenhagen, Denmark;,Research Department, Psychiatric Hospital, Aarhus University Hospital, Aarhus, Denmark
| | - Thomas D. Als
- Department of Biomedicine and Centre for Integrative Sequencing (iSEQ), Aarhus University, 8000 Aarhus C, Denmark;,The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus and Copenhagen, Denmark
| | - Edwin J. van den Oord
- Center for Biomarker Research and Personalized Medicine, Virginia Commonwealth University
| | - Karolina A. Aberg
- Center for Biomarker Research and Personalized Medicine, Virginia Commonwealth University
| | - Ole Mors
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus and Copenhagen, Denmark;,Centre for Psychiatric Research, Aarhus University Hospital, Risskov, Denmark
| | - Preben Bo Mortensen
- The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus and Copenhagen, Denmark;,National Centre for Register-based Research, Aarhus University, Aarhus, Denmark
| | - Zhenwu Luo
- Department of Microbiology and Immunology, Medical University of South Carolina, Charleston, SC
| | - Franziska Degenhardt
- Department of Genomics, Life & Brain Center, and Institute of Human Genetics, University of Bonn, Bonn, Germany
| | - Sven Cichon
- Division of Medical Genetics, Department of Biomedicine, University Basel, Basel, Switzerland;,Institute of Neuroscience and Medicine (INM-1), Research Center Juelich, Juelich, Germany
| | - Thomas G. Schulze
- Department of Psychiatry and Psychotherapy, University Medical Center Georg-August-Universität, 37075 Goettingen, Germany;,Institute of Psychiatric Phenomics and Genomics (IPPG), Ludwig-Maximilians-University Munich
| | - Markus M. Nöthen
- Department of Genomics, Life & Brain Center, and Institute of Human Genetics, University of Bonn, Bonn, Germany
| | | | | | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Zhongming Zhao
- Departments of Biomedical Informatics and Psychiatry, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Lin Gan
- Departments of Biomedical Informatics and Psychiatry, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
| | - Yong-Gang Yao
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province, Kunming Institute of Zoology, Kunming, Yunnan, China;,CAS Center for Excellence in Brain Science, Chinese Academy of Sciences, Shanghai, 200031, China
| |
Collapse
|
188
|
Genetic and socioeconomic study of mate choice in Latinos reveals novel assortment patterns. Proc Natl Acad Sci U S A 2015; 112:13621-6. [PMID: 26483472 DOI: 10.1073/pnas.1501741112] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Nonrandom mating in human populations has important implications for genetics and medicine as well as for economics and sociology. In this study, we performed an integrative analysis of a large cohort of Mexican and Puerto Rican couples using detailed socioeconomic attributes and genotypes. We found that in ethnically homogeneous Latino communities, partners are significantly more similar in their genomic ancestries than expected by chance. Consistent with this, we also found that partners are more closely related--equivalent to between third and fourth cousins in Mexicans and Puerto Ricans--than matched random male-female pairs. Our analysis showed that this genomic ancestry similarity cannot be explained by the standard socioeconomic measurables alone. Strikingly, the assortment of genomic ancestry in couples was consistently stronger than even the assortment of education. We found enriched correlation of partners' genotypes at genes known to be involved in facial development. We replicated our results across multiple geographic locations. We discuss the implications of assortment and assortment-specific loci on disease dynamics and disease mapping methods in Latinos.
Collapse
|
189
|
Vatsiou AI, Bazin E, Gaggiotti OE. Detection of selective sweeps in structured populations: a comparison of recent methods. Mol Ecol 2015; 25:89-103. [DOI: 10.1111/mec.13360] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 07/27/2015] [Accepted: 08/25/2015] [Indexed: 12/01/2022]
Affiliation(s)
- Alexandra I. Vatsiou
- Laboratoire d'Ecologie Alpine UMR CNRS 5553 Université Joseph Fourier Grenoble France
- Scottish Oceans Institute East Sands University of St Andrews St Andrews KY16 8LB UK
| | - Eric Bazin
- Laboratoire d'Ecologie Alpine UMR CNRS 5553 Université Joseph Fourier Grenoble France
| | - Oscar E. Gaggiotti
- Laboratoire d'Ecologie Alpine UMR CNRS 5553 Université Joseph Fourier Grenoble France
- Scottish Oceans Institute East Sands University of St Andrews St Andrews KY16 8LB UK
| |
Collapse
|
190
|
Marques PI, Fonseca F, Sousa T, Santos P, Camilo V, Ferreira Z, Quesada V, Seixas S. Adaptive Evolution Favoring KLK4 Downregulation in East Asians. Mol Biol Evol 2015; 33:93-108. [PMID: 26420451 DOI: 10.1093/molbev/msv199] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The human kallikrein (KLK) cluster, located at chromosome 19q13.3-13.4, encodes 15 serine proteases, including neighboring genes (KLK3, KLK2, KLK4, and KLK5) with key roles in the cascades of semen liquefaction, tooth enamel maturation, and skin desquamation. KLK2 and KLK3 were previously identified as targets of adaptive evolution in primates through different mechanisms linked to reproductive biology and, in humans, genome-wide scans of positive selection captured, a yet unexplored, evidence for KLK neutrality departure in East Asians. We perform a detailed evaluation of KLK3-KLK5 variability in the 1000 Genomes samples from East Asia, Europe, and Africa, which was sustained by our own sequencing. In East Asians, we singled out a 70-kb region surrounding KLK4 that combined unusual low levels of diversity, high frequency variants with significant levels of population differentiation (FST > 0.5) and fairly homogenous haplotypes given the large local recombination rates. Among these variants, rs1654556_G, rs198968_T, and rs17800874_A stand out for their location on putative regulatory regions and predicted functional effects, namely the introduction of several microRNA binding sites and a repressor motif. Our functional assays carried out in different cellular models showed that rs198968_T and rs17800874_A operate synergistically to reduce KLK4 expression and could be further assisted by rs1654556_G. Considering the previous findings that KLK4 inactivation causes enamel malformations in humans and mice, and that this gene is coexpressed in epidermal layers along with several substrates involved in either cell adhesion or keratinocyte differentiation, we propose KLK4 as another target of selection in East Asians correlated to tooth and epidermal morphological traits.
Collapse
Affiliation(s)
- Patrícia Isabel Marques
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto (I3S), Porto, Portugal Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal Department of Biochemistry and Molecular Biology-IUOPA, University of Oviedo, Oviedo, Spain Institute of Biomedical Sciences Abel Salazar (ICBAS), University of Porto, Porto, Portugal
| | - Filipa Fonseca
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto (I3S), Porto, Portugal Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| | - Tânia Sousa
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto (I3S), Porto, Portugal Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| | - Paulo Santos
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto (I3S), Porto, Portugal Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| | - Vânia Camilo
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto (I3S), Porto, Portugal Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| | - Zélia Ferreira
- Department of Computational and Systems Biology, University of Pittsburgh
| | - Victor Quesada
- Department of Biochemistry and Molecular Biology-IUOPA, University of Oviedo, Oviedo, Spain
| | - Susana Seixas
- Instituto de Investigação e Inovação em Saúde, Universidade do Porto (I3S), Porto, Portugal Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| |
Collapse
|
191
|
Ronen R, Tesler G, Akbari A, Zakov S, Rosenberg NA, Bafna V. Predicting Carriers of Ongoing Selective Sweeps without Knowledge of the Favored Allele. PLoS Genet 2015; 11:e1005527. [PMID: 26402243 PMCID: PMC4581834 DOI: 10.1371/journal.pgen.1005527] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2015] [Accepted: 08/24/2015] [Indexed: 11/19/2022] Open
Abstract
Methods for detecting the genomic signatures of natural selection have been heavily studied, and they have been successful in identifying many selective sweeps. For most of these sweeps, the favored allele remains unknown, making it difficult to distinguish carriers of the sweep from non-carriers. In an ongoing selective sweep, carriers of the favored allele are likely to contain a future most recent common ancestor. Therefore, identifying them may prove useful in predicting the evolutionary trajectory—for example, in contexts involving drug-resistant pathogen strains or cancer subclones. The main contribution of this paper is the development and analysis of a new statistic, the Haplotype Allele Frequency (HAF) score. The HAF score, assigned to individual haplotypes in a sample, naturally captures many of the properties shared by haplotypes carrying a favored allele. We provide a theoretical framework for computing expected HAF scores under different evolutionary scenarios, and we validate the theoretical predictions with simulations. As an application of HAF score computations, we develop an algorithm (PreCIOSS: Predicting Carriers of Ongoing Selective Sweeps) to identify carriers of the favored allele in selective sweeps, and we demonstrate its power on simulations of both hard and soft sweeps, as well as on data from well-known sweeps in human populations. Methods for detecting the genomic signatures of natural selection have been heavily studied, and they have been successful in identifying genomic regions under positive selection. However, methods that detect positive selective sweeps do not typically identify the favored allele, or even the haplotypes carrying the favored allele. The main contribution of this paper is the development and analysis of a new statistic (the HAF score), assigned to individual haplotypes. Using both theoretical analyses and simulations, we describe how the HAF scores differ for carriers and non-carriers of the favored allele, and how they change dynamically during a selective sweep. We also develop an algorithm, PreCIOSS, for separating carriers and non-carriers. Our tool has broad applicability as carriers of the favored allele are likely to contain a future most recent common ancestor. Therefore, identifying them may prove useful in predicting the evolutionary trajectory—for example, in contexts involving drug-resistant pathogen strains or cancer subclones.
Collapse
Affiliation(s)
- Roy Ronen
- Bioinformatics Graduate Program, University of California, San Diego, La Jolla, California, United States of America
| | - Glenn Tesler
- Department of Mathematics, University of California, San Diego, La Jolla, California, United States of America
| | - Ali Akbari
- Department of Electrical & Computer Engineering, University of California, San Diego, La Jolla, California, United States of America
| | - Shay Zakov
- Department of Computer Science & Engineering, University of California, San Diego, La Jolla, California, United States of America
| | - Noah A. Rosenberg
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Vineet Bafna
- Department of Computer Science & Engineering, University of California, San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|
192
|
Pontremoli C, Mozzi A, Forni D, Cagliani R, Pozzoli U, Menozzi G, Vertemara J, Bresolin N, Clerici M, Sironi M. Natural Selection at the Brush-Border: Adaptations to Carbohydrate Diets in Humans and Other Mammals. Genome Biol Evol 2015; 7:2569-84. [PMID: 26319403 PMCID: PMC4607523 DOI: 10.1093/gbe/evv166] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Dietary shifts can drive molecular evolution in mammals and a major transition in human history, the agricultural revolution, favored carbohydrate consumption. We investigated the evolutionary history of nine genes encoding brush-border proteins involved in carbohydrate digestion/absorption. Results indicated widespread adaptive evolution in mammals, with several branches experiencing episodic selection, particularly strong in bats. Many positively selected sites map to functional protein regions (e.g., within glucosidase catalytic crevices), with parallel evolution at SI (sucrase-isomaltase) and MGAM (maltase-glucoamylase). In human populations, five genes were targeted by positive selection acting on noncoding variants within regulatory elements. Analysis of ancient DNA samples indicated that most derived alleles were already present in the Paleolithic. Positively selected variants at SLC2A5 (fructose transporter) were an exception and possibly spread following the domestication of specific fruit crops. We conclude that agriculture determined no major selective event at carbohydrate metabolism genes in humans, with implications for susceptibility to metabolic disorders.
Collapse
Affiliation(s)
- Chiara Pontremoli
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Alessandra Mozzi
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Diego Forni
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Rachele Cagliani
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Uberto Pozzoli
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Giorgia Menozzi
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Jacopo Vertemara
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| | - Nereo Bresolin
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy Dino Ferrari Centre, Department of Physiopathology and Transplantation, University of Milan, Fondazione Ca' Granda IRCCS Ospedale Maggiore Policlinico, Italy
| | - Mario Clerici
- Department of Physiopathology and Transplantation, University of Milan, Italy Don C. Gnocchi Foundation ONLUS, IRCCS, Milan, Italy
| | - Manuela Sironi
- Bioinformatics, Scientific Institute IRCCS E.MEDEA, Bosisio Parini, Italy
| |
Collapse
|
193
|
Pybus M, Luisi P, Dall'Olio GM, Uzkudun M, Laayouni H, Bertranpetit J, Engelken J. Hierarchical boosting: a machine-learning framework to detect and classify hard selective sweeps in human populations. Bioinformatics 2015; 31:3946-52. [PMID: 26315912 DOI: 10.1093/bioinformatics/btv493] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 08/17/2015] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION Detecting positive selection in genomic regions is a recurrent topic in natural population genetic studies. However, there is little consistency among the regions detected in several genome-wide scans using different tests and/or populations. Furthermore, few methods address the challenge of classifying selective events according to specific features such as age, intensity or state (completeness). RESULTS We have developed a machine-learning classification framework that exploits the combined ability of some selection tests to uncover different polymorphism features expected under the hard sweep model, while controlling for population-specific demography. As a result, we achieve high sensitivity toward hard selective sweeps while adding insights about their completeness (whether a selected variant is fixed or not) and age of onset. Our method also determines the relevance of the individual methods implemented so far to detect positive selection under specific selective scenarios. We calibrated and applied the method to three reference human populations from The 1000 Genome Project to generate a genome-wide classification map of hard selective sweeps. This study improves detection of selective sweep by overcoming the classical selection versus no-selection classification strategy, and offers an explanation to the lack of consistency observed among selection tests when applied to real data. Very few signals were observed in the African population studied, while our method presents higher sensitivity in this population demography. AVAILABILITY AND IMPLEMENTATION The genome-wide results for three human populations from The 1000 Genomes Project and an R-package implementing the 'Hierarchical Boosting' framework are available at http://hsb.upf.edu/.
Collapse
Affiliation(s)
- Marc Pybus
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Pierre Luisi
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain, Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Giovanni Marco Dall'Olio
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain, Division of Cancer Studies, King's College of London, London SE1 1UL, UK and
| | - Manu Uzkudun
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Hafid Laayouni
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain, Departament de Genètica i de Microbiologia, Universitat Autonòma de Barcelona, Bellaterra 8193, Spain
| | - Jaume Bertranpetit
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Johannes Engelken
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona 08003, Spain
| |
Collapse
|
194
|
Abstract
Domestic animals represent an extremely useful model for linking genotypic and phenotypic variation. One approach involves identifying allele frequency differences between populations, using F(ST), to detect selective sweeps. While simple to calculate, FST may generate false positives due to aspects of population history. This prompted the development of hapFLK, a metric that measures haplotype differentiation while accounting for the genetic relationship between populations. The focus of this paper was to apply hapFLK in sheep with available SNP50 genotypes. The hapFLK approach identified a known selective sweep on chromosome 10 with high precision. Further, five regions were identified centered on genes with strong evidence for positive selection (COL1A2, NCAPG, LCORL, and RXFP2). Estimation of global F(ST) revealed many more genomic regions, providing empirical data in support of published simulation-based results concerning elevated type I error associated with F(ST) when it is being used to characterize sweep regions. The findings, while conducted using sheep SNP data, are likely to be applicable across those domestic animal species that have undergone artificial selection for desirable phenotypic traits.
Collapse
|
195
|
Chen H. Population genetic studies in the genomic sequencing era. DONG WU XUE YAN JIU = ZOOLOGICAL RESEARCH 2015; 36:223-32. [PMID: 26228473 DOI: 10.13918/j.issn.2095-8137.2015.4.223] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Abstract
Recent advances in high-throughput sequencing technologies have revolutionized the field of population genetics. Data now routinely contain genomic level polymorphism information, and the low cost of DNA sequencing enables researchers to investigate tens of thousands of subjects at a time. This provides an unprecedented opportunity to address fundamental evolutionary questions, while posing challenges on traditional population genetic theories and methods. This review provides an overview of the recent methodological developments in the field of population genetics, specifically methods used to infer ancient population history and investigate natural selection using large-sample, large-scale genetic data. Several open questions are also discussed at the end of the review.
Collapse
Affiliation(s)
- Hua Chen
- Center for Computational Genomics, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101,
| |
Collapse
|
196
|
Hunter-Zinck H, Clark AG. Aberrant Time to Most Recent Common Ancestor as a Signature of Natural Selection. Mol Biol Evol 2015; 32:2784-97. [PMID: 26093129 DOI: 10.1093/molbev/msv142] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Natural selection inference methods often target one mode of selection of a particular age and strength. However, detecting multiple modes simultaneously, or with atypical representations, would be advantageous for understanding a population's evolutionary history. We have developed an anomaly detection algorithm using distributions of pairwise time to most recent common ancestor (TMRCA) to simultaneously detect multiple modes of natural selection in whole-genome sequences. As natural selection distorts local genealogies in distinct ways, the method uses pairwise TMRCA distributions, which approximate genealogies at a nonrecombining locus, to detect distortions without targeting a specific mode of selection. We evaluate the performance of our method, TSel, for both positive and balancing selection over different time-scales and selection strengths and compare TSel's performance with that of other methods. We then apply TSel to the Complete Genomics diversity panel, a set of human whole-genome sequences, and recover loci previously inferred to be under positive or balancing selection.
Collapse
Affiliation(s)
- Haley Hunter-Zinck
- Department of Biological Statistics and Computational Biology, Cornell University
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University
| |
Collapse
|
197
|
Zhao F, McParland S, Kearney F, Du L, Berry DP. Detection of selection signatures in dairy and beef cattle using high-density genomic information. Genet Sel Evol 2015; 47:49. [PMID: 26089079 PMCID: PMC4472243 DOI: 10.1186/s12711-015-0127-3] [Citation(s) in RCA: 125] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2014] [Accepted: 05/19/2015] [Indexed: 01/05/2023] Open
Abstract
Background Artificial selection for economically important traits in cattle is expected to have left distinctive selection signatures on the genome. Access to high-density genotypes facilitates the accurate identification of genomic regions that have undergone positive selection. These findings help to better elucidate the mechanisms of selection and to identify candidate genes of interest to breeding programs. Results Information on 705 243 autosomal single nucleotide polymorphisms (SNPs) in 3122 dairy and beef male animals from seven cattle breeds (Angus, Belgian Blue, Charolais, Hereford, Holstein-Friesian, Limousin and Simmental) were used to detect selection signatures by applying two complementary methods, integrated haplotype score (iHS) and global fixation index (FST). To control for false positive results, we used false discovery rate (FDR) adjustment to calculate adjusted iHS within each breed and the genome-wide significance level was about 0.003. Using the iHS method, 83, 92, 91, 101, 85, 101 and 86 significant genomic regions were detected for Angus, Belgian Blue, Charolais, Hereford, Holstein-Friesian, Limousin and Simmental cattle, respectively. None of these regions was common to all seven breeds. Using the FST approach, 704 individual SNPs were detected across breeds. Annotation of the regions of the genome that showed selection signatures revealed several interesting candidate genes i.e. DGAT1, ABCG2, MSTN, CAPN3, FABP3, CHCHD7, PLAG1, JAZF1, PRKG2, ACTC1, TBC1D1, GHR, BMP2, TSG1, LYN, KIT and MC1R that play a role in milk production, reproduction, body size, muscle formation or coat color. Fifty-seven common candidate genes were found by both the iHS and global FST methods across the seven breeds. Moreover, many novel genomic regions and genes were detected within the regions that showed selection signatures; for some candidate genes, signatures of positive selection exist in the human genome. Multilevel bioinformatic analyses of the detected candidate genes suggested that the PPAR pathway may have been subjected to positive selection. Conclusions This study provides a high-resolution bovine genomic map of positive selection signatures that are either specific to one breed or common to a subset of the seven breeds analyzed. Our results will contribute to the detection of functional candidate genes that have undergone positive selection in future studies. Electronic supplementary material The online version of this article (doi:10.1186/s12711-015-0127-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Fuping Zhao
- National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Sinead McParland
- Animal and Grassland Research and Innovation Centre, Teagasc, Moorpark, Co., Cork, Ireland.
| | - Francis Kearney
- Irish Cattle Breeding Federation, Highfield House, Bandon, Co., Cork, Ireland.
| | - Lixin Du
- National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Donagh P Berry
- Animal and Grassland Research and Innovation Centre, Teagasc, Moorpark, Co., Cork, Ireland.
| |
Collapse
|
198
|
Handelman SK, Seweryn M, Smith RM, Hartmann K, Wang D, Pietrzak M, Johnson AD, Kloczkowski A, Sadee W. Conditional entropy in variation-adjusted windows detects selection signatures associated with expression quantitative trait loci (eQTLs). BMC Genomics 2015; 16 Suppl 8:S8. [PMID: 26111110 PMCID: PMC4480832 DOI: 10.1186/1471-2164-16-s8-s8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outcome. Because evolutionary processes tend to act on gene regulation, we test whether regulatory variants are under positive selection. We introduce a new approach to enhance detection of genetic markers undergoing positive selection, using conditional entropy to capture recent local selection signals. RESULTS We use conditional logistic regression to compare our Adjusted Haplotype Conditional Entropy (H|H) measure of positive selection to existing positive selection measures. H|H and existing measures were applied to published regulatory variants acting in cis (cis-eQTLs), with conditional logistic regression testing whether regulatory variants undergo stronger positive selection than the surrounding gene. These cis-eQTLs were drawn from six independent studies of genotype and RNA expression. The conditional logistic regression shows that, overall, H|H is substantially more powerful than existing positive-selection methods in identifying cis-eQTLs against other Single Nucleotide Polymorphisms (SNPs) in the same genes. When broken down by Gene Ontology, H|H predictions are particularly strong in some biological process categories, where regulatory variants are under strong positive selection compared to the bulk of the gene, distinct from those GO categories under overall positive selection. . However, cis-eQTLs in a second group of genes lack positive selection signatures detectable by H|H, consistent with ancient short haplotypes compared to the surrounding gene (for example, in innate immunity GO:0042742); under such other modes of selection, H|H would not be expected to be a strong predictor.. These conditional logistic regression models are adjusted for Minor allele frequency(MAF); otherwise, ascertainment bias is a huge factor in all eQTL data sets. Relationships between Gene Ontology categories, positive selection and eQTL specificity were replicated with H|H in a single larger data set. Our measure, Adjusted Haplotype Conditional Entropy (H|H), was essential in generating all of the results above because it: 1) is a stronger overall predictor for eQTLs than comparable existing approaches, and 2) shows low sequential auto-correlation, overcoming problems with convergence of these conditional regression statistical models. CONCLUSIONS Our new method, H|H, provides a consistently more robust signal associated with cis-eQTLs compared to existing methods. We interpret this to indicate that some cis-eQTLs are under positive selection compared to their surrounding genes. Conditional entropy indicative of a selective sweep is an especially strong predictor of eQTLs for genes in several biological processes of medical interest. Where conditional entropy is a weak or negative predictor of eQTLs, such as innate immune genes, this would be consistent with balancing selection acting on such eQTLs over long time periods. Different measures of selection may be needed for variant prioritization under other modes of evolutionary selection.
Collapse
Affiliation(s)
- Samuel K Handelman
- Center for Pharmacogenomics, The Ohio State University College of Medicine, Graves Hall, 330 W. 10th Ave., Columbus, OH 43210, USA
| | - Michal Seweryn
- Mathematical Biosciences Institute, Jennings Hall 3rd Floor, 1735 Neil Ave., Columbus, OH 43210, USA
- Faculty of Mathematics and Computer Science, Łódź University, Narutowicza 65, 90-131 Łódź, Poland
- Division of Biostatistics, The Ohio State University College of Public Health, Cunz Hall, 1841 Neil Avenue, Columbus, OH 43210-1240, USA
| | - Ryan M Smith
- Center for Pharmacogenomics, The Ohio State University College of Medicine, Graves Hall, 330 W. 10th Ave., Columbus, OH 43210, USA
| | - Katherine Hartmann
- Center for Pharmacogenomics, The Ohio State University College of Medicine, Graves Hall, 330 W. 10th Ave., Columbus, OH 43210, USA
| | - Danxin Wang
- Center for Pharmacogenomics, The Ohio State University College of Medicine, Graves Hall, 330 W. 10th Ave., Columbus, OH 43210, USA
| | - Maciej Pietrzak
- Center for Pharmacogenomics, The Ohio State University College of Medicine, Graves Hall, 330 W. 10th Ave., Columbus, OH 43210, USA
- Division of Biostatistics, The Ohio State University College of Public Health, Cunz Hall, 1841 Neil Avenue, Columbus, OH 43210-1240, USA
| | - Andrew D Johnson
- Division of Intramural Research, National Heart, Lung and Blood Institute, Cardiovascular Epidemiology and Human Genomics Branch, The Framingham Heart Study, 73 Mt. Wayte Ave., Suite #2, Framingham, MA, USA
| | - Andrzej Kloczkowski
- Battelle Center for Mathematical Medicine, Nationwide Children's Hospital, 700 Children's Drive, Columbus OH 43205, USA
- Department of Pediatrics, The Ohio State University College of Medicine, 700 Children's Drive, Columbus OH 43205, USA
- Kavli Institute for Theoretical Physics China, Chinese Academy of Sciences, Beijing 100190, China
| | - Wolfgang Sadee
- Center for Pharmacogenomics, The Ohio State University College of Medicine, Graves Hall, 330 W. 10th Ave., Columbus, OH 43210, USA
| |
Collapse
|
199
|
Fijarczyk A, Babik W. Detecting balancing selection in genomes: limits and prospects. Mol Ecol 2015; 24:3529-45. [DOI: 10.1111/mec.13226] [Citation(s) in RCA: 144] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Revised: 04/27/2015] [Accepted: 04/30/2015] [Indexed: 12/17/2022]
Affiliation(s)
- Anna Fijarczyk
- Institute of Environmental Sciences; Jagiellonian University; Gronostajowa 7 30-387 Kraków Poland
| | - Wiesław Babik
- Institute of Environmental Sciences; Jagiellonian University; Gronostajowa 7 30-387 Kraków Poland
| |
Collapse
|
200
|
Racimo F, Sankararaman S, Nielsen R, Huerta-Sánchez E. Evidence for archaic adaptive introgression in humans. Nat Rev Genet 2015; 16:359-71. [PMID: 25963373 PMCID: PMC4478293 DOI: 10.1038/nrg3936] [Citation(s) in RCA: 336] [Impact Index Per Article: 33.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
As modern and ancient DNA sequence data from diverse human populations accumulate, evidence is increasing in support of the existence of beneficial variants acquired from archaic humans that may have accelerated adaptation and improved survival in new environments - a process known as adaptive introgression. Within the past few years, a series of studies have identified genomic regions that show strong evidence for archaic adaptive introgression. Here, we provide an overview of the statistical methods developed to identify archaic introgressed fragments in the genome sequences of modern humans and to determine whether positive selection has acted on these fragments. We review recently reported examples of adaptive introgression, grouped by selection pressure, and consider the level of supporting evidence for each. Finally, we discuss challenges and recommendations for inferring selection on introgressed regions.
Collapse
Affiliation(s)
- Fernando Racimo
- Department of Integrative Biology, UC Berkeley, Berkeley CA 97420
| | | | - Rasmus Nielsen
- Department of Integrative Biology, UC Berkeley, Berkeley CA 97420
- Department of Statistics, UC Berkeley, Berkeley CA 97420
| | | |
Collapse
|