1
|
Wang C, Masala M, Fiorillo E, Devoto M, Cucca F, Ionita-Laza I. Quantile-specific confounding: correction for subtle population stratification via quantile regression. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.18.638253. [PMID: 40166133 PMCID: PMC11957037 DOI: 10.1101/2025.03.18.638253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Subtle population structure remains a significant concern in genome-wide association studies. Using human height as an example, we show how quantile regression, a natural extension of linear regression, can better correct for subtle population structure due to its inherent ability to adjust for quantile-specific effects of covariates such as principal components. We utilize data from the UK biobank and the SardiNIA/ProgeNIA project for demonstration.
Collapse
Affiliation(s)
- Chen Wang
- Department of Biostatistics, Columbia University, New York, USA
| | - Marco Masala
- Institute for Genetic and Biomedical Research, National Research Council, Italy
| | - Edoardo Fiorillo
- Institute for Genetic and Biomedical Research, National Research Council, Italy
| | - Marcella Devoto
- Institute for Genetic and Biomedical Research, National Research Council, Italy
| | - Francesco Cucca
- Institute for Genetic and Biomedical Research, National Research Council, Italy
- Department of Biomedical Sciences, University of Sassari, Sassari, Italy
| | - Iuliana Ionita-Laza
- Department of Biostatistics, Columbia University, New York, USA
- Department of Statistics, Lund University, Lund, Sweden
| |
Collapse
|
2
|
Anderson NW, Kirk L, Schraiber JG, Ragsdale AP. A path integral approach for allele frequency dynamics under polygenic selection. Genetics 2025; 229:1-63. [PMID: 39531638 PMCID: PMC12086674 DOI: 10.1093/genetics/iyae182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 10/11/2024] [Accepted: 10/16/2024] [Indexed: 11/16/2024] Open
Abstract
Many phenotypic traits have a polygenic genetic basis, making it challenging to learn their genetic architectures and predict individual phenotypes. One promising avenue to resolve the genetic basis of complex traits is through evolve-and-resequence (E&R) experiments, in which laboratory populations are exposed to some selective pressure and trait-contributing loci are identified by extreme frequency changes over the course of the experiment. However, small laboratory populations will experience substantial random genetic drift, and it is difficult to determine whether selection played a role in a given allele frequency change (AFC). Predicting AFCs under drift and selection, even for alleles contributing to simple, monogenic traits, has remained a challenging problem. Recently, there have been efforts to apply the path integral, a method borrowed from physics, to solve this problem. So far, this approach has been limited to genic selection, and is therefore inadequate to capture the complexity of quantitative, highly polygenic traits that are commonly studied. Here, we extend one of these path integral methods, the perturbation approximation, to selection scenarios that are of interest to quantitative genetics. We derive analytic expressions for the transition probability (i.e. the probability that an allele will change in frequency from x to y in time t) of an allele contributing to a trait subject to stabilizing selection, as well as that of an allele contributing to a trait rapidly adapting to a new phenotypic optimum. We use these expressions to characterize the use of AFC to test for selection, as well as explore optimal design choices for E&R experiments to uncover the genetic architecture of polygenic traits under selection.
Collapse
Affiliation(s)
- Nathan W Anderson
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Lloyd Kirk
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Joshua G Schraiber
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
3
|
Cho HW, Jin HS, Kim SS, Eom YB. Forensic height estimation using polygenic score in Korean population. Mol Genet Genomics 2024; 299:78. [PMID: 39120737 DOI: 10.1007/s00438-024-02172-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 07/30/2024] [Indexed: 08/10/2024]
Abstract
Height is known to be a classically heritable trait controlled by complex polygenic factors. Numerous height-associated genetic variants across the genome have been identified so far. It is also a representative of externally visible characteristics (EVC) for predicting appearance in forensic science. When biological evidence at a crime scene is deficient in identifying an individual, the examination of forensic DNA phenotyping using some genetic variants could be considered. In this study, we aimed to predict 'height', a representative forensic phenotype, by using a small number of genetic variants when short tandem repeat (STR) analysis is hard with insufficient biological samples. Our results not only replicated previous genetic signals but also indicated an upward trend in polygenic score (PGS) with increasing height in the validation and replication stages for both genders. These results demonstrate that the established SNP sets in this study could be used for height estimation in the Korean population. Specifically, since the PGS model constructed in this study targets only a small number of SNPs, it contributes to enabling forensic DNA phenotyping even at crime scenes with a minimal amount of biological evidence. To the best of our knowledge, this was the first study to evaluate a PGS model for height estimation in the Korean population using GWAS signals. Our study offers insight into the polygenic effect of height in East Asians, incorporating genetic variants from non-Asian populations.
Collapse
Affiliation(s)
- Hye-Won Cho
- Department of Medical Sciences, Graduate School, Soonchunhyang University, Asan, 31538, Chungnam, Republic of Korea
| | - Hyun-Seok Jin
- Department of Biomedical Laboratory Science, College of Life and Health Sciences, Hoseo University, Asan, 31499, Chungnam, Republic of Korea
| | - Sung-Soo Kim
- Department of Biomedical Laboratory Science, College of Life and Health Sciences, Hoseo University, Asan, 31499, Chungnam, Republic of Korea
| | - Yong-Bin Eom
- Department of Medical Sciences, Graduate School, Soonchunhyang University, Asan, 31538, Chungnam, Republic of Korea.
- Department of Biomedical Laboratory Science, College of Medical Sciences, Soonchunhyang University, 22 Soonchunhyang-ro, Sinchang-myeon, Asan-si, 31538, Chungcheongnam-do, Republic of Korea.
| |
Collapse
|
4
|
Blanc J, Berg JJ. Testing for differences in polygenic scores in the presence of confounding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.12.532301. [PMID: 36993707 PMCID: PMC10055004 DOI: 10.1101/2023.03.12.532301] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Polygenic scores have become an important tool in human genetics, enabling the prediction of individuals' phenotypes from their genotypes. Understanding how the pattern of differences in polygenic score predictions across individuals intersects with variation in ancestry can provide insights into the evolutionary forces acting on the trait in question, and is important for understanding health disparities. However, because most polygenic scores are computed using effect estimates from population samples, they are susceptible to confounding by both genetic and environmental effects that are correlated with ancestry. The extent to which this confounding drives patterns in the distribution of polygenic scores depends on patterns of population structure in both the original estimation panel and in the prediction/test panel. Here, we use theory from population and statistical genetics, together with simulations, to study the procedure of testing for an association between polygenic scores and axes of ancestry variation in the presence of confounding. We use a general model of genetic relatedness to describe how confounding in the estimation panel biases the distribution of polygenic scores in a way that depends on the degree of overlap in population structure between panels. We then show how this confounding can bias tests for associations between polygenic scores and important axes of ancestry variation in the test panel. Specifically, for any given test, there exists a single axis of population structure in the GWAS panel that needs to be controlled for in order to protect the test. Based on this result, we propose a new approach for directly estimating this axis of population structure in the GWAS panel. We then use simulations to compare the performance of this approach to the standard approach in which the principal components of the GWAS panel genotypes are used to control for stratification.
Collapse
Affiliation(s)
- Jennifer Blanc
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Jeremy J. Berg
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
5
|
Anderson NW, Kirk L, Schraiber JG, Ragsdale AP. A Path Integral Approach for Allele Frequency Dynamics Under Polygenic Selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.14.599114. [PMID: 38915613 PMCID: PMC11195211 DOI: 10.1101/2024.06.14.599114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Many phenotypic traits have a polygenic genetic basis, making it challenging to learn their genetic architectures and predict individual phenotypes. One promising avenue to resolve the genetic basis of complex traits is through evolve-and-resequence experiments, in which laboratory populations are exposed to some selective pressure and trait-contributing loci are identified by extreme frequency changes over the course of the experiment. However, small laboratory populations will experience substantial random genetic drift, and it is difficult to determine whether selection played a roll in a given allele frequency change. Predicting how much allele frequencies change under drift and selection had remained an open problem well into the 21st century, even those contributing to simple, monogenic traits. Recently, there have been efforts to apply the path integral, a method borrowed from physics, to solve this problem. So far, this approach has been limited to genic selection, and is therefore inadequate to capture the complexity of quantitative, highly polygenic traits that are commonly studied. Here we extend one of these path integral methods, the perturbation approximation, to selection scenarios that are of interest to quantitative genetics. In particular, we derive analytic expressions for the transition probability (i.e., the probability that an allele will change in frequency from x , to y in time t ) of an allele contributing to a trait subject to stabilizing selection, as well as that of an allele contributing to a trait rapidly adapting to a new phenotypic optimum. We use these expressions to characterize the use of allele frequency change to test for selection, as well as explore optimal design choices for evolve-and-resequence experiments to uncover the genetic architecture of polygenic traits under selection.
Collapse
Affiliation(s)
- Nathan W. Anderson
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Lloyd Kirk
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Joshua G. Schraiber
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, 90089, USA
| | - Aaron P. Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
6
|
She H, Hao Y, Song G, Luo X, Lei F, Zhai W, Qu Y. Gene expression plasticity followed by genetic change during colonization in a high-elevation environment. eLife 2024; 12:RP86687. [PMID: 38470231 DOI: 10.7554/elife.86687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024] Open
Abstract
Phenotypic plasticity facilitates organismal invasion of novel environments, and the resultant phenotypic change may later be modified by genetic change, so called 'plasticity first.' Herein, we quantify gene expression plasticity and regulatory adaptation in a wild bird (Eurasian Tree Sparrow) from its original lowland (ancestral stage), experimentally implemented hypoxia acclimation (plastic stage), and colonized highland (colonized stage). Using a group of co-expressed genes from the cardiac and flight muscles, respectively, we demonstrate that gene expression plasticity to hypoxia tolerance is more often reversed than reinforced at the colonized stage. By correlating gene expression change with muscle phenotypes, we show that colonized tree sparrows reduce maladaptive plasticity that largely associated with decreased hypoxia tolerance. Conversely, adaptive plasticity that is congruent with increased hypoxia tolerance is often reinforced in the colonized tree sparrows. Genes displaying large levels of reinforcement or reversion plasticity (i.e. 200% of original level) show greater genetic divergence between ancestral and colonized populations. Overall, our work demonstrates that gene expression plasticity at the initial stage of high-elevation colonization can be reversed or reinforced through selection-driven adaptive modification.
Collapse
Affiliation(s)
- Huishang She
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Yan Hao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Gang Song
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Xu Luo
- Faculty of Biodiversity and Conservation, Southwest Forestry University, Kunming, China
| | - Fumin Lei
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Weiwei Zhai
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Yanhua Qu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
7
|
Gao Z. Unveiling recent and ongoing adaptive selection in human populations. PLoS Biol 2024; 22:e3002469. [PMID: 38236800 PMCID: PMC10796035 DOI: 10.1371/journal.pbio.3002469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2024] Open
Abstract
Genome-wide scans for signals of selection have become a routine part of the analysis of population genomic variation datasets and have resulted in compelling evidence of selection during recent human evolution. This Essay spotlights methodological innovations that have enabled the detection of selection over very recent timescales, even in contemporary human populations. By harnessing large-scale genomic and phenotypic datasets, these new methods use different strategies to uncover connections between genotype, phenotype, and fitness. This Essay outlines the rationale and key findings of each strategy, discusses challenges in interpretation, and describes opportunities to improve detection and understanding of ongoing selection in human populations.
Collapse
Affiliation(s)
- Ziyue Gao
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
8
|
Sear R, Townsend C. 'Dysgenic fertility' is an ideological, not a scientific, concept. A Comment on: 'Stability and change in male fertility patterns by cognitive ability across 32 birth cohorts' (2023), by Bratsberg & Rogeberg. Biol Lett 2023; 19:20230390. [PMID: 37909106 PMCID: PMC10618866 DOI: 10.1098/rsbl.2023.0390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 10/10/2023] [Indexed: 11/02/2023] Open
Abstract
Recently Bratsberg & Rogeberg (2023) presented an analysis in Biology Letters of how cognitive ability is associated with fertility in Norwegian men. Our concern relates to the theoretical framework of this paper. The analysis is framed around the concept of 'dysgenic fertility', which is treated throughout as a scientific theory, but 'dysgenic fertility' is not science, it is an ideological concept.
Collapse
Affiliation(s)
- Rebecca Sear
- Population Health, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
| | | |
Collapse
|
9
|
Ben Jemaa S, Tolone M, Sardina MT, Di Gerlando R, Chessari G, Criscione A, Persichilli C, Portolano B, Mastrangelo S. A genome-wide comparison between selected and unselected Valle del Belice sheep reveals differences in population structure and footprints of recent selection. J Anim Breed Genet 2023; 140:558-567. [PMID: 37226373 DOI: 10.1111/jbg.12779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 05/05/2023] [Accepted: 05/11/2023] [Indexed: 05/26/2023]
Abstract
About three decades of breeding and selection in the Valle del Belìce sheep are expected to have left several genomic footprints related to milk production traits. In this study, we have assembled a dataset with 451 individuals of the Valle del Belìce sheep breed: 184 animals that underwent directional selection for milk production and 267 unselected animals, genotyped for 40,660 single-nucleotide polymorphisms (SNPs). Three different statistical approaches, both within (iHS and ROH) and between (Rsb) groups, were used to identify genomic regions potentially under selection. Population structure analyses separated all individuals according to their belonging to the two groups. A total of four genomic regions on two chromosomes were jointly identified by at least two statistical approaches. Several candidate genes for milk production were identified, corroborating the polygenic nature of this trait and which may provide clues to potential new selection targets. We also found candidate genes for growth and reproductive traits. Overall, the identified genes may explain the effect of selection to improve the performances related to milk production traits in the breed. Further studies using high-density array data, would be particularly relevant to refine and validate these results.
Collapse
Affiliation(s)
- Slim Ben Jemaa
- Laboratoire des Productions Animales et Fourragères, Institut National de la Recherche Agronomique de Tunisie, Université de Carthage, Ariana, Tunisia
| | - Marco Tolone
- Dipartimento Scienze Agrarie, Alimentari e Forestali, University of Palermo, Palermo, Italy
| | - Maria Teresa Sardina
- Dipartimento Scienze Agrarie, Alimentari e Forestali, University of Palermo, Palermo, Italy
| | - Rosalia Di Gerlando
- Dipartimento Scienze Agrarie, Alimentari e Forestali, University of Palermo, Palermo, Italy
| | - Giorgio Chessari
- Dipartimento Agricoltura, Alimentazione e Ambiente, University of Catania, Catania, Italy
| | - Andrea Criscione
- Dipartimento Agricoltura, Alimentazione e Ambiente, University of Catania, Catania, Italy
| | - Christian Persichilli
- Dipartimento di Agraria, Ambientale e Scienze dell'alimentazione, University of Molise, Campobasso, Italy
| | - Baldassare Portolano
- Dipartimento Scienze Agrarie, Alimentari e Forestali, University of Palermo, Palermo, Italy
| | - Salvatore Mastrangelo
- Dipartimento Scienze Agrarie, Alimentari e Forestali, University of Palermo, Palermo, Italy
| |
Collapse
|
10
|
Majara L, Kalungi A, Koen N, Tsuo K, Wang Y, Gupta R, Nkambule LL, Zar H, Stein DJ, Kinyanda E, Atkinson EG, Martin AR. Low and differential polygenic score generalizability among African populations due largely to genetic diversity. HGG ADVANCES 2023; 4:100184. [PMID: 36873096 PMCID: PMC9982687 DOI: 10.1016/j.xhgg.2023.100184] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 02/04/2023] [Indexed: 02/15/2023] Open
Abstract
African populations are vastly underrepresented in genetic studies but have the most genetic variation and face wide-ranging environmental exposures globally. Because systematic evaluations of genetic prediction had not yet been conducted in ancestries that span African diversity, we calculated polygenic risk scores (PRSs) in simulations across Africa and in empirical data from South Africa, Uganda, and the United Kingdom to better understand the generalizability of genetic studies. PRS accuracy improves with ancestry-matched discovery cohorts more than from ancestry-mismatched studies. Within ancestrally and ethnically diverse South African individuals, we find that PRS accuracy is low for all traits but varies across groups. Differences in African ancestries contribute more to variability in PRS accuracy than other large cohort differences considered between individuals in the United Kingdom versus Uganda. We computed PRS in African ancestry populations using existing European-only versus ancestrally diverse genetic studies; the increased diversity produced the largest accuracy gains for hemoglobin concentration and white blood cell count, reflecting large-effect ancestry-enriched variants in genes known to influence sickle cell anemia and the allergic response, respectively. Differences in PRS accuracy across African ancestries originating from diverse regions are as large as across out-of-Africa continental ancestries, requiring commensurate nuance.
Collapse
Affiliation(s)
- Lerato Majara
- Global Initiative for Neuropsychiatric Genetics Education in Research (GINGER), Harvard T.H. Chan School of Public Health, Department of Epidemiology, Boston, MA, USA
- MRC Human Genetics Research Unit, Division of Human Genetics, Institute of Infectious Disease and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Observatory 7925, South Africa
- Department of Psychiatry and Neuroscience Institute, University of Cape Town, Cape Town, South Africa
| | - Allan Kalungi
- Global Initiative for Neuropsychiatric Genetics Education in Research (GINGER), Harvard T.H. Chan School of Public Health, Department of Epidemiology, Boston, MA, USA
- Department of Psychiatry, College of Health Sciences, Makerere University, Kampala, Uganda
- Department of Psychiatry, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
- Mental Health Project, Medical Research Council/Uganda Virus Research Institute (MRC/UVRI) & London School of Hygiene and Tropical Medicine (LSHTM), Uganda Research Unit, Entebbe, Uganda
| | - Nastassja Koen
- Global Initiative for Neuropsychiatric Genetics Education in Research (GINGER), Harvard T.H. Chan School of Public Health, Department of Epidemiology, Boston, MA, USA
- Department of Psychiatry and Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Cape Town, South Africa
| | - Kristin Tsuo
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Ying Wang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Rahul Gupta
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA 02115, USA
| | - Lethukuthula L. Nkambule
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Heather Zar
- Department of Paediatrics and Child Health, Red Cross Children’s Hospital and Medical Research Council Unit on Child and Adolescent Health, University of Cape Town, Cape Town, South Africa
| | - Dan J. Stein
- Department of Psychiatry and Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- South African Medical Research Council (SAMRC) Unit on Risk and Resilience in Mental Disorders, Cape Town, South Africa
| | - Eugene Kinyanda
- Mental Health Project, Medical Research Council/Uganda Virus Research Institute (MRC/UVRI) & London School of Hygiene and Tropical Medicine (LSHTM), Uganda Research Unit, Entebbe, Uganda
| | - Elizabeth G. Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Alicia R. Martin
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| |
Collapse
|
11
|
Shipilina D, Pal A, Stankowski S, Chan YF, Barton NH. On the origin and structure of haplotype blocks. Mol Ecol 2023; 32:1441-1457. [PMID: 36433653 PMCID: PMC10946714 DOI: 10.1111/mec.16793] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 11/16/2022] [Accepted: 11/18/2022] [Indexed: 11/27/2022]
Abstract
The term "haplotype block" is commonly used in the developing field of haplotype-based inference methods. We argue that the term should be defined based on the structure of the Ancestral Recombination Graph (ARG), which contains complete information on the ancestry of a sample. We use simulated examples to demonstrate key features of the relationship between haplotype blocks and ancestral structure, emphasizing the stochasticity of the processes that generate them. Even the simplest cases of neutrality or of a "hard" selective sweep produce a rich structure, often missed by commonly used statistics. We highlight a number of novel methods for inferring haplotype structure, based on the full ARG, or on a sequence of trees, and illustrate how they can be used to define haplotype blocks using an empirical data set. While the advent of new, computationally efficient methods makes it possible to apply these concepts broadly, they (and additional new methods) could benefit from adding features to explore haplotype blocks, as we define them. Understanding and applying the concept of the haplotype block will be essential to fully exploit long and linked-read sequencing technologies.
Collapse
Affiliation(s)
- Daria Shipilina
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG)Uppsala UniversityUppsalaSweden
- Institute of Science and Technology AustriaKlosterneuburgAustria
- Swedish Collegium for Advanced StudyUppsalaSweden
| | - Arka Pal
- Institute of Science and Technology AustriaKlosterneuburgAustria
| | - Sean Stankowski
- Institute of Science and Technology AustriaKlosterneuburgAustria
| | | | | |
Collapse
|
12
|
Abdellaoui A, Yengo L, Verweij KJH, Visscher PM. 15 years of GWAS discovery: Realizing the promise. Am J Hum Genet 2023; 110:179-194. [PMID: 36634672 PMCID: PMC9943775 DOI: 10.1016/j.ajhg.2022.12.011] [Citation(s) in RCA: 218] [Impact Index Per Article: 109.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
It has been 15 years since the advent of the genome-wide association study (GWAS) era. Here, we review how this experimental design has realized its promise by facilitating an impressive range of discoveries with remarkable impact on multiple fields, including population genetics, complex trait genetics, epidemiology, social science, and medicine. We predict that the emergence of large-scale biobanks will continue to expand to more diverse populations and capture more of the allele frequency spectrum through whole-genome sequencing, which will further improve our ability to investigate the causes and consequences of human genetic variation for complex traits and diseases.
Collapse
Affiliation(s)
- Abdel Abdellaoui
- Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands.
| | - Loic Yengo
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Karin J H Verweij
- Department of Psychiatry, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Peter M Visscher
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
13
|
Novembre J, Stein C, Asgari S, Gonzaga-Jauregui C, Landstrom A, Lemke A, Li J, Mighton C, Taylor M, Tishkoff S. Addressing the challenges of polygenic scores in human genetic research. Am J Hum Genet 2022; 109:2095-2100. [PMID: 36459976 PMCID: PMC9808501 DOI: 10.1016/j.ajhg.2022.10.012] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
The genotyping of millions of human samples has made it possible to evaluate variants across the human genome for their possible association with risks for numerous diseases and other traits by using genome-wide association studies (GWASs). The associations between phenotype and genotype found in GWASs make possible the construction of polygenic scores (PGSs), which aim to predict a trait or disease outcome in an individual on the basis of their genotype (in the disease case, the term polygenic risk score [PRS] is often used). PGSs have shown promise for studying the biology of complex traits and as a tool for evaluating individual disease risks in clinical settings. Although the quantity and quality of data to compute PGSs are increasing, challenges remain in the technical aspects of developing PGSs and in the ethical and social issues that might arise from their use. This ASHG Guidance emphasizes three major themes for researchers working with or interested in the application of PGSs in their own research: (1) developing diverse research cohorts; (2) fostering robustness in the development, application, and interpretation of PGSs; and (3) improving the communication of PGS results and their implications to broad audiences.
Collapse
Affiliation(s)
- John Novembre
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Human Genetics, University of Chicago, Chicago, IL, USA,Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA,Corresponding author
| | - Catherine Stein
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA,Corresponding author
| | - Samira Asgari
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Claudia Gonzaga-Jauregui
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,International Laboratory for Human Genome Research, Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México
| | - Andrew Landstrom
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Pediatrics, Division of Cardiology, Duke University School of Medicine, Durham, NC, USA
| | - Amy Lemke
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Norton Children’s Research Institute, affiliated with the University of Louisville School of Medicine, Louisville, KY, USA
| | - Jun Li
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Chloe Mighton
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Genomics Health Services Research Program, St. Michael’s Hospital, Unity Health Toronto, Toronto, ON, Canada,Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Matthew Taylor
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Adult Medical Genetics Program, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Sarah Tishkoff
- Professional Practice and Social Implications Committee Polygenic Scores Guidance Writing Group, American Society of Human Genetics, Rockville MD, USA,Department of Genetics, Center for Global Genomics and Health Equity, University of Pennsylvania, Philadelphia, PA, USA,Department of Biology, Center for Global Genomics and Health Equity, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
14
|
Abraham A, LaBella AL, Capra JA, Rokas A. Mosaic patterns of selection in genomic regions associated with diverse human traits. PLoS Genet 2022; 18:e1010494. [PMID: 36342969 PMCID: PMC9671423 DOI: 10.1371/journal.pgen.1010494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 11/17/2022] [Accepted: 10/21/2022] [Indexed: 11/09/2022] Open
Abstract
Natural selection shapes the genetic architecture of many human traits. However, the prevalence of different modes of selection on genomic regions associated with variation in traits remains poorly understood. To address this, we developed an efficient computational framework to calculate positive and negative enrichment of different evolutionary measures among regions associated with complex traits. We applied the framework to summary statistics from >900 genome-wide association studies (GWASs) and 11 evolutionary measures of sequence constraint, population differentiation, and allele age while accounting for linkage disequilibrium, allele frequency, and other potential confounders. We demonstrate that this framework yields consistent results across GWASs with variable sample sizes, numbers of trait-associated SNPs, and analytical approaches. The resulting evolutionary atlas maps diverse signatures of selection on genomic regions associated with complex human traits on an unprecedented scale. We detected positive enrichment for sequence conservation among trait-associated regions for the majority of traits (>77% of 290 high power GWASs), which included reproductive traits. Many traits also exhibited substantial positive enrichment for population differentiation, especially among hair, skin, and pigmentation traits. In contrast, we detected widespread negative enrichment for signatures of balancing selection (51% of GWASs) and absence of enrichment for evolutionary signals in regions associated with late-onset Alzheimer's disease. These results support a pervasive role for negative selection on regions of the human genome that contribute to variation in complex traits, but also demonstrate that diverse modes of evolution are likely to have shaped trait-associated loci. This atlas of evolutionary signatures across the diversity of available GWASs will enable exploration of the relationship between the genetic architecture and evolutionary processes in the human genome.
Collapse
Affiliation(s)
- Abin Abraham
- Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Abigail L. LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina, United States of America
- North Carolina Research Center, Kannapolis, North Carolina, United States of America
| | - John A. Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, California, United States of America
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
15
|
Kullo IJ, Lewis CM, Inouye M, Martin AR, Ripatti S, Chatterjee N. Polygenic scores in biomedical research. Nat Rev Genet 2022; 23:524-532. [PMID: 35354965 PMCID: PMC9391275 DOI: 10.1038/s41576-022-00470-z] [Citation(s) in RCA: 90] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/03/2022] [Indexed: 12/12/2022]
Abstract
Public health strategies aimed at disease prevention or early detection and intervention have the potential to advance human health worldwide. However, their success depends on the identification of risk factors that underlie disease burden in the general population. Genome-wide association studies (GWAS) have implicated thousands of single-nucleotide polymorphisms (SNPs) in common complex diseases or traits. By calculating a weighted sum of the number of trait-associated alleles harboured by an individual, a polygenic score (PGS), also called a polygenic risk score (PRS), can be constructed that reflects an individual’s estimated genetic predisposition for a given phenotype. Here, we ask six experts to give their opinions on the utility of these probabilistic tools, their strengths and limitations, and the remaining barriers that need to be overcome for their equitable use.
Collapse
Affiliation(s)
- Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA.
| | - Cathryn M Lewis
- Social, Genetic and Developmental Psychiatry Centre & Department of Medical & Molecular, King's College London, London, UK.
| | - Michael Inouye
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
- Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.
| | - Alicia R Martin
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| | - Samuli Ripatti
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland.
- Department of Public Health, University of Helsinki, Helsinki, Finland.
| | - Nilanjan Chatterjee
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
16
|
Caro-Consuegra R, Nieves-Colón MA, Rawls E, Rubin-de-Celis V, Lizárraga B, Vidaurre T, Sandoval K, Fejerman L, Stone AC, Moreno-Estrada A, Bosch E. Uncovering signals of positive selection in Peruvian populations from three ecological regions. Mol Biol Evol 2022; 39:6647595. [PMID: 35860855 PMCID: PMC9356722 DOI: 10.1093/molbev/msac158] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Perú hosts extremely diverse ecosystems which can be broadly classified into three major ecoregions: the Pacific desert coast, the Andean highlands, and the Amazon rainforest. Since its initial peopling approximately 12,000 years ago, the populations inhabiting such ecoregions might have differentially adapted to their contrasting environmental pressures. Previous studies have described several candidate genes underlying adaptation to hypobaric hypoxia among Andean highlanders. However, the adaptive genetic diversity of coastal and rainforest populations has been less studied. Here, we gathered genome-wide SNP-array data from 286 Peruvians living across the three ecoregions and analysed signals of recent positive selection through population differentiation and haplotype-based selection scans. Among highland populations, we identify candidate genes related to cardiovascular function (TLL1, DUSP27, TBX5, PLXNA4, SGCD), to the Hypoxia-Inducible Factor pathway (TGFA, APIP), to skin pigmentation (MITF), as well as to glucose (GLIS3) and glycogen metabolism (PPP1R3C, GANC). In contrast, most signatures of adaptation in coastal and rainforest populations comprise candidate genes related to the immune system (including SIGLEC8, TRIM21, CD44 and ICAM1 in the coast; CBLB and PRDM1 in rainforest and the BRD2- HLA-DOA- HLA-DPA1 region in both), possibly as a result of strong pathogen-driven selection. This study identifies candidate genes related to human adaptation to the diverse environments of South America.
Collapse
Affiliation(s)
- Rocio Caro-Consuegra
- Institute of Evolutionary Biology (UPF-CSIC), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
| | - Maria A Nieves-Colón
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada (UGA-LANGEBIO), CINVESTAV, Irapuato, Guanajuato, Mexico.,School of Human Evolution and Social Change, Arizona State University, Tempe, AZ, USA.,Department of Anthropology, University of Minnesota Twin Cities, Minneapolis, MN, USA
| | - Erin Rawls
- School of Human Evolution and Social Change, Arizona State University, Tempe, AZ, USA
| | - Verónica Rubin-de-Celis
- Laboratorio de Genómica Molecular Evolutiva, Instituto de Ciencia y Tecnología, Universidad Ricardo Palma, Lima, Perú
| | - Beatriz Lizárraga
- Emeritus Professor, Facultad de Ciencias Biológicas, Universidad Nacional Mayor de San Marcos, Lima, Perú
| | | | - Karla Sandoval
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada (UGA-LANGEBIO), CINVESTAV, Irapuato, Guanajuato, Mexico
| | - Laura Fejerman
- Department of Public Health Sciences, University of California Davis, Davis, CA, USA
| | - Anne C Stone
- School of Human Evolution and Social Change, Arizona State University, Tempe, AZ, USA.,Center for Evolution and Medicine, Arizona State University, Tempe, AZ, USA
| | - Andrés Moreno-Estrada
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada (UGA-LANGEBIO), CINVESTAV, Irapuato, Guanajuato, Mexico
| | - Elena Bosch
- Institute of Evolutionary Biology (UPF-CSIC), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain.,Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Reus, Spain
| |
Collapse
|
17
|
Novembre J. The background and legacy of Lewontin's apportionment of human genetic diversity. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200406. [PMID: 35430890 PMCID: PMC9014184 DOI: 10.1098/rstb.2020.0406] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 03/18/2022] [Indexed: 12/18/2022] Open
Abstract
Lewontin's 1972 article 'The apportionment of human diversity' described a key feature of human genetic diversity that would have profound impacts on conversations regarding genetics and race: the typical genetic locus varies much less between classical human race groupings than one might infer from inspecting the features historically used to define those races, like skin pigmentation. From this, Lewontin concluded: 'Human racial classification … is now seen to be of virtually no genetic or taxonomic significance' (p. 397). Here, 50 years after the paper's publication, the goal is to understand the origins and legacy of the paper. Aided by insights from published papers and interviews with several of Lewontin's contemporaries, I review the 1972 paper, asking about the intellectual background that led to the publication of the paper, the development of its impact, the critiques of the work and the work's application and limitations today. The hope is that by gaining a clearer understanding of the origin and reasoning of the paper, we might dispel various confusions about the result and sharpen an understanding of the enduring value and insight the result provides. This article is part of the theme issue 'Celebrating 50 years since Lewontin's apportionment of human diversity'.
Collapse
Affiliation(s)
- John Novembre
- Department of Human Genetics, University of Chicago, Chicago, 60637, IL
- Department of Ecology and Evolution, University of Chicago, Chicago, 60637, IL
| |
Collapse
|
18
|
Yair S, Coop G. Population differentiation of polygenic score predictions under stabilizing selection. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200416. [PMID: 35430887 PMCID: PMC9014188 DOI: 10.1098/rstb.2020.0416] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 03/08/2022] [Indexed: 12/15/2022] Open
Abstract
Given the many small-effect loci uncovered by genome-wide association studies (GWAS), polygenic scores have become central to genomic medicine, and have found application in diverse settings including evolutionary studies of adaptation. Despite their promise, polygenic scores have been found to suffer from limited portability across human populations. This at first seems in conflict with the observation that most common genetic variation is shared among populations. We investigate one potential cause of this discrepancy: stabilizing selection on complex traits. Counterintuitively, while stabilizing selection constrains phenotypic evolution, it accelerates the loss and fixation of alleles underlying trait variation within populations (GWAS loci). Thus even when populations share an optimum phenotype, stabilizing selection erodes the variance contributed by their shared GWAS loci, such that predictions from GWAS in one population explain less of the phenotypic variation in another. We develop theory to quantify how stabilizing selection is expected to reduce the prediction accuracy of polygenic scores in populations not represented in GWAS samples. In addition, we find that polygenic scores can substantially overstate average genetic differences of phenotypes among populations. We emphasize stabilizing selection around a common optimum as a useful null model to connect patterns of allele frequency and polygenic score differentiation. This article is part of the theme issue 'Celebrating 50 years since Lewontin's apportionment of human diversity'.
Collapse
Affiliation(s)
- Sivan Yair
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Graham Coop
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| |
Collapse
|
19
|
Gopalan S, Smith SP, Korunes K, Hamid I, Ramachandran S, Goldberg A. Human genetic admixture through the lens of population genomics. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200410. [PMID: 35430881 PMCID: PMC9014191 DOI: 10.1098/rstb.2020.0410] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 03/24/2022] [Indexed: 12/13/2022] Open
Abstract
Over the past 50 years, geneticists have made great strides in understanding how our species' evolutionary history gave rise to current patterns of human genetic diversity classically summarized by Lewontin in his 1972 paper, 'The Apportionment of Human Diversity'. One evolutionary process that requires special attention in both population genetics and statistical genetics is admixture: gene flow between two or more previously separated source populations to form a new admixed population. The admixture process introduces ancestry-based structure into patterns of genetic variation within and between populations, which in turn influences the inference of demographic histories, identification of genetic targets of selection and prediction of complex traits. In this review, we outline some challenges for admixture population genetics, including limitations of applying methods designed for populations without recent admixture to the study of admixed populations. We highlight recent studies and methodological advances that aim to overcome such challenges, leveraging genomic signatures of admixture that occurred in the past tens of generations to gain insights into human history, natural selection and complex trait architecture. This article is part of the theme issue 'Celebrating 50 years since Lewontin's apportionment of human diversity'.
Collapse
Affiliation(s)
- Shyamalika Gopalan
- Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA
| | - Samuel Pattillo Smith
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA
- Department of Ecology, Evolution and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Katharine Korunes
- Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA
| | - Iman Hamid
- Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA
- Department of Ecology, Evolution and Organismal Biology, Brown University, Providence, RI 02912, USA
- Data Science Initiative, Brown University, Providence, RI 02912, USA
| | - Amy Goldberg
- Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA
| |
Collapse
|
20
|
Burt CH. Challenging the utility of polygenic scores for social science: Environmental confounding, downward causation, and unknown biology. Behav Brain Sci 2022; 46:e207. [PMID: 35551690 PMCID: PMC9653522 DOI: 10.1017/s0140525x22001145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The sociogenomics revolution is upon us, we are told. Whether revolutionary or not, sociogenomics is poised to flourish given the ease of incorporating polygenic scores (or PGSs) as "genetic propensities" for complex traits into social science research. Pointing to evidence of ubiquitous heritability and the accessibility of genetic data, scholars have argued that social scientists not only have an opportunity but a duty to add PGSs to social science research. Social science research that ignores genetics is, some proponents argue, at best partial and likely scientifically flawed, misleading, and wasteful. Here, I challenge arguments about the value of genetics for social science and with it the claimed necessity of incorporating PGSs into social science models as measures of genetic influences. In so doing, I discuss the impracticability of distinguishing genetic influences from environmental influences because of non-causal gene-environment correlations, especially population stratification, familial confounding, and downward causation. I explain how environmental effects masquerade as genetic influences in PGSs, which undermines their raison d'être as measures of genetic propensity, especially for complex socially contingent behaviors that are the subject of sociogenomics. Additionally, I draw attention to the partial, unknown biology, while highlighting the persistence of an implicit, unavoidable reductionist genes versus environments approach. Leaving sociopolitical and ethical concerns aside, I argue that the potential scientific rewards of adding PGSs to social science are few and greatly overstated and the scientific costs, which include obscuring structural disadvantages and cultural influences, outweigh these meager benefits for most social science applications.
Collapse
Affiliation(s)
- Callie H Burt
- Department of Criminal Justice & Criminology, Center for Research on Interpersonal Violence (CRIV), Georgia State University, Atlanta, GA, USA ; www.callieburt.org
| |
Collapse
|
21
|
Smith SP, Shahamatdar S, Cheng W, Zhang S, Paik J, Graff M, Haiman C, Matise TC, North KE, Peters U, Kenny E, Gignoux C, Wojcik G, Crawford L, Ramachandran S. Enrichment analyses identify shared associations for 25 quantitative traits in over 600,000 individuals from seven diverse ancestries. Am J Hum Genet 2022; 109:871-884. [PMID: 35349783 PMCID: PMC9118115 DOI: 10.1016/j.ajhg.2022.03.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 03/02/2022] [Indexed: 12/12/2022] Open
Abstract
Since 2005, genome-wide association (GWA) datasets have been largely biased toward sampling European ancestry individuals, and recent studies have shown that GWA results estimated from self-identified European individuals are not transferable to non-European individuals because of various confounding challenges. Here, we demonstrate that enrichment analyses that aggregate SNP-level association statistics at multiple genomic scales-from genes to genomic regions and pathways-have been underutilized in the GWA era and can generate biologically interpretable hypotheses regarding the genetic basis of complex trait architecture. We illustrate examples of the robust associations generated by enrichment analyses while studying 25 continuous traits assayed in 566,786 individuals from seven diverse self-identified human ancestries in the UK Biobank and the Biobank Japan as well as 44,348 admixed individuals from the PAGE consortium including cohorts of African American, Hispanic and Latin American, Native Hawaiian, and American Indian/Alaska Native individuals. We identify 1,000 gene-level associations that are genome-wide significant in at least two ancestry cohorts across these 25 traits as well as highly conserved pathway associations with triglyceride levels in European, East Asian, and Native Hawaiian cohorts.
Collapse
Affiliation(s)
- Samuel Pattillo Smith
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Sahar Shahamatdar
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Wei Cheng
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA
| | - Selena Zhang
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA
| | - Joseph Paik
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA
| | - Misa Graff
- Department of Epidemiology, University of North Carolina, Chapel Hill, Chapel Hill, NC 27599, USA
| | - Christopher Haiman
- Department of Preventative Medicine, University of Southern California, Los Angeles, CA 90089, USA
| | - T C Matise
- Department of Genetics, Rutgers University, Piscataway, NJ 08854, USA
| | - Kari E North
- Department of Epidemiology, University of North Carolina, Chapel Hill, Chapel Hill, NC 27599, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Eimear Kenny
- The Center for Genomic Health, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA; Department of Medicine, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York City, NY 10029, USA
| | - Chris Gignoux
- Division of Biomedical Informatics and Personalized Medicine, University of Colorado, Denver, CO 80204, USA
| | - Genevieve Wojcik
- Department of Epidemiology, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Lorin Crawford
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Biostatistics, Brown University, Providence, RI 02906, USA; Microsoft Research New England, Cambridge, MA 02142, USA
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown University, Providence, RI 02912, USA; Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, RI 02912, USA; Data Science Initiative, Brown University, Providence, RI 02912, USA.
| |
Collapse
|
22
|
Evidence for continent-wide convergent evolution and stasis throughout 150 y of a biological invasion. Proc Natl Acad Sci U S A 2022; 119:e2107584119. [PMID: 35476511 PMCID: PMC9170017 DOI: 10.1073/pnas.2107584119] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Adaptive evolution can help species to persist and spread in new environments, but it is unclear how the rate and duration of adaptive evolution vary throughout species ranges and on the decadal timescales most relevant to managing biodiversity for the 21st century. Using herbarium records, we reconstruct 150 y of evolution in an invasive plant as it spread across North America. Flowering phenology evolves to adapt to local growing seasons throughout the range but stalls after about a century. This punctuated, convergent evolution recapitulates long-term dynamics in the fossil record, implicating limits to evolutionary rates that are not evident for the first century of spread. The extent to which evolution can rescue a species from extinction, or facilitate range expansion, depends critically on the rate, duration, and geographical extent of the evolutionary response to natural selection. Adaptive evolution can occur quickly, but the duration and geographical extent of contemporary evolution in natural systems remain poorly studied. This is particularly true for species with large geographical ranges and for timescales that lie between “long-term” field experiments and the fossil record. Here, we introduce the Virtual Common Garden (VCG) to investigate phenotypic evolution in natural history collections while controlling for phenotypic plasticity in response to local growing conditions. Reconstructing 150 y of evolution in Lythrum salicaria (purple loosestrife) as it invaded North America, we analyze phenology measurements of 3,429 herbarium records, reconstruct growing conditions from more than 12 million local temperature records, and validate predictions across three common gardens spanning 10° of latitude. We find that phenological clines have evolved repeatedly throughout the range, during the first century of evolution. Thereafter, the rate of microevolution stalls, recapitulating macroevolutionary stasis observed in the fossil record. Our study demonstrates that preserved specimens are a critical resource for investigating limits to evolution in natural populations. Our results show how natural selection and trade-offs measured in field studies predict adaptive divergence observable in herbarium specimens over 15 decades at a continental scale.
Collapse
|
23
|
Whiteman NK. Evolution in small steps and giant leaps. Evolution 2022; 76:67-77. [PMID: 35040122 PMCID: PMC9387839 DOI: 10.1111/evo.14432] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 12/28/2021] [Accepted: 01/03/2022] [Indexed: 02/03/2023]
Abstract
The first Editor of Evolution was Ernst Mayr. His foreword to the first issue of Evolution published in 1947 framed evolution as a "problem of interaction" that was just beginning to be studied in this broad context. First, I explore progress and prospects on understanding the subsidiary interactions identified by Mayr, including interactions between parts of organisms, between individuals and populations, between species, and between the organism and its abiotic environment. Mayr's overall "problem of interaction" framework is examined in the context of coevolution within and among levels of biological organization. This leads to a comparison in the relative roles of biotic versus abiotic agents of selection and fluctuating versus directional selection, followed by stabilizing selection in shaping the genomic architecture of adaptation. Oligogenic architectures may be typical for traits shaped more by fluctuating selection and biotic selection. Conversely, polygenic architectures may be typical for traits shaped more by directional followed by stabilizing selection and abiotic selection. The distribution of effect sizes and turnover dynamics of adaptive alleles in these scenarios deserves further study. Second, I review two case studies on the evolution of acquired toxicity in animals, one involving cardiac glycosides obtained from plants and one involving bacterial virulence factors horizontally transferred to animals. The approaches used in these studies and the results gained directly flow from Mayr's vision of an evolutionary biology that revolves around the "problem of interaction."
Collapse
Affiliation(s)
- Noah K. Whiteman
- Department of Integrative Biology, University of California, Berkeley, California 94720
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720
| |
Collapse
|
24
|
Hartfield M, Poulsen NA, Guldbrandtsen B, Bataillon T. Using singleton densities to detect recent selection in Bos taurus. Evol Lett 2021; 5:595-606. [PMID: 34917399 PMCID: PMC8645200 DOI: 10.1002/evl3.263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 10/07/2021] [Accepted: 10/08/2021] [Indexed: 11/05/2022] Open
Abstract
Many quantitative traits are subject to polygenic selection, where several genomic regions undergo small, simultaneous changes in allele frequency that collectively alter a phenotype. The widespread availability of genome data, along with novel statistical techniques, has made it easier to detect these changes. We apply one such method, the "Singleton Density Score" (SDS), to the Holstein breed of Bos taurus to detect recent selection (arising up to around 740 years ago). We identify several genes as candidates for targets of recent selection, including some relating to cell regulation, catabolic processes, neural-cell adhesion and immunity. We do not find strong evidence that three traits that are important to humans-milk protein content, milk fat content, and stature-have been subject to directional selection. Simulations demonstrate that because B. taurus recently experienced a population bottleneck, singletons are depleted so the power of SDS methods is reduced. These results inform on which genes underlie recent genetic change in B. taurus, while providing information on how polygenic selection can be best investigated in future studies.
Collapse
Affiliation(s)
- Matthew Hartfield
- Bioinformatics Research CentreAarhus UniversityAarhusDK‐8000Denmark
- Institute of Evolutionary BiologyUniversity of EdinburghEdinburghEH9 3FLUnited Kingdom
| | | | - Bernt Guldbrandtsen
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and GeneticsAarhus UniversityTjeleDK‐8830Denmark
- Rheinische Friedrich‐Wilhelms‐Universität BonnInstitut für TierwissenschaftenBonnDE‐53115Germany
- Department of Veterinary SciencesCopenhagen UniversityFrederiksberg CDK‐1870Denmark
| | - Thomas Bataillon
- Bioinformatics Research CentreAarhus UniversityAarhusDK‐8000Denmark
| |
Collapse
|
25
|
Sohail M, Izarraras-Gomez A, Ortega-Del Vecchyo D. Populations, Traits, and Their Spatial Structure in Humans. Genome Biol Evol 2021; 13:evab272. [PMID: 34894236 PMCID: PMC8715524 DOI: 10.1093/gbe/evab272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2021] [Indexed: 11/16/2022] Open
Abstract
The spatial distribution of genetic variants is jointly determined by geography, past demographic processes, natural selection, and its interplay with environmental variation. A fraction of these genetic variants are "causal alleles" that affect the manifestation of a complex trait. The effect exerted by these causal alleles on complex traits can be independent or dependent on the environment. Understanding the evolutionary processes that shape the spatial structure of causal alleles is key to comprehend the spatial distribution of complex traits. Natural selection, past population size changes, range expansions, consanguinity, assortative mating, archaic introgression, admixture, and the environment can alter the frequencies, effect sizes, and heterozygosities of causal alleles. This provides a genetic axis along which complex traits can vary. However, complex traits also vary along biogeographical and sociocultural axes which are often correlated with genetic axes in complex ways. The purpose of this review is to consider these genetic and environmental axes in concert and examine the ways they can help us decipher the variation in complex traits that is visible in humans today. This initiative necessarily implies a discussion of populations, traits, the ability to infer and interpret "genetic" components of complex traits, and how these have been impacted by adaptive events. In this review, we provide a history-aware discussion on these topics using both the recent and more distant past of our academic discipline and its relevant contexts.
Collapse
Affiliation(s)
- Mashaal Sohail
- Department of Human Genetics, University of Chicago, USA
- Centro de Ciencias Genómicas (CCG), Universidad Nacional Autónoma de México (UNAM), Cuernavaca, Morelos, México
| | - Alan Izarraras-Gomez
- Laboratorio Internacional de Investigación sobre el Genoma Humano (LIIGH), Universidad Nacional Autónoma de México (UNAM), Juriquilla, Querétaro, México
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano (LIIGH), Universidad Nacional Autónoma de México (UNAM), Juriquilla, Querétaro, México
| |
Collapse
|
26
|
Kreiner JM, Tranel PJ, Weigel D, Stinchcombe JR, Wright SI. The genetic architecture and population genomic signatures of glyphosate resistance in Amaranthus tuberculatus. Mol Ecol 2021; 30:5373-5389. [PMID: 33853196 DOI: 10.1111/mec.15920] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Revised: 03/15/2021] [Accepted: 04/06/2021] [Indexed: 01/04/2023]
Abstract
Much of what we know about the genetic basis of herbicide resistance has come from detailed investigations of monogenic adaptation at known target-sites, despite the increasingly recognized importance of polygenic resistance. Little work has been done to characterize the broader genomic basis of herbicide resistance, including the number and distribution of genes involved, their effect sizes, allele frequencies and signatures of selection. In this work, we implemented genome-wide association (GWA) and population genomic approaches to examine the genetic architecture of glyphosate (Round-up) resistance in the problematic agricultural weed Amaranthus tuberculatus. A GWA was able to correctly identify the known target-gene but statistically controlling for two causal target-site mechanisms revealed an additional 250 genes across all 16 chromosomes associated with non-target-site resistance (NTSR). The encoded proteins had functions that have been linked to NTSR, the most significant of which is response to chemicals, but also showed pleiotropic roles in reproduction and growth. Compared to an empirical null that accounts for complex population structure, the architecture of NTSR was enriched for large effect sizes and low allele frequencies, suggesting the role of pleiotropic constraints on its evolution. The enrichment of rare alleles also suggested that the genetic architecture of NTSR may be population-specific and heterogeneous across the range. Despite their rarity, we found signals of recent positive selection on NTSR-alleles by both window- and haplotype-based statistics, and an enrichment of amino acid changing variants. In our samples, genome-wide single nucleotide polymorphisms explain a comparable amount of the total variation in glyphosate resistance to monogenic mechanisms, even in a collection of individuals where 80% of resistant individuals have large-effect TSR mutations, indicating an underappreciated polygenic contribution to the evolution of herbicide resistance in weed populations.
Collapse
Affiliation(s)
- Julia M Kreiner
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Patrick J Tranel
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - John R Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
- Koffler Scientific Reserve, University of Toronto, King City, ON, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
- Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
27
|
Prakrithi P, Lakra P, Sundar D, Kapoor M, Mukerji M, Gupta I, The Indian Genome Variation Consortium. Genetic Risk Prediction of COVID-19 Susceptibility and Severity in the Indian Population. Front Genet 2021; 12:714185. [PMID: 34707636 PMCID: PMC8543005 DOI: 10.3389/fgene.2021.714185] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 09/08/2021] [Indexed: 01/09/2023] Open
Abstract
Host genetic variants can determine their susceptibility to COVID-19 infection and severity as noted in a recent Genome-wide Association Study (GWAS). Given the prominent genetic differences in Indian sub-populations as well as differential prevalence of COVID-19, here, we compute genetic risk scores in diverse Indian sub-populations that may predict differences in the severity of COVID-19 outcomes. We utilized the top 100 most significantly associated single-nucleotide polymorphisms (SNPs) from a GWAS by Pairo-Castineira et al. determining the genetic susceptibility to severe COVID-19 infection, to compute population-wise polygenic risk scores (PRS) for populations represented in the Indian Genome Variation Consortium (IGVC) database. Using a generalized linear model accounting for confounding variables, we found that median PRS was significantly associated (p < 2 x 10−16) with COVID-19 mortality in each district corresponding to the population studied and had the largest effect on mortality (regression coefficient = 10.25). As a control we repeated our analysis on randomly selected 100 non-associated SNPs several times and did not find significant association. Therefore, we conclude that genetic susceptibility may play a major role in determining the differences in COVID-19 outcomes and mortality across the Indian sub-continent. We suggest that combining PRS with other observed risk-factors in a Bayesian framework may provide a better prediction model for ascertaining high COVID-19 risk groups and to design more effective public health resource allocation and vaccine distribution schemes.
Collapse
Affiliation(s)
- P Prakrithi
- Genomics and Molecular Medicine, CSIR Institute of Genomics and Integrative Biology, New Delhi, India.,Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi, India
| | - Priya Lakra
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi, India
| | - Durai Sundar
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi, India
| | - Manav Kapoor
- Department of Neuroscience, Icahn School of Medicine at Mt. Sinai, New York, NY, United States
| | - Mitali Mukerji
- Genomics and Molecular Medicine, CSIR Institute of Genomics and Integrative Biology, New Delhi, India
| | - Ishaan Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi, India
| | - The Indian Genome Variation Consortium
- Genomics and Molecular Medicine, CSIR Institute of Genomics and Integrative Biology, New Delhi, India.,Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi, India.,Department of Neuroscience, Icahn School of Medicine at Mt. Sinai, New York, NY, United States
| |
Collapse
|
28
|
Allele frequency differentiation at height-associated SNPs among continental human populations. Eur J Hum Genet 2021; 29:1542-1548. [PMID: 34267339 PMCID: PMC8484658 DOI: 10.1038/s41431-021-00938-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 06/20/2021] [Accepted: 06/28/2021] [Indexed: 02/07/2023] Open
Abstract
Methods to detect polygenic adaptation have recently been shown to be sensitive to uncorrected stratification in GWAS, thereby casting doubts on whether polygenic adaptation is prevalent among humans. Consistent with a signal of adaptation at human height loci, the mean FST among African, East Asian, and European populations was shown to be significantly higher at height-associated SNPs than that at non-associated SNPs. This conclusion was reached, however, using height-associated SNPs ascertained from a GWAS design impacted by residual confounding due to uncorrected stratification. Specifically, we show here that the estimated effect sizes are significantly correlated with population structure across continents, potentially explaining the elevated differentiation previously reported. We alleviated these concerns of confounding by ascertaining height-associated SNPs from two biobank GWAS (UK Biobank, UKB, and Biobank Japan, BBJ), where measures to control for confounding in GWAS are more effective. Consistent with a global signature of polygenic adaptation, we found that compared to non-associated SNPs, frequencies of height-associated SNPs are indeed significantly more differentiated among continental populations from both the 1000 Genomes Project (p = 0.0012 for UKB and p = 0.0265 for BBJ), and the Human Genome Diversity Project (p = 0.0225 for UKB and p = 0.0032 for BBJ). However, we found no significant difference among continental populations in polygenic height scores. Through simulations, we found that polygenic score-based statistics could lose power in detecting polygenic adaptation in presence of independent converging selections, thereby potentially explaining the inconsistent results based on FST and polygenic scores.
Collapse
|
29
|
Mathieson I. The omnigenic model and polygenic prediction of complex traits. Am J Hum Genet 2021; 108:1558-1563. [PMID: 34331855 PMCID: PMC8456163 DOI: 10.1016/j.ajhg.2021.07.003] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 07/08/2021] [Indexed: 12/16/2022] Open
Abstract
The omnigenic model was proposed as a framework to understand the highly polygenic architecture of complex traits revealed by genome-wide association studies (GWASs). I argue that this model also explains recent observations about cross-population genetic effects, specifically the low transferability of polygenic scores and the lack of clear evidence for polygenic selection. In particular, the omnigenic model explains why the effects of most GWAS variants vary between populations. This interpretation has several consequences for the evolutionary interpretation and practical use of GWAS summary statistics and polygenic scores. First, some polygenic scores may be applicable only in populations of the same ancestry and environment as the discovery population. Second, most GWAS associations will have differing effects between populations and are unlikely to be robust clinical targets. Finally, it may not always be possible to detect polygenic selection from population genetic data. These considerations make it difficult to interpret the clinical and evolutionary meanings of polygenic scores without an explicit model of genetic architecture.
Collapse
Affiliation(s)
- Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
30
|
Irving-Pease EK, Muktupavela R, Dannemann M, Racimo F. Quantitative Human Paleogenetics: What can Ancient DNA Tell us About Complex Trait Evolution? Front Genet 2021; 12:703541. [PMID: 34422004 PMCID: PMC8371751 DOI: 10.3389/fgene.2021.703541] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 07/08/2021] [Indexed: 12/13/2022] Open
Abstract
Genetic association data from national biobanks and large-scale association studies have provided new prospects for understanding the genetic evolution of complex traits and diseases in humans. In turn, genomes from ancient human archaeological remains are now easier than ever to obtain, and provide a direct window into changes in frequencies of trait-associated alleles in the past. This has generated a new wave of studies aiming to analyse the genetic component of traits in historic and prehistoric times using ancient DNA, and to determine whether any such traits were subject to natural selection. In humans, however, issues about the portability and robustness of complex trait inference across different populations are particularly concerning when predictions are extended to individuals that died thousands of years ago, and for which little, if any, phenotypic validation is possible. In this review, we discuss the advantages of incorporating ancient genomes into studies of trait-associated variants, the need for models that can better accommodate ancient genomes into quantitative genetic frameworks, and the existing limits to inferences about complex trait evolution, particularly with respect to past populations.
Collapse
Affiliation(s)
- Evan K. Irving-Pease
- Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| | - Rasa Muktupavela
- Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| | - Michael Dannemann
- Center for Genomics, Evolution and Medicine, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Fernando Racimo
- Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
31
|
Chande AT, Rishishwar L, Ban D, Nagar SD, Conley AB, Rowell J, Valderrama-Aguirre AE, Medina-Rivas MA, Jordan IK. The Phenotypic Consequences of Genetic Divergence between Admixed Latin American Populations: Antioquia and Chocó, Colombia. Genome Biol Evol 2021; 12:1516-1527. [PMID: 32681795 PMCID: PMC7513793 DOI: 10.1093/gbe/evaa154] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/12/2020] [Indexed: 12/11/2022] Open
Abstract
Genome-wide association studies have uncovered thousands of genetic variants that are associated with a wide variety of human traits. Knowledge of how trait-associated variants are distributed within and between populations can provide insight into the genetic basis of group-specific phenotypic differences, particularly for health-related traits. We analyzed the genetic divergence levels for 1) individual trait-associated variants and 2) collections of variants that function together to encode polygenic traits, between two neighboring populations in Colombia that have distinct demographic profiles: Antioquia (Mestizo) and Chocó (Afro-Colombian). Genetic ancestry analysis showed 62% European, 32% Native American, and 6% African ancestry for Antioquia compared with 76% African, 10% European, and 14% Native American ancestry for Chocó, consistent with demography and previous results. Ancestry differences can confound cross-population comparison of polygenic risk scores (PRS); however, we did not find any systematic bias in PRS distributions for the two populations studied here, and population-specific differences in PRS were, for the most part, small and symmetrically distributed around zero. Both genetic differentiation at individual trait-associated single nucleotide polymorphisms and population-specific PRS differences between Antioquia and Chocó largely reflected anthropometric phenotypic differences that can be readily observed between the populations along with reported disease prevalence differences. Cases where population-specific differences in genetic risk did not align with observed trait (disease) prevalence point to the importance of environmental contributions to phenotypic variance, for both infectious and complex, common disease. The results reported here are distributed via a web-based platform for searching trait-associated variants and PRS divergence levels at http://map.chocogen.com (last accessed August 12, 2020).
Collapse
Affiliation(s)
- Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Lavanya Rishishwar
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Dongjo Ban
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Shashwat D Nagar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Andrew B Conley
- IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| | - Jessica Rowell
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | - Augusto E Valderrama-Aguirre
- PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia.,Biomedical Research Institute (COL0082529), Cali, Colombia.,Universidad Santiago de Cali, Colombia
| | - Miguel A Medina-Rivas
- PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia.,Centro de Investigación en Biodiversidad y Hábitat, Universidad Tecnológica del Chocó, Quibdó, Colombia
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia.,IHRC-Georgia Tech Applied Bioinformatics Laboratory, Atlanta, Georgia.,PanAmerican Bioinformatics Institute, Valle del Cauca, Cali, Colombia
| |
Collapse
|
32
|
Panofsky A, Dasgupta K, Iturriaga N. How White nationalists mobilize genetics: From genetic ancestry and human biodiversity to counterscience and metapolitics. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2021; 175:387-398. [PMID: 32986847 PMCID: PMC9909835 DOI: 10.1002/ajpa.24150] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 07/25/2020] [Accepted: 09/03/2020] [Indexed: 01/15/2023]
Abstract
OBJECTIVES Our aim in this study was to understand how genetics ideas are appropriated and mobilized online toward the political projects of White nationalism and the alt right. Studying three different online venues, we investigated how genetics is used to support racial realism, hereditarianism, and racial hierarchy. We analyzed how these ideas are connected to political and metapolitical projects. In addition, we examined the strategies used to build authority for these interpretations. METHODS We analyze three online venues in which genetics has been mobilized to advance racial realism and hereditarian explanations of racial differences. These were (a) the use of genetic ancestry tests in online nationalist discussions, (b) blogs and other venues in which the human biodiversity ideas are articulated, (c) activities surrounding the OpenPsych collection of online journals. Ethnographic and interpretive methods were applied to investigate scientific and political meanings of efforts to mobilize genetic ideas. RESULTS We found that White nationalists use genetic ancestry tests to align White identity with ideas of racial purity and diversity, educating each other about genetics, and debating the boundaries of Whiteness. "Human biodiversity" has been mobilized as a movement to catalog and create hereditarian ideas about racial differences and to distribute them as "red pills" to transform online discourse. The OpenPsych journals have allowed amateur hereditarian psychologists to publish papers, coordinate activity, and legitimate their project at the academic margins. CONCLUSIONS These various appropriations of genetics aim to further racial realism and hereditarian explanations of racial social and behavioral differences. Beyond these substantive aims, on a "metapolitical" level, they serve to reframe concepts and standards for political and scientific discussion of race, challenge structures of academic legitimacy and expertise, and build a cadre of ideological foot soldiers armed with an argumentative toolkit. As professional anthropologists and geneticists aim to accurately communicate their science and its implications for understanding human differences to the public, they must contend with these substantive claims and metapolitical contexts.
Collapse
Affiliation(s)
- Aaron Panofsky
- UCLA Institute for Society and Genetics, Los Angeles, California
| | - Kushan Dasgupta
- UCLA Institute for Society and Genetics, Los Angeles, California
| | - Nicole Iturriaga
- Max Planck Institute for the Study of Religious and Ethnic Diversity, Gottingen, Niedersachsen, Germany
| |
Collapse
|
33
|
Lin M, Park DS, Zaitlen NA, Henn BM, Gignoux CR. Admixed Populations Improve Power for Variant Discovery and Portability in Genome-Wide Association Studies. Front Genet 2021; 12:673167. [PMID: 34108994 PMCID: PMC8181458 DOI: 10.3389/fgene.2021.673167] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 04/27/2021] [Indexed: 11/13/2022] Open
Abstract
Genome-wide association studies (GWAS) are primarily conducted in single-ancestry settings. The low transferability of results has limited our understanding of human genetic architecture across a range of complex traits. In contrast to homogeneous populations, admixed populations provide an opportunity to capture genetic architecture contributed from multiple source populations and thus improve statistical power. Here, we provide a mechanistic simulation framework to investigate the statistical power and transferability of GWAS under directional polygenic selection or varying divergence. We focus on a two-way admixed population and show that GWAS in admixed populations can be enriched for power in discovery by up to 2-fold compared to the ancestral populations under similar sample size. Moreover, higher accuracy of cross-population polygenic score estimates is also observed if variants and weights are trained in the admixed group rather than in the ancestral groups. Common variant associations are also more likely to replicate if first discovered in the admixed group and then transferred to an ancestral population, than the other way around (across 50 iterations with 1,000 causal SNPs, training on 10,000 individuals, testing on 1,000 in each population, p = 3.78e-6, 6.19e-101, ∼0 for FST = 0.2, 0.5, 0.8, respectively). While some of these FST values may appear extreme, we demonstrate that they are found across the entire phenome in the GWAS catalog. This framework demonstrates that investigation of admixed populations harbors significant advantages over GWAS in single-ancestry cohorts for uncovering the genetic architecture of traits and will improve downstream applications such as personalized medicine across diverse populations.
Collapse
Affiliation(s)
- Meng Lin
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Danny S Park
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, San Francisco, CA, United States
| | - Noah A Zaitlen
- Department of Neurology and Computational Medicine, University of California, Los Angeles, Los Angeles, CA, United States
| | - Brenna M Henn
- Department of Anthropology, Center for Population Biology and the Genome Center, University of California, Davis, Davis, CA, United States
| | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| |
Collapse
|
34
|
Feng Y, McQuillan MA, Tishkoff SA. Evolutionary genetics of skin pigmentation in African populations. Hum Mol Genet 2021; 30:R88-R97. [PMID: 33438000 PMCID: PMC8117430 DOI: 10.1093/hmg/ddab007] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 01/07/2021] [Accepted: 01/07/2021] [Indexed: 12/14/2022] Open
Abstract
Skin color is a highly heritable human trait, and global variation in skin pigmentation has been shaped by natural selection, migration and admixture. Ethnically diverse African populations harbor extremely high levels of genetic and phenotypic diversity, and skin pigmentation varies widely across Africa. Recent genome-wide genetic studies of skin pigmentation in African populations have advanced our understanding of pigmentation biology and human evolutionary history. For example, novel roles in skin pigmentation for loci near MFSD12 and DDB1 have recently been identified in African populations. However, due to an underrepresentation of Africans in human genetic studies, there is still much to learn about the evolutionary genetics of skin pigmentation. Here, we summarize recent progress in skin pigmentation genetics in Africans and discuss the importance of including more ethnically diverse African populations in future genetic studies. In addition, we discuss methods for functional validation of adaptive variants related to skin pigmentation.
Collapse
Affiliation(s)
- Yuanqing Feng
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Michael A McQuillan
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Tishkoff
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
35
|
Fagny M, Austerlitz F. Polygenic Adaptation: Integrating Population Genetics and Gene Regulatory Networks. Trends Genet 2021; 37:631-638. [PMID: 33892958 DOI: 10.1016/j.tig.2021.03.005] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/15/2021] [Accepted: 03/16/2021] [Indexed: 12/13/2022]
Abstract
The adaptation of populations to local environments often relies on the selection of optimal values for polygenic traits. Here, we first summarize the results obtained from different quantitative genetics and population genetics models, about the genetic architecture of polygenic traits and their response to directional selection. We then highlight the contribution of systems biology to the understanding of the molecular bases of polygenic traits and the evolution of gene regulatory networks involved in these traits. Finally, we discuss the need for a unifying framework merging the fields of population genetics, quantitative genetics and systems biology to better understand the molecular bases of polygenic traits adaptation.
Collapse
Affiliation(s)
- Maud Fagny
- UMR7206 Eco-Anthropologie, Muséum National d'Histoire Naturelle, Centre National de la Recherche Scientifique, Université de Paris, Paris, France.
| | - Frédéric Austerlitz
- UMR7206 Eco-Anthropologie, Muséum National d'Histoire Naturelle, Centre National de la Recherche Scientifique, Université de Paris, Paris, France
| |
Collapse
|
36
|
Durvasula A, Lohmueller KE. Negative selection on complex traits limits phenotype prediction accuracy between populations. Am J Hum Genet 2021; 108:620-631. [PMID: 33691092 DOI: 10.1016/j.ajhg.2021.02.013] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 02/17/2021] [Indexed: 12/22/2022] Open
Abstract
Phenotype prediction is a key goal for medical genetics. Unfortunately, most genome-wide association studies are done in European populations, which reduces the accuracy of predictions via polygenic scores in non-European populations. Here, we use population genetic models to show that human demographic history and negative selection on complex traits can result in population-specific genetic architectures. For traits where alleles with the largest effect on the trait are under the strongest negative selection, approximately half of the heritability can be accounted for by variants in Europe that are absent from Africa, leading to poor performance in phenotype prediction across these populations. Further, under such a model, individuals in the tails of the genetic risk distribution may not be identified via polygenic scores generated in another population. We empirically test these predictions by building a model to stratify heritability between European-specific and shared variants and applied it to 37 traits and diseases in the UK Biobank. Across these phenotypes, ∼30% of the heritability comes from European-specific variants. We conclude that genetic association studies need to include more diverse populations to enable the utility of phenotype prediction in all populations.
Collapse
Affiliation(s)
- Arun Durvasula
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Kirk E Lohmueller
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA 90095, USA; Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, CA 90095, USA.
| |
Collapse
|
37
|
Yasumizu Y, Sakaue S, Konuma T, Suzuki K, Matsuda K, Murakami Y, Kubo M, Palamara PF, Kamatani Y, Okada Y. Genome-Wide Natural Selection Signatures Are Linked to Genetic Risk of Modern Phenotypes in the Japanese Population. Mol Biol Evol 2021; 37:1306-1316. [PMID: 31957793 PMCID: PMC7182208 DOI: 10.1093/molbev/msaa005] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Elucidation of natural selection signatures and relationships with phenotype spectra is important to understand adaptive evolution of modern humans. Here, we conducted a genome-wide scan of selection signatures of the Japanese population by estimating locus-specific time to the most recent common ancestor using the ascertained sequentially Markovian coalescent (ASMC), from the biobank-based large-scale genome-wide association study data of 170,882 subjects. We identified 29 genetic loci with selection signatures satisfying the genome-wide significance. The signatures were most evident at the alcohol dehydrogenase (ADH) gene cluster locus at 4q23 (PASMC = 2.2 × 10−36), followed by relatively strong selection at the FAM96A (15q22), MYOF (10q23), 13q21, GRIA2 (4q32), and ASAP2 (2p25) loci (PASMC < 1.0 × 10−10). The additional analysis interrogating extended haplotypes (integrated haplotype score) showed robust concordance of the detected signatures, contributing to fine-mapping of the genes, and provided allelic directional insights into selection pressure (e.g., positive selection for ADH1B-Arg48His and HLA-DPB1*04:01). The phenome-wide selection enrichment analysis with the trait-associated variants identified a variety of the modern human phenotypes involved in the adaptation of Japanese. We observed population-specific evidence of enrichment with the alcohol-related phenotypes, anthropometric and biochemical clinical measurements, and immune-related diseases, differently from the findings in Europeans using the UK Biobank resource. Our study demonstrated population-specific features of the selection signatures in Japanese, highlighting a value of the natural selection study using the nation-wide biobank-scale genome and phenotype data.
Collapse
Affiliation(s)
| | - Saori Sakaue
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan.,Department of Allergy and Rheumatology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Takahiro Konuma
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
| | - Ken Suzuki
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
| | - Koichi Matsuda
- Department of Computational Biology and Medical Science, Graduate school of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Yoshinori Murakami
- Division of Molecular Pathology, The Institute of Medical Sciences, The University of Tokyo, Tokyo, Japan
| | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | | | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan.,Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita, Japan.,Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita, Japan
| |
Collapse
|
38
|
Ekoru K, Adeyemo AA, Chen G, Doumatey AP, Zhou J, Bentley AR, Shriner D, Rotimi CN. Genetic risk scores for cardiometabolic traits in sub-Saharan African populations. Int J Epidemiol 2021; 50:1283-1296. [PMID: 33729508 DOI: 10.1093/ije/dyab046] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 02/25/2021] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND There is growing support for the use of genetic risk scores (GRS) in routine clinical settings. Due to the limited diversity of current genomic discovery samples, there are concerns that the predictive power of GRS will be limited in non-European ancestry populations. GRS for cardiometabolic traits were evaluated in sub-Saharan Africans in comparison with African Americans and European Americans. METHODS We evaluated the predictive utility of GRS for 12 cardiometabolic traits in sub-Saharan Africans (AF; n = 5200), African Americans (AA; n = 9139) and European Americans (EUR; n = 9594). GRS were constructed as weighted sums of the number of risk alleles. Predictive utility was assessed using the additional phenotypic variance explained and the increase in discriminatory ability over traditional risk factors [age, sex and body mass index (BMI)], with adjustment for ancestry-derived principal components. RESULTS Across all traits, GRS showed up to a 5-fold and 20-fold greater predictive utility in EUR relative to AA and AF, respectively. Predictive utility was most consistent for lipid traits, with percentage increase in explained variation attributable to GRS ranging from 10.6% to 127.1% among EUR, 26.6% to 65.8% among AA and 2.4% to 37.5% among AF. These differences were recapitulated in the discriminatory power, whereby the predictive utility of GRS was 4-fold greater in EUR relative to AA and up to 44-fold greater in EUR relative to AF. Obesity and blood pressure traits showed a similar pattern of greater predictive utility among EUR. CONCLUSIONS This work demonstrates the poorer performance of GRS in AF and highlights the need to improve representation of multiple ethnic populations in genomic studies to ensure equitable clinical translation of GRS.
Collapse
Affiliation(s)
- Kenneth Ekoru
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Adebowale A Adeyemo
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Guanjie Chen
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ayo P Doumatey
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jie Zhou
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Amy R Bentley
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Daniel Shriner
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Charles N Rotimi
- Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
39
|
Tilot AK, Khramtsova EA, Liang D, Grasby KL, Jahanshad N, Painter J, Colodro-Conde L, Bralten J, Hibar DP, Lind PA, Liu S, Brotman SM, Thompson PM, Medland SE, Macciardi F, Stranger BE, Davis LK, Fisher SE, Stein JL. The Evolutionary History of Common Genetic Variants Influencing Human Cortical Surface Area. Cereb Cortex 2021; 31:1873-1887. [PMID: 33290510 PMCID: PMC7945014 DOI: 10.1093/cercor/bhaa327] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 10/09/2020] [Accepted: 10/09/2020] [Indexed: 12/15/2022] Open
Abstract
Structural brain changes along the lineage leading to modern Homo sapiens contributed to our distinctive cognitive and social abilities. However, the evolutionarily relevant molecular variants impacting key aspects of neuroanatomy are largely unknown. Here, we integrate evolutionary annotations of the genome at diverse timescales with common variant associations from large-scale neuroimaging genetic screens. We find that alleles with evidence of recent positive polygenic selection over the past 2000-3000 years are associated with increased surface area (SA) of the entire cortex, as well as specific regions, including those involved in spoken language and visual processing. Therefore, polygenic selective pressures impact the structure of specific cortical areas even over relatively recent timescales. Moreover, common sequence variation within human gained enhancers active in the prenatal cortex is associated with postnatal global SA. We show that such variation modulates the function of a regulatory element of the developmentally relevant transcription factor HEY2 in human neural progenitor cells and is associated with structural changes in the inferior frontal cortex. These results indicate that non-coding genomic regions active during prenatal cortical development are involved in the evolution of human brain structure and identify novel regulatory elements and genes impacting modern human brain structure.
Collapse
Affiliation(s)
- Amanda K Tilot
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, 6500 AH, Netherlands
- Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Marina del Rey, CA 90292, USA
| | - Ekaterina A Khramtsova
- Department of Medicine, Section of Genetic Medicine & Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
- Computational Sciences, Janssen Pharmaceuticals, Spring House, PA 19477, USA
| | - Dan Liang
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Katrina L Grasby
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Neda Jahanshad
- Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Marina del Rey, CA 90292, USA
| | - Jodie Painter
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Lucía Colodro-Conde
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Janita Bralten
- Radboud University Medical Center, 6525 XZ Nijmegen, Netherlands
| | | | - Penelope A Lind
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Siyao Liu
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Sarah M Brotman
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Paul M Thompson
- Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Marina del Rey, CA 90292, USA
| | - Sarah E Medland
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Fabio Macciardi
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA 92697, USA
| | - Barbara E Stranger
- Department of Medicine, Section of Genetic Medicine & Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
- Department of Pharmacology, Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Lea K Davis
- Department of Medicine, Division of Medical Genetics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Vanderbilt University Medical Center, Vanderbilt Genetics Institute, Nashville, TN 37232, USA
| | - Simon E Fisher
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, 6500 AH, Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, 6500 HB, Netherlands
| | - Jason L Stein
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
40
|
Hernandez M, Perry GH. Scanning the human genome for "signatures" of positive selection: Transformative opportunities and ethical obligations. Evol Anthropol 2021; 30:113-121. [PMID: 33788352 DOI: 10.1002/evan.21893] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 01/25/2021] [Accepted: 03/11/2021] [Indexed: 12/15/2022]
Abstract
The relationship history of evolutionary anthropology and genetics is complex. At best, genetics is a beautifully integrative part of the discipline. Yet this integration has also been fraught, with punctuated, disruptive challenges to dogma, periodic reluctance by some members of the field to embrace results from analyses of genetic data, and occasional over-assertions of genetic definitiveness by geneticists. At worst, evolutionary genetics has been a tool for reinforcing racism and colonialism. While a number of genetics/genomics papers have disproportionately impacted evolutionary anthropology, here we highlight the 2002 presentation of an elegantly powerful approach for identifying "signatures" of past positive selection from haplotype-based patterns of genetic variation. Together with technological advances in genotyping methods, this article transformed our field by facilitating genome-wide "scans" for signatures of past positive selection in human populations. This approach helped researchers test longstanding evolutionary anthropology hypotheses while simultaneously providing opportunities to develop entirely new ones. Genome-wide scans for signatures of positive selection have since been conducted in diverse worldwide populations, with striking findings of local adaptation and convergent evolution. Yet there are ethical considerations with respect to the ubiquity of these studies and the cross-application of the genome-wide scan approach to existing datasets, which we also discuss.
Collapse
Affiliation(s)
- Margarita Hernandez
- Department of Anthropology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - George H Perry
- Department of Anthropology, Pennsylvania State University, University Park, Pennsylvania, USA
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, USA
| |
Collapse
|
41
|
Bird KA. No support for the hereditarian hypothesis of the Black-White achievement gap using polygenic scores and tests for divergent selection. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2021; 175:465-476. [PMID: 33529393 DOI: 10.1002/ajpa.24216] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 11/27/2020] [Accepted: 12/20/2020] [Indexed: 12/19/2022]
Abstract
OBJECTIVES Debate about the cause of IQ score gaps between Black and White populations has persisted within genetics, anthropology, and psychology. Recently, authors claimed polygenic scores provide evidence that a significant portion of differences in cognitive performance between Black and White populations are caused by genetic differences due to natural selection, the "hereditarian hypothesis." This study aims to show conceptual and methodological flaws of past studies supporting the hereditarian hypothesis. MATERIALS AND METHODS Polygenic scores for educational attainment were constructed for African and European samples of the 1000 Genomes Project. Evidence for selection was evaluated using an excess variance test. Education associated variants were further evaluated for signals of selection by testing for excess genetic differentiation (Fst ). Expected mean difference in IQ for populations was calculated under a neutral evolutionary scenario and contrasted to hereditarian claims. RESULTS Tests for selection using polygenic scores failed to find evidence of natural selection when the less biased within-family GWAS effect sizes were used. Tests for selection using Fst values did not find evidence of natural selection. Expected mean difference in IQ was substantially smaller than postulated by hereditarians, even under unrealistic assumptions that overestimate genetic contribution. CONCLUSION Given these results, hereditarian claims are not supported in the least. Cognitive performance does not appear to have been under diversifying selection in Europeans and Africans. In the absence of diversifying selection, the best case estimate for genetic contributions to group differences in cognitive performance is substantially smaller than hereditarians claim and is consistent with genetic differences contributing little to the Black-White gap.
Collapse
Affiliation(s)
- Kevin A Bird
- Department of Horticulture, Michigan State University, East Lansing, Michigan, USA.,Ecology, Evolutionary Biology and Behavior Program, Michigan State University, East Lansing, Michigan, USA
| |
Collapse
|
42
|
Abstract
The selection pressures that have shaped the evolution of complex traits in humans remain largely unknown, and in some contexts highly contentious, perhaps above all where they concern mean trait differences among groups. To date, the discussion has focused on whether such group differences have any genetic basis, and if so, whether they are without fitness consequences and arose via random genetic drift, or whether they were driven by selection for different trait optima in different environments. Here, we highlight a plausible alternative: that many complex traits evolve under stabilizing selection in the face of shifting environmental effects. Under this scenario, there will be rapid evolution at the loci that contribute to trait variation, even when the trait optimum remains the same. These considerations underscore the strong assumptions about environmental effects that are required in ascribing trait differences among groups to genetic differences.
Collapse
Affiliation(s)
- Arbel Harpak
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| | - Molly Przeworski
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
| |
Collapse
|
43
|
Barghi N, Hermisson J, Schlötterer C. Polygenic adaptation: a unifying framework to understand positive selection. Nat Rev Genet 2020; 21:769-781. [PMID: 32601318 DOI: 10.1038/s41576-020-0250-z] [Citation(s) in RCA: 178] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2020] [Indexed: 12/20/2022]
Abstract
Most adaption processes have a polygenic genetic basis, but even with the recent explosive growth of genomic data we are still lacking a unified framework describing the dynamics of selected alleles. Building on recent theoretical and empirical work we introduce the concept of adaptive architecture, which extends the genetic architecture of an adaptive trait by factors influencing its adaptive potential and population genetic principles. Because adaptation can be typically achieved by many different combinations of adaptive alleles (redundancy), we describe how two characteristics - heterogeneity among loci and non-parallelism between replicated populations - are hallmarks for the characterization of polygenic adaptation in evolving populations. We discuss how this unified framework can be applied to natural and experimental populations.
Collapse
Affiliation(s)
- Neda Barghi
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | - Joachim Hermisson
- Mathematics and BioSciences Group, Faculty of Mathematics and Max Perutz Labs, University of Vienna, Vienna, Austria.
| | | |
Collapse
|
44
|
Raffington L, Mallard T, Harden KP. Polygenic Scores in Developmental Psychology: Invite Genetics In, Leave Biodeterminism Behind. ANNUAL REVIEW OF DEVELOPMENTAL PSYCHOLOGY 2020; 2:389-411. [PMID: 38249435 PMCID: PMC10798791 DOI: 10.1146/annurev-devpsych-051820-123945] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]
Abstract
Polygenic scores offer developmental psychologists new methods for integrating genetic information into research on how people change and develop across the life span. Indeed, polygenic scores have correlations with developmental outcomes that rival correlations with traditional developmental psychology variables, such as family income. Yet linking people's genetics with differences between them in socially valued developmental outcomes, such as educational attainment, has historically been used to justify acts of state-sponsored violence. In this review, we emphasize that an interdisciplinary understanding of the environmental and structural determinants of social inequality, in conjunction with a transactional developmental perspective on how people interact with their environments, is critical to interpreting associations between polygenic measures and phenotypes. While there is a risk of misuse, early applications of polygenic scores to developmental psychology have already provided novel findings that identify environmental mechanisms of life course processes that can be used to diagnose inequalities in social opportunity.
Collapse
Affiliation(s)
- Laurel Raffington
- Department of Psychology, University of Texas, Austin, Texas 78712, USA
- Population Research Center, University of Texas, Austin, Texas 78712, USA
| | - Travis Mallard
- Department of Psychology, University of Texas, Austin, Texas 78712, USA
| | - K Paige Harden
- Department of Psychology, University of Texas, Austin, Texas 78712, USA
- Population Research Center, University of Texas, Austin, Texas 78712, USA
| |
Collapse
|
45
|
Abstract
Polygenic risk scores (PRS) use the results of genome-wide association studies (GWAS) to predict quantitative phenotypes or disease risk at an individual level, and provide a potential route to the use of genetic data in personalized medical care. However, a major barrier to the use of PRS is that the majority of GWAS come from cohorts of European ancestry. The predictive power of PRS constructed from these studies is substantially lower in non-European ancestry cohorts, although the reasons for this are unclear. To address this question, we investigate the performance of PRS for height in cohorts with admixed African and European ancestry, allowing us to evaluate ancestry-related differences in PRS predictive accuracy while controlling for environment and cohort differences. We first show that the predictive accuracy of height PRS increases linearly with European ancestry and is partially explained by European ancestry segments of the admixed genomes. We show that recombination rate, differences in allele frequencies, and differences in marginal effect sizes across ancestries all contribute to the decrease in predictive power, but none of these effects explain the decrease on its own. Finally, we demonstrate that prediction for admixed individuals can be improved by using a linear combination of PRS that includes ancestry-specific effect sizes, although this approach is at present limited by the small size of non-European ancestry discovery cohorts.
Collapse
|
46
|
Fuentes A. Biological anthropology's critical engagement with genomics, evolution, race/racism, and ourselves: Opportunities and challenges to making a difference in the academy and the world. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2020; 175:326-338. [PMID: 33098091 DOI: 10.1002/ajpa.24162] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 07/25/2020] [Accepted: 10/02/2020] [Indexed: 12/31/2022]
Abstract
Biological anthropology can, and should, matter in the Anthropocene. Biological anthropologists are interested in human biology and the human experience in a broader ecological, evolutionary, and phylogenetic context. We are interested in the material of the body, the history of the body, and interactions of diverse bodies, communities, ecologies, and evolutionary processes. However, the cultural realities of bodies, histories, communities, livelihoods, perceptions, and experiences are as central to the endeavor and inquiry of biological anthropology as are their material aspects. Biological anthropology is a constant dialectic between the cultural and the biological. In this essay, I argue that Biological Anthropology has much to offer, a history to contend with, and a future that matters. To illustrate this, I highlight theoretical and methodological issues in genomics, evolutionary theory and connect them to the study of Race and Racism to emphasize specific arenas where Biological Anthropology has a great capacity, and a strong obligation, to play a central role. However, Biological Anthropology also has substantive internal issues that hinder our ability to do the best possible science. If we are to live up to our potential and make a difference in the 21st century we need to ameliorate our structural shortcomings and expand our voice, and impact, in academic and public discourse. The goal of this perspective is to offer suggestions for moving us toward this goal.
Collapse
Affiliation(s)
- Agustín Fuentes
- Department of Anthropology, 123 Aaron Burr Hall, Princeton University, Princeton, New Jersey, USA
| |
Collapse
|
47
|
Martin AR, Gignoux CR, Walters RK, Wojcik GL, Neale BM, Gravel S, Daly MJ, Bustamante CD, Kenny EE. Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am J Hum Genet 2020; 107:788-789. [PMID: 33007199 PMCID: PMC7536609 DOI: 10.1016/j.ajhg.2020.08.020] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
|
48
|
Wang Y, Guo J, Ni G, Yang J, Visscher PM, Yengo L. Theoretical and empirical quantification of the accuracy of polygenic scores in ancestry divergent populations. Nat Commun 2020; 11:3865. [PMID: 32737319 PMCID: PMC7395791 DOI: 10.1038/s41467-020-17719-y] [Citation(s) in RCA: 122] [Impact Index Per Article: 24.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Accepted: 07/10/2020] [Indexed: 02/03/2023] Open
Abstract
Polygenic scores (PGS) have been widely used to predict disease risk using variants identified from genome-wide association studies (GWAS). To date, most GWAS have been conducted in populations of European ancestry, which limits the use of GWAS-derived PGS in non-European ancestry populations. Here, we derive a theoretical model of the relative accuracy (RA) of PGS across ancestries. We show through extensive simulations that the RA of PGS based on genome-wide significant SNPs can be predicted accurately from modelling linkage disequilibrium (LD), minor allele frequencies (MAF), cross-population correlations of causal SNP effects and heritability. We find that LD and MAF differences between ancestries can explain between 70 and 80% of the loss of RA of European-based PGS in African ancestry for traits like body mass index and type 2 diabetes. Our results suggest that causal variants underlying common genetic variation identified in European ancestry GWAS are mostly shared across continents.
Collapse
Affiliation(s)
- Ying Wang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jing Guo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Guiyan Ni
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jian Yang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
- Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang, 325027, China
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Loic Yengo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia.
| |
Collapse
|
49
|
McQuillan MA, Zhang C, Tishkoff SA, Platt A. The importance of including ethnically diverse populations in studies of quantitative trait evolution. Curr Opin Genet Dev 2020; 62:30-35. [PMID: 32604012 PMCID: PMC7942184 DOI: 10.1016/j.gde.2020.05.037] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 05/26/2020] [Accepted: 05/28/2020] [Indexed: 10/24/2022]
Abstract
For many traits, human variation is less a matter of categorical differences than quantitative variation, such as height, where individuals fall along a continuum from short to tall. Most recent studies utilize large population-based samples with whole-genome sequences to study the evolution of these traits and have made significant progress implementing a broad spectrum of techniques. However, relatively few studies of quantitative trait evolution include ethnically diverse populations, which often harbor the highest levels of genetic and phenotypic diversity. Thus, our ability to draw inferences about quantitative trait adaptation has been limited. Here, we review recent studies examining human quantitative trait adaptation, and argue that including ethnically diverse populations, particularly from Africa, will be especially informative for our understanding of how humans adapt to the world around them.
Collapse
Affiliation(s)
- Michael A McQuillan
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Chao Zhang
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Tishkoff
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Alexander Platt
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
50
|
Barghi N, Schlötterer C. Distinct Patterns of Selective Sweep and Polygenic Adaptation in Evolve and Resequence Studies. Genome Biol Evol 2020; 12:890-904. [PMID: 32282913 PMCID: PMC7313669 DOI: 10.1093/gbe/evaa073] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/07/2020] [Indexed: 12/15/2022] Open
Abstract
In molecular population genetics, adaptation is typically thought to occur via selective sweeps, where targets of selection have independent effects on the phenotype and rise to fixation, whereas in quantitative genetics, many loci contribute to the phenotype and subtle frequency changes occur at many loci during polygenic adaptation. The sweep model makes specific predictions about frequency changes of beneficial alleles and many test statistics have been developed to detect such selection signatures. Despite polygenic adaptation is probably the prevalent mode of adaptation, because of the traditional focus on the phenotype, we are lacking a solid understanding of the similarities and differences of selection signatures under the two models. Recent theoretical and empirical studies have shown that both selective sweep and polygenic adaptation models could result in a sweep-like genomic signature; therefore, additional criteria are needed to distinguish the two models. With replicated populations and time series data, experimental evolution studies have the potential to identify the underlying model of adaptation. Using the framework of experimental evolution, we performed computer simulations to study the pattern of selected alleles for two models: 1) adaptation of a trait via independent beneficial mutations that are conditioned for fixation, that is, selective sweep model and 2) trait optimum model (polygenic adaptation), that is adaptation of a quantitative trait under stabilizing selection after a sudden shift in trait optimum. We identify several distinct patterns of selective sweep and trait optimum models in populations of different sizes. These features could provide the foundation for development of quantitative approaches to differentiate the two models.
Collapse
Affiliation(s)
- Neda Barghi
- Institut für Populationsgenetik, Vetmeduni, Vienna, Austria
| | | |
Collapse
|