1
|
Patel RA, Weiß CL, Zhu H, Mostafavi H, Simons YB, Spence JP, Pritchard JK. Characterizing selection on complex traits through conditional frequency spectra. Genetics 2025; 229:iyae210. [PMID: 39691067 PMCID: PMC12005249 DOI: 10.1093/genetics/iyae210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Revised: 11/18/2024] [Accepted: 12/03/2024] [Indexed: 12/19/2024] Open
Abstract
Natural selection on complex traits is difficult to study in part due to the ascertainment inherent to genome-wide association studies (GWAS). The power to detect a trait-associated variant in GWAS is a function of its frequency and effect size - but for traits under selection, the effect size of a variant determines the strength of selection against it, constraining its frequency. Recognizing the biases inherent to GWAS ascertainment, we propose studying the joint distribution of allele frequencies across populations, conditional on the frequencies in the GWAS cohort. Before considering these conditional frequency spectra, we first characterized the impact of selection and non-equilibrium demography on allele frequency dynamics forwards and backwards in time. We then used these results to understand conditional frequency spectra under realistic human demography. Finally, we investigated empirical conditional frequency spectra for GWAS variants associated with 106 complex traits, finding compelling evidence for either stabilizing or purifying selection. Our results provide insights into polygenic score portability and other properties of variants ascertained with GWAS, highlighting the utility of conditional frequency spectra.
Collapse
Affiliation(s)
- Roshni A Patel
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Clemens L Weiß
- Stanford Cancer Institute Core, Stanford University, Stanford, CA 94305, USA
| | - Huisheng Zhu
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Hakhamanesh Mostafavi
- Center for Human Genetics and Genomics, New York University School of Medicine, New York, NY 10016, USA
- Division of Biostatistics, Department of Population Health, New York University School of Medicine, New York, NY 10016, USA
| | - Yuval B Simons
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Jeffrey P Spence
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
2
|
Du S, Chen J, Li J, Qian W, Wu S, Peng Q, Liu Y, Pan T, Li Y, Hadi SS, Tan J, Yuan Z, Wang J, Tang K, Wang Z, Wen Y, Dong X, Zhou W, Ruiz-Linares A, Shi Y, Jin L, Liu F, Zhang M, Wang S. A multi-ancestry GWAS meta-analysis of facial features and its application in predicting archaic human features. J Genet Genomics 2025; 52:513-524. [PMID: 39002897 DOI: 10.1016/j.jgg.2024.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 07/06/2024] [Accepted: 07/06/2024] [Indexed: 07/15/2024]
Abstract
Facial morphology, a complex trait influenced by genetics, holds great significance in evolutionary research. However, due to limited fossil evidence, the facial characteristics of Neanderthals and Denisovans have remained largely unknown. In this study, we conduct a large-scale multi-ethnic meta-analysis of the genome-wide association study (GWAS), including 9674 East Asians and 10,115 Europeans, quantitatively assessing 78 facial traits using 3D facial images. We identify 71 genomic loci associated with facial features, including 21 novel loci. We develop a facial polygenic score (FPS) that enables the prediction of facial features based on genetic information. Interestingly, the distribution of FPSs among populations from diverse continental groups exhibits relevant correlations with observed facial features. Furthermore, we apply the FPS to predict the facial traits of seven Neanderthals and one Denisovan using ancient DNA and align predictions with the fossil records. Our results suggest that Neanderthals and Denisovans likely share similar facial features, such as a wider but shorter nose and a wider endocanthion distance. The decreased mouth width is characterized specifically in Denisovans. The integration of genomic data and facial trait analysis provides valuable insights into the evolutionary history and adaptive changes in human facial morphology.
Collapse
Affiliation(s)
- Siyuan Du
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China; Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Jieyi Chen
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China; Center for Molecular Medicine, Pediatrics Research Institute, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Jiarui Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Wei Qian
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Sijie Wu
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Qianqian Peng
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yu Liu
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Ting Pan
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Yi Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Sibte Syed Hadi
- Department of Forensic Sciences, College of Criminal Justice, Naif Arab University for Security Sciences, Riyadh 11452, Kingdom of Saudi Arabia
| | - Jingze Tan
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Ziyu Yuan
- Fudan-Taizhou Institute of Health Sciences, Taizhou, Jiangsu 225326, China
| | - Jiucun Wang
- Fudan-Taizhou Institute of Health Sciences, Taizhou, Jiangsu 225326, China; Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai 200120, China; Research Unit of Dissecting the Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Shanghai 200438, China
| | - Kun Tang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Zhuo Wang
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Yanqin Wen
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Xinran Dong
- Center for Molecular Medicine, Pediatrics Research Institute, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Wenhao Zhou
- Center for Molecular Medicine, Pediatrics Research Institute, Children's Hospital of Fudan University, Shanghai 201102, China; Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, Guangdong 510623, China
| | - Andrés Ruiz-Linares
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China; Aix-Marseille Université, CNRS, EFS, ADES, Marseille 13005, France; Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London WC1E 6BT, UK
| | - Yongyong Shi
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Li Jin
- Fudan-Taizhou Institute of Health Sciences, Taizhou, Jiangsu 225326, China; Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai 200120, China; Research Unit of Dissecting the Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Shanghai 200438, China
| | - Fan Liu
- Department of Forensic Sciences, College of Criminal Justice, Naif Arab University for Security Sciences, Riyadh 11452, Kingdom of Saudi Arabia; Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, 3015 CN Rotterdam, the Netherlands
| | - Manfei Zhang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China; Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200030, China; Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai 200120, China.
| | - Sijia Wang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.
| |
Collapse
|
3
|
Piffer D, Kirkegaard EOW. Polygenic Selection and Environmental Influence on Adult Body Height: Genetic and Living Standard Contributions Across Diverse Populations. Twin Res Hum Genet 2024:1-18. [PMID: 39639460 DOI: 10.1017/thg.2024.43] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2024]
Abstract
We analyzed whole-genome sequencing (WGS) data from 51 populations and combined WGS and array data from 89 populations. Multiple types of polygenic scores (PGS) were employed, derived from multi-ancestry, between-family genome-wide association study (GWAS; MIX-Height), European-ancestry, between-family GWAS (EUR-Height), and European-ancestry siblings GWAS (SIB-Height). Our findings demonstrate that both genetic and environmental factors significantly influence adult body height between populations. Models that included both genetic and environmental predictors best explained population differences in adult body height, with the MIX-Height PGS and environmental factors (Human Development Index [HDI] + per capita caloric intake) achieving an R2 of .83. Our findings shed light on Deaton's 'African paradox', which noted the relatively tall stature of African populations despite poor nutrition and childhood health. Contrary to Deaton's hypotheses, we demonstrate that both genetic differences and environmental factors significantly influence body height in countries with high infant mortality rates. This suggests that the observed tall stature in African populations can be attributed, in part, to a high genetic predisposition for body height. Furthermore, tests of divergent selection based on the QST (i.e., standardized measure of the genetic differentiation of a quantitative trait among populations) and FST (neutral marker loci) measures exceeded neutral expectations, reaching statistical significance (p < .01) with the MIX-Height PGS but not with the SIB-Height PGS. This result indicates potential selective pressures on body height-related genetic variants across populations.
Collapse
|
4
|
Davies NM, Hemani G, Neiderhiser JM, Martin HC, Mills MC, Visscher PM, Yengo L, Young AS, Keller MC. The importance of family-based sampling for biobanks. Nature 2024; 634:795-803. [PMID: 39443775 PMCID: PMC11623399 DOI: 10.1038/s41586-024-07721-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 06/13/2024] [Indexed: 10/25/2024]
Abstract
Biobanks aim to improve our understanding of health and disease by collecting and analysing diverse biological and phenotypic information in large samples. So far, biobanks have largely pursued a population-based sampling strategy, where the individual is the unit of sampling, and familial relatedness occurs sporadically and by chance. This strategy has been remarkably efficient and successful, leading to thousands of scientific discoveries across multiple research domains, and plans for the next wave of biobanks are underway. In this Perspective, we discuss the strengths and limitations of a complementary sampling strategy for future biobanks based on oversampling of close genetic relatives. Such family-based samples facilitate research that clarifies causal relationships between putative risk factors and outcomes, particularly in estimates of genetic effects, because they enable analyses that reduce or eliminate confounding due to familial and demographic factors. Family-based biobank samples would also shed new light on fundamental questions across multiple fields that are often difficult to explore in population-based samples. Despite the potential for higher costs and greater analytical complexity, the many advantages of family-based samples should often outweigh their potential challenges.
Collapse
Affiliation(s)
- Neil M Davies
- Division of Psychiatry, University College London, London, UK.
- Department of Statistical Science, University College London, London, UK.
- Department of Public Health and Nursing, Norwegian University of Science and Technology, Trondheim, Norway.
| | - Gibran Hemani
- Medical Research Council Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Jenae M Neiderhiser
- Department of Psychology, The Pennsylvania State University, University Park, PA, USA
| | - Hilary C Martin
- Human Genetics Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Melinda C Mills
- Department of Economics, Econometrics & Finance, University of Groningen, Groningen, The Netherlands
- Department of Genetics, University Medical Centre Groningen, Groningen, The Netherlands
- Leverhulme Centre for Demographic Science, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Loïc Yengo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
| | - Alexander Strudwick Young
- UCLA Anderson School of Management, Los Angeles, CA, USA
- Human Genetics Department, UCLA David Geffen School of Medicine, Los Angeles, CA, USA
| | - Matthew C Keller
- Institute for Behavioral Genetics, University of Colorado at Boulder, Boulder, CO, USA.
- Department of Psychology and Neuroscience, University of Colorado at Boulder, Boulder, CO, USA.
| |
Collapse
|
5
|
Brassington L, Arner AM, Watowich MM, Damstedt J, Ng KS, Lim YAL, Venkataraman VV, Wallace IJ, Kraft TS, Lea AJ. Integrating the Thrifty Genotype and Evolutionary Mismatch Hypotheses to understand variation in cardiometabolic disease risk. Evol Med Public Health 2024; 12:214-226. [PMID: 39484023 PMCID: PMC11525211 DOI: 10.1093/emph/eoae014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/18/2024] [Indexed: 11/03/2024] Open
Abstract
More than 60 years ago, James Neel proposed the Thrifty Genotype Hypothesis to explain the widespread prevalence of type 2 diabetes in Western, industrial contexts. This hypothesis posits that variants linked to conservative energy usage and increased fat deposition would have been favored throughout human evolution due to the advantages they could provide during periods of resource limitation. However, in industrial environments, these variants instead produce an increased risk of obesity, metabolic syndrome, type 2 diabetes, and related health issues. This hypothesis has been popular and impactful, with thousands of citations, many ongoing debates, and several spin-off theories in biomedicine, evolutionary biology, and anthropology. However, despite great attention, the applicability and utility of the Thrifty Genotype Hypothesis (TGH) to modern human health remains, in our opinion, unresolved. To move research in this area forward, we first discuss the original formulation of the TGH and its critiques. Second, we trace the TGH to updated hypotheses that are currently at the forefront of the evolutionary medicine literature-namely, the Evolutionary Mismatch Hypothesis. Third, we lay out empirical predictions for updated hypotheses and evaluate them against the current literature. Finally, we discuss study designs that could be fruitful for filling current knowledge gaps; here, we focus on partnerships with subsistence-level groups undergoing lifestyle transitions, and we present data from an ongoing study with the Orang Asli of Malaysia to illustrate this point. Overall, we hope this synthesis will guide new empirical research aimed at understanding how the human evolutionary past interacts with our modern environments to influence cardiometabolic health.
Collapse
Affiliation(s)
- Layla Brassington
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Audrey M Arner
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Marina M Watowich
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | - Jane Damstedt
- Department of Anthropology, University of Utah, Salt Lake City, Utah, USA
| | - Kee Seong Ng
- Department of Medicine, Faculty of Medicine, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Yvonne A L Lim
- Department of Parasitology, Faculty of Medicine, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Vivek V Venkataraman
- Department of Anthropology and Archaeology, University of Calgary, Calgary, Alberta, Canada
| | - Ian J Wallace
- Department of Anthropology, University of New Mexico, Albuquerque, New Mexico, USA
| | - Thomas S Kraft
- Department of Anthropology, University of Utah, Salt Lake City, Utah, USA
| | - Amanda J Lea
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
6
|
Blanc J, Berg JJ. Testing for differences in polygenic scores in the presence of confounding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.12.532301. [PMID: 36993707 PMCID: PMC10055004 DOI: 10.1101/2023.03.12.532301] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Polygenic scores have become an important tool in human genetics, enabling the prediction of individuals' phenotypes from their genotypes. Understanding how the pattern of differences in polygenic score predictions across individuals intersects with variation in ancestry can provide insights into the evolutionary forces acting on the trait in question, and is important for understanding health disparities. However, because most polygenic scores are computed using effect estimates from population samples, they are susceptible to confounding by both genetic and environmental effects that are correlated with ancestry. The extent to which this confounding drives patterns in the distribution of polygenic scores depends on patterns of population structure in both the original estimation panel and in the prediction/test panel. Here, we use theory from population and statistical genetics, together with simulations, to study the procedure of testing for an association between polygenic scores and axes of ancestry variation in the presence of confounding. We use a general model of genetic relatedness to describe how confounding in the estimation panel biases the distribution of polygenic scores in a way that depends on the degree of overlap in population structure between panels. We then show how this confounding can bias tests for associations between polygenic scores and important axes of ancestry variation in the test panel. Specifically, for any given test, there exists a single axis of population structure in the GWAS panel that needs to be controlled for in order to protect the test. Based on this result, we propose a new approach for directly estimating this axis of population structure in the GWAS panel. We then use simulations to compare the performance of this approach to the standard approach in which the principal components of the GWAS panel genotypes are used to control for stratification.
Collapse
Affiliation(s)
- Jennifer Blanc
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Jeremy J. Berg
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
7
|
Patel RA, Weiß CL, Zhu H, Mostafavi H, Simons YB, Spence JP, Pritchard JK. Conditional frequency spectra as a tool for studying selection on complex traits in biobanks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.15.599126. [PMID: 38948697 PMCID: PMC11212903 DOI: 10.1101/2024.06.15.599126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Natural selection on complex traits is difficult to study in part due to the ascertainment inherent to genome-wide association studies (GWAS). The power to detect a trait-associated variant in GWAS is a function of frequency and effect size - but for traits under selection, the effect size of a variant determines the strength of selection against it, constraining its frequency. To account for GWAS ascertainment, we propose studying the joint distribution of allele frequencies across populations, conditional on the frequencies in the GWAS cohort. Before considering these conditional frequency spectra, we first characterized the impact of selection and non-equilibrium demography on allele frequency dynamics forwards and backwards in time. We then used these results to understand conditional frequency spectra under realistic human demography. Finally, we investigated empirical conditional frequency spectra for GWAS variants associated with 106 complex traits, finding compelling evidence for either stabilizing or purifying selection. Our results provide insight into polygenic score portability and other properties of variants ascertained with GWAS, highlighting the utility of conditional frequency spectra.
Collapse
Affiliation(s)
- Roshni A. Patel
- Department of Genetics, Stanford University School of Medicine, Stanford, CA
| | - Clemens L. Weiß
- Stanford Cancer Institute Core, Stanford University School of Medicine, Stanford, CA
| | - Huisheng Zhu
- Department of Biology, Stanford University, Stanford, CA
| | - Hakhamanesh Mostafavi
- Center for Human Genetics and Genomics, New York University School of Medicine, New York, NY
- Division of Biostatistics, Department of Population Health, New York University School of Medicine, New York, NY
| | | | - Jeffrey P. Spence
- Department of Genetics, Stanford University School of Medicine, Stanford, CA
| | - Jonathan K. Pritchard
- Department of Genetics, Stanford University School of Medicine, Stanford, CA
- Department of Biology, Stanford University, Stanford, CA
| |
Collapse
|
8
|
Chen S, Tang D, Deng L, Xu S. Asian-European differentiation of schizophrenia-associated genes driven by admixture and natural selection. iScience 2024; 27:109560. [PMID: 38638564 PMCID: PMC11024917 DOI: 10.1016/j.isci.2024.109560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 12/29/2023] [Accepted: 03/22/2024] [Indexed: 04/20/2024] Open
Abstract
The European-centered genome-wide association studies of schizophrenia (SCZ) may not be well applied to non-European populations. We analyzed 1,592 reported SCZ-associated genes using the public genome data and found an overall higher Asian-European differentiation on the SCZ-associated variants than at the genome-wide level. Notable examples included 15 missense variants, a regulatory variant SLC5A10-rs1624825, and a damaging variant TSPAN18-rs1001292. Independent local adaptations in recent 25,000 years, after the Asian-European divergence, could have contributed to such genetic differentiation, as were identified at a missense mutation LTN1-rs57646126-A in Asians, and a non-risk allele ZSWIM6-rs72761442-G in Europeans. Altai-Neanderthal-derived alleles may have opposite effects on SCZ susceptibility between ancestries. Furthermore, adaptive introgression was detected on the non-risk haplotype at 1q21.2 in Europeans, while in Asians it was observed on the SCZ risk haplotype at 3p21.31 which is also potentially ultra-violet protective. This study emphasizes the importance of including more representative Asian samples in future SCZ studies.
Collapse
Affiliation(s)
- Sihan Chen
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Die Tang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Lian Deng
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Shuhua Xu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200438, China
| |
Collapse
|
9
|
Timmins IR, The PRACTICAL Consortium, Dudbridge F. Bayesian approach to assessing population differences in genetic risk of disease with application to prostate cancer. PLoS Genet 2024; 20:e1011212. [PMID: 38630784 PMCID: PMC11023298 DOI: 10.1371/journal.pgen.1011212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 03/07/2024] [Indexed: 04/19/2024] Open
Abstract
Population differences in risk of disease are common, but the potential genetic basis for these differences is not well understood. A standard approach is to compare genetic risk across populations by testing for mean differences in polygenic scores, but existing studies that use this approach do not account for statistical noise in effect estimates (i.e., the GWAS betas) that arise due to the finite sample size of GWAS training data. Here, we show using Bayesian polygenic score methods that the level of uncertainty in estimates of genetic risk differences across populations is highly dependent on the GWAS training sample size, the polygenicity (number of causal variants), and genetic distance (FST) between the populations considered. We derive a Wald test for formally assessing the difference in genetic risk across populations, which we show to have calibrated type 1 error rates under a simplified assumption that all SNPs are independent, which we achieve in practise using linkage disequilibrium (LD) pruning. We further provide closed-form expressions for assessing the uncertainty in estimates of relative genetic risk across populations under the special case of an infinitesimal genetic architecture. We suggest that for many complex traits and diseases, particularly those with more polygenic architectures, current GWAS sample sizes are insufficient to detect moderate differences in genetic risk across populations, though more substantial differences in relative genetic risk (relative risk > 1.5) can be detected. We show that conventional approaches that do not account for sampling error from the training sample, such as using a simple t-test, have very high type 1 error rates. When applying our approach to prostate cancer, we demonstrate a higher genetic risk in African Ancestry men, with lower risk in men of European followed by East Asian ancestry.
Collapse
Affiliation(s)
- Iain R. Timmins
- Department of Population Health Sciences, University of Leicester, Leicester, United Kingdom
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, United Kingdom
- Statistical Innovation, AstraZeneca, Cambridge, United Kingdom
| | | | - Frank Dudbridge
- Department of Population Health Sciences, University of Leicester, Leicester, United Kingdom
| |
Collapse
|
10
|
Wu K, Bu F, Wu Y, Zhang G, Wang X, He S, Liu MF, Chen R, Yuan H. Exploring noncoding variants in genetic diseases: from detection to functional insights. J Genet Genomics 2024; 51:111-132. [PMID: 38181897 DOI: 10.1016/j.jgg.2024.01.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Revised: 12/26/2023] [Accepted: 01/01/2024] [Indexed: 01/07/2024]
Abstract
Previous studies on genetic diseases predominantly focused on protein-coding variations, overlooking the vast noncoding regions in the human genome. The development of high-throughput sequencing technologies and functional genomics tools has enabled the systematic identification of functional noncoding variants. These variants can impact gene expression, regulation, and chromatin conformation, thereby contributing to disease pathogenesis. Understanding the mechanisms that underlie the impact of noncoding variants on genetic diseases is indispensable for the development of precisely targeted therapies and the implementation of personalized medicine strategies. The intricacies of noncoding regions introduce a multitude of challenges and research opportunities. In this review, we introduce a spectrum of noncoding variants involved in genetic diseases, along with research strategies and advanced technologies for their precise identification and in-depth understanding of the complexity of the noncoding genome. We will delve into the research challenges and propose potential solutions for unraveling the genetic basis of rare and complex diseases.
Collapse
Affiliation(s)
- Ke Wu
- Institute of Rare Diseases, West China Hospital of Sichuan University, Chengdu, Sichuan 610041, China
| | - Fengxiao Bu
- Institute of Rare Diseases, West China Hospital of Sichuan University, Chengdu, Sichuan 610041, China
| | - Yang Wu
- Institute of Rare Diseases, West China Hospital of Sichuan University, Chengdu, Sichuan 610041, China
| | - Gen Zhang
- Institute of Rare Diseases, West China Hospital of Sichuan University, Chengdu, Sichuan 610041, China
| | - Xin Wang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, Zhejiang 310024, China
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mo-Fang Liu
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, Zhejiang 310024, China; State Key Laboratory of Molecular Biology, State Key Laboratory of Cell Biology, Shanghai Key Laboratory of Molecular Andrology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai 200031, China.
| | - Runsheng Chen
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.
| | - Huijun Yuan
- Institute of Rare Diseases, West China Hospital of Sichuan University, Chengdu, Sichuan 610041, China.
| |
Collapse
|
11
|
Goovaerts S, Hoskens H, Eller RJ, Herrick N, Musolf AM, Justice CM, Yuan M, Naqvi S, Lee MK, Vandermeulen D, Szabo-Rogers HL, Romitti PA, Boyadjiev SA, Marazita ML, Shaffer JR, Shriver MD, Wysocka J, Walsh S, Weinberg SM, Claes P. Joint multi-ancestry and admixed GWAS reveals the complex genetics behind human cranial vault shape. Nat Commun 2023; 14:7436. [PMID: 37973980 PMCID: PMC10654897 DOI: 10.1038/s41467-023-43237-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 11/01/2023] [Indexed: 11/19/2023] Open
Abstract
The cranial vault in humans is highly variable, clinically relevant, and heritable, yet its genetic architecture remains poorly understood. Here, we conduct a joint multi-ancestry and admixed multivariate genome-wide association study on 3D cranial vault shape extracted from magnetic resonance images of 6772 children from the ABCD study cohort yielding 30 genome-wide significant loci. Follow-up analyses indicate that these loci overlap with genomic risk loci for sagittal craniosynostosis, show elevated activity cranial neural crest cells, are enriched for processes related to skeletal development, and are shared with the face and brain. We present supporting evidence of regional localization for several of the identified genes based on expression patterns in the cranial vault bones of E15.5 mice. Overall, our study provides a comprehensive overview of the genetics underlying normal-range cranial vault shape and its relevance for understanding modern human craniofacial diversity and the etiology of congenital malformations.
Collapse
Affiliation(s)
- Seppe Goovaerts
- Department of Human Genetics, KU Leuven, Leuven, Belgium.
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium.
| | - Hanne Hoskens
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
| | - Ryan J Eller
- Department of Biology, Indiana University Indianapolis, Indianapolis, IN, USA
| | - Noah Herrick
- Department of Biology, Indiana University Indianapolis, Indianapolis, IN, USA
| | - Anthony M Musolf
- Statistical Genetics Section, Computational and Statistical Genomics Branch, NHGRI, NIH, MD, Baltimore, USA
| | - Cristina M Justice
- Genometrics Section, Computational and Statistical Genomics Branch, Division of Intramural Research, NHGRI, NIH, Baltimore, MD, USA
- Neurobehavioral Clinical Research Section, Social and Behavioral Research Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Meng Yuan
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
| | - Sahin Naqvi
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
- Departments of Genetics and Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Myoung Keun Lee
- Department of Oral and Craniofacial Sciences, Center for Craniofacial and Dental Genetics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Dirk Vandermeulen
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
| | - Heather L Szabo-Rogers
- Department of Anatomy, Physiology and Pharmacology, University of Saskatchewan, Saskatchewan, Canada
| | - Paul A Romitti
- Department of Epidemiology, College of Public Health, The University of Iowa, Iowa City, IA, USA
| | - Simeon A Boyadjiev
- Department of Pediatrics, University of California Davis, Sacramento, CA, USA
| | - Mary L Marazita
- Department of Oral and Craniofacial Sciences, Center for Craniofacial and Dental Genetics, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA
| | - John R Shaffer
- Department of Oral and Craniofacial Sciences, Center for Craniofacial and Dental Genetics, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Mark D Shriver
- Department of Anthropology, Pennsylvania State University, State College, PA, USA
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Susan Walsh
- Department of Biology, Indiana University Indianapolis, Indianapolis, IN, USA
| | - Seth M Weinberg
- Department of Oral and Craniofacial Sciences, Center for Craniofacial and Dental Genetics, University of Pittsburgh, Pittsburgh, PA, USA.
- Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA.
- Department of Anthropology, University of Pittsburgh, Pittsburgh, PA, USA.
| | - Peter Claes
- Department of Human Genetics, KU Leuven, Leuven, Belgium.
- Medical Imaging Research Center, University Hospitals Leuven, Leuven, Belgium.
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium.
- Murdoch Children's Research Institute, Melbourne, VIC, Australia.
| |
Collapse
|
12
|
González-Peñas J, de Hoyos L, Díaz-Caneja CM, Andreu-Bernabeu Á, Stella C, Gurriarán X, Fañanás L, Bobes J, González-Pinto A, Crespo-Facorro B, Martorell L, Vilella E, Muntané G, Molto MD, Gonzalez-Piqueras JC, Parellada M, Arango C, Costas J. Recent natural selection conferred protection against schizophrenia by non-antagonistic pleiotropy. Sci Rep 2023; 13:15500. [PMID: 37726359 PMCID: PMC10509162 DOI: 10.1038/s41598-023-42578-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 09/12/2023] [Indexed: 09/21/2023] Open
Abstract
Schizophrenia is a debilitating psychiatric disorder associated with a reduced fertility and decreased life expectancy, yet common predisposing variation substantially contributes to the onset of the disorder, which poses an evolutionary paradox. Previous research has suggested balanced selection, a mechanism by which schizophrenia risk alleles could also provide advantages under certain environments, as a reliable explanation. However, recent studies have shown strong evidence against a positive selection of predisposing loci. Furthermore, evolutionary pressures on schizophrenia risk alleles could have changed throughout human history as new environments emerged. Here in this study, we used 1000 Genomes Project data to explore the relationship between schizophrenia predisposing loci and recent natural selection (RNS) signatures after the human diaspora out of Africa around 100,000 years ago on a genome-wide scale. We found evidence for significant enrichment of RNS markers in derived alleles arisen during human evolution conferring protection to schizophrenia. Moreover, both partitioned heritability and gene set enrichment analyses of mapped genes from schizophrenia predisposing loci subject to RNS revealed a lower involvement in brain and neuronal related functions compared to those not subject to RNS. Taken together, our results suggest non-antagonistic pleiotropy as a likely mechanism behind RNS that could explain the persistence of schizophrenia common predisposing variation in human populations due to its association to other non-psychiatric phenotypes.
Collapse
Affiliation(s)
- Javier González-Peñas
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain.
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain.
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain.
| | - Lucía de Hoyos
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Covadonga M Díaz-Caneja
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- School of Medicine, Universidad Complutense, Madrid, Spain
| | - Álvaro Andreu-Bernabeu
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
- School of Medicine, Universidad Complutense, Madrid, Spain
| | - Carol Stella
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
| | - Xaquín Gurriarán
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
| | - Lourdes Fañanás
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Department of Evolutionary Biology, Ecology and Environmental Sciences, Faculty of Biology, University of Barcelona, Barcelona, Spain
| | - Julio Bobes
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Faculty of Medicine and Health Sciences - Psychiatry, Universidad de Oviedo, ISPA, INEUROPA, Oviedo, Spain
| | - Ana González-Pinto
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- BIOARABA Health Research Institute, OSI Araba, University Hospital, University of the Basque Country, Vitoria, Spain
| | - Benedicto Crespo-Facorro
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Department of Psychiatry, Hospital Universitario Virgen del Rocío, Universidad de Sevilla, Seville, Spain
| | - Lourdes Martorell
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Hospital Universitari Institut Pere Mata, IISPV, Universitat Rovira I Virgili, Reus, Spain
| | - Elisabet Vilella
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Hospital Universitari Institut Pere Mata, IISPV, Universitat Rovira I Virgili, Reus, Spain
| | - Gerard Muntané
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Hospital Universitari Institut Pere Mata, IISPV, Universitat Rovira I Virgili, Reus, Spain
| | - María Dolores Molto
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Department of Genetics, University of Valencia, Campus of Burjassot, Valencia, Spain
- Department of Medicine, Universitat de València, Valencia, Spain
| | - Jose Carlos Gonzalez-Piqueras
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- Department of Medicine, Universitat de València, Valencia, Spain
- Fundación Investigación Hospital Clínico de Valencia, INCLIVA, 46010, Valencia, Spain
| | - Mara Parellada
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- School of Medicine, Universidad Complutense, Madrid, Spain
| | - Celso Arango
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Calle Ibiza, 43, 28009, Madrid, Spain
- Instituto de Investigación Sanitaria Gregorio Marañón (IiSGM), Madrid, Spain
- CIBERSAM, Centro Investigación Biomédica en Red Salud Mental, Madrid, Spain
- School of Medicine, Universidad Complutense, Madrid, Spain
| | - Javier Costas
- Instituto de Investigación Sanitaria (IDIS) de Santiago de Compostela, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), Servizo Galego de Saúde (SERGAS), Santiago de Compostela, Galicia, Spain
| |
Collapse
|
13
|
Hong Z. The value of sociogenomics in understanding genetic evolution in contemporary human populations. Behav Brain Sci 2023; 46:e217. [PMID: 37695001 DOI: 10.1017/s0140525x22002424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Burt's target article oddly misses the important intellectual contribution of sociogenomics to our understanding of genetic evolution in contemporary human populations. Although social scientists' immediate research agendas are often not evolutionary in nature, I call for a better appreciation of the role of sociogenomics in answering important evolutionary questions.
Collapse
Affiliation(s)
- Ze Hong
- Department of Sociology, Zhejiang University, Hangzhou, Zhejiang, China
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA ; https://kevinhong.home.blog/
| |
Collapse
|
14
|
Zhang J, Zhang S, Qiao J, Wang T, Zeng P. Similarity and diversity of genetic architecture for complex traits between East Asian and European populations. BMC Genomics 2023; 24:314. [PMID: 37308816 DOI: 10.1186/s12864-023-09434-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 06/07/2023] [Indexed: 06/14/2023] Open
Abstract
BACKGROUND Genome-wide association studies have detected a large number of single-nucleotide polymorphisms (SNPs) associated with complex traits in diverse ancestral groups. However, the trans-ethnic similarity and diversity of genetic architecture is not well understood currently. RESULTS By leveraging summary statistics of 37 traits from East Asian (Nmax=254,373) or European (Nmax=693,529) populations, we first evaluated the trans-ethnic genetic correlation (ρg) and found substantial evidence of shared genetic overlap underlying these traits between the two populations, with [Formula: see text] ranging from 0.53 (se = 0.11) for adult-onset asthma to 0.98 (se = 0.17) for hemoglobin A1c. However, 88.9% of the genetic correlation estimates were significantly less than one, indicating potential heterogeneity in genetic effect across populations. We next identified common associated SNPs using the conjunction conditional false discovery rate method and observed 21.7% of trait-associated SNPs can be identified simultaneously in both populations. Among these shared associated SNPs, 20.8% showed heterogeneous influence on traits between the two ancestral populations. Moreover, we demonstrated that population-common associated SNPs often exhibited more consistent linkage disequilibrium and allele frequency pattern across ancestral groups compared to population-specific or null ones. We also revealed population-specific associated SNPs were much likely to undergo natural selection compared to population-common associated SNPs. CONCLUSIONS Our study provides an in-depth understanding of similarity and diversity regarding genetic architecture for complex traits across diverse populations, and can assist in trans-ethnic association analysis, genetic risk prediction, and causal variant fine mapping.
Collapse
Affiliation(s)
- Jinhui Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Shuo Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Jiahao Qiao
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China
| | - Ting Wang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
- Engineering Research Innovation Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, Jiangsu, 221004, China.
| |
Collapse
|
15
|
Abraham A, Labella AL, Benton ML, Rokas A, Capra JA. GSEL: a fast, flexible python package for detecting signatures of diverse evolutionary forces on genomic regions. Bioinformatics 2023; 39:btad037. [PMID: 36655767 PMCID: PMC9879724 DOI: 10.1093/bioinformatics/btad037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 11/17/2022] [Accepted: 01/18/2023] [Indexed: 01/20/2023] Open
Abstract
SUMMARY GSEL is a computational framework for calculating the enrichment of signatures of diverse evolutionary forces in a set of genomic regions. GSEL can flexibly integrate any sequence-based evolutionary metric and analyze sets of human genomic regions identified by genome-wide assays (e.g. GWAS, eQTL, *-seq). The core of GSEL's approach is the generation of empirical null distributions tailored to the allele frequency and linkage disequilibrium structure of the regions of interest. We illustrate the application of GSEL to variants identified from a GWAS of body mass index, a highly polygenic trait. AVAILABILITY AND IMPLEMENTATION GSEL is implemented as a fast, flexible and user-friendly python package. It is available with demonstration data at https://github.com/abraham-abin13/gsel_vec. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Abin Abraham
- Division of General Pediatrics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Abigail L Labella
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232, USA
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37232, USA
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, NC 28223, USA
- North Carolina Research Center, Kannapolis, NC 28081, USA
| | | | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232, USA
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37232, USA
- Department of Computer Science, Baylor University, Waco, TX 76706, USA
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37232, USA
| | - John A Capra
- Department of Epidemiology and Biostatistics, Bakar Computational Health Sciences Institute, University of California, San Francisco, CA 94143, USA
| |
Collapse
|
16
|
Ding J, Chen X, Shi Z, Bai K, Shi S. Association of Metabolically Healthy Obesity and Risk of Cardiovascular Disease Among Adults in China: A Retrospective Cohort Study. Diabetes Metab Syndr Obes 2023; 16:151-159. [PMID: 36760599 PMCID: PMC9869897 DOI: 10.2147/dmso.s397243] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 01/10/2023] [Indexed: 01/19/2023] Open
Abstract
PURPOSE Previous studies have shown that metabolically healthy obesity (MHO) and changes in its status are connected to an increased incidence of cardiovascular disease (CVD). Yet, fewer studies have been conducted in China, especially for the middle-aged and elderly population, a high-risk group. The purpose of the study was to investigate the association between metabolic health status and CVD events. PATIENTS AND METHODS A total of 46,055 participants were categorized into 6 subgroups with different metabolic states according to the existence of metabolic syndrome and body mass index (BMI). The changes in obesity and metabolic health status were defined from baseline to follow-up outcomes with a combination of overweight and obesity. Cox proportional hazards models estimated the association of CVD events and each BMI-metabolic groups. RESULTS MHO and metabolic abnormality normal weight (MANW) subjects had a higher HR of CVD, 1.62 (95% CI, 1.36-1.92) and 1.24 (95% CI, 1.07-1.44), respectively, than their metabolically healthy normal weight (MHNW) counterparts. Then, more than 50% and 30% of the metabolically healthy overweight or obesity (MHOO) populations maintained their status and converted to a metabolically unhealthy state, respectively. Stable MANW, MHOO and metabolically abnormal obesity (MAO) were associated with a higher risk for CVD, 1.68 (95% CI, 1.37-2.05),1.26 (95% CI, 1.08-1.47) and 1.65 (95% CI, 1.45-1.88), respectively, than stable MHNW. CONCLUSION Despite being of normal weight, MANW status is in fact a risk factor for CVD, as well as MHO, especially for the Chinese middle-aged and elderly population. Furthermore, metabolic health is a transient state for partial middle-aged and elderly Chinese individuals, and MAO has the highest risk of CVD, including coronary heart disease (CHD) and stroke.
Collapse
Affiliation(s)
- Jiacheng Ding
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| | - Xuejiao Chen
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| | - Zhan Shi
- Department of Pharmacy, Zhengzhou People’s Hospital, Zhengzhou, Henan, People’s Republic of China
| | - Kaizhi Bai
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
| | - Songhe Shi
- Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, Zhengzhou, Henan, People’s Republic of China
- Correspondence: Songhe Shi, Department of Epidemiology and Health Statistics, College of Public Health, Zhengzhou University, No. 100 Science Avenue, Zhengzhou City, Henan Province, People’s Republic of China, Tel + 86 371 18037108985, Email
| |
Collapse
|
17
|
Abraham A, LaBella AL, Capra JA, Rokas A. Mosaic patterns of selection in genomic regions associated with diverse human traits. PLoS Genet 2022; 18:e1010494. [PMID: 36342969 PMCID: PMC9671423 DOI: 10.1371/journal.pgen.1010494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 11/17/2022] [Accepted: 10/21/2022] [Indexed: 11/09/2022] Open
Abstract
Natural selection shapes the genetic architecture of many human traits. However, the prevalence of different modes of selection on genomic regions associated with variation in traits remains poorly understood. To address this, we developed an efficient computational framework to calculate positive and negative enrichment of different evolutionary measures among regions associated with complex traits. We applied the framework to summary statistics from >900 genome-wide association studies (GWASs) and 11 evolutionary measures of sequence constraint, population differentiation, and allele age while accounting for linkage disequilibrium, allele frequency, and other potential confounders. We demonstrate that this framework yields consistent results across GWASs with variable sample sizes, numbers of trait-associated SNPs, and analytical approaches. The resulting evolutionary atlas maps diverse signatures of selection on genomic regions associated with complex human traits on an unprecedented scale. We detected positive enrichment for sequence conservation among trait-associated regions for the majority of traits (>77% of 290 high power GWASs), which included reproductive traits. Many traits also exhibited substantial positive enrichment for population differentiation, especially among hair, skin, and pigmentation traits. In contrast, we detected widespread negative enrichment for signatures of balancing selection (51% of GWASs) and absence of enrichment for evolutionary signals in regions associated with late-onset Alzheimer's disease. These results support a pervasive role for negative selection on regions of the human genome that contribute to variation in complex traits, but also demonstrate that diverse modes of evolution are likely to have shaped trait-associated loci. This atlas of evolutionary signatures across the diversity of available GWASs will enable exploration of the relationship between the genetic architecture and evolutionary processes in the human genome.
Collapse
Affiliation(s)
- Abin Abraham
- Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Abigail L. LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina, United States of America
- North Carolina Research Center, Kannapolis, North Carolina, United States of America
| | - John A. Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, California, United States of America
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
18
|
Fedorova L, Khrunin A, Khvorykh G, Lim J, Thornton N, Mulyar OA, Limborska S, Fedorov A. Analysis of Common SNPs across Continents Reveals Major Genomic Differences between Human Populations. Genes (Basel) 2022; 13:genes13081472. [PMID: 36011383 PMCID: PMC9408407 DOI: 10.3390/genes13081472] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 08/12/2022] [Accepted: 08/17/2022] [Indexed: 12/03/2022] Open
Abstract
Common alleles tend to be more ancient than rare alleles. These common SNPs appeared thousands of years ago and reflect intricate human evolution including various adaptations, admixtures, and migration events. Eighty-four thousand abundant region-specific alleles (ARSAs) that are common in one continent but absent in the rest of the world have been characterized by processing 3100 genomes from 230 populations. Also computed were 17,446 polymorphic sites with regional absence of common alleles (RACAs), which are widespread globally but absent in one region. A majority of these region-specific SNPs were found in Africa. America has the second greatest number of ARSAs (3348) and is even ahead of Europe (1911). Surprisingly, East Asia has the highest number of RACAs (10,524) and the lowest number of ARSAs (362). ARSAs and RACAs have distinct compositions of ancestral versus derived alleles in different geographical regions, reflecting their unique evolution. Genes associated with ARSA and RACA SNPs were identified and their functions were analyzed. The core 100 genes shared by multiple populations and associated with region-specific natural selection were examined. The largest part of them (42%) are related to the nervous system. ARSA and RACA SNPs are important for both association and human evolution studies.
Collapse
Affiliation(s)
| | - Andrey Khrunin
- Institute of Molecular Genetics of National Research Centre, “Kurchatov Institute”, 123182 Moscow, Russia
| | - Gennady Khvorykh
- Institute of Molecular Genetics of National Research Centre, “Kurchatov Institute”, 123182 Moscow, Russia
| | - Jan Lim
- CRI Genetics LLC, Santa Monica, CA 90404, USA
| | | | | | - Svetlana Limborska
- Institute of Molecular Genetics of National Research Centre, “Kurchatov Institute”, 123182 Moscow, Russia
| | - Alexei Fedorov
- CRI Genetics LLC, Santa Monica, CA 90404, USA
- Department of Medicine, University of Toledo, Toledo, OH 43606, USA
- Correspondence: ; Tel.: +1-419-383-5270
| |
Collapse
|
19
|
High heritability of ascending aortic diameter and trans-ancestry prediction of thoracic aortic disease. Nat Genet 2022; 54:772-782. [PMID: 35637384 DOI: 10.1038/s41588-022-01070-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 03/31/2022] [Indexed: 12/24/2022]
Abstract
Enlargement of the aorta is an important risk factor for aortic aneurysm and dissection, a leading cause of morbidity in the developed world. Here we performed automated extraction of ascending aortic diameter from cardiac magnetic resonance images of 36,021 individuals from the UK Biobank, followed by genome-wide association. We identified lead variants across 41 loci, including genes related to cardiovascular development (HAND2, TBX20) and Mendelian forms of thoracic aortic disease (ELN, FBN1). A polygenic score significantly predicted prevalent risk of thoracic aortic aneurysm and the need for surgical intervention for patients with thoracic aneurysm across multiple ancestries within the UK Biobank, FinnGen, the Penn Medicine Biobank and the Million Veterans Program (MVP). Additionally, we highlight the primary causal role of blood pressure in reducing aortic dilation using Mendelian randomization. Overall, our findings provide a roadmap for using genetic determinants of human anatomy to understand cardiovascular development while improving prediction of diseases of the thoracic aorta.
Collapse
|
20
|
Zhang M, Wu S, Du S, Qian W, Chen J, Qiao L, Yang Y, Tan J, Yuan Z, Peng Q, Liu Y, Navarro N, Tang K, Ruiz-Linares A, Wang J, Claes P, Jin L, Li J, Wang S. Genetic variants underlying differences in facial morphology in East Asian and European populations. Nat Genet 2022; 54:403-411. [PMID: 35393595 DOI: 10.1038/s41588-022-01038-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 01/19/2022] [Accepted: 02/25/2022] [Indexed: 11/09/2022]
Abstract
Facial morphology-a conspicuous feature of human appearance-is highly heritable. Previous studies on the genetic basis of facial morphology were performed mainly in European-ancestry cohorts (EUR). Applying a data-driven phenotyping and multivariate genome-wide scanning protocol to a large collection of three-dimensional facial images of individuals with East Asian ancestry (EAS), we identified 244 variants in 166 loci (62 new) associated with typical-range facial variation. A newly proposed polygenic shape analysis indicates that the effects of the variants on facial shape in EAS can be generalized to EUR. Based on this, we further identified 13 variants related to differences between facial shape in EUR and EAS populations. Evolutionary analyses suggest that the difference in nose shape between EUR and EAS populations is caused by a directional selection, due mainly to a local adaptation in Europeans. Our results illustrate the underlying genetic basis for facial differences across populations.
Collapse
Affiliation(s)
- Manfei Zhang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- School of Computer Science, Fudan University, Shanghai, China
| | - Sijie Wu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Siyuan Du
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Wei Qian
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- School of Computer Science, Fudan University, Shanghai, China
| | - Jieyi Chen
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Lu Qiao
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yajun Yang
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Jingze Tan
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Ziyu Yuan
- Fudan-Taizhou Institute of Health Sciences, Taizhou, China
| | - Qianqian Peng
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yu Liu
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Nicolas Navarro
- Biogéosciences, UMR 6282 CNRS-EPHE, Université Bourgogne Franche-Comté, Dijon, France
- Ecole Pratique des Hautes Etudes, PSL University, Paris, France
| | - Kun Tang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Andrés Ruiz-Linares
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
- Aix-Marseille Université, CNRS, EFS, ADES, Marseille, France
- Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London, UK
| | - Jiucun Wang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China
- Fudan-Taizhou Institute of Health Sciences, Taizhou, China
| | - Peter Claes
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium
- Medical Imaging Research Center, UZ Leuven, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- Murdoch Children's Research Institute, Melbourne, Victoria, Australia
| | - Li Jin
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.
- Fudan-Taizhou Institute of Health Sciences, Taizhou, China.
| | - Jiarui Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium.
- Medical Imaging Research Center, UZ Leuven, Leuven, Belgium.
| | - Sijia Wang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
21
|
Weissbrod O, Kanai M, Shi H, Gazal S, Peyrot WJ, Khera AV, Okada Y, Martin AR, Finucane HK, Price AL. Leveraging fine-mapping and multipopulation training data to improve cross-population polygenic risk scores. Nat Genet 2022; 54:450-458. [PMID: 35393596 PMCID: PMC9009299 DOI: 10.1038/s41588-022-01036-9] [Citation(s) in RCA: 148] [Impact Index Per Article: 49.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 02/25/2022] [Indexed: 01/25/2023]
Abstract
Polygenic risk scores suffer reduced accuracy in non-European populations, exacerbating health disparities. We propose PolyPred, a method that improves cross-population polygenic risk scores by combining two predictors: a new predictor that leverages functionally informed fine-mapping to estimate causal effects (instead of tagging effects), addressing linkage disequilibrium differences, and BOLT-LMM, a published predictor. When a large training sample is available in the non-European target population, we propose PolyPred+, which further incorporates the non-European training data. We applied PolyPred to 49 diseases/traits in four UK Biobank populations using UK Biobank British training data, and observed relative improvements versus BOLT-LMM ranging from +7% in south Asians to +32% in Africans, consistent with simulations. We applied PolyPred+ to 23 diseases/traits in UK Biobank east Asians using both UK Biobank British and Biobank Japan training data, and observed improvements of +24% versus BOLT-LMM and +12% versus PolyPred. Summary statistics-based analogs of PolyPred and PolyPred+ attained similar improvements.
Collapse
Affiliation(s)
- Omer Weissbrod
- Epidemiology Department, Harvard School of Public Health, Boston, MA, USA.
| | - Masahiro Kanai
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
| | - Huwenbo Shi
- Epidemiology Department, Harvard School of Public Health, Boston, MA, USA
- OMNI Bioinformatics, San Francisco, CA, USA
| | - Steven Gazal
- Epidemiology Department, Harvard School of Public Health, Boston, MA, USA
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Wouter J Peyrot
- Epidemiology Department, Harvard School of Public Health, Boston, MA, USA
- Department of Psychiatry, Amsterdam UMC, Vrije Universiteit, Amsterdam, the Netherlands
| | - Amit V Khera
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Verve Therapeutics, Cambridge, MA, USA
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | | | - Hilary K Finucane
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Alkes L Price
- Epidemiology Department, Harvard School of Public Health, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
22
|
The correctness of large scale analysis of genomic data. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES 2021. [DOI: 10.2478/fcds-2021-0024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Abstract
Implementing a large genomic project is a demanding task, also from the computer science point of view. Besides collecting many genome samples and sequencing them, there is processing of a huge amount of data at every stage of their production and analysis. Efficient transfer and storage of the data is also an important issue. During the execution of such a project, there is a need to maintain work standards and control quality of the results, which can be difficult if a part of the work is carried out externally. Here, we describe our experience with such data quality analysis on a number of levels - from an obvious check of the quality of the results obtained, to examining consistency of the data at various stages of their processing, to verifying, as far as possible, their compatibility with the data describing the sample.
Collapse
|
23
|
Campbell-Staton SC, Velotta JP, Winchell KM. Selection on adaptive and maladaptive gene expression plasticity during thermal adaptation to urban heat islands. Nat Commun 2021; 12:6195. [PMID: 34702827 PMCID: PMC8548502 DOI: 10.1038/s41467-021-26334-4] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 09/10/2021] [Indexed: 12/13/2022] Open
Abstract
Phenotypic plasticity enables a single genotype to produce multiple phenotypes in response to environmental variation. Plasticity may play a critical role in the colonization of novel environments, but its role in adaptive evolution is controversial. Here we suggest that rapid parallel regulatory adaptation of Anolis lizards to urban heat islands is due primarily to selection for reduced and/or reversed heat-induced plasticity that is maladaptive in urban thermal conditions. We identify evidence for polygenic selection across genes of the skeletal muscle transcriptome associated with heat tolerance. Forest lizards raised in common garden conditions exhibit heat-induced changes in expression of these genes that largely correlate with decreased heat tolerance, consistent with maladaptive regulatory response to high-temperature environments. In contrast, urban lizards display reduced gene expression plasticity after heat challenge in common garden and a significant increase in gene expression change that is congruent with greater heat tolerance, a putatively adaptive state in warmer urban environments. Genes displaying maladaptive heat-induced plasticity repeatedly show greater genetic divergence between urban and forest habitats than those displaying adaptive plasticity. These results highlight the role of selection against maladaptive regulatory plasticity during rapid adaptive modification of complex systems in the wild.
Collapse
Affiliation(s)
- Shane C Campbell-Staton
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, 08540, USA.
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, 90095, USA.
- Institute for Society and Genetics, University of California, Los Angeles, CA, 90095, USA.
| | - Jonathan P Velotta
- Department of Biological Sciences, University of Denver, Denver, CO, 80208, USA
| | | |
Collapse
|
24
|
Wang Y, Zhao B, Choi J, Lee EA. Genomic approaches to trace the history of human brain evolution with an emerging opportunity for transposon profiling of ancient humans. Mob DNA 2021; 12:22. [PMID: 34663455 PMCID: PMC8525043 DOI: 10.1186/s13100-021-00250-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 09/27/2021] [Indexed: 12/17/2022] Open
Abstract
Transposable elements (TEs) significantly contribute to shaping the diversity of the human genome, and lines of evidence suggest TEs as one of driving forces of human brain evolution. Existing computational approaches, including cross-species comparative genomics and population genetic modeling, can be adapted for the study of the role of TEs in evolution. In particular, diverse ancient and archaic human genome sequences are increasingly available, allowing reconstruction of past human migration events and holding the promise of identifying and tracking TEs among other evolutionarily important genetic variants at an unprecedented spatiotemporal resolution. However, highly degraded short DNA templates and other unique challenges presented by ancient human DNA call for major changes in current experimental and computational procedures to enable the identification of evolutionarily important TEs. Ancient human genomes are valuable resources for investigating TEs in the evolutionary context, and efforts to explore ancient human genomes will potentially provide a novel perspective on the genetic mechanism of human brain evolution and inspire a variety of technological and methodological advances. In this review, we summarize computational and experimental approaches that can be adapted to identify and validate evolutionarily important TEs, especially for human brain evolution. We also highlight strategies that leverage ancient genomic data and discuss unique challenges in ancient transposon genomics.
Collapse
Affiliation(s)
- Yilan Wang
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Boxun Zhao
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
| | - Jaejoon Choi
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Eunjung Alice Lee
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- The Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
| |
Collapse
|
25
|
Polygenic Risk Scores Contribute to Personalized Medicine of Parkinson's Disease. J Pers Med 2021; 11:jpm11101030. [PMID: 34683174 PMCID: PMC8539098 DOI: 10.3390/jpm11101030] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 10/11/2021] [Accepted: 10/12/2021] [Indexed: 12/18/2022] Open
Abstract
Parkinson’s disease (PD) is the second most common neurodegenerative disorder characterized by the loss of dopaminergic neurons. The vast majority of PD patients develop the disease sporadically and it is assumed that the cause lies in polygenic and environmental components. The overall polygenic risk is the result of a large number of common low-risk variants discovered by large genome-wide association studies (GWAS). Polygenic risk scores (PRS), generated by compiling genome-wide significant variants, are a useful prognostic tool that quantifies the cumulative effect of genetic risk in a patient and in this way helps to identify high-risk patients. Although there are limitations to the construction and application of PRS, such as considerations of limited genetic underpinning of diseases explained by SNPs and generalizability of PRS to other populations, this personalized risk prediction could make a promising contribution to stratified medicine and tailored therapeutic interventions in the future.
Collapse
|
26
|
Allele frequency differentiation at height-associated SNPs among continental human populations. Eur J Hum Genet 2021; 29:1542-1548. [PMID: 34267339 PMCID: PMC8484658 DOI: 10.1038/s41431-021-00938-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 06/20/2021] [Accepted: 06/28/2021] [Indexed: 02/07/2023] Open
Abstract
Methods to detect polygenic adaptation have recently been shown to be sensitive to uncorrected stratification in GWAS, thereby casting doubts on whether polygenic adaptation is prevalent among humans. Consistent with a signal of adaptation at human height loci, the mean FST among African, East Asian, and European populations was shown to be significantly higher at height-associated SNPs than that at non-associated SNPs. This conclusion was reached, however, using height-associated SNPs ascertained from a GWAS design impacted by residual confounding due to uncorrected stratification. Specifically, we show here that the estimated effect sizes are significantly correlated with population structure across continents, potentially explaining the elevated differentiation previously reported. We alleviated these concerns of confounding by ascertaining height-associated SNPs from two biobank GWAS (UK Biobank, UKB, and Biobank Japan, BBJ), where measures to control for confounding in GWAS are more effective. Consistent with a global signature of polygenic adaptation, we found that compared to non-associated SNPs, frequencies of height-associated SNPs are indeed significantly more differentiated among continental populations from both the 1000 Genomes Project (p = 0.0012 for UKB and p = 0.0265 for BBJ), and the Human Genome Diversity Project (p = 0.0225 for UKB and p = 0.0032 for BBJ). However, we found no significant difference among continental populations in polygenic height scores. Through simulations, we found that polygenic score-based statistics could lose power in detecting polygenic adaptation in presence of independent converging selections, thereby potentially explaining the inconsistent results based on FST and polygenic scores.
Collapse
|
27
|
Mathieson I. The omnigenic model and polygenic prediction of complex traits. Am J Hum Genet 2021; 108:1558-1563. [PMID: 34331855 PMCID: PMC8456163 DOI: 10.1016/j.ajhg.2021.07.003] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 07/08/2021] [Indexed: 12/16/2022] Open
Abstract
The omnigenic model was proposed as a framework to understand the highly polygenic architecture of complex traits revealed by genome-wide association studies (GWASs). I argue that this model also explains recent observations about cross-population genetic effects, specifically the low transferability of polygenic scores and the lack of clear evidence for polygenic selection. In particular, the omnigenic model explains why the effects of most GWAS variants vary between populations. This interpretation has several consequences for the evolutionary interpretation and practical use of GWAS summary statistics and polygenic scores. First, some polygenic scores may be applicable only in populations of the same ancestry and environment as the discovery population. Second, most GWAS associations will have differing effects between populations and are unlikely to be robust clinical targets. Finally, it may not always be possible to detect polygenic selection from population genetic data. These considerations make it difficult to interpret the clinical and evolutionary meanings of polygenic scores without an explicit model of genetic architecture.
Collapse
Affiliation(s)
- Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
28
|
Irving-Pease EK, Muktupavela R, Dannemann M, Racimo F. Quantitative Human Paleogenetics: What can Ancient DNA Tell us About Complex Trait Evolution? Front Genet 2021; 12:703541. [PMID: 34422004 PMCID: PMC8371751 DOI: 10.3389/fgene.2021.703541] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 07/08/2021] [Indexed: 12/13/2022] Open
Abstract
Genetic association data from national biobanks and large-scale association studies have provided new prospects for understanding the genetic evolution of complex traits and diseases in humans. In turn, genomes from ancient human archaeological remains are now easier than ever to obtain, and provide a direct window into changes in frequencies of trait-associated alleles in the past. This has generated a new wave of studies aiming to analyse the genetic component of traits in historic and prehistoric times using ancient DNA, and to determine whether any such traits were subject to natural selection. In humans, however, issues about the portability and robustness of complex trait inference across different populations are particularly concerning when predictions are extended to individuals that died thousands of years ago, and for which little, if any, phenotypic validation is possible. In this review, we discuss the advantages of incorporating ancient genomes into studies of trait-associated variants, the need for models that can better accommodate ancient genomes into quantitative genetic frameworks, and the existing limits to inferences about complex trait evolution, particularly with respect to past populations.
Collapse
Affiliation(s)
- Evan K. Irving-Pease
- Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| | - Rasa Muktupavela
- Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| | - Michael Dannemann
- Center for Genomics, Evolution and Medicine, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Fernando Racimo
- Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
29
|
Zhu C, Talhelm T, Li Y, Chen G, Zhu J, Wang J. Relationship between rice farming and polygenic scores potentially linked to agriculture in China. ROYAL SOCIETY OPEN SCIENCE 2021; 8:210382. [PMID: 34457340 PMCID: PMC8371358 DOI: 10.1098/rsos.210382] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 07/26/2021] [Indexed: 06/13/2023]
Abstract
Following domestication in the lower Yangtze River valley 9400 years ago, rice farming spread throughout China and changed lifestyle patterns among Neolithic populations. Here, we report evidence that the advent of rice domestication and cultivation may have shaped humans not only culturally but also genetically. Leveraging recent findings from molecular genetics, we construct a number of polygenic scores (PGSs) of behavioural traits and examine their associations with rice cultivation based on a sample of 4101 individuals recently collected from mainland China. A total of nine polygenic traits and genotypes are investigated in this study, including PGSs of height, body mass index, depression, time discounting, reproduction, educational attainment, risk preference, ADH1B rs1229984 and ALDH2 rs671. Two-stage least-squares estimates of the county-level percentage of cultivated land devoted to paddy rice on the PGS of age at first birth (b = -0.029, p = 0.021) and ALDH2 rs671 (b = 0.182, p < 0.001) are both statistically significant and robust to a wide range of potential confounds and alternative explanations. These findings imply that rice farming may influence human evolution in relatively recent human history.
Collapse
Affiliation(s)
- Chen Zhu
- College of Economics and Management, China Agricultural University, Beijing 100081, People's Republic of China
- Academy of Global Food Economics and Policy (AGFEP), China Agricultural University, Beijing 100081, People's Republic of China
- Beijing Food Safety Policy and Strategy Research Base, China Agricultural University, Beijing 100081, People's Republic of China
| | - Thomas Talhelm
- Booth School of Business, University of Chicago, Chicago, IL 60637, USA
| | - Yingxiang Li
- WeGene, Shenzhen Zaozhidao Technology Co. Ltd, Shenzhen, People's Republic of China
| | - Gang Chen
- WeGene, Shenzhen Zaozhidao Technology Co. Ltd, Shenzhen, People's Republic of China
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, People's Republic of China
| | - Jiong Zhu
- Institute of Economics, School of Economics, Xiamen University, Xiamen, People's Republic of China
- Wang Yanan Institute for Studies in Economics (WISE), Xiamen University, Xiamen, People's Republic of China
| | - Jun Wang
- School of Public Administration and Policy, Renmin University of China, Beijing, People's Republic of China
| |
Collapse
|
30
|
Widen E, Raben TG, Lello L, Hsu SDH. Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank. Genes (Basel) 2021; 12:991. [PMID: 34209487 PMCID: PMC8308062 DOI: 10.3390/genes12070991] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 06/22/2021] [Accepted: 06/23/2021] [Indexed: 12/29/2022] Open
Abstract
We use UK Biobank data to train predictors for 65 blood and urine markers such as HDL, LDL, lipoprotein A, glycated haemoglobin, etc. from SNP genotype. For example, our Polygenic Score (PGS) predictor correlates ∼0.76 with lipoprotein A level, which is highly heritable and an independent risk factor for heart disease. This may be the most accurate genomic prediction of a quantitative trait that has yet been produced (specifically, for European ancestry groups). We also train predictors of common disease risk using blood and urine biomarkers alone (no DNA information); we call these predictors biomarker risk scores, BMRS. Individuals who are at high risk (e.g., odds ratio of >5× population average) can be identified for conditions such as coronary artery disease (AUC∼0.75), diabetes (AUC∼0.95), hypertension, liver and kidney problems, and cancer using biomarkers alone. Our atherosclerotic cardiovascular disease (ASCVD) predictor uses ∼10 biomarkers and performs in UKB evaluation as well as or better than the American College of Cardiology ASCVD Risk Estimator, which uses quite different inputs (age, diagnostic history, BMI, smoking status, statin usage, etc.). We compare polygenic risk scores (risk conditional on genotype: PRS) for common diseases to the risk predictors which result from the concatenation of learned functions BMRS and PGS, i.e., applying the BMRS predictors to the PGS output.
Collapse
Affiliation(s)
- Erik Widen
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
| | - Timothy G. Raben
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
| | - Louis Lello
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
- Genomic Prediction, Inc., 675 US Highway One, North Brunswick, NJ 08902, USA
| | - Stephen D. H. Hsu
- Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA; (T.G.R.); (S.D.H.H.)
- Genomic Prediction, Inc., 675 US Highway One, North Brunswick, NJ 08902, USA
| |
Collapse
|
31
|
Lu H, Wang T, Zhang J, Zhang S, Huang S, Zeng P. Evaluating marginal genetic correlation of associated loci for complex diseases and traits between European and East Asian populations. Hum Genet 2021; 140:1285-1297. [PMID: 34091770 DOI: 10.1007/s00439-021-02299-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 05/31/2021] [Indexed: 12/14/2022]
Abstract
Genome-wide association studies (GWASs) have successfully identified a large amount of single-nucleotide polymorphisms associated with many complex phenotypes in diverse populations. However, a comprehensive understanding of the genetic correlation of associated loci of phenotypes across populations remains lacking and the extent to which associations discovered in one population can be generalized to other populations or can be utilized for trans-ethnic genetic prediction is also unclear. By leveraging summary statistics, we proposed MAGIC to evaluate the trans-ethnic marginal genetic correlation (rm) of per-allele effect sizes for associated SNPs (P < 5E-8) under the framework of measurement error models. We confirmed the methodological advantage of MAGIC over general approaches through simulations and demonstrated its utility by analyzing 34 GWAS summary statistics of phenotypes from the East Asian (Nmax = 254,373) and European (Nmax = 1,220,901) populations. Among these phenotypes, rm was estimated to range from 0.584 (se = 0.140) for breast cancer to 0.949 (se = 0.035) for age of menarche, with an average of 0.835 (se = 0.045). We also uncovered that the trans-ethnic genetic prediction accuracy for phenotypes in the target population would substantially become low when using associated SNPs identified in non-target populations, indicating that associations discovered in the one population cannot be simply generalized to another population and that the accuracy of trans-ethnic phenotype prediction is generally dissatisfactory. Overall, our study provides in-depth insight into trans-ethnic genetic correlation and prediction for complex phenotypes across diverse populations.
Collapse
Affiliation(s)
- Haojie Lu
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ting Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Jinhui Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Shuo Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Shuiping Huang
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.,Center for Medical Statistics and Data Analysis, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ping Zeng
- Department of Epidemiology and Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China. .,Center for Medical Statistics and Data Analysis, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|
32
|
Lin YP, Mitchell-Olds T, Lee CR. The ecological, genetic and genomic architecture of local adaptation and population differentiation in Boechera stricta. Proc Biol Sci 2021; 288:20202472. [PMID: 33878927 DOI: 10.1098/rspb.2020.2472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Differential local adaptation restricts gene flow between populations inhabiting distinct environments, resulting in isolation by adaptation. In addition to the statistical inferences of genotype-environment associations, an integrative approach is needed to investigate the effect of local adaptation on population divergence at the ecological, genetic and genomic scale. Here, we combine reciprocal transplant, genome-environment association and QTL mapping to investigate local adaptation in Boechera stricta (Drummond's rockcress). With reciprocal transplant experiment, we found local genetic groups exhibit phenotypic characteristics corresponding to the distinct selection forces from different water availability. At the genetic level, the local allele of a major fitness QTL confers higher and sturdier flowering stalks, maximizing the fecundity fitness component under sufficient water supply, and its genetic variation is associated with precipitation across the landscape. At the genomewide scale, we further showed that multiple loci associated with precipitation are highly differentiated between genetic groups, suggesting that local adaptation has a widespread effect on reducing gene flow. This study provides one of the few comprehensive examples demonstrating how local adaptation facilitates population divergence at the trait, gene and genome level.
Collapse
Affiliation(s)
- Ya-Ping Lin
- Institute of Ecology and Evolutionary Biology, National Taiwan University, Taipei 10617, Taiwan
| | | | - Cheng-Ruei Lee
- Institute of Ecology and Evolutionary Biology, National Taiwan University, Taipei 10617, Taiwan.,Genome and Systems Biology Degree Program, National Taiwan University, Taipei 10617, Taiwan.,Institute of Plant Biology, National Taiwan University, Taipei 10617, Taiwan
| |
Collapse
|
33
|
Guo S, Huang S, Jiang X, Hu H, Han D, Moreno CS, Fairbrother GL, Hughes DA, Stoneking M, Khaitovich P. Variation of microRNA expression in the human placenta driven by population identity and sex of the newborn. BMC Genomics 2021; 22:286. [PMID: 33879051 PMCID: PMC8059241 DOI: 10.1186/s12864-021-07542-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 03/22/2021] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Analysis of lymphocyte cell lines revealed substantial differences in the expression of mRNA and microRNA (miRNA) among human populations. The extent of such population-associated differences in actual human tissues remains largely unexplored. The placenta is one of the few solid human tissues that can be collected in substantial numbers in a controlled manner, enabling quantitative analysis of transient biomolecules such as RNA transcripts. Here, we analyzed microRNA (miRNA) expression in human placental samples derived from 36 individuals representing four genetically distinct human populations: African Americans, European Americans, South Asians, and East Asians. All samples were collected at the same hospital following a unified protocol, thus minimizing potential biases that might influence the results. RESULTS Sequence analysis of the miRNA fraction yielded 938 annotated and 70 novel miRNA transcripts expressed in the placenta. Of them, 82 (9%) of annotated and 11 (16%) of novel miRNAs displayed quantitative expression differences among populations, generally reflecting reported genetic and mRNA-expression-based distances. Several co-expressed miRNA clusters stood out from the rest of the population-associated differences in terms of miRNA evolutionary age, tissue-specificity, and disease-association characteristics. Among three non-environmental influenced demographic parameters, the second largest contributor to miRNA expression variation after population was the sex of the newborn, with 32 miRNAs (3% of detected) exhibiting significant expression differences depending on whether the newborn was male or female. Male-associated miRNAs were evolutionarily younger and correlated inversely with the expression of target mRNA involved in neuron-related functions. In contrast, both male and female-associated miRNAs appeared to mediate different types of hormonal responses. Demographic factors further affected reported imprinted expression of 66 placental miRNAs: the imprinting strength correlated with the mother's weight, but not height. CONCLUSIONS Our results showed that among 12 assessed demographic variables, population affiliation and fetal sex had a substantial influence on miRNA expression variation among human placental samples. The effect of newborn-sex-associated miRNA differences further led to expression inhibition of the target genes clustering in specific functional pathways. By contrast, population-driven miRNA differences might mainly represent neutral changes with minimal functional impacts.
Collapse
Affiliation(s)
- Song Guo
- Skolkovo Institute of Science and Technology, 121205, Moscow, Russia
| | - Shuyun Huang
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, CAS, 320 Yue Yang Road, Shanghai, 200031, China
| | - Xi Jiang
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, CAS, 320 Yue Yang Road, Shanghai, 200031, China
| | - Haiyang Hu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, CAS, 320 Yue Yang Road, Shanghai, 200031, China
| | - Dingding Han
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, CAS, 320 Yue Yang Road, Shanghai, 200031, China
| | - Carlos S Moreno
- Department of Pathology and Laboratory Medicine and Department of Biomedical Informatics, Emory University, Atlanta, GA, 30322, USA
| | - Genevieve L Fairbrother
- Obstetrics and Gynecology of Atlanta, 1100 Johnson Ferry Rd NE Suite 800, Center 2, Atlanta, GA, 30342, USA
| | - David A Hughes
- MRC Integrative Epidemiology Unit at University of Bristol, Bristol, BS8 2BN, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, BS8 2BN, UK
| | - Mark Stoneking
- Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany.
| | | |
Collapse
|
34
|
Guo J, Bakshi A, Wang Y, Jiang L, Yengo L, Goddard ME, Visscher PM, Yang J. Quantifying genetic heterogeneity between continental populations for human height and body mass index. Sci Rep 2021; 11:5240. [PMID: 33664403 PMCID: PMC7933291 DOI: 10.1038/s41598-021-84739-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 02/22/2021] [Indexed: 12/26/2022] Open
Abstract
Genome-wide association studies (GWAS) in samples of European ancestry have identified thousands of genetic variants associated with complex traits in humans. However, it remains largely unclear whether these associations can be used in non-European populations. Here, we seek to quantify the proportion of genetic variation for a complex trait shared between continental populations. We estimated the between-population correlation of genetic effects at all SNPs (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$r_{g}$$\end{document}rg) or genome-wide significant SNPs (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$r_{{g\left( {GWS} \right)}}$$\end{document}rgGWS) for height and body mass index (BMI) in samples of European (EUR; \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$n = 49,839$$\end{document}n=49,839) and African (AFR; \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$n = 17,426$$\end{document}n=17,426) ancestry. The \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\hat{r}_{g}$$\end{document}r^g between EUR and AFR was 0.75 (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\text{s}}.{\text{e}}. = 0.035$$\end{document}s.e.=0.035) for height and 0.68 (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\text{s}}.{\text{e}}. = 0.062$$\end{document}s.e.=0.062) for BMI, and the corresponding \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\hat{r}_{{g\left( {GWS} \right)}}$$\end{document}r^gGWS was 0.82 (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\text{s}}.{\text{e}}. = 0.030$$\end{document}s.e.=0.030) for height and 0.87 (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\text{s}}.{\text{e}}. = 0.064$$\end{document}s.e.=0.064) for BMI, suggesting that a large proportion of GWAS findings discovered in Europeans are likely applicable to non-Europeans for height and BMI. There was no evidence that \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\hat{r}_{g}$$\end{document}r^g differs in SNP groups with different levels of between-population difference in allele frequency or linkage disequilibrium, which, however, can be due to the lack of power.
Collapse
Affiliation(s)
- Jing Guo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia.,Human Genetics, Wellcome Sanger Institute, Hinxton, CB10 1SA, UK
| | - Andrew Bakshi
- Monash Partners Comprehensive Cancer Consortium, Monash Biomedicine Discovery Institute Cancer Program, Prostate Cancer Research Group, Department of Anatomy and Developmental Biology, Monash University, Clayton, VIC, 3800, Australia.,Queensland Brain Institute, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Ying Wang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Longda Jiang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Loic Yengo
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Michael E Goddard
- Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, VIC, Australia.,Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, VIC, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Jian Yang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, 4072, Australia. .,School of Life Sciences, Westlake University, Hangzhou, 310024, Zhejiang, China. .,Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, 310024, Zhejiang, China.
| |
Collapse
|
35
|
Kassam I, Tan S, Gan FF, Saw WY, Tan LWL, Moong DKN, Soong R, Teo YY, Loh M. Genome-wide identification of cis DNA methylation quantitative trait loci in three Southeast Asian Populations. Hum Mol Genet 2021; 30:603-618. [PMID: 33547791 DOI: 10.1093/hmg/ddab038] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 01/25/2021] [Accepted: 01/28/2021] [Indexed: 12/12/2022] Open
Abstract
DNA methylation (DNAm) is an epigenetic modification that acts to regulate gene transcription, is essential for cellular processes and plays an important role in complex traits and disease. Variation in DNAm levels is influenced by both genetic and environmental factors. Several studies have examined the extent to which common genetic variation influences DNAm (i.e. mQTLs), however, an improved understanding of mQTLs across diverse human populations is needed to increase their utility in integrative genomic studies in order to further our understanding of complex trait and disease biology. Here, we systematically examine cis-mQTLs in three Southeast Asian populations in the Singapore Integrative Omics (iOmics) Study, comprised of Chinese (n = 93), Indians (n = 83) and Malays (n = 78). A total of 24 851 cis-mQTL probes were associated with at least one SNP in meta- and ethnicity-specific analyses at a stringent significance level. These cis-mQTL probes show significant differences in local SNP heritability between the ethnicities, enrichment in functionally relevant regions using data from the Roadmap Epigenomics Mapping Consortium and are associated with nearby genes and complex traits due to pleiotropy. Importantly, DNAm prediction performance and the replication of cis-mQTLs both within iOmics and between two independent mQTL studies in European and Bangladeshi individuals is best when the genetic distance between the ethnicities is small, with differences in cis-mQTLs likely due to differences in allele frequency and linkage disequilibrium. This study highlights the importance of, and opportunities from, extending investigation of the genetic control of DNAm to Southeast Asian populations.
Collapse
Affiliation(s)
- Irfahan Kassam
- Life Sciences Institute, National University of Singapore, Singapore 117456.,Saw Swee Hock School of Public Health, National University of Singapore, Singapore 117549
| | - Sili Tan
- KK Research Centre, KK Women's and Children's Hospital, Singapore 229899
| | - Fei Fei Gan
- Department of NUH Tissue Repository, National University Health System, Singapore 119228
| | - Woei-Yuh Saw
- Baker Heart and Diabetes Institute, Melbourne Victoria, Australia 3004
| | - Linda Wei-Lin Tan
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore 117549
| | - Don Kyin Nwe Moong
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore 117549
| | - Richie Soong
- Cancer Science Institute of Singapore, National University of Singapore, Singapore 117599
| | - Yik-Ying Teo
- Life Sciences Institute, National University of Singapore, Singapore 117456.,Saw Swee Hock School of Public Health, National University of Singapore, Singapore 117549
| | - Marie Loh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore 308232.,Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom W2 1PG.,Translational Laboratory in Genetic Medicine, Agency for Science, Technology and Research (ASTAR), Singapore 138648
| |
Collapse
|
36
|
Li JH, Mazur CA, Berisa T, Pickrell JK. Low-pass sequencing increases the power of GWAS and decreases measurement error of polygenic risk scores compared to genotyping arrays. Genome Res 2021; 31:529-537. [PMID: 33536225 PMCID: PMC8015847 DOI: 10.1101/gr.266486.120] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 02/01/2021] [Indexed: 01/08/2023]
Abstract
Low-pass sequencing (sequencing a genome to an average depth less than 1× coverage) combined with genotype imputation has been proposed as an alternative to genotyping arrays for trait mapping and calculation of polygenic scores. To empirically assess the relative performance of these technologies for different applications, we performed low-pass sequencing (targeting coverage levels of 0.5× and 1×) and array genotyping (using the Illumina Global Screening Array [GSA]) on 120 DNA samples derived from African- and European-ancestry individuals that are part of the 1000 Genomes Project. We then imputed both the sequencing data and the genotyping array data to the 1000 Genomes Phase 3 haplotype reference panel using a leave-one-out design. We evaluated overall imputation accuracy from these different assays as well as overall power for GWAS from imputed data and computed polygenic risk scores for coronary artery disease and breast cancer using previously derived weights. We conclude that low-pass sequencing plus imputation, in addition to providing a substantial increase in statistical power for genome-wide association studies, provides increased accuracy for polygenic risk prediction at effective coverages of ∼0.5× and higher compared to the Illumina GSA.
Collapse
Affiliation(s)
- Jeremiah H Li
- Gencove, Incorporated, New York, New York 10016, USA
| | - Chase A Mazur
- Gencove, Incorporated, New York, New York 10016, USA
| | - Tomaz Berisa
- Gencove, Incorporated, New York, New York 10016, USA
| | | |
Collapse
|
37
|
Martin CA, Armstrong C, Illera JC, Emerson BC, Richardson DS, Spurgin LG. Genomic variation, population history and within-archipelago adaptation between island bird populations. ROYAL SOCIETY OPEN SCIENCE 2021; 8:201146. [PMID: 33972847 PMCID: PMC8074581 DOI: 10.1098/rsos.201146] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 01/11/2021] [Indexed: 05/13/2023]
Abstract
Oceanic island archipelagos provide excellent models to understand evolutionary processes. Colonization events and gene flow can interact with selection to shape genetic variation at different spatial scales. Landscape-scale variation in biotic and abiotic factors may drive fine-scale selection within islands, while long-term evolutionary processes may drive divergence between distantly related populations. Here, we examine patterns of population history and selection between recently diverged populations of the Berthelot's pipit (Anthus berthelotii), a passerine endemic to three North Atlantic archipelagos. First, we use demographic trees and f3 statistics to show that genome-wide divergence across the species range is largely shaped by colonization and bottlenecks, with evidence of very weak gene flow between populations. Then, using a genome scan approach, we identify signatures of divergent selection within archipelagos at single nucleotide polymorphisms (SNPs) in genes potentially associated with craniofacial development and DNA repair. We did not detect within-archipelago selection at the same SNPs as were detected previously at broader spatial scales between archipelagos, but did identify signatures of selection at loci associated with similar biological functions. These findings suggest that similar ecological factors may repeatedly drive selection between recently separated populations, as well as at broad spatial scales across varied landscapes.
Collapse
Affiliation(s)
- Claudia A. Martin
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| | - Claire Armstrong
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
- NERC Biomolecular Analysis Facility, Department of Animal and Plant Sciences, University of Sheffield, Alfred Denny Building, Western Bank, Sheffield S10 2TN, UK
| | - Juan Carlos Illera
- Oviedo University, Campus of Mieres, Research Unit of Biodiversity (UO-CSIC-PA), Research Building, 5th floor, c/Gonzalo Gutiérrez Quirós, s/n, 33600 Mieres, Asturias, Spain
| | - Brent C. Emerson
- Island Ecology and Evolution Research Group, Institute of Natural Products and Agrobiology (IPNA-CSIC), C/Astrofísico Francisco Sánchez 3, 38206 La Laguna, Tenerife, Canary Islands, Spain
| | - David S. Richardson
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| | - Lewis G. Spurgin
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
| |
Collapse
|
38
|
LaBella AL, Abraham A, Pichkar Y, Fong SL, Zhang G, Muglia LJ, Abbot P, Rokas A, Capra JA. Accounting for diverse evolutionary forces reveals mosaic patterns of selection on human preterm birth loci. Nat Commun 2020; 11:3731. [PMID: 32709900 PMCID: PMC7382462 DOI: 10.1038/s41467-020-17258-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 06/19/2020] [Indexed: 02/02/2023] Open
Abstract
Currently, there is no comprehensive framework to evaluate the evolutionary forces acting on genomic regions associated with human complex traits and contextualize the relationship between evolution and molecular function. Here, we develop an approach to test for signatures of diverse evolutionary forces on trait-associated genomic regions. We apply our method to regions associated with spontaneous preterm birth (sPTB), a complex disorder of global health concern. We find that sPTB-associated regions harbor diverse evolutionary signatures including conservation, excess population differentiation, accelerated evolution, and balanced polymorphism. Furthermore, we integrate evolutionary context with molecular evidence to hypothesize how these regions contribute to sPTB risk. Finally, we observe enrichment in signatures of diverse evolutionary forces in sPTB-associated regions compared to genomic background. By quantifying multiple evolutionary forces acting on sPTB-associated regions, our approach improves understanding of both functional roles and the mosaic of evolutionary forces acting on loci. Our work provides a blueprint for investigating evolutionary pressures on complex traits.
Collapse
Affiliation(s)
- Abigail L LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Abin Abraham
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA
- Vanderbilt University Medical Center, Vanderbilt University, Nashville, TN, 37232, USA
| | - Yakov Pichkar
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Sarah L Fong
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA
| | - Ge Zhang
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- The Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati, OH, 45267, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, 45267, USA
| | - Louis J Muglia
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- The Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati, OH, 45267, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, 45267, USA
| | - Patrick Abbot
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA.
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA.
| | - John A Capra
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA.
- Departments of Biomedical Informatics and Computer Science, Vanderbilt Genetics Institute, Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA.
| |
Collapse
|
39
|
Chen M, Sidore C, Akiyama M, Ishigaki K, Kamatani Y, Schlessinger D, Cucca F, Okada Y, Chiang CWK. Evidence of Polygenic Adaptation in Sardinia at Height-Associated Loci Ascertained from the Biobank Japan. Am J Hum Genet 2020; 107:60-71. [PMID: 32533944 PMCID: PMC7332648 DOI: 10.1016/j.ajhg.2020.05.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 05/19/2020] [Indexed: 01/31/2023] Open
Abstract
Adult height is one of the earliest putative examples of polygenic adaptation in humans. However, this conclusion was recently challenged because residual uncorrected stratification from large-scale consortium studies was considered responsible for the previously noted genetic difference. It thus remains an open question whether height loci exhibit signals of polygenic adaptation in any human population. We re-examined this question, focusing on one of the shortest European populations, the Sardinians, in addition to mainland European populations. We utilized height-associated loci from the Biobank Japan (BBJ) dataset to further alleviate concerns of biased ascertainment of GWAS loci and showed that the Sardinians remain significantly shorter than expected under neutrality (∼0.22 standard deviation shorter than Utah residents with ancestry from northern and western Europe [CEU] on the basis of polygenic height scores, p = 3.89 × 10-4). We also found the trajectory of polygenic height scores between the Sardinian and the British populations diverged over at least the last 10,000 years (p = 0.0082), consistent with a signature of polygenic adaptation driven primarily by the Sardinian population. Although the polygenic score-based analysis showed a much subtler signature in mainland European populations, we found a clear and robust adaptive signature in the UK population by using a haplotype-based statistic, the trait singleton density score (tSDS), driven by the height-increasing alleles (p = 9.1 × 10-4). In summary, by ascertaining height loci in a distant East Asian population, we further supported the evidence of polygenic adaptation at height-associated loci among the Sardinians. In mainland Europeans, the adaptive signature was detected in haplotype-based analysis but not in polygenic score-based analysis.
Collapse
Affiliation(s)
- Minhui Chen
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA.
| | - Carlo Sidore
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Monserrato 09042, Cagliari, Italy
| | - Masato Akiyama
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan; Department of Ocular Pathology and Imaging Science, Graduate School of Medical Sciences, Kyushu University, Fukuoka 812-8582, Japan
| | - Kazuyoshi Ishigaki
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan; Kyoto-McGill International Collaborative School in Genomic Medicine, Kyoto University Graduate School of Medicine, Kyoto 606-8501, Japan
| | - David Schlessinger
- Laboratory of Genetics and Genomics, National Institute on Aging, US National Institutes of Health, Baltimore, MD 21224, USA
| | - Francesco Cucca
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Monserrato 09042, Cagliari, Italy
| | - Yukinori Okada
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan; Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka 565-0871, Japan
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA; Quantitative and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA.
| |
Collapse
|
40
|
Mathieson I. Human adaptation over the past 40,000 years. Curr Opin Genet Dev 2020; 62:97-104. [PMID: 32745952 PMCID: PMC7484260 DOI: 10.1016/j.gde.2020.06.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 05/10/2020] [Accepted: 06/01/2020] [Indexed: 02/07/2023]
Abstract
Over the past few years several methodological and data-driven advances have greatly improved our ability to robustly detect genomic signatures of selection in humans. New methods applied to large samples of present-day genomes provide increased power, while ancient DNA allows precise estimation of timing and tempo. However, despite these advances, we are still limited in our ability to translate these signatures into understanding about which traits were actually under selection, and why. Combining information from different populations and timescales may allow interpretation of selective sweeps. Other modes of selection have proved more difficult to detect. In particular, despite strong evidence of the polygenicity of most human traits, evidence for polygenic selection is weak, and its importance in recent human evolution remains unclear. Balancing selection and archaic introgression seem important for the maintenance of potentially adaptive immune diversity, but perhaps less so for other traits.
Collapse
Affiliation(s)
- Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, United States.
| |
Collapse
|
41
|
Long-read bitter gourd ( Momordica charantia) genome and the genomic architecture of nonclassic domestication. Proc Natl Acad Sci U S A 2020; 117:14543-14551. [PMID: 32461376 DOI: 10.1073/pnas.1921016117] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The genetic architecture of quantitative traits is determined by both Mendelian and polygenic factors, yet classic examples of plant domestication focused on selective sweep of newly mutated Mendelian genes. Here we report the chromosome-level genome assembly and the genomic investigation of a nonclassic domestication example, bitter gourd (Momordica charantia), an important Asian vegetable and medicinal plant of the family Cucurbitaceae. Population resequencing revealed the divergence between wild and South Asian cultivars about 6,000 y ago, followed by the separation of the Southeast Asian cultivars about 800 y ago, with the latter exhibiting more extreme trait divergence from wild progenitors and stronger signs of selection on fruit traits. Unlike some crops where the largest phenotypic changes and traces of selection happened between wild and cultivar groups, in bitter gourd large differences exist between two regional cultivar groups, likely reflecting the distinct consumer preferences in different countries. Despite breeding efforts toward increasing female flower proportion, a gynoecy locus exhibits complex patterns of balanced polymorphism among haplogroups, with potential signs of selective sweep within haplogroups likely reflecting artificial selection and introgression from cultivars back to wild accessions. Our study highlights the importance to investigate such nonclassic example of domestication showing signs of balancing selection and polygenic trait architecture in addition to classic selective sweep in Mendelian factors.
Collapse
|
42
|
Yu C, Ni G, van der Werf J, Lee SH. Detecting Genotype-Population Interaction Effects by Ancestry Principal Components. Front Genet 2020; 11:379. [PMID: 32373165 PMCID: PMC7186421 DOI: 10.3389/fgene.2020.00379] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 03/27/2020] [Indexed: 01/22/2023] Open
Abstract
Heterogeneity in the phenotypic mean and variance across populations is often observed for complex traits. One way to understand heterogeneous phenotypes lies in uncovering heterogeneity in genetic effects. Previous studies on genetic heterogeneity across populations were typically based on discrete groups in populations stratified by different countries or cohorts, which ignored the difference of population characteristics for the individuals within each group and resulted in loss of information. Here, we introduce a novel concept of genotype-by-population (G × P) interaction where population is defined by the first and second ancestry principal components (PCs), which are less likely to be confounded with country/cohort-specific factors. We applied a reaction norm model fitting each of 70 complex traits with significant SNP-heritability and the PCs as covariates to examine G × P interactions across diverse populations including white British and other white Europeans from the UK Biobank (N = 22,229). Our results demonstrated a significant population genetic heterogeneity for behavioral traits such as age at first sexual intercourse and academic qualification. Our approach may shed light on the latent genetic architecture of complex traits that underlies the modulation of genetic effects across different populations.
Collapse
Affiliation(s)
- Chenglong Yu
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia
- College of Medicine and Public Health, Flinders University, Bedford Park, SA, Australia
- South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Guiyan Ni
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Julius van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - S. Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia
- South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| |
Collapse
|
43
|
Khrunin AV, Khvorykh GV, Fedorov AN, Limborska SA. Genomic landscape of the signals of positive natural selection in populations of Northern Eurasia: A view from Northern Russia. PLoS One 2020; 15:e0228778. [PMID: 32023328 PMCID: PMC7001972 DOI: 10.1371/journal.pone.0228778] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 01/23/2020] [Indexed: 12/15/2022] Open
Abstract
Natural selection of beneficial genetic variants played a critical role in human adaptation to a wide range of environmental conditions. Northern Eurasia, despite its severe climate, is home to lots of ethnically diverse populations. The genetic variants associated with the survival of these populations have hardly been analyzed. We searched for the genomic signatures of positive selection in (1) the genome-wide microarray data of 432 people from eight different northern Russian populations and (2) the whole-genome sequences of 250 people from Northern Eurasia from a public repository through testing the extended haplotype homozigosity (EHH) and direct comparison of allele frequency, respectively. The 20 loci with the strongest selection signals were characterized in detail. Among the top EHH hits were the NRG3 and NBEA genes, which are involved in the development and functioning of the neural system, the PTPRM gene, which mediates cell-cell interactions and adhesion, and a region on chromosome 4 (chr4:28.7-28.9 Mb) that contained several loci affiliated with different classes of non-coding RNAs (RN7SL101P, MIR4275, MESTP3, and LINC02364). NBEA and the region on chromosome 4 were novel selection targets that were identified for the first time in Western Siberian populations. Cross-population comparisons of EHH profiles suggested a particular role for the chr4:28.7-28.9 Mb region in the local adaptation of Western Siberians. The strongest selection signal identified in Siberian sequenced genomes was formed by six SNPs on chromosome 11 (chr11:124.9-125.2 Mb). This region included well-known genes SLC37A2 and PKNOX2. SLC37A2 is most-highly expressed in the gut. Its expression is regulated by vitamin D, which is often deficient in northern regions. The PKNOX2 gene is a transcription factor of the homeobox family that is expressed in the brain and many other tissues. This gene is associated with alcohol addiction, which is widespread in many Northern Eurasian populations.
Collapse
Affiliation(s)
- Andrey V. Khrunin
- Department of Molecular Bases of Human Genetics, Institute of Molecular Genetics of Russian Academy of Sciences, Moscow, Russia
| | - Gennady V. Khvorykh
- Department of Molecular Bases of Human Genetics, Institute of Molecular Genetics of Russian Academy of Sciences, Moscow, Russia
| | - Alexei N. Fedorov
- Department of Molecular Bases of Human Genetics, Institute of Molecular Genetics of Russian Academy of Sciences, Moscow, Russia
- Department of Medicine, University of Toledo, Toledo, Ohio, United States of America
| | - Svetlana A. Limborska
- Department of Molecular Bases of Human Genetics, Institute of Molecular Genetics of Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
44
|
Fustier MA, Martínez-Ainsworth NE, Aguirre-Liguori JA, Venon A, Corti H, Rousselet A, Dumas F, Dittberner H, Camarena MG, Grimanelli D, Ovaskainen O, Falque M, Moreau L, de Meaux J, Montes-Hernández S, Eguiarte LE, Vigouroux Y, Manicacci D, Tenaillon MI. Common gardens in teosintes reveal the establishment of a syndrome of adaptation to altitude. PLoS Genet 2019; 15:e1008512. [PMID: 31860672 PMCID: PMC6944379 DOI: 10.1371/journal.pgen.1008512] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2019] [Revised: 01/06/2020] [Accepted: 11/07/2019] [Indexed: 12/14/2022] Open
Abstract
In plants, local adaptation across species range is frequent. Yet, much has to be discovered on its environmental drivers, the underlying functional traits and their molecular determinants. Genome scans are popular to uncover outlier loci potentially involved in the genetic architecture of local adaptation, however links between outliers and phenotypic variation are rarely addressed. Here we focused on adaptation of teosinte populations along two elevation gradients in Mexico that display continuous environmental changes at a short geographical scale. We used two common gardens, and phenotyped 18 traits in 1664 plants from 11 populations of annual teosintes. In parallel, we genotyped these plants for 38 microsatellite markers as well as for 171 outlier single nucleotide polymorphisms (SNPs) that displayed excess of allele differentiation between pairs of lowland and highland populations and/or correlation with environmental variables. Our results revealed that phenotypic differentiation at 10 out of the 18 traits was driven by local selection. Trait covariation along the elevation gradient indicated that adaptation to altitude results from the assembly of multiple co-adapted traits into a complex syndrome: as elevation increases, plants flower earlier, produce less tillers, display lower stomata density and carry larger, longer and heavier grains. The proportion of outlier SNPs associating with phenotypic variation, however, largely depended on whether we considered a neutral structure with 5 genetic groups (73.7%) or 11 populations (13.5%), indicating that population stratification greatly affected our results. Finally, chromosomal inversions were enriched for both SNPs whose allele frequencies shifted along elevation as well as phenotypically-associated SNPs. Altogether, our results are consistent with the establishment of an altitudinal syndrome promoted by local selective forces in teosinte populations in spite of detectable gene flow. Because elevation mimics climate change through space, SNPs that we found underlying phenotypic variation at adaptive traits may be relevant for future maize breeding.
Collapse
Affiliation(s)
- Margaux-Alison Fustier
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Natalia E. Martínez-Ainsworth
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Jonás A. Aguirre-Liguori
- Departamento de Ecología Evolutiva, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | - Anthony Venon
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Hélène Corti
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Agnès Rousselet
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Fabrice Dumas
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Hannes Dittberner
- Institute of Botany, University of Cologne Biocenter, Cologne, Germany
| | - María G. Camarena
- Campo Experimental Bajío, InstitutoNacional de Investigaciones Forestales, Agrícolas y Pecuarias, Celaya, Mexico
| | - Daniel Grimanelli
- UMR Diversité, Adaptation et Développement des plantes, Université de Montpellier, Institut de Recherche pour le développement, Montpellier, France
| | - Otso Ovaskainen
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland
- Centre for Biodiversity Dynamics, Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
| | - Matthieu Falque
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Laurence Moreau
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Juliette de Meaux
- Institute of Botany, University of Cologne Biocenter, Cologne, Germany
| | - Salvador Montes-Hernández
- Campo Experimental Bajío, InstitutoNacional de Investigaciones Forestales, Agrícolas y Pecuarias, Celaya, Mexico
| | - Luis E. Eguiarte
- Departamento de Ecología Evolutiva, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
| | - Yves Vigouroux
- UMR Diversité, Adaptation et Développement des plantes, Université de Montpellier, Institut de Recherche pour le développement, Montpellier, France
| | - Domenica Manicacci
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| | - Maud I. Tenaillon
- Génétique Quantitative et Evolution – Le Moulon, Université Paris-Saclay, Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, Centre National de la Recherche Scientifique, AgroParisTech, Gif-sur-Yvette, France
| |
Collapse
|
45
|
Abstract
PURPOSE OF REVIEW The goal of the review is to provide a comprehensive overview of the current understanding of the mechanisms underlying variation in human stature. RECENT FINDINGS Human height is an anthropometric trait that varies considerably within human populations as well as across the globe. Historically, much research focus was placed on understanding the biology of growth plate chondrocytes and how modifications to core chondrocyte proliferation and differentiation pathways potentially shaped height attainment in normal as well as pathological contexts. Recently, much progress has been made to improve our understanding regarding the mechanisms underlying the normal and pathological range of height variation within as well as between human populations, and today, it is understood to reflect complex interactions among a myriad of genetic, environmental, and evolutionary factors. Indeed, recent improvements in genetics (e.g., GWAS) and breakthroughs in functional genomics (e.g., whole exome sequencing, DNA methylation analysis, ATAC-sequencing, and CRISPR) have shed light on previously unknown pathways/mechanisms governing pathological and common height variation. Additionally, the use of an evolutionary perspective has also revealed important mechanisms that have shaped height variation across the planet. This review provides an overview of the current knowledge of the biological mechanisms underlying height variation by highlighting new research findings on skeletal growth control with an emphasis on previously unknown pathways/mechanisms influencing pathological and common height variation. In this context, this review also discusses how evolutionary forces likely shaped the genomic architecture of height across the globe.
Collapse
Affiliation(s)
| | - Terence D Capellini
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
46
|
Xiang R, Berg IVD, MacLeod IM, Hayes BJ, Prowse-Wilkins CP, Wang M, Bolormaa S, Liu Z, Rochfort SJ, Reich CM, Mason BA, Vander Jagt CJ, Daetwyler HD, Lund MS, Chamberlain AJ, Goddard ME. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc Natl Acad Sci U S A 2019; 116:19398-19408. [PMID: 31501319 PMCID: PMC6765237 DOI: 10.1073/pnas.1904159116] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Many genome variants shaping mammalian phenotype are hypothesized to regulate gene transcription and/or to be under selection. However, most of the evidence to support this hypothesis comes from human studies. Systematic evidence for regulatory and evolutionary signals contributing to complex traits in a different mammalian model is needed. Sequence variants associated with gene expression (expression quantitative trait loci [eQTLs]) and concentration of metabolites (metabolic quantitative trait loci [mQTLs]) and under histone-modification marks in several tissues were discovered from multiomics data of over 400 cattle. Variants under selection and evolutionary constraint were identified using genome databases of multiple species. These analyses defined 30 sets of variants, and for each set, we estimated the genetic variance the set explained across 34 complex traits in 11,923 bulls and 32,347 cows with 17,669,372 imputed variants. The per-variant trait heritability of these sets across traits was highly consistent (r > 0.94) between bulls and cows. Based on the per-variant heritability, conserved sites across 100 vertebrate species and mQTLs ranked the highest, followed by eQTLs, young variants, those under histone-modification marks, and selection signatures. From these results, we defined a Functional-And-Evolutionary Trait Heritability (FAETH) score indicating the functionality and predicted heritability of each variant. In additional 7,551 cattle, the high FAETH-ranking variants had significantly increased genetic variances and genomic prediction accuracies in 3 production traits compared to the low FAETH-ranking variants. The FAETH framework combines the information of gene regulation, evolution, and trait heritability to rank variants, and the publicly available FAETH data provide a set of biological priors for cattle genomic selection worldwide.
Collapse
Affiliation(s)
- Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia;
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Irene van den Berg
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Benjamin J Hayes
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- Centre for Animal Science, The University of Queensland, St. Lucia, QLD 4067, Australia
| | - Claire P Prowse-Wilkins
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Min Wang
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Sunduimijid Bolormaa
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Zhiqian Liu
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Simone J Rochfort
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Coralie M Reich
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Brett A Mason
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Christy J Vander Jagt
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Mogens S Lund
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, DK-8830 Tjele, Denmark
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Michael E Goddard
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| |
Collapse
|
47
|
Analysis of polygenic risk score usage and performance in diverse human populations. Nat Commun 2019; 10:3328. [PMID: 31346163 PMCID: PMC6658471 DOI: 10.1038/s41467-019-11112-0] [Citation(s) in RCA: 616] [Impact Index Per Article: 102.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Accepted: 06/18/2019] [Indexed: 12/11/2022] Open
Abstract
A historical tendency to use European ancestry samples hinders medical genetics research, including the use of polygenic scores, which are individual-level metrics of genetic risk. We analyze the first decade of polygenic scoring studies (2008–2017, inclusive), and find that 67% of studies included exclusively European ancestry participants and another 19% included only East Asian ancestry participants. Only 3.8% of studies were among cohorts of African, Hispanic, or Indigenous peoples. We find that predictive performance of European ancestry-derived polygenic scores is lower in non-European ancestry samples (e.g. African ancestry samples: t = −5.97, df = 24, p = 3.7 × 10−6), and we demonstrate the effects of methodological choices in polygenic score distributions for worldwide populations. These findings highlight the need for improved treatment of linkage disequilibrium and variant frequencies when applying polygenic scoring to cohorts of non-European ancestry, and bolster the rationale for large-scale GWAS in diverse human populations. Predominant participation of European-ancestry individuals in genetic studies has hindered the better understanding of genetic risk in non-European ancestry individuals. Here, Duncan et al. quantify polygenic risk score use and performance in worldwide populations.
Collapse
|
48
|
Nakayama K, Inaba Y. Genetic variants influencing obesity-related traits in Japanese population. Ann Hum Biol 2019; 46:298-304. [PMID: 31307227 DOI: 10.1080/03014460.2019.1644373] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Context: Adipose tissue is the main organ that stores energy and participates in adaptive thermogenesis of the human body. The adipose tissue content in an individual is determined by a combination of genetic factors and lifestyle related factors. While Japanese people, along with the closely related East Asians, are generally thinner than individuals of European ancestry, they are prone to accumulating visceral adipose tissues. Genome-wide discovery of loci influencing obesity-related traits, and application of the genome sequence data to assess natural selection, provides evidence that the obesity-related traits in East Asians might be shaped by natural selection. Objective: This review aims to summarise health and evolutionary implications of genetic variants influencing obesity-related traits in Japanese. Methods: This study gathered recently published papers of medical, genetic and evolutionary studies regarding obesity-related traits in the Japanese and closely related East Asians. Results and conclusion: A high susceptibility to central obesity of Japanese and closely related East Asians might have been shaped by natural selection favouring thrifty genotypes. Moreover, natural selection favouring higher thermogenic activity of brown adipose tissues would contribute to increased non-thrifty alleles in ancestors of East Asians.
Collapse
Affiliation(s)
- Kazuhiro Nakayama
- Laboratory of Evolutionary Anthropology, Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo , Chiba , Japan
| | - Yuta Inaba
- Laboratory of Evolutionary Anthropology, Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo , Chiba , Japan
| |
Collapse
|
49
|
Sohail M, Maier RM, Ganna A, Bloemendal A, Martin AR, Turchin MC, Chiang CWK, Hirschhorn J, Daly MJ, Patterson N, Neale B, Mathieson I, Reich D, Sunyaev SR. Polygenic adaptation on height is overestimated due to uncorrected stratification in genome-wide association studies. eLife 2019; 8:e39702. [PMID: 30895926 PMCID: PMC6428571 DOI: 10.7554/elife.39702] [Citation(s) in RCA: 234] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Accepted: 01/15/2019] [Indexed: 01/03/2023] Open
Abstract
Genetic predictions of height differ among human populations and these differences have been interpreted as evidence of polygenic adaptation. These differences were first detected using SNPs genome-wide significantly associated with height, and shown to grow stronger when large numbers of sub-significant SNPs were included, leading to excitement about the prospect of analyzing large fractions of the genome to detect polygenic adaptation for multiple traits. Previous studies of height have been based on SNP effect size measurements in the GIANT Consortium meta-analysis. Here we repeat the analyses in the UK Biobank, a much more homogeneously designed study. We show that polygenic adaptation signals based on large numbers of SNPs below genome-wide significance are extremely sensitive to biases due to uncorrected population stratification. More generally, our results imply that typical constructions of polygenic scores are sensitive to population stratification and that population-level differences should be interpreted with caution. Editorial note This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that all the issues have been addressed (see decision letter).
Collapse
Affiliation(s)
- Mashaal Sohail
- Division of Genetics, Department of MedicineBrigham and Women’s Hospital and Harvard Medical SchoolBostonUnited States
- Department of Biomedical InformaticsHarvard Medical SchoolBostonUnited States
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
| | - Robert M Maier
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric ResearchBroad Institute of MIT and HarvardCambridgeUnited States
- Analytical and Translational Genetics UnitMassachusetts General HospitalBostonUnited States
| | - Andrea Ganna
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric ResearchBroad Institute of MIT and HarvardCambridgeUnited States
- Analytical and Translational Genetics UnitMassachusetts General HospitalBostonUnited States
- Department of Medical Epidemiology and BiostatisticsKarolinska InstitutetStockholmSweden
- Institute for Molecular Medicine FinlandUniversity of HelsinkiHelsinkiFinland
| | - Alex Bloemendal
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric ResearchBroad Institute of MIT and HarvardCambridgeUnited States
- Analytical and Translational Genetics UnitMassachusetts General HospitalBostonUnited States
| | - Alicia R Martin
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric ResearchBroad Institute of MIT and HarvardCambridgeUnited States
- Analytical and Translational Genetics UnitMassachusetts General HospitalBostonUnited States
| | - Michael C Turchin
- Center for Computational Molecular BiologyBrown UniversityProvidenceUnited States
- Department of Ecology and Evolutionary BiologyBrown UniversityProvidenceUnited States
| | - Charleston WK Chiang
- Department of Preventive Medicine, Center for Genetic Epidemiology, Keck School of MedicineUniversity of Southern CaliforniaLos AngelesUnited States
| | - Joel Hirschhorn
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Departments of Pediatrics and GeneticsHarvard Medical SchoolBostonUnited States
- Division of Endocrinology and Center for Basic and Translational Obesity ResearchBoston Children’s HospitalBostonUnited States
| | - Mark J Daly
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric ResearchBroad Institute of MIT and HarvardCambridgeUnited States
- Analytical and Translational Genetics UnitMassachusetts General HospitalBostonUnited States
- Institute for Molecular Medicine FinlandUniversity of HelsinkiHelsinkiFinland
| | - Nick Patterson
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Department of GeneticsHarvard Medical SchoolBostonUnited States
| | - Benjamin Neale
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Stanley Center for Psychiatric ResearchBroad Institute of MIT and HarvardCambridgeUnited States
- Analytical and Translational Genetics UnitMassachusetts General HospitalBostonUnited States
| | - Iain Mathieson
- Department of Genetics, Perelman School of MedicineUniversity of PennsylvaniaPhiladelphiaUnited States
| | - David Reich
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Department of GeneticsHarvard Medical SchoolBostonUnited States
- Howard Hughes Medical Institute, Harvard Medical SchoolBostonUnited States
| | - Shamil R Sunyaev
- Department of Biomedical InformaticsHarvard Medical SchoolBostonUnited States
- Program in Medical and Population GeneticsBroad Institute of MIT and HarvardCambridgeUnited States
- Division of Genetics, Department of MedicineBrigham and Women’s Hospital and Harvard Medical SchoolBostonUnited States
| |
Collapse
|
50
|
Grinde KE, Qi Q, Thornton TA, Liu S, Shadyab AH, Chan KHK, Reiner AP, Sofer T. Generalizing polygenic risk scores from Europeans to Hispanics/Latinos. Genet Epidemiol 2019; 43:50-62. [PMID: 30368908 PMCID: PMC6330129 DOI: 10.1002/gepi.22166] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 08/12/2018] [Accepted: 08/28/2018] [Indexed: 12/17/2022]
Abstract
Polygenic risk scores (PRSs) are weighted sums of risk allele counts of single-nucleotide polymorphisms (SNPs) associated with a disease or trait. PRSs are typically constructed based on published results from Genome-Wide Association Studies (GWASs), and the majority of which has been performed in large populations of European ancestry (EA) individuals. Although many genotype-trait associations have generalized across populations, the optimal choice of SNPs and weights for PRSs may differ between populations due to different linkage disequilibrium (LD) and allele frequency patterns. We compare various approaches for PRS construction, using GWAS results from both large EA studies and a smaller study in Hispanics/Latinos: The Hispanic Community Health Study/Study of Latinos (HCHS/SOL, n = 12 , 803 ). We consider multiple approaches for selecting SNPs and for computing SNP weights. We study the performance of the resulting PRSs in an independent study of Hispanics/Latinos from the Women's Health Initiative (WHI, n = 3 , 582 ). We support our investigation with simulation studies of potential genetic architectures in a single locus. We observed that selecting variants based on EA GWASs generally performs well, except for blood pressure trait. However, the use of EA GWASs for weight estimation was suboptimal. Using non-EA GWAS results to estimate weights improved results.
Collapse
Affiliation(s)
- Kelsey E. Grinde
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Qibin Qi
- Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | | | - Simin Liu
- Department of Epidemiology, Brown University, Providence, RI, USA
| | - Aladdin H. Shadyab
- Department of Family Medicine and Public Health, University of California San Diego, San Diego, CA, USA
| | - Kei Hang K. Chan
- Department of Epidemiology, Brown University, Providence, RI, USA
- Departments of Biomedical Sciences and Electronic Engineering, City University of Hong Kong, HKSAR
| | - Alexander P. Reiner
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Tamar Sofer
- Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| |
Collapse
|