1
|
Ma Y, Zhao Y, Zhang JF, Bi W. Efficient and accurate framework for genome-wide gene-environment interaction analysis in large-scale biobanks. Nat Commun 2025; 16:3064. [PMID: 40157913 PMCID: PMC11955004 DOI: 10.1038/s41467-025-57887-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Accepted: 03/03/2025] [Indexed: 04/01/2025] Open
Abstract
Gene-environment interaction (G×E) analysis elucidates the interplay between genetic and environmental factors. Genome-wide association studies (GWAS) have expanded to encompass complex traits like time-to-event and ordinal traits, which provide richer phenotypic information. However, most existing scalable approaches focus only on quantitative or binary traits. Here we propose SPAGxECCT, a scalable and accurate framework for diverse trait types. SPAGxECCT fits a genotype-independent model and employs a hybrid strategy including saddlepoint approximation (SPA) for accurate p value calculation, especially for low-frequency variants and unbalanced phenotypic distributions. We extend SPAGxECCT to SPAGxEmixCCT, which accounts for population stratification and is applicable to multi-ancestry or admixed populations. SPAGxEmixCCT can further be extended to SPAGxEmixCCT-local, which identifies ancestry-specific G×E effects using local ancestry. Through extensive simulations and real data analyses of UK Biobank data, we demonstrate that SPAGxECCT and SPAGxEmixCCT are scalable to analyze large-scale study cohort, control type I error rates effectively, and maintain power.
Collapse
Affiliation(s)
- Yuzhuo Ma
- Department of Medical Genetics, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Yanlong Zhao
- State Key Laboratory of Mathematical Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Ji-Feng Zhang
- State Key Laboratory of Mathematical Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Wenjian Bi
- Department of Medical Genetics, School of Basic Medical Sciences, Peking University, Beijing, China.
- Center for Medical Genetics, School of Basic Medical Sciences, Peking University, Beijing, China.
- Medicine Innovation Center for Fundamental Research on Major Immunology-related Diseases, Peking University, Beijing, China.
- Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University, Beijing, China.
| |
Collapse
|
2
|
Bu L, Wang Y, Tan L, Wen Z, Hu X, Zhang Z, Zhao Y. Haplotype analysis incorporating ancestral origins identified novel genetic loci associated with chicken body weight using an advanced intercross line. Genet Sel Evol 2024; 56:78. [PMID: 39707197 DOI: 10.1186/s12711-024-00946-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 12/04/2024] [Indexed: 12/23/2024] Open
Abstract
BACKGROUND The genome-wide association study (GWAS) is a powerful method for mapping quantitative trait loci (QTL). However, standard GWAS can detect only QTL that segregate in the mapping population. Crossing populations with different characteristics increases genetic variability but F2 or back-crosses lack mapping resolution due to the limited number of recombination events. This drawback can be overcome with advanced intercross line (AIL) populations, which increase the number recombination events and provide a more accurate mapping resolution. Recent studies in humans have revealed ancestry-dependent genetic architecture and shown the effectiveness of admixture mapping in admixed populations. RESULTS Through the incorporation of line-of-origin effects and GWAS on an F9 AIL population, we identified genes that affect body weight at eight weeks of age (BW8) in chickens. The proposed ancestral-haplotype-based GWAS (testing only the origin regardless of the alleles) revealed three new QTLs on GGA12, GGA15, and GGA20. By using the concepts of ancestral homozygotes (individuals that carry two haplotypes of the same origin) and ancestral heterozygotes (carrying one haplotype of each origin), we identified 632 loci that exhibited high-parent (the heterozygote is better than both parents) and mid-parent (the heterozygote is better than the median of the parents) dominance across 12 chromosomes. Out of the 199 genes associated with BW8, EYA1, PDE1C, and MYC were identified as the best candidate genes for further validation. CONCLUSIONS In addition to the candidate genes reported in this study, our research demonstrates the effectiveness of incorporating ancestral information in population genetic analyses, which can be broadly applicable for genetic mapping in populations generated by ancestors with distinct phenotypes and genetic backgrounds. Our methods can benefit both geneticists and biologists interested in the genetic determinism of complex traits.
Collapse
Affiliation(s)
- Lina Bu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yuzhe Wang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China
| | - Lizhi Tan
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zilong Wen
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoxiang Hu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
| | - Yiqiang Zhao
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China.
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China.
| |
Collapse
|
3
|
Sinnott-Armstrong N, Fields S, Roth F, Starita LM, Trapnell C, Villen J, Fowler DM, Queitsch C. Understanding genetic variants in context. eLife 2024; 13:e88231. [PMID: 39625477 PMCID: PMC11614383 DOI: 10.7554/elife.88231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 11/15/2024] [Indexed: 12/06/2024] Open
Abstract
Over the last three decades, human genetics has gone from dissecting high-penetrance Mendelian diseases to discovering the vast and complex genetic etiology of common human diseases. In tackling this complexity, scientists have discovered the importance of numerous genetic processes - most notably functional regulatory elements - in the development and progression of these diseases. Simultaneously, scientists have increasingly used multiplex assays of variant effect to systematically phenotype the cellular consequences of millions of genetic variants. In this article, we argue that the context of genetic variants - at all scales, from other genetic variants and gene regulation to cell biology to organismal environment - are critical components of how we can employ genomics to interpret these variants, and ultimately treat these diseases. We describe approaches to extend existing experimental assays and computational approaches to examine and quantify the importance of this context, including through causal analytic approaches. Having a unified understanding of the molecular, physiological, and environmental processes governing the interpretation of genetic variants is sorely needed for the field, and this perspective argues for feasible approaches by which the combined interpretation of cellular, animal, and epidemiological data can yield that knowledge.
Collapse
Affiliation(s)
- Nasa Sinnott-Armstrong
- Herbold Computational Biology Program, Fred Hutchinson Cancer CenterSeattleUnited States
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Brotman Baty Institute for Precision MedicineSeattleUnited States
| | - Stanley Fields
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Department of Medicine, University of WashingtonSeattleUnited States
| | - Frederick Roth
- Donnelly Centre and Departments of Molecular Genetics and Computer Science, University of TorontoTorontoCanada
- Lunenfeld-Tanenbaum Research Institute, Mt. Sinai HospitalTorontoCanada
- Department of Computational and Systems Biology, University of Pittsburgh School of MedicinePittsburghUnited States
| | - Lea M Starita
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Brotman Baty Institute for Precision MedicineSeattleUnited States
| | - Cole Trapnell
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Brotman Baty Institute for Precision MedicineSeattleUnited States
| | - Judit Villen
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Brotman Baty Institute for Precision MedicineSeattleUnited States
| | - Douglas M Fowler
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Brotman Baty Institute for Precision MedicineSeattleUnited States
- Department of Bioengineering, University of WashingtonSeattleUnited States
| | - Christine Queitsch
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Brotman Baty Institute for Precision MedicineSeattleUnited States
| |
Collapse
|
4
|
Herrera-Luis E, Benke K, Volk H, Ladd-Acosta C, Wojcik GL. Gene-environment interactions in human health. Nat Rev Genet 2024; 25:768-784. [PMID: 38806721 DOI: 10.1038/s41576-024-00731-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2024] [Indexed: 05/30/2024]
Abstract
Gene-environment interactions (G × E), the interplay of genetic variation with environmental factors, have a pivotal impact on human complex traits and diseases. Statistically, G × E can be assessed by determining the deviation from expectation of predictive models based solely on the phenotypic effects of genetics or environmental exposures. Despite the unprecedented, widespread and diverse use of G × E analytical frameworks, heterogeneity in their application and reporting hinders their applicability in public health. In this Review, we discuss study design considerations as well as G × E analytical frameworks to assess polygenic liability dependent on the environment, to identify specific genetic variants exhibiting G × E, and to characterize environmental context for these dynamics. We conclude with recommendations to address the most common challenges and pitfalls in the conceptualization, methodology and reporting of G × E studies, as well as future directions.
Collapse
Affiliation(s)
- Esther Herrera-Luis
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kelly Benke
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Heather Volk
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Christine Ladd-Acosta
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Genevieve L Wojcik
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
5
|
Fu B, Pazokitoroudi A, Xue A, Anand A, Anand P, Zaitlen N, Sankararaman S. A biobank-scale test of marginal epistasis reveals genome-wide signals of polygenic epistasis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.10.557084. [PMID: 37745394 PMCID: PMC10515811 DOI: 10.1101/2023.09.10.557084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
The contribution of epistasis (interactions among genes or genetic variants) to human complex trait variation remains poorly understood. Methods that aim to explicitly identify pairs of genetic variants, usually single nucleotide polymorphisms (SNPs), associated with a trait suffer from low power due to the large number of hypotheses tested while also having to deal with the computational problem of searching over a potentially large number of candidate pairs. An alternate approach involves testing whether a single SNP modulates variation in a trait against a polygenic background. While overcoming the limitation of low power, such tests of polygenic or marginal epistasis (ME) are infeasible on Biobank-scale data where hundreds of thousands of individuals are genotyped over millions of SNPs. We present a method to test for ME of a SNP on a trait that is applicable to biobank-scale data. We performed extensive simulations to show that our method provides calibrated tests of ME. We applied our method to test for ME at SNPs that are associated with 53 quantitative traits across ≈ 300 K unrelated white British individuals in the UK Biobank (UKBB). Testing 15, 601 trait-loci associations that were significant in GWAS, we identified 16 trait-loci pairs across 12 traits that demonstrate strong evidence of ME signals (p-value p < 5 × 10 - 8 53 ). We further partitioned the significant ME signals across the genome to identify 6 trait-loci pairs with evidence of local (within-chromosome) ME while 15 show evidence of distal (cross-chromosome) ME. Across the 16 trait-loci pairs, we document that the proportion of trait variance explained by ME is about 12x as large as that explained by the GWAS effects on average (range: 0.59 to 43.89). Our results show, for the first time, evidence of interaction effects between individual genetic variants and overall polygenic background modulating complex trait variation.
Collapse
Affiliation(s)
- Boyang Fu
- Department of Computer Science, UCLA, Los Angeles, CA, USA
| | | | - Albert Xue
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Aakarsh Anand
- Department of Computer Science, UCLA, Los Angeles, CA, USA
| | - Prateek Anand
- Department of Computer Science, UCLA, Los Angeles, CA, USA
| | - Noah Zaitlen
- Department of Neurology, UCLA, Los Angeles, CA, USA
- Department of Computational Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA
| | - Sriram Sankararaman
- Department of Computer Science, UCLA, Los Angeles, CA, USA
- Department of Computational Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA
| |
Collapse
|
6
|
Abstract
Admixed populations constitute a large portion of global human genetic diversity, yet they are often left out of genomics analyses. This exclusion is problematic, as it leads to disparities in the understanding of the genetic structure and history of diverse cohorts and the performance of genomic medicine across populations. Admixed populations have particular statistical challenges, as they inherit genomic segments from multiple source populations-the primary reason they have historically been excluded from genetic studies. In recent years, however, an increasing number of statistical methods and software tools have been developed to account for and leverage admixture in the context of genomics analyses. Here, we provide a survey of such computational strategies for the informed consideration of admixture to allow for the well-calibrated inclusion of mixed ancestry populations in large-scale genomics studies, and we detail persisting gaps in existing tools.
Collapse
Affiliation(s)
- Taotao Tan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA;
| | - Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA;
| |
Collapse
|
7
|
Bhatraju PK, Stanaway IB, Palmer MR, Menon R, Schaub JA, Menez S, Srivastava A, Wilson FP, Kiryluk K, Palevsky PM, Naik AS, Sakr SS, Jarvik GP, Parikh CR, Ware LB, Ikizler TA, Siew ED, Chinchilli VM, Coca SG, Garg AX, Go AS, Kaufman JS, Kimmel PL, Himmelfarb J, Wurfel MM. Genome-wide Association Study for AKI. KIDNEY360 2023; 4:870-880. [PMID: 37273234 PMCID: PMC10371295 DOI: 10.34067/kid.0000000000000175] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 04/03/2023] [Indexed: 06/06/2023]
Abstract
Key Points Two genetic variants in the DISP1-TLR5 gene locus were associated with risk of AKI. DISP1 and TLR5 were differentially regulated in kidney biopsy tissue from patients with AKI compared with no AKI. Background Although common genetic risks for CKD are well established, genetic factors influencing risk for AKI in hospitalized patients are poorly understood. Methods We conducted a genome-wide association study in 1369 participants in the Assessment, Serial Evaluation, and Subsequent Sequelae of AKI Study; a multiethnic population of hospitalized participants with and without AKI matched on demographics, comorbidities, and kidney function before hospitalization. We then completed functional annotation of top-performing variants for AKI using single-cell RNA sequencing data from kidney biopsies in 12 patients with AKI and 18 healthy living donors from the Kidney Precision Medicine Project. Results No genome-wide significant associations with AKI risk were found in Assessment, Serial Evaluation, and Subsequent Sequelae of AKI (P < 5×10 −8 ). The top two variants with the strongest association with AKI mapped to the dispatched resistance-nodulation-division (RND) transporter family member 1 (DISP1) gene and toll-like receptor 5 (TLR5) gene locus, rs17538288 (odds ratio, 1.55; 95% confidence interval, 1.32 to 182; P = 9.47×10 −8 ) and rs7546189 (odds ratio, 1.53; 95% confidence interval, 1.30 to 1.81; P = 4.60×10 −7 ). In comparison with kidney tissue from healthy living donors, kidney biopsies in patients with AKI showed differential DISP1 expression in proximal tubular epithelial cells (adjusted P = 3.9× 10−2) and thick ascending limb of the loop of Henle (adjusted P = 8.7× 10−3) and differential TLR5 gene expression in thick ascending limb of the loop of Henle (adjusted P = 4.9× 10−30). Conclusions AKI is a heterogeneous clinical syndrome with various underlying risk factors, etiologies, and pathophysiology that may limit the identification of genetic variants. Although no variants reached genome-wide significance, we report two variants in the intergenic region between DISP1 and TLR5 , suggesting this region as a novel risk for AKI susceptibility.
Collapse
Affiliation(s)
- Pavan K Bhatraju
- Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of Washington, Seattle, Washington
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| | - Ian B Stanaway
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| | - Melody R Palmer
- Departments of Medicine (Medical Genetics) and Genome Sciences, University of Washington School of Medicine, Seattle, Washington
| | - Rajasree Menon
- Division of Nephrology, Department of Medicine, Michigan Medicine, Ann Arbor, Michigan
| | - Jennifer A Schaub
- Division of Nephrology, Department of Medicine, Michigan Medicine, Ann Arbor, Michigan
| | - Steven Menez
- Division of Nephrology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Anand Srivastava
- Department of Medicine, Division of Nephrology and Hypertension, Northwestern University School of Medicine, Chicago, Illinois
| | - F Perry Wilson
- Program of Applied Translational Research, Yale School of Medicine, New Haven, Connecticut
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York City, New York
| | - Paul M Palevsky
- Kidney Medicine Section, VA Pittsburgh Healthcare System, Pittsburgh, Pennsylvania
- Renal-Electrolyte Division, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania
| | - Abhijit S Naik
- Division of Nephrology, University of Michigan, Ann Arbor, Michigan
| | - Sana S Sakr
- Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of Washington, Seattle, Washington
| | - Gail P Jarvik
- Departments of Medicine (Medical Genetics) and Genome Sciences, University of Washington School of Medicine, Seattle, Washington
| | - Chirag R Parikh
- Division of Nephrology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Lorraine B Ware
- Division of Allergy, Pulmonary and Critical Care, Vanderbilt University Medical Center, Nashville, Tennessee
| | - T Alp Ikizler
- Division of Nephrology and Hypertension, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Edward D Siew
- Division of Nephrology and Hypertension, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Vernon M Chinchilli
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, Pennsylvania
| | - Steve G Coca
- Section of Nephrology, Department of Internal Medicine, Mount Sinai School of Medicine, New York, New York
| | - Amit X Garg
- Division of Nephrology, Department of Medicine, Western University, London, Ontario, Canada
| | - Alan S Go
- Division of Nephrology, Department of Medicine, University of California, San Francisco, California
- Division of Research, Kaiser Permanente Northern California, Oakland, California
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California
| | - James S Kaufman
- Division of Nephrology, New York University School of Medicine, New York, New York
- Division of Nephrology, VA New York Harbor Healthcare System, New York, New York
| | - Paul L Kimmel
- Division of Renal Diseases and Hypertension, Department of Medicine, George Washington University Medical Center, Washington, DC
| | - Jonathan Himmelfarb
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| | - Mark M Wurfel
- Division of Pulmonary, Critical Care and Sleep Medicine, Department of Medicine, University of Washington, Seattle, Washington
- Kidney Research Institute, Division of Nephrology, Department of Medicine, University of Washington, Seattle, Washington
| |
Collapse
|
8
|
Batra A, Chen LM, Wang Z, Parent C, Pokhvisneva I, Patel S, Levitan RD, Meaney MJ, Silveira PP. Early Life Adversity and Polygenic Risk for High Fasting Insulin Are Associated With Childhood Impulsivity. Front Neurosci 2021; 15:704785. [PMID: 34539334 PMCID: PMC8441000 DOI: 10.3389/fnins.2021.704785] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 08/03/2021] [Indexed: 01/11/2023] Open
Abstract
While the co-morbidity between metabolic and psychiatric behaviors is well-established, the mechanisms are poorly understood, and exposure to early life adversity (ELA) is a common developmental risk factor. ELA is associated with altered insulin sensitivity and poor behavioral inhibition throughout life, which seems to contribute to the development of metabolic and psychiatric disturbances in the long term. We hypothesize that a genetic background associated with higher fasting insulin interacts with ELA to influence the development of executive functions (e.g., impulsivity in young children). We calculated the polygenic risk scores (PRSs) from the genome-wide association study (GWAS) of fasting insulin at different thresholds and identified the subset of single nucleotide polymorphisms (SNPs) that best predicted peripheral insulin levels in children from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort [N = 467; pt– initial = 0.24 (10,296 SNPs), pt– refined = 0.05 (57 SNPs)]. We then calculated the refined PRS (rPRS) for fasting insulin at this specific threshold in the children from the Maternal Adversity, Vulnerability and Neurodevelopment (MAVAN) cohort and investigated its interaction effect with adversity on an impulsivity task applied at 36 months. We found a significant effect of interaction between fasting insulin rPRS and adversity exposure predicting impulsivity measured by the Snack Delay Task at 36 months [β = −0.329, p = 0.024], such that higher PRS [β = −0.551, p = 0.009] was linked to more impulsivity in individuals exposed to more adversity. Enrichment analysis (MetaCoreTM) of the SNPs that compose the fasting insulin rPRS at this threshold was significant for certain nervous system development processes including dopamine D2 receptor signaling. Additional enrichment analysis (FUMA) of the genes mapped from the SNPs in the fasting insulin rPRS showed enrichment with the accelerated cognitive decline GWAS. Therefore, the genetic background associated with risk for adult higher fasting insulin moderates the impact of early adversity on childhood impulsivity.
Collapse
Affiliation(s)
- Aashita Batra
- Integrated Program in Neuroscience, McGill University, Montreal, QC, Canada.,Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada
| | - Lawrence M Chen
- Integrated Program in Neuroscience, McGill University, Montreal, QC, Canada.,Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada
| | - Zihan Wang
- Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada
| | - Carine Parent
- Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada
| | - Irina Pokhvisneva
- Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada
| | - Sachin Patel
- Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada
| | - Robert D Levitan
- Mood and Anxiety Disorders Program, Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Michael J Meaney
- Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada.,Ludmer Centre for Neuroinformatics and Mental Health, Douglas Research Centre, McGill University, Montreal, QC, Canada.,Translational Neuroscience Programme, Singapore Institute for Clinical Sciences, Agency for Science, Technology and Research (A∗STAR), Singapore, Singapore
| | - Patricia Pelufo Silveira
- Douglas Mental Health University Institute, McGill University, Montreal, QC, Canada.,Ludmer Centre for Neuroinformatics and Mental Health, Douglas Research Centre, McGill University, Montreal, QC, Canada
| |
Collapse
|
9
|
Lin M, Park DS, Zaitlen NA, Henn BM, Gignoux CR. Admixed Populations Improve Power for Variant Discovery and Portability in Genome-Wide Association Studies. Front Genet 2021; 12:673167. [PMID: 34108994 PMCID: PMC8181458 DOI: 10.3389/fgene.2021.673167] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 04/27/2021] [Indexed: 11/13/2022] Open
Abstract
Genome-wide association studies (GWAS) are primarily conducted in single-ancestry settings. The low transferability of results has limited our understanding of human genetic architecture across a range of complex traits. In contrast to homogeneous populations, admixed populations provide an opportunity to capture genetic architecture contributed from multiple source populations and thus improve statistical power. Here, we provide a mechanistic simulation framework to investigate the statistical power and transferability of GWAS under directional polygenic selection or varying divergence. We focus on a two-way admixed population and show that GWAS in admixed populations can be enriched for power in discovery by up to 2-fold compared to the ancestral populations under similar sample size. Moreover, higher accuracy of cross-population polygenic score estimates is also observed if variants and weights are trained in the admixed group rather than in the ancestral groups. Common variant associations are also more likely to replicate if first discovered in the admixed group and then transferred to an ancestral population, than the other way around (across 50 iterations with 1,000 causal SNPs, training on 10,000 individuals, testing on 1,000 in each population, p = 3.78e-6, 6.19e-101, ∼0 for FST = 0.2, 0.5, 0.8, respectively). While some of these FST values may appear extreme, we demonstrate that they are found across the entire phenome in the GWAS catalog. This framework demonstrates that investigation of admixed populations harbors significant advantages over GWAS in single-ancestry cohorts for uncovering the genetic architecture of traits and will improve downstream applications such as personalized medicine across diverse populations.
Collapse
Affiliation(s)
- Meng Lin
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Danny S Park
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, San Francisco, CA, United States
| | - Noah A Zaitlen
- Department of Neurology and Computational Medicine, University of California, Los Angeles, Los Angeles, CA, United States
| | - Brenna M Henn
- Department of Anthropology, Center for Population Biology and the Genome Center, University of California, Davis, Davis, CA, United States
| | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| |
Collapse
|
10
|
Sheppard B, Rappoport N, Loh PR, Sanders SJ, Zaitlen N, Dahl A. A model and test for coordinated polygenic epistasis in complex traits. Proc Natl Acad Sci U S A 2021; 118:e1922305118. [PMID: 33833052 PMCID: PMC8053945 DOI: 10.1073/pnas.1922305118] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Interactions between genetic variants-epistasis-is pervasive in model systems and can profoundly impact evolutionary adaption, population disease dynamics, genetic mapping, and precision medicine efforts. In this work, we develop a model for structured polygenic epistasis, called coordinated epistasis (CE), and prove that several recent theories of genetic architecture fall under the formal umbrella of CE. Unlike standard epistasis models that assume epistasis and main effects are independent, CE captures systematic correlations between epistasis and main effects that result from pathway-level epistasis, on balance skewing the penetrance of genetic effects. To test for the existence of CE, we propose the even-odd (EO) test and prove it is calibrated in a range of realistic biological models. Applying the EO test in the UK Biobank, we find evidence of CE in 18 of 26 traits spanning disease, anthropometric, and blood categories. Finally, we extend the EO test to tissue-specific enrichment and identify several plausible tissue-trait pairs. Overall, CE is a dimension of genetic architecture that can capture structured, systemic forms of epistasis in complex human traits.
Collapse
Affiliation(s)
- Brooke Sheppard
- Department of Psychiatry and Behavioral Sciences, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA 94143
| | - Nadav Rappoport
- Department of Psychiatry and Behavioral Sciences, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA 94143
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA 94143
| | - Po-Ru Loh
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA 02142
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115
| | - Stephan J Sanders
- Department of Psychiatry and Behavioral Sciences, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA 94143
| | - Noah Zaitlen
- Department of Psychiatry and Behavioral Sciences, Weill Institute for Neurosciences, University of California San Francisco, San Francisco, CA 94143;
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA 90095
| | - Andy Dahl
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095;
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA 90095
- Section of Genetic Medicine, University of Chicago, Chicago, IL 60637
| |
Collapse
|
11
|
Rau CD, Gonzales NM, Bloom JS, Park D, Ayroles J, Palmer AA, Lusis AJ, Zaitlen N. Modeling epistasis in mice and yeast using the proportion of two or more distinct genetic backgrounds: Evidence for "polygenic epistasis". PLoS Genet 2020; 16:e1009165. [PMID: 33104702 PMCID: PMC7644088 DOI: 10.1371/journal.pgen.1009165] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 11/05/2020] [Accepted: 10/02/2020] [Indexed: 12/22/2022] Open
Abstract
Background The majority of quantitative genetic models used to map complex traits assume that alleles have similar effects across all individuals. Significant evidence suggests, however, that epistatic interactions modulate the impact of many alleles. Nevertheless, identifying epistatic interactions remains computationally and statistically challenging. In this work, we address some of these challenges by developing a statistical test for polygenic epistasis that determines whether the effect of an allele is altered by the global genetic ancestry proportion from distinct progenitors. Results We applied our method to data from mice and yeast. For the mice, we observed 49 significant genotype-by-ancestry interaction associations across 14 phenotypes as well as over 1,400 Bonferroni-corrected genotype-by-ancestry interaction associations for mouse gene expression data. For the yeast, we observed 92 significant genotype-by-ancestry interactions across 38 phenotypes. Given this evidence of epistasis, we test for and observe evidence of rapid selection pressure on ancestry specific polymorphisms within one of the cohorts, consistent with epistatic selection. Conclusions Unlike our prior work in human populations, we observe widespread evidence of ancestry-modified SNP effects, perhaps reflecting the greater divergence present in crosses using mice and yeast. Many statistical tests which link genetic markers in the genome to differences in traits rely on the assumption that the same polymorphism will have identical effects in different individuals. However, there is substantial evidence indicating that this is not the case. Epistasis is the phenomenon in which multiple polymorphisms interact with one another to amplify or negate each other’s effects on a trait. We hypothesized that individual SNP effects could be changed in a polygenic manner, such that the proportion of as genetic ancestry, rather than specific markers, might be used to capture epistatic interactions. Motivated by this possibility, we develop a new statistical test that allowed us to examine the genome to identify polymorphisms which have different effects depending on the ancestral makeup of each individual. We use our test in two different populations of inbred mice and a yeast panel and demonstrate that these sorts of variable effect polymorphisms exist in 14 different physical traits in mice and 38 phenotypes in yeast as well as in murine gene expression. We use the term “polygenic epistasis” to distinguish these interactions from the more conventional two- or multi-locus interactions.
Collapse
Affiliation(s)
- Christoph D. Rau
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
| | - Natalia M. Gonzales
- Department of Human Genetics, University of Chicago, Chicago, IL, United States of America
| | - Joshua S. Bloom
- Department of Human Genetics, UCLA, Los Angeles, CA, United States of America
| | - Danny Park
- Department of Medicine, UCSF, San Francisco, CA, United States of America
| | - Julien Ayroles
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, United States of America
| | - Abraham A. Palmer
- Department of Psychiatry, and Institute for Genomic Medicine, UCSD, San Diego, CA, United States of America
| | - Aldons J. Lusis
- Department of Human Genetics, UCLA, Los Angeles, CA, United States of America
| | - Noah Zaitlen
- Department of Neurology, UCLA, Los Angeles, CA, United States of America
- * E-mail:
| |
Collapse
|
12
|
Ziyatdinov A, Parker MM, Vaysse A, Beaty TH, Kraft P, Cho MH, Aschard H. Mixed-model admixture mapping identifies smoking-dependent loci of lung function in African Americans. Eur J Hum Genet 2020; 28:656-668. [PMID: 31836859 PMCID: PMC7171162 DOI: 10.1038/s41431-019-0545-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Revised: 10/30/2019] [Accepted: 11/01/2019] [Indexed: 11/08/2022] Open
Abstract
Admixture mapping has led to the discovery of many genes associated with differential disease risk by ancestry, highlighting the importance of ancestry-based approaches to association studies. However, the potential of admixture mapping in deciphering the interplay between genes and environment exposures has been seldom explored. Here we performed a genome-wide screening of local ancestry-smoking interactions for five spirometric lung function phenotypes in 3300 African Americans from the COPDGene study. To account for population structure and outcome heterogeneity across exposure groups, we developed a multi-component linear mixed model for mapping gene-environment interactions and empirically showed its robustness and increased power. When applied to the COPDGene study, our approach identified two 11p15.2-3 and 2q37 loci, exhibiting local ancestry-smoking interactions at genome-wide significant level, which would have been missed by standard single-nucleotide polymorphism analyses. These two loci harbor the PARVA and RAB17 genes previously recognized to be involved in smoking behavior. Overall, our study provides the first evidence for potential synergistic effects between African ancestry and smoking on pulmonary function, and underlines the importance of ethnic diversity in genetic studies.
Collapse
Affiliation(s)
- Andrey Ziyatdinov
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| | - Margaret M Parker
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Amaury Vaysse
- Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI), Institut Pasteur, Paris, France
| | - Terri H Beaty
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Peter Kraft
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Michael H Cho
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Hugues Aschard
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI), Institut Pasteur, Paris, France
| |
Collapse
|
13
|
Park CS, De T, Xu Y, Zhong Y, Smithberger E, Alarcon C, Gamazon ER, Perera MA. Hepatocyte gene expression and DNA methylation as ancestry-dependent mechanisms in African Americans. NPJ Genom Med 2019; 4:29. [PMID: 31798965 PMCID: PMC6877651 DOI: 10.1038/s41525-019-0102-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Accepted: 09/27/2019] [Indexed: 12/13/2022] Open
Abstract
African Americans (AAs) are an admixed population with widely varying proportion of West African ancestry (WAA). Here we report the correlation of WAA to gene expression and DNA methylation in AA-derived hepatocytes, a cell type important in disease and drug response. We perform mediation analysis to test whether methylation is a mediator of the effect of ancestry on expression. GTEx samples and a second cohort are used as validation. One hundred and thirty-one genes are associated with WAA (FDR < 0.10), 28 of which replicate and represent 220 GWAS phenotypes. Among PharmGKB pharmacogenes, VDR, PTGIS, ALDH1A1, CYP2C19, and P2RY1 nominally associate with WAA (p < 0.05). We find 1037 WAA-associated, differentially methylated regions (FDR < 0.05), with hypomethylated genes enriched in drug-response pathways. In conclusion, WAA contributes to variability in hepatocyte expression and DNA methylation with identified genes previously implicated for diseases disproportionately affecting AAs, including cardiovascular (PTGIS, PLAT) and renal (APOL1) disease, and drug response (CYP2C19).
Collapse
Affiliation(s)
- C. S. Park
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
| | - T. De
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
| | - Y. Xu
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
- Center for Translational Data Science, University of Chicago, Chicago, IL USA
| | - Y. Zhong
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
| | - E. Smithberger
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
- Department of Pathology and Laboratory Medicine, University of North Carolina School of Medicine, Chapel Hill, NC USA
| | - C. Alarcon
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
| | - E. R. Gamazon
- Vanderbilt Genetics Institute and Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN USA
- Data Science Institute, Vanderbilt University, Nashville, TN USA
- Clare Hall, University of Cambridge, Cambridge, UK
| | - M. A. Perera
- Department of Pharmacology, Center for Pharmacogenomics, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
| |
Collapse
|
14
|
Popejoy AB, Ritter DI, Crooks K, Currey E, Fullerton SM, Hindorff LA, Koenig B, Ramos EM, Sorokin EP, Wand H, Wright MW, Zou J, Gignoux CR, Bonham VL, Plon SE, Bustamante CD. The clinical imperative for inclusivity: Race, ethnicity, and ancestry (REA) in genomics. Hum Mutat 2019; 39:1713-1720. [PMID: 30311373 DOI: 10.1002/humu.23644] [Citation(s) in RCA: 95] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 08/17/2018] [Accepted: 08/30/2018] [Indexed: 12/12/2022]
Abstract
The Clinical Genome Resource (ClinGen) Ancestry and Diversity Working Group highlights the need to develop guidance on race, ethnicity, and ancestry (REA) data collection and use in clinical genomics. We present quantitative and qualitative evidence to characterize: (1) acquisition of REA data via clinical laboratory requisition forms, and (2) information disparity across populations in the Genome Aggregation Database (gnomAD) at clinically relevant sites ascertained from annotations in ClinVar. Our requisition form analysis showed substantial heterogeneity in clinical laboratory ascertainment of REA, as well as marked incongruity among terms used to define REA categories. There was also striking disparity across REA populations in the amount of information available about clinically relevant variants in gnomAD. European ancestral populations constituted the majority of observations (55.8%), allele counts (59.7%), and private alleles (56.1%) in gnomAD at 550 loci with "pathogenic" and "likely pathogenic" expert-reviewed variants in ClinVar. Our findings highlight the importance of implementing and supporting programs to increase diversity in genome sequencing and clinical genomics, as well as measuring uncertainty around population-level datasets that are used in variant interpretation. Finally, we suggest the need for a standardized REA data collection framework to be developed through partnerships and collaborations and adopted across clinical genomics.
Collapse
Affiliation(s)
- Alice B Popejoy
- Department of Biomedical Data Science, Stanford University, Standford, California
| | - Deborah I Ritter
- Department of Pediatrics, Texas Children's Hospital, Baylor College of Medicine, Houston, Texas
| | - Kristy Crooks
- Department of Pathology, University of Colorado, Anschutz Medical Campus, Aurora, Colorado.,Department of Medicine, Division of Bioinformatics and Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado
| | - Erin Currey
- Division of Genomics and Society, National Human Genome Research Institute (NHGRI), Bethesda, Maryland
| | | | - Lucia A Hindorff
- Division of Genomics and Society, National Human Genome Research Institute (NHGRI), Bethesda, Maryland
| | - Barbara Koenig
- Department of Anthropology, History, and Social Medicine, University of California, San Francisco
| | - Erin M Ramos
- Division of Genomics and Society, National Human Genome Research Institute (NHGRI), Bethesda, Maryland
| | - Elena P Sorokin
- Department of Biomedical Data Science, Stanford University, Standford, California
| | - Hannah Wand
- Department of Biomedical Data Science, Stanford University, Standford, California
| | - Mathew W Wright
- Department of Biomedical Data Science, Stanford University, Standford, California
| | - James Zou
- Department of Biomedical Data Science, Stanford University, Standford, California
| | - Christopher R Gignoux
- Department of Medicine, Division of Bioinformatics and Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, Colorado
| | - Vence L Bonham
- Social and Behavioral Research Branch, National Human Genome Research Institute (NHGRI), Bethesda, Maryland
| | - Sharon E Plon
- Department of Pediatrics, Texas Children's Hospital, Baylor College of Medicine, Houston, Texas
| | - Carlos D Bustamante
- Department of Biomedical Data Science, Stanford University, Standford, California
| | | |
Collapse
|
15
|
Prasad B, Saxena R, Goel N, Patel SR. Genetic Ancestry for Sleep Research: Leveraging Health Inequalities to Identify Causal Genetic Variants. Chest 2018; 153:1478-1496. [PMID: 29604255 DOI: 10.1016/j.chest.2018.03.024] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 03/02/2018] [Accepted: 03/19/2018] [Indexed: 02/08/2023] Open
Abstract
Recent evidence has highlighted the health inequalities in sleep behaviors and sleep disorders that adversely affect outcomes in select populations, including African-American and Hispanic-American subjects. Race-related sleep health inequalities are ascribed to differences in multilevel and interlinked health determinants, such as sociodemographic factors, health behaviors, and biology. African-American and Hispanic-American subjects are admixed populations whose genetic inheritance combines two or more ancestral populations originating from different continents. Racial inequalities in admixed populations can be parsed into relevant groups of mediating factors (environmental vs genetic) with the use of measures of genetic ancestry, including the proportion of an individual's genetic makeup that comes from each of the major ancestral continental populations. This review describes sleep health inequalities in African-American and Hispanic-American subjects and considers the potential utility of ancestry studies to exploit these differences to gain insight into the genetic underpinnings of these phenotypes. The inclusion of genetic approaches in future studies of admixed populations will allow greater understanding of the potential biological basis of race-related sleep health inequalities.
Collapse
Affiliation(s)
- Bharati Prasad
- Department of Medicine, University of Illinois at Chicago, and Jesse Brown VA Medical Center, Chicago, IL.
| | - Richa Saxena
- Center for Genomic Medicine and Department of Anesthesia, Pain, and Critical Care Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA
| | - Namni Goel
- Division of Sleep and Chronobiology, Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA
| | - Sanjay R Patel
- Department of Medicine, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|