1
|
Sun WX, Chang XY, Chen Y, Zhao Q, Zhang YM. The integration of quantile regression with 3VmrMLM identifies more QTNs and QTN-by-environment interactions using SNP- and haplotype-based markers. PLANT COMMUNICATIONS 2025; 6:101196. [PMID: 39580620 PMCID: PMC11956104 DOI: 10.1016/j.xplc.2024.101196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 10/11/2024] [Accepted: 11/20/2024] [Indexed: 11/26/2024]
Abstract
Current methods used in genome-wide association studies frequently lack power owing to their inability to detect heterogeneous associations and rare and multiallelic variants. To address these issues, quantile regression is integrated with a three (compressed) variance component multi-locus random-SNP-effect mixed linear model (3VmrMLM) to propose q3VmrMLM for detecting heterogeneous quantitative trait nucleotides (QTNs) and QTN-by-environment interactions (QEIs), and then design haplotype-based q3VmrMLM (q3VmrMLM-Hap) for identifying multiallelic haplotypes and rare variants. In Monte Carlo simulation studies, q3VmrMLM had higher power than 3VmrMLM, sequence kernel association test (SKAT), and integrated quantile rank test (iQRAT). In a re-analysis of 10 traits in 1439 rice hybrids, 261 known genes were identified only by q3VmrMLM and q3VmrMLM-Hap, whereas 175 known genes were detected by both the new and existing methods. Of all the significant QTNs with known genes, q3VmrMLM (179: 140 variance heterogeneity and 157 quantile effect heterogeneity) found more heterogeneous QTNs than 3VmrMLM (123), SKAT (27), and iQRAT (29); q3VmrMLM-Hap (121) mapped more low-frequency (<0.05) QTNs than q3VmrMLM (51), 3VmrMLM (43), SKAT (11), and iQRAT (12); and q3VmrMLM-Hap (12), q3VmrMLM (16), and 3VmrMLM (12) had similar power in identifying gene-by-environment interactions. All significant and suggested QTNs achieved the highest predictive accuracy (r = 0.9045). In conclusion, this study describes a new and complementary approach to mining genes and unraveling the genetic architecture of complex traits in crops.
Collapse
Affiliation(s)
- Wen-Xian Sun
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiao-Yu Chang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ying Chen
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Qiong Zhao
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan-Ming Zhang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
2
|
Palma-Martínez MJ, Posadas-García YS, Shaukat A, López-Ángeles BE, Sohail M. Evolution, genetic diversity, and health. Nat Med 2025; 31:751-761. [PMID: 40055519 DOI: 10.1038/s41591-025-03558-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Accepted: 02/03/2025] [Indexed: 03/21/2025]
Abstract
Human genetic diversity in today's world has been shaped by evolutionary history, demographic shifts and environmental exposures, influencing complex traits, disease susceptibility and drug responses. Capturing this diversity is essential for advancing precision medicine and promoting equitable healthcare. Despite the great progress achieved with initiatives such as the human Pangenome and large biobanks that aim for a better representation of human diversity, important challenges remain. In this Perspective, we discuss the importance of diversity in clinical genomics through an evolutionary lens. We highlight progress and challenges and outline key clinical applications of diverse genetic data. We argue that diversifying both datasets and methodologies-integrating ancestral and environmental factors-is crucial for fully understanding the genetic basis of human health and disease.
Collapse
Affiliation(s)
- María J Palma-Martínez
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, México
| | | | - Amara Shaukat
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Brenda E López-Ángeles
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, México
| | - Mashaal Sohail
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, México.
| |
Collapse
|
3
|
Akingbuwa WA, Nivard MG. Detecting Non-linear Dependence through Genome Wide Analysis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.12.637804. [PMID: 39990333 PMCID: PMC11844478 DOI: 10.1101/2025.02.12.637804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2025]
Abstract
In the current study we introduce statistical methods based on trigonometry, to infer the shape of a (non)-linear bivariate genetic relationship. We do this based on a series of piecemeal GWASs of segments of a target (continuous) trait distribution, and the genetic correlations between those GWASs and a second trait. Simulations confirm that we are able to retrieve the shape of the relationship given certain assumptions about the nature of the relationship between the traits. We applied the method to the genetic relationship between BMI, sleep duration, and height, and psychiatric disorders (ADHD, anorexia nervosa, and depression) using data from approximately 450K individuals from UK Biobank. In the relationship between BMI and psychiatric traits, we found that the expected value of depression is a nonlinear function of BMI i.e. there is a nonlinear genetic relationship between both traits. We observed similar findings for the genetic relationship between BMI and anorexia, sleep duration and depression, and sleep duration and ADHD. We observed no underlying nonlinearity in the genetic relationship between height and psychiatric traits. Using a novel statistical approach, we show that nonlinear genetic relationships between traits are detectable and genetic associations as quantified using global estimators like genetic correlations are not informative about underlying complexities in these relationships. Our findings challenge assumptions of linearity in genetic epidemiology and suggest that bivariate genetic associations are not uniform across the phenotypic spectrum, which may have implications for the development of targeted interventions.
Collapse
Affiliation(s)
- Wonuola A Akingbuwa
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Michel G Nivard
- Medical Research Council Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| |
Collapse
|
4
|
Herrera-Luis E, Benke K, Volk H, Ladd-Acosta C, Wojcik GL. Gene-environment interactions in human health. Nat Rev Genet 2024; 25:768-784. [PMID: 38806721 DOI: 10.1038/s41576-024-00731-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2024] [Indexed: 05/30/2024]
Abstract
Gene-environment interactions (G × E), the interplay of genetic variation with environmental factors, have a pivotal impact on human complex traits and diseases. Statistically, G × E can be assessed by determining the deviation from expectation of predictive models based solely on the phenotypic effects of genetics or environmental exposures. Despite the unprecedented, widespread and diverse use of G × E analytical frameworks, heterogeneity in their application and reporting hinders their applicability in public health. In this Review, we discuss study design considerations as well as G × E analytical frameworks to assess polygenic liability dependent on the environment, to identify specific genetic variants exhibiting G × E, and to characterize environmental context for these dynamics. We conclude with recommendations to address the most common challenges and pitfalls in the conceptualization, methodology and reporting of G × E studies, as well as future directions.
Collapse
Affiliation(s)
- Esther Herrera-Luis
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kelly Benke
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Heather Volk
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Christine Ladd-Acosta
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Genevieve L Wojcik
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
5
|
Pazokitoroudi A, Liu Z, Dahl A, Zaitlen N, Rosset S, Sankararaman S. A scalable and robust variance components method reveals insights into the architecture of gene-environment interactions underlying complex traits. Am J Hum Genet 2024; 111:1462-1480. [PMID: 38866020 PMCID: PMC11267529 DOI: 10.1016/j.ajhg.2024.05.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 05/15/2024] [Accepted: 05/15/2024] [Indexed: 06/14/2024] Open
Abstract
Understanding the contribution of gene-environment interactions (GxE) to complex trait variation can provide insights into disease mechanisms, explain sources of heritability, and improve genetic risk prediction. While large biobanks with genetic and deep phenotypic data hold promise for obtaining novel insights into GxE, our understanding of GxE architecture in complex traits remains limited. We introduce a method to estimate the proportion of trait variance explained by GxE (GxE heritability) and additive genetic effects (additive heritability) across the genome and within specific genomic annotations. We show that our method is accurate in simulations and computationally efficient for biobank-scale datasets. We applied our method to common array SNPs (MAF ≥1%), fifty quantitative traits, and four environmental variables (smoking, sex, age, and statin usage) in unrelated white British individuals in the UK Biobank. We found 68 trait-E pairs with significant genome-wide GxE heritability (p<0.05/200) with a ratio of GxE to additive heritability of ≈6.8% on average. Analyzing ≈8 million imputed SNPs (MAF ≥0.1%), we documented an approximate 28% increase in genome-wide GxE heritability compared to array SNPs. We partitioned GxE heritability across minor allele frequency (MAF) and local linkage disequilibrium (LD) values, revealing that, like additive allelic effects, GxE allelic effects tend to increase with decreasing MAF and LD. Analyzing GxE heritability near genes highly expressed in specific tissues, we find significant brain-specific enrichment for body mass index (BMI) and basal metabolic rate in the context of smoking and adipose-specific enrichment for waist-hip ratio (WHR) in the context of sex.
Collapse
Affiliation(s)
- Ali Pazokitoroudi
- Department of Computer Science, UCLA, Los Angeles, CA, USA; Department of Epidemiology, Harvard School of Public Health, Boston, MA, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Zhengtong Liu
- Department of Computer Science, UCLA, Los Angeles, CA, USA
| | - Andrew Dahl
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Noah Zaitlen
- Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA; Department of Computational Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA; Department of Neurology, UCLA, Los Angeles, CA, USA
| | - Saharon Rosset
- Department of Statistics, Tel-Aviv University, Tel-Aviv, Israel
| | - Sriram Sankararaman
- Department of Computer Science, UCLA, Los Angeles, CA, USA; Department of Human Genetics, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA; Department of Computational Medicine, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
6
|
Boye C, Nirmalan S, Ranjbaran A, Luca F. Genotype × environment interactions in gene regulation and complex traits. Nat Genet 2024; 56:1057-1068. [PMID: 38858456 PMCID: PMC11492161 DOI: 10.1038/s41588-024-01776-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 04/25/2024] [Indexed: 06/12/2024]
Abstract
Genotype × environment interactions (GxE) have long been recognized as a key mechanism underlying human phenotypic variation. Technological developments over the past 15 years have dramatically expanded our appreciation of the role of GxE in both gene regulation and complex traits. The richness and complexity of these datasets also required parallel efforts to develop robust and sensitive statistical and computational approaches. Although our understanding of the genetic architecture of molecular and complex traits has been maturing, a large proportion of complex trait heritability remains unexplained. Furthermore, there are increasing efforts to characterize the effect of environmental exposure on human health. We therefore review GxE in human gene regulation and complex traits, advocating for a comprehensive approach that jointly considers genetic and environmental factors in human health and disease.
Collapse
Affiliation(s)
- Carly Boye
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US
| | - Shreya Nirmalan
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US
| | - Ali Ranjbaran
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US
| | - Francesca Luca
- Center for Molecular Medicine and Genetics, Wayne State University, Detroit, MI, US.
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, US.
- Department of Biology, University of Rome "Tor Vergata", Rome, Italy.
| |
Collapse
|
7
|
Dong Z, Jiang W, Li H, DeWan AT, Zhao H. LDER-GE estimates phenotypic variance component of gene-environment interactions in human complex traits accurately with GE interaction summary statistics and full LD information. Brief Bioinform 2024; 25:bbae335. [PMID: 38980374 PMCID: PMC11232466 DOI: 10.1093/bib/bbae335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 06/05/2024] [Accepted: 06/26/2024] [Indexed: 07/10/2024] Open
Abstract
Gene-environment (GE) interactions are essential in understanding human complex traits. Identifying these interactions is necessary for deciphering the biological basis of such traits. In this study, we review state-of-art methods for estimating the proportion of phenotypic variance explained by genome-wide GE interactions and introduce a novel statistical method Linkage-Disequilibrium Eigenvalue Regression for Gene-Environment interactions (LDER-GE). LDER-GE improves the accuracy of estimating the phenotypic variance component explained by genome-wide GE interactions using large-scale biobank association summary statistics. LDER-GE leverages the complete Linkage Disequilibrium (LD) matrix, as opposed to only the diagonal squared LD matrix utilized by LDSC (Linkage Disequilibrium Score)-based methods. Our extensive simulation studies demonstrate that LDER-GE performs better than LDSC-based approaches by enhancing statistical efficiency by ~23%. This improvement is equivalent to a sample size increase of around 51%. Additionally, LDER-GE effectively controls type-I error rate and produces unbiased results. We conducted an analysis using UK Biobank data, comprising 307 259 unrelated European-Ancestry subjects and 966 766 variants, across 217 environmental covariate-phenotype (E-Y) pairs. LDER-GE identified 34 significant E-Y pairs while LDSC-based method only identified 23 significant E-Y pairs with 22 overlapped with LDER-GE. Furthermore, we employed LDER-GE to estimate the aggregated variance component attributed to multiple GE interactions, leading to an increase in the explained phenotypic variance with GE interactions compared to considering main genetic effects only. Our results suggest the importance of impacts of GE interactions on human complex traits.
Collapse
Affiliation(s)
- Zihan Dong
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT 06510, United States
- Center for Perinatal, Pediatric and Environmental Epidemiology, 60 College Street, Yale School of Public Health, New Haven, CT 06510, United States
| | - Wei Jiang
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT 06510, United States
| | - Hongyu Li
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT 06510, United States
| | - Andrew T DeWan
- Center for Perinatal, Pediatric and Environmental Epidemiology, 60 College Street, Yale School of Public Health, New Haven, CT 06510, United States
- Department of Chronic Disease Epidemiology, Yale School of Public Health, 60 College Street, New Haven, CT 06510, United States
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, CT 06510, United States
| |
Collapse
|
8
|
Kemper KE, Sidorenko J, Wang H, Hayes BJ, Wray NR, Yengo L, Keller MC, Goddard M, Visscher PM. Genetic influence on within-person longitudinal change in anthropometric traits in the UK Biobank. Nat Commun 2024; 15:3776. [PMID: 38710707 PMCID: PMC11074304 DOI: 10.1038/s41467-024-47802-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 04/10/2024] [Indexed: 05/08/2024] Open
Abstract
The causes of temporal fluctuations in adult traits are poorly understood. Here, we investigate the genetic determinants of within-person trait variability of 8 repeatedly measured anthropometric traits in 50,117 individuals from the UK Biobank. We found that within-person (non-directional) variability had a SNP-based heritability of 2-5% for height, sitting height, body mass index (BMI) and weight (P ≤ 2.4 × 10-3). We also analysed longitudinal trait change and show a loss of both average height and weight beyond about 70 years of age. A variant tracking the Alzheimer's risk APOE- E 4 allele (rs429358) was significantly associated with weight loss ( β = -0.047 kg per yr, s.e. 0.007, P = 2.2 × 10-11), and using 2-sample Mendelian Randomisation we detected a relationship consistent with causality between decreased lumbar spine bone mineral density and height loss (bxy = 0.011, s.e. 0.003, P = 3.5 × 10-4). Finally, population-level variance quantitative trait loci (vQTL) were consistent with within-person variability for several traits, indicating an overlap between trait variability assessed at the population or individual level. Our findings help elucidate the genetic influence on trait-change within an individual and highlight disease risks associated with these changes.
Collapse
Affiliation(s)
- Kathryn E Kemper
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia.
| | - Julia Sidorenko
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Huanwei Wang
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane, QLD, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
- Department of Psychiatry, University of Oxford, Oxford, UK
| | - Loic Yengo
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Matthew C Keller
- Institute for Behavioral Genetics, University of Colorado, Boulder, CO, USA
| | - Michael Goddard
- Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, VIC, Australia
- Biosciences Research Division, Agriculture Victoria, Bundoora, VIC, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia.
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Population Health, University of Oxford, Oxford, UK.
| |
Collapse
|
9
|
Yu Z, Hu G, Wang J, Li Z. Association between hepatitis A seropositivity and bone mineral density in adolescents and adults: a cross-sectional study using NHANES data. SAO PAULO MED J 2024; 142:e2023266. [PMID: 38655984 PMCID: PMC11034701 DOI: 10.1590/1516-3180.2023.0266.r1.08022024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 01/25/2024] [Accepted: 02/08/2024] [Indexed: 04/26/2024] Open
Abstract
BACKGROUND Osteoporosis, characterized by decreased bone density and increased fracture risk, imposes significant physical, psychosocial, and financial burdens. Early detection and prevention are crucial for managing osteoporosis and reducing the risk of fractures. OBJECTIVES To investigate the relationship between Hepatitis A seropositivity and bone mineral density (BMD) in adolescents and adults and to explore the potential link between Hepatitis A infection and osteoporosis risk. DESIGN AND SETTING This cross-sectional study used data from the National Health and Nutrition Examination Survey (NHANES) from 2011 to 2018 to evaluate the association between hepatitis A seropositivity and BMD in 15,693 participants. METHODS Multivariable regression analysis was used to calculate the mean BMD and standard error for adolescents and adults, followed by an independent z-test to determine whether there was a significant difference between the seropositive and seronegative groups. RESULTS Hepatitis A seropositive adolescents and adults had lower BMD than their seronegative counterparts, with significant differences in lumber spine (mean difference = -0.03 g/cm2, P < 0.01 for both age groups) and pelvis BMDs (mean difference = -0.02 g/cm2, P < 0.01 for the adult age groups), after adjusting for various covariates. CONCLUSIONS This study confirmed that both adolescent and adult individuals seropositive for Hepatitis A antibodies had reduced BMD among both adolescents and adults, especially in the adult group. This finding suggests a possible link between Hepatitis A infection and risk of osteoporosis.
Collapse
Affiliation(s)
- Zhuowen Yu
- Doctoral student, Physician. Department of Orthopedics, Second Xiangya Hospital of Central South University, Changsha, China; Hunan Key Laboratory of Tumor Models and Individualized Medicine, Second Xiangya Hospital of Central South University, Changsha, China
| | - Gunchu Hu
- Doctoral student, Physician. Department of General Surgery, Second Xiangya Hospital of Central South University, Changsha, China
| | - Jiajie Wang
- Master’s student, Physician. Department of Orthopedics, Second Xiangya Hospital of Central South University, Changsha, China; Hunan Key Laboratory of Tumor Models and Individualized Medicine, Second Xiangya Hospital of Central South University, Changsha, China
| | - Zhihong Li
- PhD. Physician, Professor, Department of Orthopedics, Second Xiangya Hospital of Central South University, Changsha, China; Hunan Key Laboratory of Tumor Models and Individualized Medicine, Second Xiangya Hospital of Central South University, Changsha, China
| |
Collapse
|
10
|
Jayasinghe D, Momin MM, Beckmann K, Hyppönen E, Benyamin B, Lee SH. Mitigating type 1 error inflation and power loss in GxE PRS: Genotype-environment interaction in polygenic risk score models. Genet Epidemiol 2024; 48:85-100. [PMID: 38303123 DOI: 10.1002/gepi.22546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 01/03/2024] [Accepted: 01/08/2024] [Indexed: 02/03/2024]
Abstract
The use of polygenic risk score (PRS) models has transformed the field of genetics by enabling the prediction of complex traits and diseases based on an individual's genetic profile. However, the impact of genotype-environment interaction (GxE) on the performance and applicability of PRS models remains a crucial aspect to be explored. Currently, existing genotype-environment interaction polygenic risk score (GxE PRS) models are often inappropriately used, which can result in inflated type 1 error rates and compromised results. In this study, we propose novel GxE PRS models that jointly incorporate additive and interaction genetic effects although also including an additional quadratic term for nongenetic covariates, enhancing their robustness against model misspecification. Through extensive simulations, we demonstrate that our proposed models outperform existing models in terms of controlling type 1 error rates and enhancing statistical power. Furthermore, we apply the proposed models to real data, and report significant GxE effects. Specifically, we highlight the impact of our models on both quantitative and binary traits. For quantitative traits, we uncover the GxE modulation of genetic effects on body mass index by alcohol intake frequency. In the case of binary traits, we identify the GxE modulation of genetic effects on hypertension by waist-to-hip ratio. These findings underscore the importance of employing a robust model that effectively controls type 1 error rates, thus preventing the occurrence of spurious GxE signals. To facilitate the implementation of our approach, we have developed an innovative R software package called GxEprs, specifically designed to detect and estimate GxE effects. Overall, our study highlights the importance of accurate GxE modeling and its implications for genetic risk prediction, although providing a practical tool to support further research in this area.
Collapse
Affiliation(s)
- Dovini Jayasinghe
- Australian Centre for Precision Health, University of South Australia, Adelaide, South Australia, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, South Australia, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, South Australia, Australia
| | - Md Moksedul Momin
- Australian Centre for Precision Health, University of South Australia, Adelaide, South Australia, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, South Australia, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, South Australia, Australia
- Department of Genetics and Animal Breeding, Faculty of Veterinary Medicine, Chattogram Veterinary and Animal Sciences University (CVASU), Khulshi, Chattogram, Bangladesh
| | - Kerri Beckmann
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, South Australia, Australia
| | - Elina Hyppönen
- Australian Centre for Precision Health, University of South Australia, Adelaide, South Australia, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, South Australia, Australia
- UniSA Clinical and Health Sciences, University of South Australia, Adelaide, South Australia, Australia
| | - Beben Benyamin
- Australian Centre for Precision Health, University of South Australia, Adelaide, South Australia, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, South Australia, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, South Australia, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, South Australia, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, South Australia, Australia
- South Australian Health and Medical Research Institute (SAHMRI), University of South Australia, Adelaide, South Australia, Australia
| |
Collapse
|
11
|
Amente LD, Mills NT, Le TD, Hyppönen E, Lee SH. Unraveling phenotypic variance in metabolic syndrome through multi-omics. Hum Genet 2024; 143:35-47. [PMID: 38095720 DOI: 10.1007/s00439-023-02619-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 11/18/2023] [Indexed: 01/19/2024]
Abstract
Complex multi-omics effects drive the clustering of cardiometabolic risk factors, underscoring the imperative to comprehend how individual and combined omics shape phenotypic variation. Our study partitions phenotypic variance in metabolic syndrome (MetS), blood glucose (GLU), triglycerides (TG), high-density lipoprotein cholesterol (HDL-C), and blood pressure through genome, transcriptome, metabolome, and exposome (i.e., lifestyle exposome) analyses. Our analysis included a cohort of 62,822 unrelated individuals with white British ancestry, sourced from the UK biobank. We employed linear mixed models to partition phenotypic variance using the restricted maximum likelihood (REML) method, implemented in MTG2 (v2.22). We initiated the analysis by individually modeling omics, followed by subsequent integration of pairwise omics in a joint model that also accounted for the covariance and interaction between omics layers. Finally, we estimated the correlations of various omics effects between the phenotypes using bivariate REML. Significant proportions of the MetS variance were attributed to distinct data sources: genome (9.47%), transcriptome (4.24%), metabolome (14.34%), and exposome (3.77%). The phenotypic variances explained by the genome, transcriptome, metabolome, and exposome ranged from 3.28% for GLU to 25.35% for HDL-C, 0% for GLU to 19.34% for HDL-C, 4.29% for systolic blood pressure (SBP) to 35.75% for TG, and 0.89% for GLU to 10.17% for HDL-C, respectively. Significant correlations were found between genomic and transcriptomic effects for TG and HDL-C. Furthermore, significant interaction effects between omics data were detected for both MetS and its components. Interestingly, significant correlation of omics effect between the phenotypes was found. This study underscores omics' roles, interaction effects, and random-effects covariance in unveiling phenotypic variation in multi-omics domains.
Collapse
Affiliation(s)
- Lamessa Dube Amente
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
- South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia.
| | - Natalie T Mills
- Discipline of Psychiatry, University of Adelaide, Adelaide, SA, 5000, Australia
| | - Thuc Duy Le
- UniSA STEM, University of South Australia, Mawson Lakes, SA, 5095, Australia
| | - Elina Hyppönen
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia
- UniSA Clinical and Health Sciences, University of South Australia, Adelaide, SA, 5000, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
- South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia.
| |
Collapse
|
12
|
Miao J, Wu Y, Lu Q. Statistical methods for gene-environment interaction analysis. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL STATISTICS 2024; 16:e1635. [PMID: 38699459 PMCID: PMC11064894 DOI: 10.1002/wics.1635] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 09/12/2023] [Indexed: 05/05/2024]
Abstract
Most human complex phenotypes result from multiple genetic and environmental factors and their interactions. Understanding the mechanisms by which genetic and environmental factors interact offers valuable insights into the genetic architecture of complex traits and holds great potential for advancing precision medicine. The emergence of large population biobanks has led to the development of numerous statistical methods aiming at identifying gene-environment interactions (G × E). In this review, we present state-of-the-art statistical methodologies for G × E analysis. We will survey a spectrum of approaches for single-variant G × E mapping, followed by various techniques for polygenic G × E analysis. We conclude this review with a discussion on the future directions and challenges in G × E research.
Collapse
Affiliation(s)
- Jiacheng Miao
- Department of Biostatistics and Medical Informatics, University of Wisconsin–Madison, Madison, Wisconsin, USA
| | - Yixuan Wu
- University of Wisconsin–Madison, Madison, Wisconsin, USA
| | - Qiongshi Lu
- Department of Biostatistics and Medical Informatics, University of Wisconsin–Madison, Madison, Wisconsin, USA
- Department of Statistics, University of Wisconsin–Madison, Madison, Wisconsin, USA
- Center for Demography of Health and Aging, University of Wisconsin–Madison, Madison, Wisconsin, USA
| |
Collapse
|
13
|
Di Scipio M, Khan M, Mao S, Chong M, Judge C, Pathan N, Perrot N, Nelson W, Lali R, Di S, Morton R, Petch J, Paré G. A versatile, fast and unbiased method for estimation of gene-by-environment interaction effects on biobank-scale datasets. Nat Commun 2023; 14:5196. [PMID: 37626057 PMCID: PMC10457310 DOI: 10.1038/s41467-023-40913-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 08/16/2023] [Indexed: 08/27/2023] Open
Abstract
Identification of gene-by-environment interactions (GxE) is crucial to understand the interplay of environmental effects on complex traits. However, current methods evaluating GxE on biobank-scale datasets have limitations. We introduce MonsterLM, a multiple linear regression method that does not rely on model specification and provides unbiased estimates of variance explained by GxE. We demonstrate robustness of MonsterLM through comprehensive genome-wide simulations using real genetic data from 325,989 individuals. We estimate GxE using waist-to-hip-ratio, smoking, and exercise as the environmental variables on 13 outcomes (N = 297,529-325,989) in the UK Biobank. GxE variance is significant for 8 environment-outcome pairs, ranging from 0.009 - 0.071. The majority of GxE variance involves SNPs without strong marginal or interaction associations. We observe modest improvements in polygenic score prediction when incorporating GxE. Our results imply a significant contribution of GxE to complex trait variance and we show MonsterLM to be well-purposed to handle this with biobank-scale data.
Collapse
Affiliation(s)
- Matteo Di Scipio
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
| | - Mohammad Khan
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
| | - Shihong Mao
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
| | - Michael Chong
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada
| | - Conor Judge
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
| | - Nazia Pathan
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
| | - Nicolas Perrot
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
| | - Walter Nelson
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Ricky Lali
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
| | - Shuang Di
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Robert Morton
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada
| | - Jeremy Petch
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, ON, Canada
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada
| | - Guillaume Paré
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, ON, Canada.
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, ON, Canada.
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, ON, Canada.
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada.
| |
Collapse
|
14
|
Kuznetsov IA, Tsepilov YA, Freidin MB, Williams FMK, Suri P, Aulchenko YS. Genotype-by-environment interactions in chronic back pain. Spine J 2023; 23:1108-1114. [PMID: 37080360 DOI: 10.1016/j.spinee.2023.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 04/06/2023] [Accepted: 04/11/2023] [Indexed: 04/22/2023]
Abstract
BACKGROUND CONTEXT Chronic back pain (CBP) is a common debilitating condition with substantial societal impact. While understanding genotype-by-environment (GxE) interactions may be crucial to achieving the goals of personalized medicine, there are few large-scale studies investigating this topic for CBP. None of them systematically explore multiple CBP risk factors. PURPOSE To estimate the extent to which genetic effects on CBP are modified by known demographic and clinical risk factors. RESEARCH DESIGN Case-control study, genome-wide GxE interaction study. PATIENT SAMPLE Data on up to 331,610 unrelated participants (57,881 CBP cases and 273,729 controls) from the UK Biobank cohort were used. UK Biobank is a prospective cohort with collected deep genetic and phenotypic data on approximately 500,000 individuals across the UK. OUTCOME MEASURES Self-reported chronic back pain. METHODS We applied a whole-genome approach to estimate the proportion of phenotypic variance explained by interactions between genotype and 12 known risk factors. We also analyzed if effects of common single-nucleotide polymorphisms on CBP are changed in presence of known risk factors. RESULTS The results indicate a modest, if any, modification of genetic effects by examined risk factors in CBP. Our estimates suggest that detecting such weak effects would require a sample size of millions of individuals. CONCLUSIONS The GxE interactions with examined common risk factors for CBP are either weak or absent. Interactions of such magnitude are unlikely to have the potential to inform and influence treatment strategies. Risk estimation models may use common genetic variation and the considered risk factors as independent predictors, without accounting for GxE.
Collapse
Affiliation(s)
- Ivan A Kuznetsov
- Center of Life Sciences, Skolkovo Institute of Science and Technology, 30 bld.1 Bolshoy Boulevard, Moscow 121205, Russia
| | - Yakov A Tsepilov
- Laboratory of Recombination and Segregation Analysis, Institute of Cytology and Genetics, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia; Laboratory of Theoretical and Applied Functional Genomics, Novosibirsk State University, 1 Pirogova St, Novosibirsk, 630090, Russia; Kurchatov genomics center of the Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia
| | - Maxim B Freidin
- Department of Biology, School of Biological and Behavioural Sciences, Queen Mary University of London, Mile End Rd, Bethnal Green, London E1 4DQ, UK
| | - Frances M K Williams
- Department of Twin Research and Genetic Epidemiology, King's College London, Westminster Bridge Rd, London SE1 7EH, UK
| | - Pradeep Suri
- Seattle Epidemiologic Research and Information Center, VA Puget Sound Health Care System, 1660 S. Columbian Way, Seattle, WA 98108, USA; Division of Rehabilitation Care Services, 1660 S. Columbian Way, Seattle, WA 98108, USA; Clinical Learning, Evidence, and Research Center, University of Washington, 325 Ninth AvBox 359612, Seattle, WA 98104, USA
| | - Yurii S Aulchenko
- Laboratory of Recombination and Segregation Analysis, Institute of Cytology and Genetics, 10 Lavrentiev Ave, Novosibirsk, 630090, Russia; PolyOmica, Het Vlaggeschip 61, 's-Hertogenbosch, PA 5237, The Netherlands.
| |
Collapse
|
15
|
Waters DL, van der Werf JHJ, Robinson H, Hickey LT, Clark SA. Partitioning the forms of genotype-by-environment interaction in the reaction norm analysis of stability. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:99. [PMID: 37027025 PMCID: PMC10082108 DOI: 10.1007/s00122-023-04319-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 02/07/2023] [Indexed: 05/13/2023]
Abstract
KEY MESSAGE The reaction norm analysis of stability can be enhanced by partitioning the contribution of different types of G × E to the variation in slope. The slope of regression in a reaction norm model, where the performance of a genotype is regressed over an environmental covariable, is often used as a measure of stability of genotype performance. This method could be developed further by partitioning variation in the slope of regression into the two sources of genotype-by-environment interaction (G × E) which cause it: scale-type G × E (heterogeneity of variance) and rank-type G × E (heterogeneity of correlation). Because the two types of G × E have very different properties, separating their effect would enable a clearer understanding of stability. The aim of this paper was to demonstrate two methods which seek to achieve this in reaction norm models. Reaction norm models were fit to yield data from a multi-environment trial in Barley (Hordeum vulgare), with the adjusted mean yield from each environment used as the environmental covariable. Stability estimated from factor-analytic models, which can disentangle the two types of G × E and estimate stability based on rank-type G × E, was used for comparison. Adjusting the reaction norm slope to account for scale-type G × E using a genetic regression more than tripled the correlation with factor-analytic estimates of stability (0.24-0.26 to 0.80-0.85), indicating that it removed variation in the reaction norm slope that originated from scale-type G × E. A standardisation procedure had a more modest increase (055-0.59) but could be useful when curvilinear reaction norms are required. Analyses which use reaction norms to explore the stability of genotypes could gain additional insight into the mechanisms of stability by applying the methods outlined in this study.
Collapse
Affiliation(s)
- Dominic L Waters
- School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| | - Julius H J van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Hannah Robinson
- InterGrain Pty Ltd, Perth, WA, Australia
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| | - Lee T Hickey
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| | - Sam A Clark
- School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| |
Collapse
|
16
|
Zhong W, Chhibber A, Luo L, Mehrotra DV, Shen J. A fast and powerful linear mixed model approach for genotype-environment interaction tests in large-scale GWAS. Brief Bioinform 2023; 24:6955097. [PMID: 36545787 DOI: 10.1093/bib/bbac547] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 10/26/2022] [Accepted: 11/12/2022] [Indexed: 12/24/2022] Open
Abstract
Genotype-by-environment interaction (GEI or GxE) plays an important role in understanding complex human traits. However, it is usually challenging to detect GEI signals efficiently and accurately while adjusting for population stratification and sample relatedness in large-scale genome-wide association studies (GWAS). Here we propose a fast and powerful linear mixed model-based approach, fastGWA-GE, to test for GEI effect and G + GxE joint effect. Our extensive simulations show that fastGWA-GE outperforms other existing GEI test methods by controlling genomic inflation better, providing larger power and running hundreds to thousands of times faster. We performed a fastGWA-GE analysis of ~7.27 million variants on 452 249 individuals of European ancestry for 13 quantitative traits and five environment variables in the UK Biobank GWAS data and identified 96 significant signals (72 variants across 57 loci) with GEI test P-values < 1 × 10-9, including 27 novel GEI associations, which highlights the effectiveness of fastGWA-GE in GEI signal discovery in large-scale GWAS.
Collapse
Affiliation(s)
- Wujuan Zhong
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, NJ 07065, USA
| | - Aparna Chhibber
- Translational Bioinformatics, Bristol Myers Squibb, Lawrenceville, NJ 08540, USA
| | - Lan Luo
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., North Wales, PA 19454, USA
| | - Devan V Mehrotra
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., North Wales, PA 19454, USA
| | - Judong Shen
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., Rahway, NJ 07065, USA
| |
Collapse
|
17
|
Gillett AC, Jermy BS, Lee SH, Pain O, Howard DM, Hagenaars SP, Hanscombe KB, Coleman JRI, Lewis CM. Exploring polygenic-environment and residual-environment interactions for depressive symptoms within the UK Biobank. Genet Epidemiol 2022; 46:219-233. [PMID: 35438196 PMCID: PMC9541465 DOI: 10.1002/gepi.22449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 02/04/2022] [Accepted: 03/15/2022] [Indexed: 11/10/2022]
Abstract
Substantial advances have been made in identifying genetic contributions to depression, but little is known about how the effect of genes can be modulated by the environment, creating a gene-environment interaction. Using multivariate reaction norm models (MRNMs) within the UK Biobank (N = 61294-91644), we investigate whether the polygenic and residual variance components of depressive symptoms are modulated by 17 a priori selected covariate traits-12 environmental variables and 5 biomarkers. MRNMs, a mixed-effects modelling approach, provide unbiased polygenic-covariate interaction estimates for a quantitative trait by controlling for outcome-covariate correlations and residual-covariate interactions. A continuous depressive symptom variable was the outcome in 17 MRNMs-one for each covariate trait. Each MRNM had a fixed-effects model (fixed effects included the covariate trait, demographic variables, and principal components) and a random effects model (where polygenic-covariate and residual-covariate interactions are modelled). Of the 17 selected covariates, 11 significantly modulate deviations in depressive symptoms through the modelled interactions, but no single interaction explains a large proportion of phenotypic variation. Results are dominated by residual-covariate interactions, suggesting that covariate traits (including neuroticism, childhood trauma, and BMI) typically interact with unmodelled variables, rather than a genome-wide polygenic component, to influence depressive symptoms. Only average sleep duration has a polygenic-covariate interaction explaining a demonstrably nonzero proportion of the variability in depressive symptoms. This effect is small, accounting for only 1.22% (95% confidence interval: [0.54, 1.89]) of variation. The presence of an interaction highlights a specific focus for intervention, but the negative results here indicate a limited contribution from polygenic-environment interactions.
Collapse
Affiliation(s)
- Alexandra C Gillett
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.,NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
| | - Bradley S Jermy
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.,NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
| | - Sang Hong Lee
- Australian Centre for Precision Health, University of South Australia, SA, Adelaide, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia
| | - Oliver Pain
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.,NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
| | - David M Howard
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.,Division of Psychiatry, Royal Edinburgh Hospital, University of Edinburgh, Edinburgh, UK
| | - Saskia P Hagenaars
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Ken B Hanscombe
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Jonathan R I Coleman
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.,NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK
| | - Cathryn M Lewis
- Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.,NIHR Maudsley Biomedical Research Centre, South London and Maudsley NHS Trust, London, UK.,Department of Medical and Molecular Genetics, Faculty of Life Sciences and Medicine, King's College London, London, UK
| |
Collapse
|
18
|
From Mendel to quantitative genetics in the genome era: the scientific legacy of W. G. Hill. Nat Genet 2022; 54:934-939. [PMID: 35817969 DOI: 10.1038/s41588-022-01103-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 05/18/2022] [Indexed: 11/08/2022]
Abstract
The quantitative geneticist W. G. ('Bill') Hill, awardee of the 2018 Darwin Medal of the Royal Society and the 2019 Mendel Medal of the Genetics Society (United Kingdom), died on 17 December 2021 at the age of 81 years. Here, we pay tribute to his multiple key scientific contributions, which span population and evolutionary genetics, animal and plant breeding and human genetics. We discuss his theoretical research on the role of linkage disequilibrium (LD) and mutational variance in the response to selection, the origin of the widely used LD metric r2 in genomic association studies, the genetic architecture of complex traits, the quantification of the variation in realized relationships given a pedigree relationship and much more. We demonstrate that basic theoretical research in quantitative and statistical genetics has led to profound insights into the genetics and evolution of complex traits and made predictions that were subsequently empirically validated, often decades later.
Collapse
|
19
|
Ahmed M, Mäkinen VP, Mulugeta A, Shin J, Boyle T, Hyppönen E, Lee SH. Considering hormone-sensitive cancers as a single disease in the UK biobank reveals shared aetiology. Commun Biol 2022; 5:614. [PMID: 35729236 PMCID: PMC9213416 DOI: 10.1038/s42003-022-03554-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Accepted: 06/02/2022] [Indexed: 11/09/2022] Open
Abstract
Hormone-related cancers, including cancers of the breast, prostate, ovaries, uterine, and thyroid, globally contribute to the majority of cancer incidence. We hypothesize that hormone-sensitive cancers share common genetic risk factors that have rarely been investigated by previous genomic studies of site-specific cancers. Here, we show that considering hormone-sensitive cancers as a single disease in the UK Biobank reveals shared genetic aetiology. We observe that a significant proportion of variance in disease liability is explained by the genome-wide single nucleotide polymorphisms (SNPs), i.e., SNP-based heritability on the liability scale is estimated as 10.06% (SE 0.70%). Moreover, we find 55 genome-wide significant SNPs for the disease, using a genome-wide association study. Pair-wise analysis also estimates positive genetic correlations between some pairs of hormone-sensitive cancers although they are not statistically significant. Our finding suggests that heritable genetic factors may be a key driver in the mechanism of carcinogenesis shared by hormone-sensitive cancers.
Collapse
Affiliation(s)
- Muktar Ahmed
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia. .,Department of Epidemiology, Faculty of Public Health, Jimma University Institute of Health, Jimma, Ethiopia. .,UniSA Clinical and Health Sciences, University of South Australia, Adelaide, SA, Australia. .,South Australian Health and Medical Research Institute, Adelaide, SA, Australia.
| | - Ville-Petteri Mäkinen
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia.,Computational Systems Biology Program, Precision Medicine Theme, South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Anwar Mulugeta
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia.,UniSA Clinical and Health Sciences, University of South Australia, Adelaide, SA, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Jisu Shin
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia.,UniSA Allied Health & Human Performance, University of South Australia, Adelaide, SA, Australia
| | - Terry Boyle
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, Australia.,UniSA Allied Health & Human Performance, University of South Australia, Adelaide, SA, Australia
| | - Elina Hyppönen
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia.,UniSA Clinical and Health Sciences, University of South Australia, Adelaide, SA, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Sang Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia. .,South Australian Health and Medical Research Institute, Adelaide, SA, Australia. .,UniSA Allied Health & Human Performance, University of South Australia, Adelaide, SA, Australia.
| |
Collapse
|
20
|
Waters DL, Clark SA, Moghaddar N, van der Werf JH. Genomic analysis of the slope of the reaction norm for body weight in Australian sheep. Genet Sel Evol 2022; 54:40. [PMID: 35659541 PMCID: PMC9164502 DOI: 10.1186/s12711-022-00734-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 05/20/2022] [Indexed: 11/23/2022] Open
Abstract
Background Selection of livestock based on their robustness or sensitivity to environmental variation could help improve the efficiency of production systems, particularly in the light of climate change. Genetic variation in robustness arises from genotype-by-environment (G × E) interactions, with genotypes performing differently when animals are raised in contrasted environments. Understanding the nature of this genetic variation is essential to implement strategies to improve robustness. In this study, our aim was to explore the genetics of robustness in Australian sheep to different growth environments using linear reaction norm models (RNM), with post-weaning weight records of 22,513 lambs and 60 k single nucleotide polymorphisms (SNPs). The use of scale-corrected genomic estimated breeding values (GEBV) for the slope to account for scale-type G × E interactions was also investigated. Results Additive genetic variance was observed for the slope of the RNM, with genetic correlations between low- and high-growth environments indicating substantial re-ranking of genotypes (0.44–0.49). The genetic variance increased from low- to high-growth environments. The heritability of post-weaning body weight ranged from 0.28 to 0.39. The genetic correlation between intercept and slope of the reaction norm for post-weaning body weight was low to moderate when based on the estimated (co)variance components but was much higher when based on back-solved SNP effects. An initial analysis suggested that a region on chromosome 11 affected both the intercept and the slope, but when the GEBV for the slope were conditioned on the GEBV for the intercept to remove the effect of scale-type G × E interactions on SNP effects for robustness, a single genomic region on chromosome 7 was found to be associated with robustness. This region included genes previously associated with growth traits and disease susceptibility in livestock. Conclusions This study shows a significant genetic variation in the slope of RNM that could be used for selecting for increased robustness of sheep. Both scale-type and rank-type G × E interactions contributed to variation in the slope. The correction for scale effects of GEBV for the slope should be considered when analysing robustness using RNM. Overall, robustness appears to be a highly polygenic trait. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00734-6.
Collapse
Affiliation(s)
- Dominic L Waters
- School of Environmental & Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| | - Sam A Clark
- School of Environmental & Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Nasir Moghaddar
- School of Environmental & Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Julius H van der Werf
- School of Environmental & Rural Science, University of New England, Armidale, NSW, 2351, Australia
| |
Collapse
|
21
|
Gershon ES, Lee SH, Zhou X, Sweeney JA, Tamminga C, Pearlson GA, Clementz BA, Keshavan MS, Alliey-Rodriguez N, Hudgens-Haney M, Keedy SK, Glahn DC, Asif H, Lencer R, Hill SK. An opportunity for primary prevention research in psychotic disorders. Schizophr Res 2022; 243:433-439. [PMID: 34315649 PMCID: PMC8784565 DOI: 10.1016/j.schres.2021.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/29/2021] [Accepted: 07/01/2021] [Indexed: 10/20/2022]
Abstract
An opportunity has opened for research into primary prevention of psychotic disorders, based on progress in endophenotypes, genetics, and genomics. Primary prevention requires reliable prediction of susceptibility before any symptoms are present. We studied a battery of measures where published data supports abnormalities of these measurements prior to appearance of initial psychosis symptoms. These neurobiological and behavioral measurements included cognition, eye movement tracking, Event Related Potentials, and polygenic risk scores. They generated an acceptably precise separation of healthy controls from outpatients with a psychotic disorder. METHODS: The Bipolar and Schizophrenia Network on Intermediate Phenotypes (B-SNIP) measured this battery in an ancestry-diverse series of consecutively recruited adult outpatients with a psychotic disorder and healthy controls. Participants include all genders, 16 to 50 years of age, 261 with psychotic disorders (Schizophrenia (SZ) 109, Bipolar with psychosis (BPP) 92, Schizoaffective disorder (SAD) 60), 110 healthy controls. Logistic Regression, and an extension of the Linear Mixed Model to include analysis of pairwise interactions between measures (Environmental kernel Relationship Matrices (ERM)) with multiple iterations, were performed to predict case-control status. Each regression analysis was validated with four-fold cross-validation. RESULTS AND CONCLUSIONS: Sensitivity, specificity, and Area Under the Curve of Receiver Operating Characteristic of 85%, 62%, and 86%, respectively, were obtained for both analytic methods. These prediction metrics demonstrate a promising diagnostic distinction based on premorbid risk variables. There were also statistically significant pairwise interactions between measures in the ERM model. The strong prediction metrics of both types of analytic model provide proof-of-principle for biologically-based laboratory tests as a first step toward primary prevention studies. Prospective studies of adolescents at elevated risk, vs. healthy adolescent controls, would be a next step toward development of primary prevention strategies.
Collapse
Affiliation(s)
- Elliot S Gershon
- University of Chicago, Department of Psychiatry, United States of America; University of Chicago, Department of Human Genetics, United States of America.
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA 5000, Australia; UniSA: Allied Health and Human Performance, University of South Australia, Adelaide, SA 5000, Australia; South Australian Health and Medical Research Institute, Adelaide, South Australia 5000, Australia.
| | - Xuan Zhou
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA 5000, Australia; UniSA: Allied Health and Human Performance, University of South Australia, Adelaide, SA 5000, Australia; South Australian Health and Medical Research Institute, Adelaide, South Australia 5000, Australia.
| | - John A Sweeney
- University of Cincinnati, Department of Psychiatry United States of America, Sichuan University, Hauxi Center for MR Research, China.
| | - Carol Tamminga
- University of Texas Southwestern, United States of America.
| | | | | | | | | | | | | | - David C Glahn
- Harvard Medical School, Boston Children's Hospital, United States of America.
| | - Huma Asif
- University of Chicago, United States of America.
| | - Rebekka Lencer
- University of Muenster, Muenster, Germany; Department of Psychiatry and Psychotherapy, University of Luebeck, Luebeck, Germany.
| | - S Kristian Hill
- Rosalind Franklin University of Medicine and Science, United States of America.
| |
Collapse
|
22
|
Li M, Zhang YW, Zhang ZC, Xiang Y, Liu MH, Zhou YH, Zuo JF, Zhang HQ, Chen Y, Zhang YM. A compressed variance component mixed model for detecting QTNs and QTN-by-environment and QTN-by-QTN interactions in genome-wide association studies. MOLECULAR PLANT 2022; 15:630-650. [PMID: 35202864 DOI: 10.1016/j.molp.2022.02.012] [Citation(s) in RCA: 65] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 01/26/2022] [Accepted: 02/19/2022] [Indexed: 05/25/2023]
Abstract
Although genome-wide association studies are widely used to mine genes for quantitative traits, the effects to be estimated are confounded, and the methodologies for detecting interactions are imperfect. To address these issues, the mixed model proposed here first estimates the genotypic effects for AA, Aa, and aa, and the genotypic polygenic background replaces additive and dominance polygenic backgrounds. Then, the estimated genotypic effects are partitioned into additive and dominance effects using a one-way analysis of variance model. This strategy was further expanded to cover QTN-by-environment interactions (QEIs) and QTN-by-QTN interactions (QQIs) using the same mixed-model framework. Thus, a three-variance-component mixed model was integrated with our multi-locus random-SNP-effect mixed linear model (mrMLM) method to establish a new methodological framework, 3VmrMLM, that detects all types of loci and estimates their effects. In Monte Carlo studies, 3VmrMLM correctly detected all types of loci and almost unbiasedly estimated their effects, with high powers and accuracies and a low false positive rate. In re-analyses of 10 traits in 1439 rice hybrids, detection of 269 known genes, 45 known gene-by-environment interactions, and 20 known gene-by-gene interactions strongly validated 3VmrMLM. Further analyses of known genes showed more small (67.49%), minor-allele-frequency (35.52%), and pleiotropic (30.54%) genes, with higher repeatability across datasets (54.36%) and more dominance loci. In addition, a heteroscedasticity mixed model in multiple environments and dimension reduction methods in quite a number of environments were developed to detect QEIs, and variable selection under a polygenic background was proposed for QQI detection. This study provides a new approach for revealing the genetic architecture of quantitative traits.
Collapse
Affiliation(s)
- Mei Li
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ya-Wen Zhang
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; State Key Laboratory of Cotton Biology, Anyang 455000, China
| | - Ze-Chang Zhang
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Yu Xiang
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ming-Hui Liu
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ya-Hui Zhou
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Jian-Fang Zuo
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Han-Qing Zhang
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ying Chen
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan-Ming Zhang
- Crop Information Center, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
23
|
Shin J, Zhou X, Tan JTM, Hyppönen E, Benyamin B, Lee SH. Lifestyle Modifies the Diabetes-Related Metabolic Risk, Conditional on Individual Genetic Differences. Front Genet 2022; 13:759309. [PMID: 35356427 PMCID: PMC8959634 DOI: 10.3389/fgene.2022.759309] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 01/10/2022] [Indexed: 12/26/2022] Open
Abstract
Metabolic syndrome is a group of heritable metabolic traits that are highly associated with type 2 diabetes (T2DM). Classical interventions to T2DM include individual self-management of environmental risk factors, such as improving diet quality, increasing physical activity, and reducing smoking and alcohol consumption, which decreases the risk of developing metabolic syndrome. However, it is poorly understood how the phenotypes of diabetes-related metabolic traits change with respect to lifestyle modifications at the individual level. In the analysis, we used 12 diabetes-related metabolic traits and eight lifestyle covariates from the UK Biobank comprising 288,837 white British participants genotyped for 1,133,273 genome-wide single nucleotide polymorphisms. We found 16 GxE interactions. Modulation of genetic effects by physical activity was seen for four traits (glucose, HbA1c, C-reactive protein, systolic blood pressure) and by alcohol and smoking for three (BMI, glucose, waist-hip ratio and BMI and diastolic and systolic blood pressure, respectively). We also found a number of significant phenotypic modulations by the lifestyle covariates, which were not attributed to the genetic effects in the model. Overall, modulation in the metabolic risk in response to the level of lifestyle covariates was clearly observed, and its direction and magnitude were varied depending on individual differences. We also showed that the metabolic risk inferred by our model was notably higher in T2DM prospective cases than controls. Our findings highlight the importance of individual genetic differences in the prevention and management of diabetes and suggest that the one-size-fits-all approach may not benefit all.
Collapse
Affiliation(s)
- Jisu Shin
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia.,National Cancer Center, Goyang-si, South Korea
| | - Xuan Zhou
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia
| | - Joanne T M Tan
- Vascular Research Centre, Heart and Vascular Health Program, Lifelong Health Theme, South Australian Health and Medical Research Institute, Adelaide, SA, Australia.,Adelaide Medical School, University of Adelaide, Adelaide, SA, Australia
| | - Elina Hyppönen
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia.,UniSA Clinical and Health Sciences, University of South Australia, Adelaide, SA, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Beben Benyamin
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia.,South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| |
Collapse
|
24
|
Adiposity and cancer: a Mendelian randomization analysis in the UK biobank. Int J Obes (Lond) 2021; 45:2657-2665. [PMID: 34453097 DOI: 10.1038/s41366-021-00942-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 07/21/2021] [Accepted: 08/11/2021] [Indexed: 02/07/2023]
Abstract
BACKGROUND Observational and Mendelian randomization (MR) studies link obesity and cancer, but it remains unclear whether these depend upon related metabolic abnormalities. METHODS We used information from 321,472 participants in the UK biobank, including 30,561 cases of obesity-related cancer. We constructed three genetic instruments reflecting higher adiposity together with either "unfavourable" (82 SNPs), "favourable" (24 SNPs) or "neutral" metabolic profile (25 SNPs). We looked at associations with 14 types of cancer, previously suggested to be associated with obesity. RESULTS All genetic instruments had a strong association with BMI (p < 1 × 10-300 for all). The instrument reflecting unfavourable adiposity was also associated with higher CRP, HbA1c and adverse lipid profile, while instrument reflecting metabolically favourable adiposity was associated with lower HbA1c and a favourable lipid profile. In MR-inverse-variance weighted analysis unfavourable adiposity was associated with an increased risk of non-hormonal cancers (OR = 1.22, 95% confidence interval [CI]:1.08, 1.38), but a lower risk of hormonal cancers (OR = 0.80, 95%CI: 0.72, 0.89). From individual cancers, MR analyses suggested causal increases in the risk of multiple myeloma (OR = 1.36, 95%CI: 1.09, 1.70) and endometrial cancer (OR = 1.77, 95%CI: 1.16, 2.68) by greater genetically instrumented unfavourable adiposity but lower risks of breast and prostate cancer (OR = 0.72, 95%CI: 0.61, 0.83 and OR = 0.81, 95%CI: 0.68, 0.97, respectively). Favourable or neutral adiposity were not associated with the odds of any individual cancer. CONCLUSIONS Higher adiposity associated with a higher risk of non-hormonal cancer but a lower risk of some hormone related cancers. Presence of metabolic abnormalities might aggravate the adverse effects of higher adiposity on cancer. Further studies are warranted to investigate whether interventions on adverse metabolic health may help to alleviate obesity-related cancer risk.
Collapse
|
25
|
Zhou X, Lee SH. An integrative analysis of genomic and exposomic data for complex traits and phenotypic prediction. Sci Rep 2021; 11:21495. [PMID: 34728654 PMCID: PMC8564528 DOI: 10.1038/s41598-021-00427-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 10/12/2021] [Indexed: 12/18/2022] Open
Abstract
Complementary to the genome, the concept of exposome has been proposed to capture the totality of human environmental exposures. While there has been some recent progress on the construction of the exposome, few tools exist that can integrate the genome and exposome for complex trait analyses. Here we propose a linear mixed model approach to bridge this gap, which jointly models the random effects of the two omics layers on phenotypes of complex traits. We illustrate our approach using traits from the UK Biobank (e.g., BMI and height for N ~ 35,000) with a small fraction of the exposome that comprises 28 lifestyle factors. The joint model of the genome and exposome explains substantially more phenotypic variance and significantly improves phenotypic prediction accuracy, compared to the model based on the genome alone. The additional phenotypic variance captured by the exposome includes its additive effects as well as non-additive effects such as genome-exposome (gxe) and exposome-exposome (exe) interactions. For example, 19% of variation in BMI is explained by additive effects of the genome, while additional 7.2% by additive effects of the exposome, 1.9% by exe interactions and 4.5% by gxe interactions. Correspondingly, the prediction accuracy for BMI, computed using Pearson's correlation between the observed and predicted phenotypes, improves from 0.15 (based on the genome alone) to 0.35 (based on the genome and exposome). We also show, using established theories, that integrating genomic and exposomic data can be an effective way of attaining a clinically meaningful level of prediction accuracy for disease traits. In conclusion, the genomic and exposomic effects can contribute to phenotypic variation via their latent relationships, i.e. genome-exposome correlation, and gxe and exe interactions, and modelling these effects has a potential to improve phenotypic prediction accuracy and thus holds a great promise for future clinical practice.
Collapse
Affiliation(s)
- Xuan Zhou
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
- South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, 5000, Australia.
- UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
- South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia.
| |
Collapse
|
26
|
Kendall KM, Van Assche E, Andlauer TFM, Choi KW, Luykx JJ, Schulte EC, Lu Y. The genetic basis of major depression. Psychol Med 2021; 51:2217-2230. [PMID: 33682643 DOI: 10.1017/s0033291721000441] [Citation(s) in RCA: 87] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Major depressive disorder (MDD) is a common, debilitating, phenotypically heterogeneous disorder with heritability ranges from 30% to 50%. Compared to other psychiatric disorders, its high prevalence, moderate heritability, and strong polygenicity have posed major challenges for gene-mapping in MDD. Studies of common genetic variation in MDD, driven by large international collaborations such as the Psychiatric Genomics Consortium, have confirmed the highly polygenic nature of the disorder and implicated over 100 genetic risk loci to date. Rare copy number variants associated with MDD risk were also recently identified. The goal of this review is to present a broad picture of our current understanding of the epidemiology, genetic epidemiology, molecular genetics, and gene-environment interplay in MDD. Insights into the impact of genetic factors on the aetiology of this complex disorder hold great promise for improving clinical care.
Collapse
Affiliation(s)
- K M Kendall
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University, Cardiff, UK
| | - E Van Assche
- Department of Psychiatry, University of Muenster, Muenster, Germany
| | - T F M Andlauer
- Department of Neurology, Klinikum rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - K W Choi
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA02114, USA
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114, USA
- Department of Epidemiology, Harvard TH Chan School of Public Health, Boston, MA02115, USA
| | - J J Luykx
- Department of Psychiatry, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Department of Translational Neuroscience, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Outpatient Second Opinion Clinic, GGNet Mental Health, Warnsveld, The Netherlands
| | - E C Schulte
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
- Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
| | - Y Lu
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
27
|
Amador C, Zeng Y, Barber M, Walker RM, Campbell A, McIntosh AM, Evans KL, Porteous DJ, Hayward C, Wilson JF, Navarro P, Haley CS. Genome-wide methylation data improves dissection of the effect of smoking on body mass index. PLoS Genet 2021; 17:e1009750. [PMID: 34499657 PMCID: PMC8428545 DOI: 10.1371/journal.pgen.1009750] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 07/28/2021] [Indexed: 11/18/2022] Open
Abstract
Variation in obesity-related traits has a genetic basis with heritabilities between 40 and 70%. While the global obesity pandemic is usually associated with environmental changes related to lifestyle and socioeconomic changes, most genetic studies do not include all relevant environmental covariates, so the genetic contribution to variation in obesity-related traits cannot be accurately assessed. Some studies have described interactions between a few individual genes linked to obesity and environmental variables but there is no agreement on their total contribution to differences between individuals. Here we compared self-reported smoking data and a methylation-based proxy to explore the effect of smoking and genome-by-smoking interactions on obesity related traits from a genome-wide perspective to estimate the amount of variance they explain. Our results indicate that exploiting omic measures can improve models for complex traits such as obesity and can be used as a substitute for, or jointly with, environmental records to better understand causes of disease.
Collapse
Affiliation(s)
- Carmen Amador
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Yanni Zeng
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, China
| | - Michael Barber
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Rosie M. Walker
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
- Centre for Clinical Brain Sciences, Chancellor’s Building, 49 Little France Crescent, Edinburgh BioQuarter, Edinburgh, United Kingdom
| | - Archie Campbell
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Andrew M. McIntosh
- Division of Psychiatry, University of Edinburgh, Edinburgh, United Kingdom
| | - Kathryn L. Evans
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - David J. Porteous
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Caroline Hayward
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - James F. Wilson
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
- Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, United Kingdom
| | - Pau Navarro
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
| | - Chris S. Haley
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, United Kingdom
- Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
28
|
Shin J, Lee SH. GxEsum: a novel approach to estimate the phenotypic variance explained by genome-wide GxE interaction based on GWAS summary statistics for biobank-scale data. Genome Biol 2021; 22:183. [PMID: 34154633 PMCID: PMC8218431 DOI: 10.1186/s13059-021-02403-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 06/04/2021] [Indexed: 12/14/2022] Open
Abstract
Genetic variation in response to the environment, that is, genotype-by-environment interaction (GxE), is fundamental in the biology of complex traits and diseases. However, existing methods are computationally demanding and infeasible to handle biobank-scale data. Here, we introduce GxEsum, a method for estimating the phenotypic variance explained by genome-wide GxE based on GWAS summary statistics. Through comprehensive simulations and analysis of UK Biobank with 288,837 individuals, we show that GxEsum can handle a large-scale biobank dataset with controlled type I error rates and unbiased GxE estimates, and its computational efficiency can be hundreds of times higher than existing GxE methods.
Collapse
Affiliation(s)
- Jisu Shin
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia
- UniSA: Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia
| | - Sang Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia.
- UniSA: Allied Health and Human Performance, University of South Australia, Adelaide, SA, 5000, Australia.
| |
Collapse
|
29
|
Akimova ET, Breen R, Brazel DM, Mills MC. Gene-environment dependencies lead to collider bias in models with polygenic scores. Sci Rep 2021; 11:9457. [PMID: 33947934 PMCID: PMC8097011 DOI: 10.1038/s41598-021-89020-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 04/20/2021] [Indexed: 11/09/2022] Open
Abstract
The application of polygenic scores has transformed our ability to investigate whether and how genetic and environmental factors jointly contribute to the variation of complex traits. Modelling the complex interplay between genes and environment, however, raises serious methodological challenges. Here we illustrate the largely unrecognised impact of gene-environment dependencies on the identification of the effects of genes and their variation across environments. We show that controlling for heritable covariates in regression models that include polygenic scores as independent variables introduces endogenous selection bias when one or more of these covariates depends on unmeasured factors that also affect the outcome. This results in the problem of conditioning on a collider, which in turn leads to spurious associations and effect sizes. Using graphical and simulation methods we demonstrate that the degree of bias depends on the strength of the gene-covariate correlation and of hidden heterogeneity linking covariates with outcomes, regardless of whether the main analytic focus is mediation, confounding, or gene × covariate (commonly gene × environment) interactions. We offer potential solutions, highlighting the importance of causal inference. We also urge further caution when fitting and interpreting models with polygenic scores and non-exogenous environments or phenotypes and demonstrate how spurious associations are likely to arise, advancing our understanding of such results.
Collapse
Affiliation(s)
- Evelina T Akimova
- Department of Sociology, University of Oxford, Oxford, OX1 1JD, UK. .,Leverhulme Centre for Demographic Science, University of Oxford, Oxford, OX1 1JD, UK.
| | - Richard Breen
- Department of Sociology, University of Oxford, Oxford, OX1 1JD, UK.,Nuffield College, University of Oxford, Oxford, OX1 1NF, UK
| | - David M Brazel
- Leverhulme Centre for Demographic Science, University of Oxford, Oxford, OX1 1JD, UK.,Nuffield College, University of Oxford, Oxford, OX1 1NF, UK
| | - Melinda C Mills
- Leverhulme Centre for Demographic Science, University of Oxford, Oxford, OX1 1JD, UK.,Nuffield College, University of Oxford, Oxford, OX1 1NF, UK
| |
Collapse
|
30
|
Chung Y, Lee SH, Lee HK, Lim D, van der Werf J, Lee SH. THI Modulation of Genetic and Non-genetic Variance Components for Carcass Traits in Hanwoo Cattle. Front Genet 2021; 11:576377. [PMID: 33424920 PMCID: PMC7786192 DOI: 10.3389/fgene.2020.576377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 11/25/2020] [Indexed: 11/15/2022] Open
Abstract
The phenotype of carcass traits in beef cattle are affected by random genetic and non-genetic effects, which both can be modulated by an environmental variable such as Temperature-Humidity Index (THI), a key environmental factor in cattle production. In this study, a multivariate reaction norm model (MRNM) was used to assess if the random genetic and non-genetic (i.e., residual) effects of carcass weight (CW), back fat thickness (BFT), eye muscle area (EMA), and marbling score (MS) were modulated by THI, using 9,318 Hanwoo steers (N = 8,964) and cows (N = 354) that were genotyped on the Illumina Bovine SNP50 BeadChip (50K). THI was measured based on the period of 15–45 days before slaughter. Both the correlation and the interaction between THI and random genetic and non-genetic effects were accounted for in the model. In the analyses, it was shown that the genetic effects of EMA and the non-genetic effects of CW and MS were significantly modulated by THI. No significant THI modulation of such effects was found for BFT. These results highlight the relevance of THI changes for the genetic and non-genetic variation of CW, EMA, and MS in Hanwoo beef cattle. Importantly, heritability estimates for CW, EMA, and MS from additive models without considering THI interactions were underestimated. Moreover, the significance of interaction can be biased if not properly accounting for the correlation between THI and genetic and non-genetic effects. Thus, we argue that the estimation of genetic parameters should be based on appropriate models to avoid any potential bias of estimates. Our finding should serve as a basis for future studies aiming at revealing genotype by environment interaction in estimation and genomic prediction of breeding values.
Collapse
Affiliation(s)
- Yoonji Chung
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Seung Hwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Hak-Kyo Lee
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, South Korea
| | - Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Julius van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - S Hong Lee
- Australian Centre for Precision Health, University of South Australia, Adelaide, SA, Australia.,UniSA Allied Health and Human Performance, University of South Australia, Adelaide, SA, Australia
| |
Collapse
|
31
|
Arjas A, Hauptmann A, Sillanpää MJ. Estimation of dynamic SNP-heritability with Bayesian Gaussian process models. Bioinformatics 2020; 36:3795-3802. [PMID: 32186692 PMCID: PMC7672693 DOI: 10.1093/bioinformatics/btaa199] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 03/10/2020] [Accepted: 03/17/2020] [Indexed: 11/23/2022] Open
Abstract
Motivation Improved DNA technology has made it practical to estimate single-nucleotide polymorphism (SNP)-heritability among distantly related individuals with unknown relationships. For growth- and development-related traits, it is meaningful to base SNP-heritability estimation on longitudinal data due to the time-dependency of the process. However, only few statistical methods have been developed so far for estimating dynamic SNP-heritability and quantifying its full uncertainty. Results We introduce a completely tuning-free Bayesian Gaussian process (GP)-based approach for estimating dynamic variance components and heritability as their function. For parameter estimation, we use a modern Markov Chain Monte Carlo method which allows full uncertainty quantification. Several datasets are analysed and our results clearly illustrate that the 95% credible intervals of the proposed joint estimation method (which ‘borrows strength’ from adjacent time points) are significantly narrower than of a two-stage baseline method that first estimates the variance components at each time point independently and then performs smoothing. We compare the method with a random regression model using MTG2 and BLUPF90 software and quantitative measures indicate superior performance of our method. Results are presented for simulated and real data with up to 1000 time points. Finally, we demonstrate scalability of the proposed method for simulated data with tens of thousands of individuals. Availability and implementation The C++ implementation dynBGP and simulated data are available in GitHub: https://github.com/aarjas/dynBGP. The programmes can be run in R. Real datasets are available in QTL archive: https://phenome.jax.org/centers/QTLA. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Arttu Arjas
- Research Unit of Mathematical Sciences, University of Oulu, Oulu FI-90014, Finland
| | - Andreas Hauptmann
- Research Unit of Mathematical Sciences, University of Oulu, Oulu FI-90014, Finland.,Department of Computer Science, University College London, London WC1E 6BT, UK
| | - Mikko J Sillanpää
- Research Unit of Mathematical Sciences, University of Oulu, Oulu FI-90014, Finland.,Infotech Oulu, University of Oulu, Oulu FI-90014, Finland
| |
Collapse
|
32
|
de Souza MH, Pereira Júnior JD, Steckling SDM, Mencalha J, Dias FDS, Rocha JRDASDC, Carneiro PCS, Carneiro JEDS. Adaptability and stability analyses of plants using random regression models. PLoS One 2020; 15:e0233200. [PMID: 33264283 PMCID: PMC7710123 DOI: 10.1371/journal.pone.0233200] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 11/14/2020] [Indexed: 11/26/2022] Open
Abstract
The evaluation of cultivars using multi-environment trials (MET) is an important step in plant breeding programs. One of the objectives of these evaluations is to understand the genotype by environment interaction (GEI). A method of determining the effect of GEI on the performance of cultivars is based on studies of adaptability and stability. Initial studies were based on linear regression; however, these methodologies have limitations, mainly in trials with genetic or statistical unbalanced, heterogeneity of residual variances, and genetic covariance. An alternative would be the use of random regression models (RRM), in which the behavior of the genotypes is characterized as a reaction norm using longitudinal data or repeated measurements and information regarding a covariance function. The objective of this work was the application of RRM in the study of the behavior of common bean cultivars using a MET, based on Legendre polynomials and genotype-ideotype distances. We used a set of 13 trials, which were classified as unfavorable or favorable environments. The results revealed that RRM enables the prediction of the genotypic values of cultivars in environments where they were not evaluated with high accuracy values, thereby circumventing the unbalanced of the experiments. From these values, it was possible to measure the genotypic adaptability according to ideotypes, according to their reaction norms. In addition, the stability of the cultivars can be interpreted as variation in the behavior of the ideotype. The use of ideotypes based on real data allowed a better comparison of the performance of cultivars across environments. The use of RRM in plant breeding is a good alternative to understand the behavior of cultivars in a MET, especially when we want to quantify the adaptability and stability of genotypes.
Collapse
Affiliation(s)
| | | | | | - Jussara Mencalha
- Departamento de Agronomia, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
| | | | | | | | | |
Collapse
|
33
|
Yu C, Ni G, van der Werf J, Lee SH. Detecting Genotype-Population Interaction Effects by Ancestry Principal Components. Front Genet 2020; 11:379. [PMID: 32373165 PMCID: PMC7186421 DOI: 10.3389/fgene.2020.00379] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Accepted: 03/27/2020] [Indexed: 01/22/2023] Open
Abstract
Heterogeneity in the phenotypic mean and variance across populations is often observed for complex traits. One way to understand heterogeneous phenotypes lies in uncovering heterogeneity in genetic effects. Previous studies on genetic heterogeneity across populations were typically based on discrete groups in populations stratified by different countries or cohorts, which ignored the difference of population characteristics for the individuals within each group and resulted in loss of information. Here, we introduce a novel concept of genotype-by-population (G × P) interaction where population is defined by the first and second ancestry principal components (PCs), which are less likely to be confounded with country/cohort-specific factors. We applied a reaction norm model fitting each of 70 complex traits with significant SNP-heritability and the PCs as covariates to examine G × P interactions across diverse populations including white British and other white Europeans from the UK Biobank (N = 22,229). Our results demonstrated a significant population genetic heterogeneity for behavioral traits such as age at first sexual intercourse and academic qualification. Our approach may shed light on the latent genetic architecture of complex traits that underlies the modulation of genetic effects across different populations.
Collapse
Affiliation(s)
- Chenglong Yu
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia
- College of Medicine and Public Health, Flinders University, Bedford Park, SA, Australia
- South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Guiyan Ni
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Julius van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - S. Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, Australia
- South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| |
Collapse
|
34
|
Zhou X, van der Werf J, Carson-Chahhoud K, Ni G, McGrath J, Hyppönen E, Lee SH. Whole-Genome Approach Discovers Novel Genetic and Nongenetic Variance Components Modulated by Lifestyle for Cardiovascular Health. J Am Heart Assoc 2020; 9:e015661. [PMID: 32308100 PMCID: PMC7428517 DOI: 10.1161/jaha.119.015661] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Background Both genetic and nongenetic factors can predispose individuals to cardiovascular risk. Finding ways to alter these predispositions is important for cardiovascular disease prevention. Methods and Results We used a novel whole‐genome approach to estimate the genetic and nongenetic effects on—and hence their predispositions to—cardiovascular risk and determined whether they vary with respect to lifestyle factors such as physical activity, smoking, alcohol consumption, and dietary intake. We performed analyses on the ARIC (Atherosclerosis Risk in Communities) Study (N=6896–7180) and validated findings using the UKBB (UK Biobank, N=14 076–34 538). Lifestyle modulation was evident for many cardiovascular traits such as body mass index and resting heart rate. For example, alcohol consumption modulated both genetic and nongenetic effects on body mass index, whereas smoking modulated nongenetic effects on heart rate, pulse pressure, and white blood cell count. We also stratified individuals according to estimated genetic and nongenetic effects that are modulated by lifestyle factors and showed distinct phenotype–lifestyle relationships across the stratified groups. Finally, we showed that neglecting lifestyle modulations of cardiovascular traits would on average reduce single nucleotide polymorphism heritability estimates of these traits by a small yet significant amount, primarily owing to the overestimation of residual variance. Conclusions Lifestyle changes are relevant to cardiovascular disease prevention. Individual differences in the genetic and nongenetic effects that are modulated by lifestyle factors, as shown by the stratified group analyses, implies a need for personalized lifestyle interventions. In addition, single nucleotide polymorphism–based heritability of cardiovascular traits without accounting for lifestyle modulations could be underestimated.
Collapse
Affiliation(s)
- Xuan Zhou
- Australian Centre for Precision Health University of South Australia Adelaide South Australia Australia.,South Australian Health and Medical Research Institute Adelaide South Australia Australia
| | - Julius van der Werf
- School of Environmental and Rural Science University of New England Armidale New South Wales Australia
| | - Kristin Carson-Chahhoud
- Australian Centre for Precision Health University of South Australia Adelaide South Australia Australia
| | - Guiyan Ni
- School of Environmental and Rural Science University of New England Armidale New South Wales Australia.,Institute for Molecular Bioscience University of Queensland Brisbane Queensland Australia
| | - John McGrath
- Queensland Brain Institute University of Queensland Brisbane Queensland Australia.,Queensland Centre for Mental Health Research The Park Centre for Mental Health Wacol Queensland Australia
| | - Elina Hyppönen
- Australian Centre for Precision Health University of South Australia Adelaide South Australia Australia.,South Australian Health and Medical Research Institute Adelaide South Australia Australia
| | - S Hong Lee
- Australian Centre for Precision Health University of South Australia Adelaide South Australia Australia.,South Australian Health and Medical Research Institute Adelaide South Australia Australia
| |
Collapse
|
35
|
Dahl A, Nguyen K, Cai N, Gandal MJ, Flint J, Zaitlen N. A Robust Method Uncovers Significant Context-Specific Heritability in Diverse Complex Traits. Am J Hum Genet 2020; 106:71-91. [PMID: 31901249 PMCID: PMC7042488 DOI: 10.1016/j.ajhg.2019.11.015] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 11/26/2019] [Indexed: 02/08/2023] Open
Abstract
Gene-environment interactions (GxE) can be fundamental in applications ranging from functional genomics to precision medicine and is a conjectured source of substantial heritability. However, unbiased methods to profile GxE genome-wide are nascent and, as we show, cannot accommodate general environment variables, modest sample sizes, heterogeneous noise, and binary traits. To address this gap, we propose a simple, unifying mixed model for gene-environment interaction (GxEMM). In simulations and theory, we show that GxEMM can dramatically improve estimates and eliminate false positives when the assumptions of existing methods fail. We apply GxEMM to a range of human and model organism datasets and find broad evidence of context-specific genetic effects, including GxSex, GxAdversity, and GxDisease interactions across thousands of clinical and molecular phenotypes. Overall, GxEMM is broadly applicable for testing and quantifying polygenic interactions, which can be useful for explaining heritability and invaluable for determining biologically relevant environments.
Collapse
Affiliation(s)
- Andy Dahl
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Medicine, University of California San Francisco, San Francisco, CA 94158, USA.
| | - Khiem Nguyen
- Department of Medicine, University of California San Francisco, San Francisco, CA 94158, USA
| | - Na Cai
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Michael J Gandal
- Department of Psychiatry, Semel Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Jonathan Flint
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Noah Zaitlen
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Medicine, University of California San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|
36
|
GWEHS: A Genome-Wide Effect Sizes and Heritability Screener. Genes (Basel) 2019; 10:genes10080558. [PMID: 31344961 PMCID: PMC6723621 DOI: 10.3390/genes10080558] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Revised: 07/16/2019] [Accepted: 07/18/2019] [Indexed: 11/17/2022] Open
Abstract
During the last decade, there has been a huge development of Genome-Wide Association Studies (GWAS), and thousands of loci associated to complex traits have been detected. These efforts have led to the creation of public databases of GWAS results, making a huge source of information available on the genetic background of many diverse traits. Here we present GWEHS (Genome-Wide Effect size and Heritability Screener), an open-source online application to screen loci associated to human complex traits and diseases from the NHGRI-EBI GWAS Catalog. This application provides a way to explore the distribution of effect sizes of loci affecting these traits, as well as their contribution to heritability. Furthermore, it allows for making predictions on the change in the expected mean effect size, as well as in the heritability as new loci are found. The application enables inferences on whether the additive contribution of loci expected to be discovered in the future will be able to explain the estimates of familial heritability for the different traits. We illustrate the use of this tool, compare some of the results obtained with those from a previous meta-analysis, and discuss its uses and limitations.
Collapse
|
37
|
López-Cortegano E, Caballero A. Inferring the Nature of Missing Heritability in Human Traits Using Data from the GWAS Catalog. Genetics 2019; 212:891-904. [PMID: 31123044 PMCID: PMC6614893 DOI: 10.1534/genetics.119.302077] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 05/11/2019] [Indexed: 02/07/2023] Open
Abstract
Thousands of genes responsible for many diseases and other common traits in humans have been detected by Genome Wide Association Studies (GWAS) in the last decade. However, candidate causal variants found so far usually explain only a small fraction of the heritability estimated by family data. The most common explanation for this observation is that the missing heritability corresponds to variants, either rare or common, with very small effect, which pass undetected due to a lack of statistical power. We carried out a meta-analysis using data from the NHGRI-EBI GWAS Catalog in order to explore the observed distribution of locus effects for a set of 42 complex traits and to quantify their contribution to narrow-sense heritability. With the data at hand, we were able to predict the expected distribution of locus effects for 16 traits and diseases, their expected contribution to heritability, and the missing number of loci yet to be discovered to fully explain the familial heritability estimates. Our results indicate that, for 6 out of the 16 traits, the additive contribution of a great number of loci is unable to explain the familial (broad-sense) heritability, suggesting that the gap between GWAS and familial estimates of heritability may not ever be closed for these traits. In contrast, for the other 10 traits, the additive contribution of hundreds or thousands of loci yet to be found could potentially explain the familial heritability estimates, if this were the case. Computer simulations are used to illustrate the possible contribution from nonadditive genetic effects to the gap between GWAS and familial estimates of heritability.
Collapse
Affiliation(s)
| | - Armando Caballero
- Departamento de Bioquímica, Genética e Inmunología, Universidade de Vigo, 36310, Spain
| |
Collapse
|