1
|
Gu T, Taylor JM, Mukherjee B. A synthetic data integration framework to leverage external summary-level information from heterogeneous populations. Biometrics 2023; 79:3831-3845. [PMID: 36876883 PMCID: PMC10480346 DOI: 10.1111/biom.13852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 02/24/2023] [Indexed: 03/07/2023]
Abstract
There is a growing need for flexible general frameworks that integrate individual-level data with external summary information for improved statistical inference. External information relevant for a risk prediction model may come in multiple forms, through regression coefficient estimates or predicted values of the outcome variable. Different external models may use different sets of predictors and the algorithm they used to predict the outcome Y given these predictors may or may not be known. The underlying populations corresponding to each external model may be different from each other and from the internal study population. Motivated by a prostate cancer risk prediction problem where novel biomarkers are measured only in the internal study, this paper proposes an imputation-based methodology, where the goal is to fit a target regression model with all available predictors in the internal study while utilizing summary information from external models that may have used only a subset of the predictors. The method allows for heterogeneity of covariate effects across the external populations. The proposed approach generates synthetic outcome data in each external population, uses stacked multiple imputation to create a long dataset with complete covariate information. The final analysis of the stacked imputed data is conducted by weighted regression. This flexible and unified approach can improve statistical efficiency of the estimated coefficients in the internal study, improve predictions by utilizing even partial information available from models that use a subset of the full set of covariates used in the internal study, and provide statistical inference for the external population with potentially different covariate effects from the internal population.
Collapse
Affiliation(s)
- Tian Gu
- Department of Biostatistics, University of Michigan, Ann Arbor, U.S.A
| | | | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, U.S.A
| |
Collapse
|
2
|
Margaritte-Jeannin P, Vernet R, Budu-Aggrey A, Ege M, Madore AM, Linhard C, Mohamdi H, von Mutius E, Granell R, Demenais F, Laprise C, Bouzigon E, Dizier MH. TNS1 and NRXN1 Genes Interacting With Early-Life Smoking Exposure in Asthma-Plus-Eczema Susceptibility. ALLERGY, ASTHMA & IMMUNOLOGY RESEARCH 2023; 15:779-794. [PMID: 37957795 PMCID: PMC10643854 DOI: 10.4168/aair.2023.15.6.779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 05/15/2023] [Accepted: 06/13/2023] [Indexed: 11/15/2023]
Abstract
PURPOSE Numerous genes have been associated with allergic diseases (asthma, allergic rhinitis, and eczema), but they explain only part of their heritability. This is partly because most previous studies ignored complex mechanisms such as gene-environment (G-E) interactions and complex phenotypes such as co-morbidity. However, it was recently evidenced that the co-morbidity of asthma-plus-eczema appears as a sub-entity depending on specific genetic factors. Besides, evidence also suggest that gene-by-early life environmental tobacco smoke (ETS) exposure interactions play a role in asthma, but were never investigated for asthma-plus-eczema. To identify genetic variants interacting with ETS exposure that influence asthma-plus-eczema susceptibility. METHODS To conduct a genome-wide interaction study (GWIS) of asthma-plus-eczema according to ETS exposure, we applied a 2-stage strategy with a first selection of single nucleotide polymorphisms (SNPs) from genome-wide association meta-analysis to be tested at a second stage by interaction meta-analysis. All meta-analyses were conducted across 4 studies including a total of 5,516 European-ancestry individuals, of whom 1,164 had both asthma and eczema. RESULTS Two SNPs showed significant interactions with ETS exposure. They were located in 2 genes, NRXN1 (2p16) and TNS1 (2q35), never reported associated and/or interacting with ETS exposure for asthma, eczema or more generally for allergic diseases. TNS1 is a promising candidate gene because of its link to lung and skin diseases with possible interactive effect with tobacco smoke exposure. CONCLUSIONS This first GWIS of asthma-plus-eczema with ETS exposure underlines the importance of studying sub-phenotypes such as co-morbidities as well as G-E interactions to detect new susceptibility genes.
Collapse
Affiliation(s)
- Patricia Margaritte-Jeannin
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France
| | - Raphaël Vernet
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France
| | - Ashley Budu-Aggrey
- Medical Research Council (MRC) Integrative Epidemiology Unit, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Markus Ege
- Dr von Hauner Children's Hospital, Ludwig Maximilian University; Institute of Asthma and Allergy prevention, Helmholtz Centre Munich; Comprehensive Pneumology Center Munich (CPC-M), German Center for Lung Research, Munich, Germany
| | - Anne-Marie Madore
- Département des sciences fondamentales, Centre intersectoriel en santé durable (CISD), Université du Québec à Chicoutimi, Saguenay, QC, Canada
| | - Christophe Linhard
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France
| | - Hamida Mohamdi
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France
| | - Erika von Mutius
- Dr von Hauner Children's Hospital, Ludwig Maximilian University; Institute of Asthma and Allergy prevention, Helmholtz Centre Munich; Comprehensive Pneumology Center Munich (CPC-M), German Center for Lung Research, Munich, Germany
| | - Raquell Granell
- Medical Research Council (MRC) Integrative Epidemiology Unit, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Florence Demenais
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France
| | - Cathrine Laprise
- Département des sciences fondamentales, Centre intersectoriel en santé durable (CISD), Université du Québec à Chicoutimi, Saguenay, QC, Canada
| | - Emmanuelle Bouzigon
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France
| | - Marie-Hélène Dizier
- Université Paris Cité, UMRS 1124, INSERM, Genomic Epidemiology and Multifactorial Diseases Group, Paris, France.
| |
Collapse
|
3
|
Lewinger JP, Kawaguchi ES, Gauderman WJ. A note on p-value multiple-testing adjustment for two-step genome-wide gene-environment interactions scans. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.06.27.23291946. [PMID: 37425767 PMCID: PMC10327251 DOI: 10.1101/2023.06.27.23291946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Two-step testing is the state-of-the art approach for performing genome-wide interaction scans (GWIS). It is computationally efficient and yields higher power than standard single-step-based GWIS for virtually all biologically plausible scenarios. However, while two-step tests control the genome-wide type I error rate at the desired level, the lack of associated valid p-values can make it difficult for users to compare with single step-results. We show how multiple-testing adjusted p-values can be defined for two-step test based on standard multiple-testing theory, and how they can be in turn scaled to make valid comparisons with single-step tests possible.
Collapse
Affiliation(s)
- Juan Pablo Lewinger
- Department of Population and Public Health Sciences, University of Southern California
| | - Eric S Kawaguchi
- Department of Population and Public Health Sciences, University of Southern California
| | - W James Gauderman
- Department of Population and Public Health Sciences, University of Southern California
| |
Collapse
|
4
|
Carreras-Torres R, Kim AE, Lin Y, Diez-Obrero V, Bien SA, Qu C, Wang J, Dimou N, Aglago EK, Albanes D, Arndt V, Baurley JW, Berndt SI, Bézieau S, Bishop DT, Bouras E, Brenner H, Budiarto A, Campbell PT, Casey G, Chan AT, Chang-Claude J, Chen X, Conti DV, Dampier CH, Devall MAM, Drew DA, Figueiredo JC, Gallinger S, Giles GG, Gruber SB, Gsur A, Gunter MJ, Harrison TA, Hidaka A, Hoffmeister M, Huyghe JR, Jenkins MA, Jordahl KM, Kawaguchi E, Keku TO, Kundaje A, Le Marchand L, Lewinger JP, Li L, Mahesworo B, Morrison JL, Murphy N, Nan H, Nassir R, Newcomb PA, Obón-Santacana M, Ogino S, Ose J, Pai RK, Palmer JR, Papadimitriou N, Pardamean B, Peoples AR, Pharoah PDP, Platz EA, Rennert G, Ruiz-Narvaez E, Sakoda LC, Scacheri PC, Schmit SL, Schoen RE, Shcherbina A, Slattery ML, Stern MC, Su YR, Tangen CM, Thomas DC, Tian Y, Tsilidis KK, Ulrich CM, van Duijnhoven FJB, Van Guelpen B, Visvanathan K, Vodicka P, Cenggoro TW, Weinstein SJ, White E, Wolk A, Woods MO, Hsu L, Peters U, Moreno V, Gauderman WJ. Genome-wide Interaction Study with Smoking for Colorectal Cancer Risk Identifies Novel Genetic Loci Related to Tumor Suppression, Inflammation, and Immune Response. Cancer Epidemiol Biomarkers Prev 2023; 32:315-328. [PMID: 36576985 PMCID: PMC9992283 DOI: 10.1158/1055-9965.epi-22-0763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 09/19/2022] [Accepted: 12/19/2022] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Tobacco smoking is an established risk factor for colorectal cancer. However, genetically defined population subgroups may have increased susceptibility to smoking-related effects on colorectal cancer. METHODS A genome-wide interaction scan was performed including 33,756 colorectal cancer cases and 44,346 controls from three genetic consortia. RESULTS Evidence of an interaction was observed between smoking status (ever vs. never smokers) and a locus on 3p12.1 (rs9880919, P = 4.58 × 10-8), with higher associated risk in subjects carrying the GG genotype [OR, 1.25; 95% confidence interval (CI), 1.20-1.30] compared with the other genotypes (OR <1.17 for GA and AA). Among ever smokers, we observed interactions between smoking intensity (increase in 10 cigarettes smoked per day) and two loci on 6p21.33 (rs4151657, P = 1.72 × 10-8) and 8q24.23 (rs7005722, P = 2.88 × 10-8). Subjects carrying the rs4151657 TT genotype showed higher risk (OR, 1.12; 95% CI, 1.09-1.16) compared with the other genotypes (OR <1.06 for TC and CC). Similarly, higher risk was observed among subjects carrying the rs7005722 AA genotype (OR, 1.17; 95% CI, 1.07-1.28) compared with the other genotypes (OR <1.13 for AC and CC). Functional annotation revealed that SNPs in 3p12.1 and 6p21.33 loci were located in regulatory regions, and were associated with expression levels of nearby genes. Genetic models predicting gene expression revealed that smoking parameters were associated with lower colorectal cancer risk with higher expression levels of CADM2 (3p12.1) and ATF6B (6p21.33). CONCLUSIONS Our study identified novel genetic loci that may modulate the risk for colorectal cancer of smoking status and intensity, linked to tumor suppression and immune response. IMPACT These findings can guide potential prevention treatments.
Collapse
Affiliation(s)
- Robert Carreras-Torres
- Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
- Oncology Data Analytics Program, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain
- Digestive Diseases and Microbiota Group, Girona Biomedical Research Institute (IDIBGI), Salt, 17190, Girona, Spain
| | - Andre E Kim
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Yi Lin
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Virginia Diez-Obrero
- Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
- Oncology Data Analytics Program, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain
- Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - Stephanie A Bien
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Conghui Qu
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Jun Wang
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Niki Dimou
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Elom K Aglago
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Demetrius Albanes
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Volker Arndt
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - James W Baurley
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Stéphane Bézieau
- Service de Génétique Médicale, Centre Hospitalier Universitaire (CHU) Nantes, Nantes, France
| | - D Timothy Bishop
- Leeds Institute of Cancer and Pathology, University of Leeds, Leeds, UK
| | - Emmanouil Bouras
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany
- German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Arif Budiarto
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - Peter T Campbell
- Behavioral and Epidemiology Research Group, American Cancer Society, Atlanta, Georgia, USA
| | - Graham Casey
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, Virginia, USA
| | - Andrew T Chan
- Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Xuechen Chen
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - David V Conti
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Christopher H Dampier
- Department of General Surgery, University of Virginia School of Medicine, Charlottesville, Virginia, USA
| | - Matthew AM Devall
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, Virginia, USA
| | - David A Drew
- Clinical & Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Jane C Figueiredo
- Department of Medicine, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Steven Gallinger
- Lunenfeld Tanenbaum Research Institute, Mount Sinai Hospital, University of Toronto, Toronto, Ontario, Canada
| | - Graham G Giles
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, Victoria, Australia
| | - Stephen B Gruber
- Department of Medical Oncology & Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Andrea Gsur
- Institute of Cancer Research, Department of Medicine I, Medical University Vienna, Vienna, Austria
| | - Marc J Gunter
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Tabitha A Harrison
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Akihisa Hidaka
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Michael Hoffmeister
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jeroen R Huyghe
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Mark A Jenkins
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Victoria, Australia
| | - Kristina M Jordahl
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Eric Kawaguchi
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Temitope O Keku
- Center for Gastrointestinal Biology and Disease, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Anshul Kundaje
- Department of Genetics, Department of Computer Science, Stanford University, Stanford, California, USA
| | | | - Juan Pablo Lewinger
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Li Li
- Department of Family Medicine, University of Virginia, Charlottesville, Virginia, USA
| | - Bharuno Mahesworo
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - John L Morrison
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Neil Murphy
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Hongmei Nan
- Department of Epidemiology, Richard M. Fairbanks School of Public Health, Indianapolis, Indiana, USA
| | - Rami Nassir
- Department of Pathology, School of Medicine, Umm Al-Qura’a University, Saudi Arabia
| | - Polly A Newcomb
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Mireia Obón-Santacana
- Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
- Oncology Data Analytics Program, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain
- Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - Shuji Ogino
- Program in MPE Molecular Pathological Epidemiology, Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA; Department of Oncologic Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA; Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Jennifer Ose
- Huntsman Cancer Institute, Salt Lake City, Utah, USA
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah, USA
| | - Rish K Pai
- Department of Laboratory Medicine and Pathology, Mayo Clinic Arizona, Scottsdale, Arizona, USA
| | - Julie R Palmer
- Slone Epidemiology Center at Boston University, Boston, MA, USA
| | - Nikos Papadimitriou
- Nutrition and Metabolism Branch, International Agency for Research on Cancer, Lyon, France
| | - Bens Pardamean
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | | | - Paul D P Pharoah
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Elizabeth A Platz
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
| | - Gad Rennert
- Department of Community Medicine and Epidemiology, Lady Davis Carmel Medical Center, Haifa, Israel
| | - Edward Ruiz-Narvaez
- Department of Nutritional Sciences, University of Michigan School of Public Health, Ann Arbor, Michigan, USA
| | - Lori C Sakoda
- Division of Research, Kaiser Permanente Northern California, Oakland, California, USA
| | - Peter C Scacheri
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Stephanie L Schmit
- Genomic Medicine Institute, Cleveland Clinic, Cleveland, Ohio, USA
- Population and Cancer Prevention Program, Case Comprehensive Cancer Center, Cleveland, Ohio, USA
| | - Robert E Schoen
- Department of Medicine and Epidemiology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, USA
| | - Anna Shcherbina
- Biomedical Informatics Program, Dept. of Biomedical Data Sciences, Stanford University
| | - Martha L Slattery
- Department of Internal Medicine, University of Utah, Salt Lake City, Utah, USA
| | - Mariana C Stern
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Yu-Ru Su
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Catherine M Tangen
- SWOG Statistical Center, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Duncan C Thomas
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Yu Tian
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- School of Public Health, Capital Medical University, Beijing, China
| | - Konstantinos K Tsilidis
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK
- Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece
| | - Cornelia M Ulrich
- Huntsman Cancer Institute, Salt Lake City, Utah, USA
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah, USA
| | - Franzel JB van Duijnhoven
- Division of Human Nutrition and Health, Wageningen University & Research, Wageningen, The Netherlands
| | - Bethany Van Guelpen
- Department of Radiation Sciences, Oncology Unit, Umeå University, Umeå, Sweden
- Wallenberg Centre for Molecular Medicine, Umeå University, Umeå, Sweden
| | - Kala Visvanathan
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
| | - Pavel Vodicka
- Department of Molecular Biology of Cancer, Institute of Experimental Medicine of the Czech Academy of Sciences, Prague, and Biomedical Center, Medical Faculty, Pilsen, Czech Republic
| | - Tjeng Wawan Cenggoro
- Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia
| | - Stephanie J Weinstein
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Emily White
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Alicja Wolk
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Michael O Woods
- Memorial University of Newfoundland, Discipline of Genetics, St. John's, Canada
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Ulrike Peters
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- School of Public Health, University of Washington, Seattle, Washington, USA
| | - Victor Moreno
- Colorectal Cancer Group, ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L'Hospitalet de Llobregat, Barcelona, Spain
- Oncology Data Analytics Program, Catalan Institute of Oncology, L'Hospitalet de Llobregat, Barcelona, Spain
- Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
- Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain
| | - W James Gauderman
- Division of Biostatistics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
5
|
Kawaguchi ES, Kim AE, Pablo Lewinger J, Gauderman WJ. Improved two-step testing of genome-wide gene-environment interactions. Genet Epidemiol 2023; 47:152-166. [PMID: 36571162 PMCID: PMC9974838 DOI: 10.1002/gepi.22509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 10/13/2022] [Accepted: 11/11/2022] [Indexed: 12/27/2022]
Abstract
Two-step tests for gene-environment (G × E $G\times E$ ) interactions exploit marginal single-nucleotide polymorphism (SNP) effects to improve the power of a genome-wide interaction scan. They combine a screening step based on marginal effects used to "bin" SNPs for weighted hypothesis testing in the second step to deliver greater power over single-step tests while preserving the genome-wide Type I error. However, the presence of many SNPs with detectable marginal effects on the trait of interest can reduce power by "displacing" true interactions with weaker marginal effects and by adding to the number of tests that need to be corrected for multiple testing. We introduce a new significance-based allocation into bins for Step-2G × E $G\times E$ testing that overcomes the displacement issue and propose a computationally efficient approach to account for multiple testing within bins. Simulation results demonstrate that these simple improvements can provide substantially greater power than current methods under several scenarios. An application to a multistudy collaboration for understanding colorectal cancer reveals a G × Sex interaction located near the SMAD7 gene.
Collapse
Affiliation(s)
- Eric S. Kawaguchi
- Department of Population and Public Health Sciences, University of Southern California, California, USA
| | - Andre E. Kim
- Department of Population and Public Health Sciences, University of Southern California, California, USA
| | - Juan Pablo Lewinger
- Department of Population and Public Health Sciences, University of Southern California, California, USA
| | - W. James Gauderman
- Department of Population and Public Health Sciences, University of Southern California, California, USA
| |
Collapse
|
6
|
Hecker J, Prokopenko D, Moll M, Lee S, Kim W, Qiao D, Voorhies K, Kim W, Vansteelandt S, Hobbs BD, Cho MH, Silverman EK, Lutz SM, DeMeo DL, Weiss ST, Lange C. A robust and adaptive framework for interaction testing in quantitative traits between multiple genetic loci and exposure variables. PLoS Genet 2022; 18:e1010464. [PMID: 36383614 PMCID: PMC9668174 DOI: 10.1371/journal.pgen.1010464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 10/04/2022] [Indexed: 11/17/2022] Open
Abstract
The identification and understanding of gene-environment interactions can provide insights into the pathways and mechanisms underlying complex diseases. However, testing for gene-environment interaction remains a challenge since a.) statistical power is often limited and b.) modeling of environmental effects is nontrivial and such model misspecifications can lead to false positive interaction findings. To address the lack of statistical power, recent methods aim to identify interactions on an aggregated level using, for example, polygenic risk scores. While this strategy can increase the power to detect interactions, identifying contributing genes and pathways is difficult based on these relatively global results. Here, we propose RITSS (Robust Interaction Testing using Sample Splitting), a gene-environment interaction testing framework for quantitative traits that is based on sample splitting and robust test statistics. RITSS can incorporate sets of genetic variants and/or multiple environmental factors. Based on the user's choice of statistical/machine learning approaches, a screening step selects and combines potential interactions into scores with improved interpretability. In the testing step, the application of robust statistics minimizes the susceptibility to main effect misspecifications. Using extensive simulation studies, we demonstrate that RITSS controls the type 1 error rate in a wide range of scenarios, and we show how the screening strategy influences statistical power. In an application to lung function phenotypes and human height in the UK Biobank, RITSS identified highly significant interactions based on subcomponents of genetic risk scores. While the contributing single variant interaction signals are weak, our results indicate interaction patterns that result in strong aggregated effects, providing potential insights into underlying gene-environment interaction mechanisms.
Collapse
Affiliation(s)
- Julian Hecker
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Dmitry Prokopenko
- Harvard Medical School, Boston, Massachusetts, United States of America
- Genetics and Aging Unit and McCance Center for Brain Health, Department of Neurology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Matthew Moll
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
| | - Sanghun Lee
- Department of Medical Consilience, Division of Medicine, Graduate School, Dankook University, Yongin, South Korea
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| | - Wonji Kim
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Dandi Qiao
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Kirsten Voorhies
- Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Population Medicine, PRecisiOn Medicine Translational Research (PROMoTeR) Center, Harvard Pilgrim Health Care, Boston, Massachusetts, United States of America
| | - Woori Kim
- Harvard Medical School, Boston, Massachusetts, United States of America
- Systems Biology and Computer Science Program, Ann Romney Center for Neurological Diseases, Department of Neurology, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Stijn Vansteelandt
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Gent, Belgium
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Brian D. Hobbs
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
| | - Michael H. Cho
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
| | - Edwin K. Silverman
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
| | - Sharon M. Lutz
- Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
- Department of Population Medicine, PRecisiOn Medicine Translational Research (PROMoTeR) Center, Harvard Pilgrim Health Care, Boston, Massachusetts, United States of America
| | - Dawn L. DeMeo
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Scott T. Weiss
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
| | - Christoph Lange
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America
| |
Collapse
|
7
|
Kerick M, Acosta-Herrera M, Simeón-Aznar CP, Callejas JL, Assassi S, Proudman SM, Nikpour M, Hunzelmann N, Moroncini G, de Vries-Bouwstra JK, Orozco G, Barton A, Herrick AL, Terao C, Allanore Y, Fonseca C, Alarcón-Riquelme ME, Radstake TRDJ, Beretta L, Denton CP, Mayes MD, Martin J. Complement component C4 structural variation and quantitative traits contribute to sex-biased vulnerability in systemic sclerosis. NPJ Genom Med 2022; 7:57. [PMID: 36198672 PMCID: PMC9534873 DOI: 10.1038/s41525-022-00327-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 09/22/2022] [Indexed: 11/16/2022] Open
Abstract
Copy number (CN) polymorphisms of complement C4 play distinct roles in many conditions, including immune-mediated diseases. We investigated the association of C4 CN with systemic sclerosis (SSc) risk. Imputed total C4, C4A, C4B, and HERV-K CN were analyzed in 26,633 individuals and validated in an independent cohort. Our results showed that higher C4 CN confers protection to SSc, and deviations from CN parity of C4A and C4B augmented risk. The protection contributed per copy of C4A and C4B differed by sex. Stronger protection was afforded by C4A in men and by C4B in women. C4 CN correlated well with its gene expression and serum protein levels, and less C4 was detected for both in SSc patients. Conditioned analysis suggests that C4 genetics strongly contributes to the SSc association within the major histocompatibility complex locus and highlights classical alleles and amino acid variants of HLA-DRB1 and HLA-DPB1 as C4-independent signals.
Collapse
Affiliation(s)
- Martin Kerick
- Department of Cell Biology and Immunology, Institute of Parasitology and Biomedicine López-Neyra, CSIC, Granada, Spain.
| | - Marialbert Acosta-Herrera
- Department of Cell Biology and Immunology, Institute of Parasitology and Biomedicine López-Neyra, CSIC, Granada, Spain.
- Systemic Autoimmune Disease Unit, Hospital Clínico San Cecilio, Instituto de Investigación Biosanitaria Ibs. GRANADA, Granada, Spain.
| | | | | | - Shervin Assassi
- Department of Rheumatology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Susanna M Proudman
- Rheumatology Unit, Royal Adelaide Hospital and University of Adelaide, Adelaide, SA, Australia
| | - Mandana Nikpour
- The University of Melbourne at St. Vincent's Hospital, Melbourne, VIC, Australia
| | | | - Gianluca Moroncini
- Department of Clinical and Molecular Science, Università Politecnica delle Marche e Ospedali Riuniti, Ancona, Italy
| | | | - Gisela Orozco
- Center for Genetics and Genomics Versus Arthritis, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK
- NIHR Manchester Biomedical Research Center, Manchester University NHS Foundation Trust, Manchester, Greater Manchester, UK
| | - Anne Barton
- Center for Genetics and Genomics Versus Arthritis, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK
- NIHR Manchester Biomedical Research Center, Manchester University NHS Foundation Trust, Manchester, Greater Manchester, UK
| | - Ariane L Herrick
- Division of Musculoskeletal and Dermatological Sciences, The University of Manchester, Northern care Alliance NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Yannick Allanore
- Department of Rheumatology A, Hospital Cochin, Paris, Île-de-France, France
| | - Carmen Fonseca
- Center for Rheumatology, Royal Free and University College Medical School, London, UK
| | - Marta Eugenia Alarcón-Riquelme
- Center for Genomics and Oncological Research (GENYO), Pfizer-University of Granada-Andalusian Regional Government, Granada, Spain
| | - Timothy R D J Radstake
- Department of Rheumatology and Clinical Immunology, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Lorenzo Beretta
- Referral Center for Systemic Autoimmune Diseases, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico di Milano, Milan, Italy
| | - Christopher P Denton
- Center for Rheumatology, Royal Free and University College Medical School, London, UK
| | - Maureen D Mayes
- Department of Rheumatology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Javier Martin
- Department of Cell Biology and Immunology, Institute of Parasitology and Biomedicine López-Neyra, CSIC, Granada, Spain.
| |
Collapse
|
8
|
Maxwell TJ, Franks PW, Kahn SE, Knowler WC, Mather KJ, Florez JC, Jablonski KA. Quantitative trait loci, G×E and G×G for glycemic traits: response to metformin and placebo in the Diabetes Prevention Program (DPP). J Hum Genet 2022; 67:465-473. [PMID: 35260800 PMCID: PMC10102970 DOI: 10.1038/s10038-022-01027-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 02/16/2022] [Accepted: 02/17/2022] [Indexed: 11/09/2022]
Abstract
The complex genetic architecture of type-2-diabetes (T2D) includes gene-by-environment (G×E) and gene-by-gene (G×G) interactions. To identify G×E and G×G, we screened markers for patterns indicative of interactions (relationship loci [rQTL] and variance heterogeneity loci [vQTL]). rQTL exist when the correlation between multiple traits varies by genotype and vQTL occur when the variance of a trait differs by genotype (potentially flagging G×G and G×E). In the metformin and placebo arms of the DPP (n = 1762) we screened 280,965 exomic and intergenic SNPs, for rQTL and vQTL patterns in association with year one changes from baseline in glycemia and related traits (insulinogenic index [IGI], insulin sensitivity index [ISI], fasting glucose and fasting insulin). Significant (p < 1.8 × 10-7) rQTL and vQTL generated a priori hypotheses of individual G×E tests for a SNP × metformin treatment interaction and secondarily for G×G screens. Several rQTL and vQTL identified led to 6 nominally significant (p < 0.05) metformin treatment × SNP interactions (4 for IGI, one insulin, and one glucose) and 12G×G interactions (all IGI) that exceeded experiment-wide significance (p < 4.1 × 10-9). Some loci are directly associated with incident diabetes, and others are rQTL and modify a trait's relationship with diabetes (2 diabetes/glucose, 2 diabetes/insulin, 1 diabetes/IGI). rs3197999, an ISI/insulin rQTL, is a possible gene damaging missense mutation in MST1, is associated with ulcerative colitis, sclerosing cholangitis, Crohn's disease, BMI and coronary artery disease. This study demonstrates evidence for context-dependent effects (G×G & G×E) and the complexity of these T2D-related traits.
Collapse
Affiliation(s)
- Taylor J Maxwell
- Computational Biology Institute, The George Washington University, Ashburn, VA, USA.
| | - Paul W Franks
- Genetic & Molecular Epidemiology Unit, Lund University Diabetes Center, Lund, Sweden
| | - Steven E Kahn
- VA Puget Sound Health Care System and University of Washington, Seattle, WA, USA
| | - William C Knowler
- National Institute of Diabetes and Digestive and Kidney Diseases, Phoenix, AZ, USA
| | - Kieren J Mather
- Center for Diabetes and Metabolic Diseases & Division of Endocrinology & Metabolism, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Jose C Florez
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Programs in Metabolism and Medical & Population Genetics, Broad Institute, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Kathleen A Jablonski
- The Biostatistics Center, The Milken Institute of Public Health, The George Washington University, Rockville, MD, USA
| | | |
Collapse
|
9
|
Wang J, Patel A, Wason JM, Newcombe PJ. Two-stage penalized regression screening to detect biomarker-treatment interactions in randomized clinical trials. Biometrics 2022; 78:141-150. [PMID: 33448327 PMCID: PMC7613856 DOI: 10.1111/biom.13424] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 12/16/2020] [Accepted: 12/31/2020] [Indexed: 12/30/2022]
Abstract
High-dimensional biomarkers such as genomics are increasingly being measured in randomized clinical trials. Consequently, there is a growing interest in developing methods that improve the power to detect biomarker-treatment interactions. We adapt recently proposed two-stage interaction detecting procedures in the setting of randomized clinical trials. We also propose a new stage 1 multivariate screening strategy using ridge regression to account for correlations among biomarkers. For this multivariate screening, we prove the asymptotic between-stage independence, required for familywise error rate control, under biomarker-treatment independence. Simulation results show that in various scenarios, the ridge regression screening procedure can provide substantially greater power than the traditional one-biomarker-at-a-time screening procedure in highly correlated data. We also exemplify our approach in two real clinical trial data applications.
Collapse
Affiliation(s)
- Jixiong Wang
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Ashish Patel
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - James M.S. Wason
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK,Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne, UK
| | | |
Collapse
|
10
|
Kawaguchi ES, Li G, Lewinger JP, Gauderman WJ. Two-step hypothesis testing to detect gene-environment interactions in a genome-wide scan with a survival endpoint. Stat Med 2022; 41:1644-1657. [PMID: 35075649 PMCID: PMC9007892 DOI: 10.1002/sim.9319] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 11/10/2021] [Accepted: 12/26/2021] [Indexed: 01/13/2023]
Abstract
Defined by their genetic profile, individuals may exhibit differential clinical outcomes due to an environmental exposure. Identifying subgroups based on specific exposure-modifying genes can lead to targeted interventions and focused studies. Genome-wide interaction scans (GWIS) can be performed to identify such genes, but these scans typically suffer from low power due to the large multiple testing burden. We provide a novel framework for powerful two-step hypothesis tests for GWIS with a time-to-event endpoint under the Cox proportional hazards model. In the Cox regression setting, we develop an approach that prioritizes genes for Step-2 G × E testing based on a carefully constructed Step-1 screening procedure. Simulation results demonstrate this two-step approach can lead to substantially higher power for identifying gene-environment ( G × E ) interactions compared to the standard GWIS while preserving the family wise error rate over a range of scenarios. In a taxane-anthracycline chemotherapy study for breast cancer patients, the two-step approach identifies several gene expression by treatment interactions that would not be detected using the standard GWIS.
Collapse
Affiliation(s)
- Eric S Kawaguchi
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA
| | - Gang Li
- Department of Biostatistics, University of California, Los Angeles, Los Angeles, California, USA.,Department of Computational Medicine, University of California, Los Angeles, Los Angeles, California, USA
| | - Juan Pablo Lewinger
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA
| | - W James Gauderman
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
11
|
Lin WY, Chan CC, Liu YL, Yang AC, Tsai SJ, Kuo PH. Sex-specific autosomal genetic effects across 26 human complex traits. Hum Mol Genet 2021; 29:1218-1228. [PMID: 32160288 DOI: 10.1093/hmg/ddaa040] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 12/26/2019] [Accepted: 03/05/2020] [Indexed: 12/28/2022] Open
Abstract
Previous studies have shown that men and women have different genetic architectures across many traits. However, except waist-to-hip ratio (WHR) and waist circumference (WC), it remains unknown whether the genetic effects of a certain trait are weaker or stronger on men/women. With ~18 000 Taiwan Biobank subjects, we comprehensively investigate sexual heterogeneity in autosomal genetic effects, for traits regarding cardiovascular health, diabetes, kidney, liver, anthropometric profiles, blood, etc. 'Gene-by-sex interactions' (G $\times$ S) were detected in 18 out of 26 traits, each with an interaction P-value (${{P}}_{{INT}}$) less than $0.05/104={0.00048}$, where 104 is the number of tests conducted in this study. The most significant evidence of G $\times$ S was found in WHR (${{P}}_{{INT}}$ = 3.2 $\times{{10}}^{-{55}}$) and WC (${{P}}_{{INT}}$ = 2.3$\times{{10}}^{-{41}}$). As a novel G$\times$S investigation for other traits, we here find that the autosomal genetic effects are weaker on women than on men, for low-density lipoprotein cholesterol (LDL-C), uric acid (UA) and diabetes-related traits such as fasting glucose and glycated hemoglobin. For LDL-C and UA, the evidence of G$\times$S is especially notable in subjects aged less than 50 years, where estrogen can play a role in attenuating the autosomal genetic effects of these two traits. Men and women have systematically distinct environmental contexts caused by hormonal milieu and their specific society roles, which may trigger diverse gene expressions despite the same DNA materials. As many environmental exposures are difficult to collect and quantify, sex can serve as a good surrogate for these factors.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Chang-Chuan Chan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan.,Institute of Environmental and Occupational Health Sciences, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Albert C Yang
- Division of Interdisciplinary Medicine and Biotechnology, Beth Israel Deaconess Medical Center/Harvard Medical School, Boston, MA, USA.,Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan
| | - Shih-Jen Tsai
- Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan.,Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan.,Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
12
|
Majumdar A, Burch KS, Haldar T, Sankararaman S, Pasaniuc B, Gauderman WJ, Witte JS. A two-step approach to testing overall effect of gene-environment interaction for multiple phenotypes. Bioinformatics 2021; 36:5640-5648. [PMID: 33453114 DOI: 10.1093/bioinformatics/btaa1083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 12/09/2020] [Accepted: 12/17/2020] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION While gene-environment (GxE) interactions contribute importantly to many different phenotypes, detecting such interactions requires well-powered studies and has proven difficult. To address this, we combine two approaches to improve GxE power: simultaneously evaluating multiple phenotypes and using a two-step analysis approach. Previous work shows that the power to identify a main genetic effect can be improved by simultaneously analyzing multiple related phenotypes. For a univariate phenotype, two-step methods produce higher power for detecting a GxE interaction compared to single step analysis. Therefore, we propose a two-step approach to test for an overall GxE effect for multiple phenotypes. RESULTS Using simulations we demonstrate that, when more than one phenotype has GxE effect (i.e., GxE pleiotropy), our approach offers substantial gain in power (18%-43%) to detect an aggregate-level GxE effect for a multivariate phenotype compared to an analogous two-step method to identify GxE effect for a univariate phenotype. We applied the proposed approach to simultaneously analyze three lipids, LDL, HDL and Triglyceride with the frequency of alcohol consumption as environmental factor in the UK Biobank. The method identified two loci with an overall GxE effect on the vector of lipids, one of which was missed by the competing approaches. AVAILABILITY We provide an R package MPGE implementing the proposed approach which is available from CRAN: https://cran.r-project.org/web/packages/MPGE/index.html.
Collapse
Affiliation(s)
- Arunabha Majumdar
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.,Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Kathryn S Burch
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
| | | | - Sriram Sankararaman
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
| | - Bogdan Pasaniuc
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.,Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
| | - W James Gauderman
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - John S Witte
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| |
Collapse
|
13
|
Lin WY, Liu YL, Yang AC, Tsai SJ, Kuo PH. Active Cigarette Smoking Is Associated With an Exacerbation of Genetic Susceptibility to Diabetes. Diabetes 2020; 69:2819-2829. [PMID: 33004471 DOI: 10.2337/db20-0156] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 09/26/2020] [Indexed: 11/13/2022]
Abstract
The heritability levels of two traits for diabetes diagnosis, serum fasting glucose (FG) and glycated hemoglobin (HbA1c), were estimated to be 51-62%. Studies have shown that cigarette smoking is a modifiable risk factor for diabetes. It is important to uncover whether smoking may modify the genetic risk of diabetes. This study included unrelated Taiwan Biobank subjects in a discovery cohort (TWB1) of 25,460 subjects and a replication cohort (TWB2) of 58,774 subjects. Genetic risk score (GRS) of each TWB2 subject was calculated with weights retrieved from the TWB1 analyses. We then assessed the significance of GRS-smoking interactions on FG, HbA1c, and diabetes while adjusting for covariates. A total of five smoking measurements were investigated, including active smoking status, pack-years, years as a smoker, packs smoked per day, and hours as a passive smoker per week. Except for passive smoking, all smoking measurements were associated with FG, HbA1c, and diabetes (P < 0.0033) and were associated with an exacerbation of the genetic risk of FG and HbA1c (P Interaction < 0.0033). For example, each 1 SD increase in GRS is associated with a 1.68% higher FG in subjects consuming one more pack of cigarettes per day (P Interaction = 1.9 × 10-7). Smoking cessation is especially important for people who are more genetically predisposed to diabetes.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Albert C Yang
- Division of Interdisciplinary Medicine and Biotechnology, Beth Israel Deaconess Medical Center/Harvard Medical School, Boston, MA
- Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan
| | - Shih-Jen Tsai
- Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan
- Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
- Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
- Department of Psychiatry, National Taiwan University Hospital, Taipei, Taiwan
| |
Collapse
|
14
|
Mbemi A, Khanna S, Njiki S, Yedjou CG, Tchounwou PB. Impact of Gene-Environment Interactions on Cancer Development. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E8089. [PMID: 33153024 PMCID: PMC7662361 DOI: 10.3390/ijerph17218089] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 10/26/2020] [Accepted: 10/30/2020] [Indexed: 12/24/2022]
Abstract
Several epidemiological and experimental studies have demonstrated that many human diseases are not only caused by specific genetic and environmental factors but also by gene-environment interactions. Although it has been widely reported that genetic polymorphisms play a critical role in human susceptibility to cancer and other chronic disease conditions, many single nucleotide polymorphisms (SNPs) are caused by somatic mutations resulting from human exposure to environmental stressors. Scientific evidence suggests that the etiology of many chronic illnesses is caused by the joint effect between genetics and the environment. Research has also pointed out that the interactions of environmental factors with specific allelic variants highly modulate the susceptibility to diseases. Hence, many scientific discoveries on gene-environment interactions have elucidated the impact of their combined effect on the incidence and/or prevalence rate of human diseases. In this review, we provide an overview of the nature of gene-environment interactions, and discuss their role in human cancers, with special emphases on lung, colorectal, bladder, breast, ovarian, and prostate cancers.
Collapse
Affiliation(s)
- Ariane Mbemi
- NIH/NIMHD RCMI-Center for Health Disparities Research, Jackson State University, 1400 Lynch Street, Box 18750, Jackson, MS 39217, USA; (A.M.); (S.N.)
- Department of Biology, College of Science, Engineering and Technology, Jackson State University, 1400 Lynch Street, Box 18540, Jackson, MS 39217, USA
| | - Sunali Khanna
- Department of Oral Medicine and Radiology, Nair Hospital Dental College, Municipal Corporation of Greater Mumbai, Mumbai 400 008, India;
| | - Sylvianne Njiki
- NIH/NIMHD RCMI-Center for Health Disparities Research, Jackson State University, 1400 Lynch Street, Box 18750, Jackson, MS 39217, USA; (A.M.); (S.N.)
- Department of Biology, College of Science, Engineering and Technology, Jackson State University, 1400 Lynch Street, Box 18540, Jackson, MS 39217, USA
| | - Clement G. Yedjou
- Department of Biological Sciences, College of Science and Technology, Florida Agricultural and Mechanical University, 1610 S. Martin Luther King Blvd., Tallahassee, FL 32307, USA;
| | - Paul B. Tchounwou
- NIH/NIMHD RCMI-Center for Health Disparities Research, Jackson State University, 1400 Lynch Street, Box 18750, Jackson, MS 39217, USA; (A.M.); (S.N.)
- Department of Biology, College of Science, Engineering and Technology, Jackson State University, 1400 Lynch Street, Box 18540, Jackson, MS 39217, USA
| |
Collapse
|
15
|
Arthur VL, Guan W, Loza BL, Keating B, Chen J. Joint testing of donor and recipient genetic matching scores and recipient genotype has robust power for finding genes associated with transplant outcomes. Genet Epidemiol 2020; 44:893-907. [PMID: 32783273 PMCID: PMC7658035 DOI: 10.1002/gepi.22349] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 07/09/2020] [Accepted: 07/31/2020] [Indexed: 01/05/2023]
Abstract
Genetic matching between transplant donor and recipient pairs has traditionally focused on the human leukocyte antigen (HLA) regions of the genome, but recent studies suggest that matching for non-HLA regions may be important as well. We assess four genetic matching scores for use in association analyses of transplant outcomes. These scores describe genetic ancestry distance using identity-by-state, or genetic incompatibility or mismatch of the two genomes and therefore may reflect different underlying biological mechanisms for donor and recipient genes to influence transplant outcomes. Our simulation studies show that jointly testing these scores with the recipient genotype is a powerful method for preliminary screening and discovery of transplant outcome related single nucleotide polymorphisms (SNPs) and gene regions. Following these joint tests with marginal testing of the recipient genotype and matching score separately can lead to further understanding of the biological mechanisms behind transplant outcomes. In addition, we present results of a liver transplant data analysis that shows joint testing can detect SNPs significantly associated with acute rejection in liver transplant.
Collapse
Affiliation(s)
- Victoria L Arthur
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Weihua Guan
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, USA
| | - Bao-li Loza
- Department of Surgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Brendan Keating
- Department of Surgery, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jinbo Chen
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
16
|
Lin WY, Huang CC, Liu YL, Tsai SJ, Kuo PH. Polygenic approaches to detect gene-environment interactions when external information is unavailable. Brief Bioinform 2020; 20:2236-2252. [PMID: 30219835 PMCID: PMC6954453 DOI: 10.1093/bib/bby086] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Revised: 08/14/2018] [Accepted: 08/16/2018] [Indexed: 12/18/2022] Open
Abstract
The exploration of 'gene-environment interactions' (G × E) is important for disease prediction and prevention. The scientific community usually uses external information to construct a genetic risk score (GRS), and then tests the interaction between this GRS and an environmental factor (E). However, external genome-wide association studies (GWAS) are not always available, especially for non-Caucasian ethnicity. Although GRS is an analysis tool to detect G × E in GWAS, its performance remains unclear when there is no external information. Our 'adaptive combination of Bayes factors method' (ADABF) can aggregate G × E signals and test the significance of G × E by a polygenic test. We here explore a powerful polygenic approach for G × E when external information is unavailable, by comparing our ADABF with the GRS based on marginal effects of SNPs (GRS-M) and GRS based on SNP × E interactions (GRS-I). ADABF is the most powerful method in the absence of SNP main effects, whereas GRS-M is generally the best test when single-nucleotide polymorphisms main effects exist. GRS-I is the least powerful test due to its data-splitting strategy. Furthermore, we apply these methods to Taiwan Biobank data. ADABF and GRS-M identified gene × alcohol and gene × smoking interactions on blood pressure (BP). BP-increasing alleles elevate more BP in drinkers (smokers) than in nondrinkers (nonsmokers). This work provides guidance to choose a polygenic approach to detect G × E when external information is unavailable.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Ching-Chieh Huang
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli County, Taiwan
| | - Shih-Jen Tsai
- Department of Psychiatry, TaipeiVeterans General Hospital, Taipei, Taiwan.,Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
17
|
Lin WY, Lin YS, Chan CC, Liu YL, Tsai SJ, Kuo PH. Using Genetic Risk Score Approaches to Infer Whether an Environmental Factor Attenuates or Exacerbates the Adverse Influence of a Candidate Gene. Front Genet 2020; 11:331. [PMID: 32457790 PMCID: PMC7225361 DOI: 10.3389/fgene.2020.00331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Accepted: 03/20/2020] [Indexed: 11/18/2022] Open
Abstract
Some candidate genes have been robustly reported to be associated with complex traits, such as the fat mass and obesity-associated (FTO) gene on body mass index (BMI), and the fibroblast growth factor 5 (FGF5) gene on blood pressure levels. It is of interest to know whether an environmental factor (E) can attenuate or exacerbate the adverse influence of a candidate gene. To this end, we here evaluate the performance of “genetic risk score” (GRS) approaches to detect “gene-environment interactions” (G × E). In the first stage, a GRS is calculated according to the genotypes of variants in a candidate gene. In the second stage, we test whether E can significantly modify this GRS effect. This two-stage procedure can not only provide a p-value for a G × E test but also guide inferences on how E modifies the adverse effect of a gene. With systematic simulations, we compared several ways to construct a GRS. If E exacerbates the adverse influence of a gene, GRS formed by the elastic net (ENET) or the least absolute shrinkage and selection operator (LASSO) is recommended. However, the performance of ENET or LASSO will be compromised if E attenuates the adverse influence of a gene, and using the ridge regression (RIDGE) can be more powerful in this situation. Applying RIDGE to 18,424 subjects in the Taiwan Biobank, we showed that performing regular exercise can attenuate the adverse influence of the FTO gene on four obesity measures: BMI (p = 0.0009), body fat percentage (p = 0.0031), waist circumference (p = 0.0052), and hip circumference (p = 0.0001). As another example, we used RIDGE and found the FGF5 gene has a stronger effect on blood pressure in Han Chinese with a higher waist-to-hip ratio [p = 0.0013 for diastolic blood pressure (DBP) and p = 0.0027 for systolic blood pressure (SBP)]. This study provides an evaluation on the GRS approaches, which is important to infer whether E attenuates or exacerbates the adverse influence of a candidate gene.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Shun Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Chang-Chuan Chan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan.,Institute of Environmental and Occupational Health Sciences, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Shih-Jen Tsai
- Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan.,Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan.,Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
18
|
Quantification of the overall contribution of gene-environment interaction for obesity-related traits. Nat Commun 2020; 11:1385. [PMID: 32170055 PMCID: PMC7070002 DOI: 10.1038/s41467-020-15107-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 02/11/2020] [Indexed: 12/03/2022] Open
Abstract
The growing sample size of genome-wide association studies has facilitated the discovery of gene-environment interactions (GxE). Here we propose a maximum likelihood method to estimate the contribution of GxE to continuous traits taking into account all interacting environmental variables, without the need to measure any. Extensive simulations demonstrate that our method provides unbiased interaction estimates and excellent coverage. We also offer strategies to distinguish specific GxE from general scale effects. Applying our method to 32 traits in the UK Biobank reveals that while the genetic risk score (GRS) of 376 variants explains 5.2% of body mass index (BMI) variance, GRSxE explains an additional 1.9%. Nevertheless, this interaction holds for any variable with identical correlation to BMI as the GRS, hence may not be GRS-specific. Still, we observe that the global contribution of specific GRSxE to complex traits is substantial for nine obesity-related measures (including leg impedance and trunk fat-free mass). Most gene-by-environment interaction methods rely on the availability of the interacting environment. Here, the authors propose a robust maximum likelihood method for estimating the overall statistical interaction between a genetic risk score for a continuous outcome and all environmental variables.
Collapse
|
19
|
Zhang M, Yu Y, Wang S, Salvatore M, G Fritsche L, He Z, Mukherjee B. Interaction analysis under misspecification of main effects: Some common mistakes and simple solutions. Stat Med 2020; 39:1675-1694. [PMID: 32101638 DOI: 10.1002/sim.8505] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Revised: 01/22/2020] [Accepted: 01/23/2020] [Indexed: 11/09/2022]
Abstract
The statistical practice of modeling interaction with two linear main effects and a product term is ubiquitous in the statistical and epidemiological literature. Most data modelers are aware that the misspecification of main effects can potentially cause severe type I error inflation in tests for interactions, leading to spurious detection of interactions. However, modeling practice has not changed. In this article, we focus on the specific situation where the main effects in the model are misspecified as linear terms and characterize its impact on common tests for statistical interaction. We then propose some simple alternatives that fix the issue of potential type I error inflation in testing interaction due to main effect misspecification. We show that when using the sandwich variance estimator for a linear regression model with a quantitative outcome and two independent factors, both the Wald and score tests asymptotically maintain the correct type I error rate. However, if the independence assumption does not hold or the outcome is binary, using the sandwich estimator does not fix the problem. We further demonstrate that flexibly modeling the main effect under a generalized additive model can largely reduce or often remove bias in the estimates and maintain the correct type I error rate for both quantitative and binary outcomes regardless of the independence assumption. We show, under the independence assumption and for a continuous outcome, overfitting and flexibly modeling the main effects does not lead to power loss asymptotically relative to a correctly specified main effect model. Our simulation study further demonstrates the empirical fact that using flexible models for the main effects does not result in a significant loss of power for testing interaction in general. Our results provide an improved understanding of the strengths and limitations for tests of interaction in the presence of main effect misspecification. Using data from a large biobank study "The Michigan Genomics Initiative", we present two examples of interaction analysis in support of our results.
Collapse
Affiliation(s)
- Min Zhang
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| | - Youfei Yu
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| | - Shikun Wang
- Department of Biostatistics, MD Anderson Cancer Center, Houston, Texas
| | - Maxwell Salvatore
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| | - Lars G Fritsche
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| | - Zihuai He
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, California
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan
| |
Collapse
|
20
|
Bi W, Zhao Z, Dey R, Fritsche LG, Mukherjee B, Lee S. A Fast and Accurate Method for Genome-wide Scale Phenome-wide G × E Analysis and Its Application to UK Biobank. Am J Hum Genet 2019; 105:1182-1192. [PMID: 31735295 PMCID: PMC6904814 DOI: 10.1016/j.ajhg.2019.10.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 10/14/2019] [Indexed: 02/06/2023] Open
Abstract
The etiology of most complex diseases involves genetic variants, environmental factors, and gene-environment interaction (G × E) effects. Compared with marginal genetic association studies, G × E analysis requires more samples and detailed measure of environmental exposures, and this limits the possible discoveries. Large-scale population-based biobanks with detailed phenotypic and environmental information, such as UK-Biobank, can be ideal resources for identifying G × E effects. However, due to the large computation cost and the presence of case-control imbalance, existing methods often fail. Here we propose a scalable and accurate method, SPAGE (SaddlePoint Approximation implementation of G × E analysis), that is applicable for genome-wide scale phenome-wide G × E studies. SPAGE fits a genotype-independent logistic model only once across the genome-wide analysis in order to reduce computation cost, and SPAGE uses a saddlepoint approximation (SPA) to calibrate the test statistics for analysis of phenotypes with unbalanced case-control ratios. Simulation studies show that SPAGE is 33-79 times faster than the Wald test and 72-439 times faster than the Firth's test, and SPAGE can control type I error rates at the genome-wide significance level even when case-control ratios are extremely unbalanced. Through the analysis of UK-Biobank data of 344,341 white British European-ancestry samples, we show that SPAGE can efficiently analyze large samples while controlling for unbalanced case-control ratios.
Collapse
Affiliation(s)
- Wenjian Bi
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Zhangchen Zhao
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Rounak Dey
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA 02115, USA
| | - Lars G Fritsche
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Seunggeun Lee
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
21
|
Dai JY, LeBlanc M. Case-only trees and random forests for exploring genotype-specific treatment effects in randomized clinical trials with dichotomous endpoints. J R Stat Soc Ser C Appl Stat 2019; 68:1371-1391. [PMID: 32489221 PMCID: PMC7266264 DOI: 10.1111/rssc.12366] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Discovering gene-treatment interactions in clinical trials is of rising interest in the era of precision medicine. Nonparametric statistical learning methods such as trees and random forests are useful tools for building prediction rules. In this article, we introduce trees and random forests to the recently proposed case-only approach for discovering gene-treatment interactions and estimating marker-specific treatment effects for a dichotomous trial endpoints. The motivational example is a case-control genetic association study in the Prostate Cancer Prevention Trial (PCPT), which tested the hypothesis whether finasteride can prevent prostate cancer. We compare this novel approach to the interaction tree method previously proposed. Because of the modeling simplicity - directly targeting at interaction - and the statistical efficiency of the case-only approach, case-only trees and random forests yield more accurate prediction of heterogeneous treatment effects and better measure of variable importance, relative to the interaction tree method which uses data from both cases and controls. Application of the proposed case-only trees and random forests to the PCPT study yielded a discovery of genotypes that may influence the prevention effect of finasteride.
Collapse
Affiliation(s)
- James Y Dai
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, USA
| | - Michael LeBlanc
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| |
Collapse
|
22
|
Liu Y, Huang J, Urbanowicz RJ, Chen K, Manduchi E, Greene CS, Moore JH, Scheet P, Chen Y. Embracing study heterogeneity for finding genetic interactions in large-scale research consortia. Genet Epidemiol 2019; 44:52-66. [PMID: 31583758 DOI: 10.1002/gepi.22262] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Revised: 08/02/2019] [Accepted: 08/09/2019] [Indexed: 11/12/2022]
Abstract
Genetic interactions have been recognized as a potentially important contributor to the heritability of complex diseases. Nevertheless, due to small effect sizes and stringent multiple-testing correction, identifying genetic interactions in complex diseases is particularly challenging. To address the above challenges, many genomic research initiatives collaborate to form large-scale consortia and develop open access to enable sharing of genome-wide association study (GWAS) data. Despite the perceived benefits of data sharing from large consortia, a number of practical issues have arisen, such as privacy concerns on individual genomic information and heterogeneous data sources from distributed GWAS databases. In the context of large consortia, we demonstrate that the heterogeneously appearing marginal effects over distributed GWAS databases can offer new insights into genetic interactions for which conventional methods have had limited success. In this paper, we develop a novel two-stage testing procedure, named phylogenY-based effect-size tests for interactions using first 2 moments (YETI2), to detect genetic interactions through both pooled marginal effects, in terms of averaging site-specific marginal effects, and heterogeneity in marginal effects across sites, using a meta-analytic framework. YETI2 can not only be applied to large consortia without shared personal information but also can be used to leverage underlying heterogeneity in marginal effects to prioritize potential genetic interactions. We investigate the performance of YETI2 through simulation studies and apply YETI2 to bladder cancer data from dbGaP.
Collapse
Affiliation(s)
- Yulun Liu
- Department of Population and Data Sciences, The University of Texas Southwestern Medical Center, Dallas, Texas
| | - Jing Huang
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Ryan J Urbanowicz
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Kun Chen
- Department of Statistics, University of Connecticut, Storrs, Connecticut
| | - Elisabetta Manduchi
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Casey S Greene
- Department of Pharmacology, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Jason H Moore
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Paul Scheet
- Department of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, Texas
| | - Yong Chen
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania.,Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania
| |
Collapse
|
23
|
Liu Y, Sun W, Reiner AP, Kooperberg C, He Q. Statistical inference of genetic pathway analysis in high dimensions. Biometrika 2019; 106:651. [PMID: 31427824 DOI: 10.1093/biomet/asz033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Indexed: 11/13/2022] Open
Abstract
Genetic pathway analysis has become an important tool for investigating the association between a group of genetic variants and traits. With dense genotyping and extensive imputation, the number of genetic variants in biological pathways has increased considerably and sometimes exceeds the sample size [Formula: see text]. Conducting genetic pathway analysis and statistical inference in such settings is challenging. We introduce an approach that can handle pathways whose dimension [Formula: see text] could be greater than [Formula: see text]. Our method can be used to detect pathways that have nonsparse weak signals, as well as pathways that have sparse but stronger signals. We establish the asymptotic distribution for the proposed statistic and conduct theoretical analysis on its power. Simulation studies show that our test has correct Type I error control and is more powerful than existing approaches. An application to a genome-wide association study of high-density lipoproteins demonstrates the proposed approach.
Collapse
Affiliation(s)
- Yang Liu
- Department of Mathematics and Statistics, Wright State University, 3640 Colonel Glenn Highway, Dayton, Ohio, U.S.A
| | - Wei Sun
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington, U.S.A
| | - Alexander P Reiner
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington, U.S.A
| | - Charles Kooperberg
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington, U.S.A
| | - Qianchuan He
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington, U.S.A
| |
Collapse
|
24
|
Lin WY, Chan CC, Liu YL, Yang AC, Tsai SJ, Kuo PH. Performing different kinds of physical exercise differentially attenuates the genetic effects on obesity measures: Evidence from 18,424 Taiwan Biobank participants. PLoS Genet 2019; 15:e1008277. [PMID: 31369549 PMCID: PMC6675047 DOI: 10.1371/journal.pgen.1008277] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 06/26/2019] [Indexed: 12/17/2022] Open
Abstract
Obesity is a worldwide health problem that is closely linked to many metabolic disorders. Regular physical exercise has been found to attenuate the genetic predisposition to obesity. However, it remains unknown what kinds of exercise can modify the genetic risk of obesity. This study included 18,424 unrelated Han Chinese adults aged 30–70 years who participated in the Taiwan Biobank (TWB). A total of 5 obesity measures were investigated here, including body mass index (BMI), body fat percentage (BFP), waist circumference (WC), hip circumference (HC), and waist-to-hip ratio (WHR). Because there have been no large genome-wide association studies on obesity for Han Chinese, we used the TWB internal weights to construct genetic risk scores (GRSs) for each obesity measure, and then test the significance of GRS-by-exercise interactions. The significance level throughout this work was set at 0.05/550 = 9.1x10-5 because a total of 550 tests were performed. Performing regular exercise was found to attenuate the genetic effects on 4 obesity measures, including BMI, BFP, WC, and HC. Among the 18 kinds of self-reported regular exercise, 6 mitigated the genetic effects on at least one obesity measure. Regular jogging blunted the genetic effects on BMI, BFP, and HC. Mountain climbing, walking, exercise walking, international standard dancing, and a longer practice of yoga also attenuated the genetic effects on BMI. Exercises such as cycling, stretching exercise, swimming, dance dance revolution, and qigong were not found to modify the genetic effects on any obesity measure. Across all 5 obesity measures, regular jogging consistently presented the most significant interactions with GRSs. Our findings show that the genetic effects on obesity measures can be decreased to various extents by performing different kinds of exercise. The benefits of regular physical exercise are more impactful in subjects who are more predisposed to obesity. The complex interplay of genetics and lifestyle makes obesity a challenging issue. Previous studies have found performing regular physical exercise could blunt the genetic effects on body mass index (BMI). However, BMI does not take into account lean body mass or identify central obesity. Moreover, it remains unclear what kinds of exercise could more effectively attenuate the genetic effects on obesity measures. With a sample of 18,424 unrelated Han Chinese adults, we comprehensively investigated gene-exercise interactions on 5 obesity measures: BMI, body fat percentage, waist circumference, hip circumference, and waist-to-hip ratio. Moreover, we tested whether the genetic effects on obesity measures could be modified by any of 18 kinds of self-reported regular exercise. Because no large genome-wide association studies on obesity have been done for Han Chinese, we constructed genetic risk scores with internal weights for analyses. Among these exercises, regular jogging consistently presented the strongest evidence to mitigate the genetic effects on all 5 obesity measures. Moreover, mountain climbing, walking, exercise walking, international standard dancing, and a longer practice of yoga attenuated the genetic effects on BMI. The benefits of regularly performing these 6 kinds of exercise are more impactful in subjects who are more predisposed to obesity.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
- * E-mail: (WYL); (PHK)
| | - Chang-Chuan Chan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
- Institute of Occupational Medicine and Industrial Hygiene, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Zhunan, Miaoli County, Taiwan
| | - Albert C. Yang
- Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
- Division of Interdisciplinary Medicine and Biotechnology, Beth Israel Deaconess Medical Center/Harvard Medical School, Boston, Massachusetts, United States of America
- Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan
| | - Shih-Jen Tsai
- Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
- Institute of Brain Science, National Yang-Ming University, Taipei, Taiwan
- Department of Psychiatry, Taipei Veterans General Hospital, Beitou District, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
- Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
- * E-mail: (WYL); (PHK)
| |
Collapse
|
25
|
Yu Y, Xia L, Lee S, Zhou X, Stringham HM, Boehnke M, Mukherjee B. Subset-Based Analysis Using Gene-Environment Interactions for Discovery of Genetic Associations across Multiple Studies or Phenotypes. Hum Hered 2019; 83:283-314. [PMID: 31132756 DOI: 10.1159/000496867] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 01/04/2019] [Indexed: 01/11/2023] Open
Abstract
OBJECTIVES Classical methods for combining summary data from genome-wide association studies only use marginal genetic effects, and power can be compromised in the presence of heterogeneity. We aim to enhance the discovery of novel associated loci in the presence of heterogeneity of genetic effects in subgroups defined by an environmental factor. METHODS We present a pvalue-assisted subset testing for associations (pASTA) framework that generalizes the previously proposed association analysis based on subsets (ASSET) method by incorporating gene-environment (G-E) interactions into the testing procedure. We conduct simulation studies and provide two data examples. RESULTS Simulation studies show that our proposal is more powerful than methods based on marginal associations in the presence of G-E interactions and maintains comparable power even in their absence. Both data examples demonstrate that our method can increase power to detect overall genetic associations and identify novel studies/phenotypes that contribute to the association. CONCLUSIONS Our proposed method can be a useful screening tool to identify candidate single nucleotide polymorphisms that are potentially associated with the trait(s) of interest for further validation. It also allows researchers to determine the most probable subset of traits that exhibit genetic associations in addition to the enhancement of power.
Collapse
Affiliation(s)
- Youfei Yu
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Lu Xia
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Seunggeun Lee
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Heather M Stringham
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Michael Boehnke
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA, .,Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA,
| |
Collapse
|
26
|
Gauderman WJ, Kim A, Conti DV, Morrison J, Thomas DC, Vora H, Lewinger JP. A Unified Model for the Analysis of Gene-Environment Interaction. Am J Epidemiol 2019; 188:760-767. [PMID: 30649161 DOI: 10.1093/aje/kwy278] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Revised: 12/13/2018] [Accepted: 12/17/2018] [Indexed: 11/14/2022] Open
Abstract
Gene-environment (G × E) interaction is important for many complex traits. In a case-control study of a disease trait, logistic regression is the standard approach used to model disease as a function of a gene (G), an environmental factor (E), G × E interaction, and adjustment covariates. We propose an alternative model with G as the outcome and show how it provides a unified framework for obtaining results from all of the common G × E tests. These include the 1-degree-of-freedom (df) test of G × E interaction, the 2-df joint test of G and G × E, the case-only and empirical Bayes tests, and several 2-step tests. In the context of this unified model, we propose a novel 3-df test and demonstrate that it provides robust power across a wide range of underlying G × E interaction models. We demonstrate the 3-df test in a genome-wide scan of G × sex interaction for childhood asthma using data from the Children's Health Study (Southern California, 1993-2001). This scan identified a strong G × sex interaction at the phosphodiesterase gene 4D locus (PDE4D), a known asthma-related locus, with a strong effect in males (per-allele odds ratio = 1.70; P = 3.8 × 10-8) and virtually no effect in females. We describe a software program, G×EScan (University of Southern California, Los Angeles, California), which can be used to fit standard and unified models for genome-wide G × E studies.
Collapse
Affiliation(s)
- W James Gauderman
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Andre Kim
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - David V Conti
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - John Morrison
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Duncan C Thomas
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Hita Vora
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Juan Pablo Lewinger
- Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| |
Collapse
|
27
|
Chen Y, Adrianto I, Ianuzzi MC, Garman L, Montgomery CG, Rybicki BA, Levin AM, Li J. Extended methods for gene-environment-wide interaction scans in studies of admixed individuals with varying degrees of relationships. Genet Epidemiol 2019; 43:414-426. [PMID: 30793815 DOI: 10.1002/gepi.22196] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 12/26/2018] [Accepted: 01/24/2019] [Indexed: 11/08/2022]
Abstract
The etiology of many complex diseases involves both environmental exposures and inherited genetic predisposition as well as interactions between them. Gene-environment-wide interaction studies (GEWIS) provide a means to identify the interactions between genetic variation and environmental exposures that underlie disease risk. However, current GEWIS methods lack the capability to adjust for the potentially complex correlations in studies with varying degrees of relationships (both known and unknown) among individuals in admixed populations. We developed novel generalized estimating equation (GEE) based methods-GEE-adaptive and GEE-joint-to account for phenotypic correlations due to kinship while accounting for covariates, including, measures of genome-wide ancestry. In simulation studies of admixed individuals, both methods controlled family-wise error rates, an advantage over the case-only approach. They demonstrated higher power than traditional case-control methods across a wide range of underlying alternative hypotheses, especially where both marginal and interaction effects were present. We applied the proposed method to conduct a GEWIS of a known sarcoidosis risk factor (insecticide exposure) and risk of sarcoidosis in African Americans and identified two novel loci with suggestive evidence of G × E interaction.
Collapse
Affiliation(s)
- Yalei Chen
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| | - Indra Adrianto
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| | - Michael C Ianuzzi
- Department of Internal Medicine, Northwell Staten Island University Hospital, Staten Island, New York, New York
| | - Lori Garman
- Arthritis and Clinical Immunology Research Program, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma
| | - Courtney G Montgomery
- Arthritis and Clinical Immunology Research Program, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma
| | - Benjamin A Rybicki
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan
| | - Albert M Levin
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| | - Jia Li
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan.,Center for Bioinformatics, Henry Ford Health System, Detroit, Michigan
| |
Collapse
|
28
|
Lin WY, Huang CC, Liu YL, Tsai SJ, Kuo PH. Genome-Wide Gene-Environment Interaction Analysis Using Set-Based Association Tests. Front Genet 2019; 9:715. [PMID: 30693016 PMCID: PMC6339974 DOI: 10.3389/fgene.2018.00715] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Accepted: 12/20/2018] [Indexed: 12/22/2022] Open
Abstract
The identification of gene-environment interactions (G × E) may eventually guide health-related choices and medical interventions for complex diseases. More powerful methods must be developed to identify G × E. The “adaptive combination of Bayes factors method” (ADABF) has been proposed as a powerful genome-wide polygenic approach to detect G × E. In this work, we evaluate its performance when serving as a gene-based G × E test. We compare ADABF with six tests including the “Set-Based gene-EnviRonment InterAction test” (SBERIA), “gene-environment set association test” (GESAT), etc. With extensive simulations, SBERIA and ADABF are found to be more powerful than other G × E tests. However, SBERIA suffers from a power loss when 50% SNP main effects are in the same direction with the SNP × E interaction effects while 50% are in the opposite direction. We further applied these seven G × E methods to the Taiwan Biobank data to explore gene× alcohol interactions on blood pressure levels. The ADAMTS7P1 gene at chromosome 15q25.2 was detected to interact with alcohol consumption on diastolic blood pressure (p = 9.5 × 10−7, according to the GESAT test). At this gene, the P-values provided by other six tests all reached the suggestive significance level (p < 5 × 10−5). Regarding the computation time required for a genome-wide G × E analysis, SBERIA is the fastest method, followed by ADABF. Considering the validity, power performance, robustness, and computation time, ADABF is recommended for genome-wide G × E analyses.
Collapse
Affiliation(s)
- Wan-Yu Lin
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Ching-Chieh Huang
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Zhunan, Taiwan
| | - Shih-Jen Tsai
- Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan.,Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.,Department of Public Health, College of Public Health, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
29
|
Ramstein GP, Evans J, Nandety A, Saha MC, Brummer EC, Kaeppler SM, Buell CR, Casler MD. Candidate Variants for Additive and Interactive Effects on Bioenergy Traits in Switchgrass ( Panicum virgatum L.) Identified by Genome-Wide Association Analyses. THE PLANT GENOME 2018; 11:180002. [PMID: 30512032 DOI: 10.3835/plantgenome2018.01.0002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Switchgrass ( L.) is a promising herbaceous energy crop, but further gains in biomass yield and quality must be achieved to enable a viable bioenergy industry. Developing DNA markers can contribute to such progress, but depiction of genetic bases should be reliable, involving simple additive marker effects and also interactions with genetic backgrounds (e.g., ecotypes) or synergies with other markers. We analyzed plant height, C content, N content, and mineral concentration in a diverse panel consisting of 512 genotypes of upland and lowland ecotypes. We performed association analyses based on exome capture sequencing and tested 439,170 markers for marginal effects, 83,290 markers for marker × ecotype interactions, and up to 311,445 marker pairs for pairwise interactions. Analyses of pairwise interactions focused on subsets of marker pairs preselected on the basis of marginal marker effects, gene ontology annotation, and pairwise marker associations. Our tests identified 12 significant effects. Homology and gene expression information corroborated seven effects and indicated plausible causal pathways: flowering time and lignin synthesis for plant height; plant growth and senescence for C content and mineral concentration. Four pairwise interactions were detected, including three interactions preselected on the basis of pairwise marker correlations. Furthermore, a marker × ecotype interaction and a pairwise interaction were confirmed in an independent switchgrass panel. Our analyses identified reliable candidate variants for important bioenergy traits. Moreover, they exemplified the importance of interactive effects for depicting genetic bases and illustrated the usefulness of preselecting marker pairs for identifying pairwise marker interactions in association studies.
Collapse
|
30
|
Carlson JC, Nidey NL, Butali A, Buxo CJ, Christensen K, Deleyiannis FWD, Hecht JT, Field LL, Moreno-Uribe LM, Orioli IM, Poletta FA, Padilla C, Vieira AR, Weinberg SM, Wehby GL, Feingold E, Murray JC, Marazita ML, Leslie EJ. Genome-wide interaction studies identify sex-specific risk alleles for nonsyndromic orofacial clefts. Genet Epidemiol 2018; 42:664-672. [PMID: 30277614 DOI: 10.1002/gepi.22158] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Revised: 07/17/2018] [Accepted: 07/28/2018] [Indexed: 01/11/2023]
Abstract
Nonsyndromic cleft lip with or without cleft palate (NSCL/P) is the most common craniofacial birth defect in humans and is notable for its apparent sexual dimorphism where approximately twice as many males are affected as females. The sources of this disparity are largely unknown, but interactions between genetic and sex effects are likely contributors. We examined gene-by-sex (G × S) interactions in a worldwide sample of 2,142 NSCL/P cases and 1,700 controls recruited from 13 countries. First, we performed genome-wide joint tests of the genetic (G) and G × S effects genome-wide using logistic regression assuming an additive genetic model and adjusting for 18 principal components of ancestry. We further interrogated loci with suggestive results from the joint test ( p < 1.00 × 10 -5 ) by examining the G × S effects from the same model. Out of the 133 loci with suggestive results ( p < 1.00 × 10 -5 ) for the joint test, we observed one genome-wide significant G × S effect in the 10q21 locus (rs72804706; p = 6.69 × 10 -9 ; OR = 2.62 CI [1.89, 3.62]) and 16 suggestive G × S effects. At the intergenic 10q21 locus, the risk of NSCL/P is estimated to increase with additional copies of the minor allele for females, but the opposite effect for males. Our observation that the impact of genetic variants on NSCL/P risk differs for males and females may further our understanding of the genetic architecture of NSCL/P and the sex differences underlying clefts and other birth defects.
Collapse
Affiliation(s)
- Jenna C Carlson
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania.,Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Nichole L Nidey
- Department of Epidemiology, College of Public Health, University of Iowa, Iowa City, Iowa
| | - Azeez Butali
- Department of Oral Pathology, Radiology and Medicine, Dows Institute for Dental Research, College of Dentistry, University of Iowa, Iowa City, Iowa
| | - Carmen J Buxo
- Dental and Craniofacial Genomics Core, School of Dental Medicine, University of Puerto Rico, San Juan, Puerto Rico
| | - Kaare Christensen
- Department of Epidemiology, Institute of Public Health, University of Southern Denmark, Odense, Denmark
| | - Frederic W-D Deleyiannis
- Department of Surgery, Plastic and Reconstructive Surgery Division, University of Colorado School of Medicine, Denver, Colorado
| | - Jacqueline T Hecht
- Department of Pediatrics, McGovern Medical School and School of Dentistry, UT Health at Houston, Houston, Texas
| | - L Leigh Field
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Lina M Moreno-Uribe
- Department of Orthodontics, College of Dentistry, University of Iowa, Iowa City, Iowa
| | - Ieda M Orioli
- ECLAMC (Latin American Collaborative Study of Congenital Malformations) at INAGEMP (National Institute of Population Medical Genetics), Rio de Janeiro, Brazil.,Department of Genetics, Institute of Biology, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Fernando A Poletta
- ECLAMC (Latin American Collaborative Study of Congenital Malformations) at INAGEMP (National Institute of Population Medical Genetics), Rio de Janeiro, Brazil.,CEMIC-CONICET: Center for Medical Education and Clinical Research "Norberto Quirno", Buenos Aires, Argentina
| | - Carmencita Padilla
- Department of Pediatrics, College of Medicine, University of the Philippines, Manila, Philippines.,The Philippine Genome Center, University of the Philippines System, Manilla, Philippines
| | - Alexandre R Vieira
- Department of Oral Biology, Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Seth M Weinberg
- Department of Oral Biology, Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - George L Wehby
- Department of Health Management and Policy, College of Public Health, University of Iowa, Iowa City, Iowa
| | - Eleanor Feingold
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania.,Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Jeffrey C Murray
- Department of Pediatrics, Carver College of Medicine, University of Iowa, Iowa City, Iowa
| | - Mary L Marazita
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania.,Department of Oral Biology, Center for Craniofacial and Dental Genetics, School of Dental Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania
| | - Elizabeth J Leslie
- Department of Human Genetics, Emory University School of Medicine, Emory University, Atlanta, Georgia
| |
Collapse
|
31
|
Genome-wide association study for variants that modulate relationships between cerebrospinal fluid amyloid-beta 42, tau, and p-tau levels. ALZHEIMERS RESEARCH & THERAPY 2018; 10:86. [PMID: 30153862 PMCID: PMC6114488 DOI: 10.1186/s13195-018-0410-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 07/23/2018] [Indexed: 11/10/2022]
Abstract
Background A relationship quantitative trait locus exists when the correlation between multiple traits varies by genotype for that locus. Relationship quantitative trait loci (rQTL) are often involved in gene-by-gene (G×G) interactions or gene-by-environmental interactions, making them a powerful tool for detecting G×G. Methods We performed genome-wide association studies to identify rQTL between tau and Aβ42 and ptau and Aβ42 with over 3000 individuals using age, gender, series, APOE ε2, APOE ε4, and two principal components for population structure as covariates. Each significant rQTL was separately screened for interactions with other loci for each trait in the rQTL model. Parametric bootstrapping was used to assess significance. Results We found four significant tau/Aβ42 rQTL from three unique locations and six ptau/Aβ42 rQTL from five unique locations. G×G screens with these rQTL produced four significant G×G interactions (one Aβ42, two ptau, and one tau) with four rQTL where each second locus was from a unique location. On follow-up, rs1036819 and rs74025622 were associated with Alzheimer’s disease (AD) case/control status; rs15205 and rs79099429 were associated with rate of decline. Conclusions The two most significant rQTL (rs8027714 and rs1036819) for ptau/Aβ42 are on different chromosomes and both are strong hits for pelvic organ prolapse. While diseases of the nervous system can cause pelvic organ prolapse, it is unlikely related to the ptau/Aβ42 relationship but may suggest that these two loci share a pathway. In addition to a ptau/Aβ42 rQTL and association with AD case/control status, rs1036819 is a strong rQTL for case/control status/Aβ42 and for tau/Aβ42. It resides in the ZFAT gene, which is related to autoimmune thyroid disease. For tau, rs9817620 interacts with the tau/Aβ42 rQTL rs74025622. It is in the CHL1 gene, which is a neural cell adhesion molecule and may be involved in signal transduction pathways. CHL1 is related to BACE1, which is a β-secretase enzyme that initiates production of the β-amyloid peptide involved in AD and is a primary drug target. Overall, there are numerous loci that affect the relationship between these important AD endophenotypes and some are due to interactions with other loci. Some affect the risk of AD and/or rate of progression. Electronic supplementary material The online version of this article (10.1186/s13195-018-0410-y) contains supplementary material, which is available to authorized users.
Collapse
|
32
|
Wang J, Liu Q, Pierce BL, Huo D, Olopade OI, Ahsan H, Chen LS. A meta-analysis approach with filtering for identifying gene-level gene-environment interactions. Genet Epidemiol 2018; 42:434-446. [PMID: 29430690 PMCID: PMC6013347 DOI: 10.1002/gepi.22115] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 12/13/2017] [Accepted: 01/02/2018] [Indexed: 02/02/2023]
Abstract
There is a growing recognition that gene-environment interaction (G × E) plays a pivotal role in the development and progression of complex diseases. Despite a wealth of genetic data on various complex diseases/traits generated from association and sequencing studies, detecting G × E via genome-wide analysis remains challenging due to power issues. In genome-wide G × E studies, a common strategy to improve power is to first conduct a filtering test and retain only the genetic variants that pass the filtering step for subsequent G × E analyses. Two-stage, multistage, and unified tests have been proposed to jointly consider the filtering statistics in G × E tests. However, such G × E tests based on data from a single study may still be underpowered. Meanwhile, large-scale consortia have been formed to borrow strength across studies and populations. In this work, motivated by existing single-study G × E tests with filtering and the needs for meta-analysis G × E approaches based on consortia data, we propose a meta-analysis framework for detecting gene-based G × E effects, and introduce meta-analysis-based filtering statistics in the gene-level G × E tests. Simulations demonstrate the advantages of the proposed method-the ofGEM test. We apply the proposed tests to existing data from two breast cancer consortia to identify the genes harboring genetic variants with age-dependent penetrance (i.e., gene-age interaction effects). We develop an R software package ofGEM for the proposed meta-analysis tests.
Collapse
Affiliation(s)
- Jiebiao Wang
- Department of Public Health Sciences, The University of Chicago, Chicago, Illinois, USA
| | | | - Brandon L. Pierce
- Department of Public Health Sciences, The University of Chicago, Chicago, Illinois, USA
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, USA
| | - Dezheng Huo
- Department of Public Health Sciences, The University of Chicago, Chicago, Illinois, USA
- Department of Medicine, The University of Chicago, Chicago, Illinois, USA
| | - Olufunmilayo I. Olopade
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, USA
- Department of Medicine, The University of Chicago, Chicago, Illinois, USA
- Center for Clinical Cancer Genetics & Global Health, The University of Chicago Medical Center, Chicago, Illinois, USA
| | - Habibul Ahsan
- Department of Public Health Sciences, The University of Chicago, Chicago, Illinois, USA
- Department of Human Genetics, The University of Chicago, Chicago, Illinois, USA
- Department of Medicine, The University of Chicago, Chicago, Illinois, USA
| | - Lin S. Chen
- Department of Public Health Sciences, The University of Chicago, Chicago, Illinois, USA
| |
Collapse
|
33
|
Alarcon F, Nuel G. Detecting latent exposure in genome-wide association studies using a breakpoint model for logistic regression. Stat Methods Med Res 2018; 28:1781-1792. [PMID: 29921158 DOI: 10.1177/0962280218776385] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Detecting gene-environment (G × E) interactions in the context of genome-wide association studies (GWAS) is a challenging problem since standard methods generally present a lack of power. An additional difficulty arises from the fact that the causal exposure is seldom observed and only a proxy of this exposure is observed. This leads to an additional drop in terms of power and it explains the failure of standard methods in detecting interactions, even very strong ones. In this article, we consider the latent exposure as a source of heterogeneity and we propose a new powerful method, named "Breakpoint Model for Logistic Regression" (BMLR), based on a breakpoint model, in order to detect G × E interactions when causal exposure is unobserved. First, the BMLR method is compared to the ordered-subset analysis for case-control method, which has been developed for the same purpose, through simulations. This highlights the ability of BMLR to detect the heterogeneity, and therefore, to detect interaction with latent exposure. Finally, the BMLR method is compared to standard methods, such as Plink, to perform a GWAS on a published realistic benchmark.
Collapse
Affiliation(s)
- Flora Alarcon
- 1 Laboratoire MAP5, Université Paris Descartes and CNRS, Sorbonne Paris Cité, Paris, France
| | - Gregory Nuel
- 2 Institute of Mathematics (INSMI), National Center for French Research (CNRS), Paris, France.,3 Stochastic and Biology Group, LPSM (CNRS 8001), Sorbonne Université, Paris, France
| |
Collapse
|
34
|
Dai JY, Liang J, LeBlanc M, Prentice RL, Janes H. Case-only approach to identifying markers predicting treatment effects on the relative risk scale. Biometrics 2018; 74:753-763. [PMID: 28960244 PMCID: PMC5874156 DOI: 10.1111/biom.12789] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Revised: 06/01/2017] [Accepted: 08/01/2017] [Indexed: 11/29/2022]
Abstract
Retrospectively measuring markers on stored baseline samples from participants in a randomized controlled trial (RCT) may provide high quality evidence as to the value of the markers for treatment selection. Originally developed for approximating gene-environment interactions in the odds ratio scale, the case-only method has recently been advocated for assessing gene-treatment interactions on rare disease endpoints in randomized clinical trials. In this article, the case-only approach is shown to provide a consistent and efficient estimator of marker by treatment interactions and marker-specific treatment effects on the relative risk scale. The prohibitive rare-disease assumption is no longer needed, broadening the utility of the case-only approach. The case-only method is resource-efficient as markers only need to be measured in cases only. It eliminates the need to model the marker's main effect, and can be used with any parametric or nonparametric learning method. The utility of this approach is illustrated by an application to genetic data in the Women's Health Initiative (WHI) hormone therapy trial.
Collapse
Affiliation(s)
- James Y. Dai
- Public Health Sciences Division and Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| | - Jason Liang
- Public Health Sciences Division and Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| | - Michael LeBlanc
- Public Health Sciences Division and Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| | - Ross L. Prentice
- Public Health Sciences Division and Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| | - Holly Janes
- Public Health Sciences Division and Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A
| |
Collapse
|
35
|
Gene-by-Psychosocial Factor Interactions Influence Diastolic Blood Pressure in European and African Ancestry Populations: Meta-Analysis of Four Cohort Studies. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2017; 14:ijerph14121596. [PMID: 29258278 PMCID: PMC5751013 DOI: 10.3390/ijerph14121596] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Revised: 12/04/2017] [Accepted: 12/07/2017] [Indexed: 02/07/2023]
Abstract
Inter-individual variability in blood pressure (BP) is influenced by both genetic and non-genetic factors including socioeconomic and psychosocial stressors. A deeper understanding of the gene-by-socioeconomic/psychosocial factor interactions on BP may help to identify individuals that are genetically susceptible to high BP in specific social contexts. In this study, we used a genomic region-based method for longitudinal analysis, Longitudinal Gene-Environment-Wide Interaction Studies (LGEWIS), to evaluate the effects of interactions between known socioeconomic/psychosocial and genetic risk factors on systolic and diastolic BP in four large epidemiologic cohorts of European and/or African ancestry. After correction for multiple testing, two interactions were significantly associated with diastolic BP. In European ancestry participants, outward/trait anger score had a significant interaction with the C10orf107 genomic region (p = 0.0019). In African ancestry participants, depressive symptom score had a significant interaction with the HFE genomic region (p = 0.0048). This study provides a foundation for using genomic region-based longitudinal analysis to identify subgroups of the population that may be at greater risk of elevated BP due to the combined influence of genetic and socioeconomic/psychosocial risk factors.
Collapse
|
36
|
Ahmad S, Mora S, Franks PW, Orho-Melander M, Ridker PM, Hu FB, Chasman DI. Adiposity and Genetic Factors in Relation to Triglycerides and Triglyceride-Rich Lipoproteins in the Women's Genome Health Study. Clin Chem 2017; 64:231-241. [PMID: 29097515 DOI: 10.1373/clinchem.2017.280545] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Accepted: 10/11/2017] [Indexed: 02/07/2023]
Abstract
BACKGROUND Previous results from Scandinavian cohorts have shown that obesity accentuates the effects of common genetic susceptibility variants on increased triglycerides (TG). Whether such interactions are present in the US population and further selective for particular TG-rich lipoprotein subfractions is unknown. METHODS We examined these questions using body mass index (BMI) and waist circumference (WC) among women of European ancestry from the Women's Genome Health Study (WGHS) (n = 21840 for BMI; n = 19313 for WC). A weighted genetic risk score (TG-wGRS) based on 40 published TG-associated single-nucleotide polymorphisms was calculated using published effect estimates. RESULTS Comparing overweight (BMI ≥ 25 kg/m2) and normal weight (BMI < 25 kg/m2) WGHS women, each unit increase of TG-wGRS was associated with TG increases of 1.013% and 1.011%, respectively, and this differential association was significant (Pinteraction = 0.014). Metaanalyses combining results for WGHS BMI with the 4 Scandinavian cohorts (INTER99, HEALTH2006, GLACIER, MDC) (total n = 40026) yielded a more significant interaction (Pinteraction = 0.001). Similarly, we observed differential association of the TG-wGRS with TG (Pinteraction = 0.006) in strata of WC (<80 cm vs ≥80 cm). Metaanalysis with 2 additional cohorts reporting WC (INTER99 and HEALTH2006) (total n = 27834) was significant with consistent effects (Pinteraction = 0.006). We also observed highly significant interactions of the TG-wGRS across the strata of BMI with very large, medium, and small TG-rich lipoprotein subfractions measured by nuclear magnetic resonance spectroscopy (all Pinteractions < 0.0001). The differential effects were strongest for very large TG-rich lipoprotein. CONCLUSIONS Our results support the original findings and suggest that obese individuals may be more susceptible to aggregated genetic risk associated with common TG-raising alleles, with effects accentuated in the large TG-rich lipoprotein subfraction.
Collapse
Affiliation(s)
- Shafqat Ahmad
- Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA; .,Preventive Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Samia Mora
- Preventive Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Center for Lipid Metabolomics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Paul W Franks
- Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA.,Department of Clinical Sciences, Genetic and Molecular Epidemiology Unit, Clinical Research Centre, Lund University, Skåne University Hospital, Malmö, Sweden.,Department of Public Health and Clinical Medicine, Section for Medicine, Umeå University, Umeå, Sweden
| | - Marju Orho-Melander
- Diabetes and Cardiovascular Disease-Genetic Epidemiology, Department of Clinical Sciences, Lund University, Malmö, Sweden
| | - Paul M Ridker
- Preventive Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA.,Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Frank B Hu
- Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA.,Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA
| | - Daniel I Chasman
- Preventive Medicine Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA;
| |
Collapse
|
37
|
McAllister K, Mechanic LE, Amos C, Aschard H, Blair IA, Chatterjee N, Conti D, Gauderman WJ, Hsu L, Hutter CM, Jankowska MM, Kerr J, Kraft P, Montgomery SB, Mukherjee B, Papanicolaou GJ, Patel CJ, Ritchie MD, Ritz BR, Thomas DC, Wei P, Witte JS. Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 2017; 186:753-761. [PMID: 28978193 PMCID: PMC5860428 DOI: 10.1093/aje/kwx227] [Citation(s) in RCA: 106] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 03/14/2017] [Accepted: 03/16/2017] [Indexed: 12/25/2022] Open
Abstract
Recently, many new approaches, study designs, and statistical and analytical methods have emerged for studying gene-environment interactions (G×Es) in large-scale studies of human populations. There are opportunities in this field, particularly with respect to the incorporation of -omics and next-generation sequencing data and continual improvement in measures of environmental exposures implicated in complex disease outcomes. In a workshop called "Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases," held October 17-18, 2014, by the National Institute of Environmental Health Sciences and the National Cancer Institute in conjunction with the annual American Society of Human Genetics meeting, participants explored new approaches and tools that have been developed in recent years for G×E discovery. This paper highlights current and critical issues and themes in G×E research that need additional consideration, including the improved data analytical methods, environmental exposure assessment, and incorporation of functional data and annotations.
Collapse
Affiliation(s)
| | - Leah E. Mechanic
- Correspondence to Dr. Leah E. Mechanic, Genomic Epidemiology Branch, Epidemiology and Genomics Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, 9609 Medical Center Drive, Room 4E104, MSC 9763, Bethesda, MD 20892 (e-mail: )
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Gauderman WJ, Mukherjee B, Aschard H, Hsu L, Lewinger JP, Patel CJ, Witte JS, Amos C, Tai CG, Conti D, Torgerson DG, Lee S, Chatterjee N. Update on the State of the Science for Analytical Methods for Gene-Environment Interactions. Am J Epidemiol 2017; 186:762-770. [PMID: 28978192 PMCID: PMC5859988 DOI: 10.1093/aje/kwx228] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 04/24/2017] [Accepted: 04/25/2017] [Indexed: 12/14/2022] Open
Abstract
The analysis of gene-environment interaction (G×E) may hold the key for further understanding the etiology of many complex traits. The current availability of high-volume genetic data, the wide range in types of environmental data that can be measured, and the formation of consortiums of multiple studies provide new opportunities to identify G×E but also new analytical challenges. In this article, we summarize several statistical approaches that can be used to test for G×E in a genome-wide association study. These include traditional models of G×E in a case-control or quantitative trait study as well as alternative approaches that can provide substantially greater power. The latest methods for analyzing G×E with gene sets and with data in a consortium setting are summarized, as are issues that arise due to the complexity of environmental data. We provide some speculation on why detecting G×E in a genome-wide association study has thus far been difficult. We conclude with a description of software programs that can be used to implement most of the methods described in the paper.
Collapse
Affiliation(s)
- W. James Gauderman
- Correspondence to Dr. W. James Gauderman, Division of Biostatistics, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, 2001 North Soto Street, 202-K, Los Angeles, CA 90032 (e-mail: )
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Ritchie MD, Davis JR, Aschard H, Battle A, Conti D, Du M, Eskin E, Fallin MD, Hsu L, Kraft P, Moore JH, Pierce BL, Bien SA, Thomas DC, Wei P, Montgomery SB. Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions. Am J Epidemiol 2017; 186:771-777. [PMID: 28978191 PMCID: PMC5860556 DOI: 10.1093/aje/kwx229] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 04/07/2017] [Accepted: 04/10/2017] [Indexed: 12/12/2022] Open
Abstract
A growing knowledge base of genetic and environmental information has greatly enabled the study of disease risk factors. However, the computational complexity and statistical burden of testing all variants by all environments has required novel study designs and hypothesis-driven approaches. We discuss how incorporating biological knowledge from model organisms, functional genomics, and integrative approaches can empower the discovery of novel gene-environment interactions and discuss specific methodological considerations with each approach. We consider specific examples where the application of these approaches has uncovered effects of gene-environment interactions relevant to drug response and immunity, and we highlight how such improvements enable a greater understanding of the pathogenesis of disease and the realization of precision medicine.
Collapse
Affiliation(s)
- Marylyn D. Ritchie
- Correspondence to Dr. Stephen B. Montgomery, Departments of Genetics and Pathology, Stanford University School of Medicine, Stanford, CA 94305 (e-mail: ); or Dr. Marylyn D. Ritchie, Geisinger Health System, 205 Hood Center for Health Research, Center Street, Danville, PA 17821(e-mail: )
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Stephen B. Montgomery
- Correspondence to Dr. Stephen B. Montgomery, Departments of Genetics and Pathology, Stanford University School of Medicine, Stanford, CA 94305 (e-mail: ); or Dr. Marylyn D. Ritchie, Geisinger Health System, 205 Hood Center for Health Research, Center Street, Danville, PA 17821(e-mail: )
| |
Collapse
|
40
|
Sharafeldin N, Slattery ML, Liu Q, Franco-Villalobos C, Caan BJ, Potter JD, Yasui Y. Multiple Gene-Environment Interactions on the Angiogenesis Gene-Pathway Impact Rectal Cancer Risk and Survival. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2017; 14:ijerph14101146. [PMID: 28956832 PMCID: PMC5664647 DOI: 10.3390/ijerph14101146] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2017] [Revised: 09/06/2017] [Accepted: 09/23/2017] [Indexed: 12/25/2022]
Abstract
Characterization of gene-environment interactions (GEIs) in cancer is limited. We aimed at identifying GEIs in rectal cancer focusing on a relevant biologic process involving the angiogenesis pathway and relevant environmental exposures: cigarette smoking, alcohol consumption, and animal protein intake. We analyzed data from 747 rectal cancer cases and 956 controls from the Diet, Activity and Lifestyle as a Risk Factor for Rectal Cancer study. We applied a 3-step analysis approach: first, we searched for interactions among single nucleotide polymorphisms on the pathway genes; second, we searched for interactions among the genes, both steps using Logic regression; third, we examined the GEIs significant at the 5% level using logistic regression for cancer risk and Cox proportional hazards models for survival. Permutation-based test was used for multiple testing adjustment. We identified 8 significant GEIs associated with risk among 6 genes adjusting for multiple testing: TNF (OR = 1.85, 95% CI: 1.10, 3.11), TLR4 (OR = 2.34, 95% CI: 1.38, 3.98), and EGR2 (OR = 2.23, 95% CI: 1.04, 4.78) with smoking; IGF1R (OR = 1.69, 95% CI: 1.04, 2.72), TLR4 (OR = 2.10, 95% CI: 1.22, 3.60) and EGR2 (OR = 2.12, 95% CI: 1.01, 4.46) with alcohol; and PDGFB (OR = 1.75, 95% CI: 1.04, 2.92) and MMP1 (OR = 2.44, 95% CI: 1.24, 4.81) with protein. Five GEIs were associated with survival at the 5% significance level but not after multiple testing adjustment: CXCR1 (HR = 2.06, 95% CI: 1.13, 3.75) with smoking; and KDR (HR = 4.36, 95% CI: 1.62, 11.73), TLR2 (HR = 9.06, 95% CI: 1.14, 72.11), EGR2 (HR = 2.45, 95% CI: 1.42, 4.22), and EGFR (HR = 6.33, 95% CI: 1.95, 20.54) with protein. GEIs between angiogenesis genes and smoking, alcohol, and animal protein impact rectal cancer risk. Our results support the importance of considering the biologic hypothesis to characterize GEIs associated with cancer outcomes.
Collapse
Affiliation(s)
- Noha Sharafeldin
- School of Public Health, University of Alberta, Edmonton, AB T6G 2R3, Canada.
- Department of Medicine, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA.
| | - Martha L Slattery
- Department of Internal Medicine, University of Utah Health Sciences Center, Salt Lake City, UT 84132, USA.
| | - Qi Liu
- School of Public Health, University of Alberta, Edmonton, AB T6G 2R3, Canada.
| | | | - Bette J Caan
- Division of Research, Kaiser Permanente Medical Care Program, Oakland, CA 94612, USA.
| | - John D Potter
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
- Department of Epidemiology, School of Public Health, University of Washington, Seattle, WA 98195, USA.
- Centre for Public Health Research, Massey University, P.O. Box 756, Wellington 6140, New Zealand.
| | - Yutaka Yasui
- School of Public Health, University of Alberta, Edmonton, AB T6G 2R3, Canada.
- Department of Epidemiology & Cancer Control, St. Jude Children's Research Hospital, Memphis, TN 38105, USA.
| |
Collapse
|
41
|
Winkler TW, Justice AE, Cupples LA, Kronenberg F, Kutalik Z, Heid IM. Approaches to detect genetic effects that differ between two strata in genome-wide meta-analyses: Recommendations based on a systematic evaluation. PLoS One 2017; 12:e0181038. [PMID: 28749953 PMCID: PMC5531538 DOI: 10.1371/journal.pone.0181038] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 06/26/2017] [Indexed: 12/13/2022] Open
Abstract
Genome-wide association meta-analyses (GWAMAs) conducted separately by two strata have identified differences in genetic effects between strata, such as sex-differences for body fat distribution. However, there are several approaches to identify such differences and an uncertainty which approach to use. Assuming the availability of stratified GWAMA results, we compare various approaches to identify between-strata differences in genetic effects. We evaluate type I error and power via simulations and analytical comparisons for different scenarios of strata designs and for different types of between-strata differences. For strata of equal size, we find that the genome-wide test for difference without any filtering is the best approach to detect stratum-specific genetic effects with opposite directions, while filtering for overall association followed by the difference test is best to identify effects that are predominant in one stratum. When there is no a priori hypothesis on the type of difference, a combination of both approaches can be recommended. Some approaches violate type I error control when conducted in the same data set. For strata of unequal size, the best approach depends on whether the genetic effect is predominant in the larger or in the smaller stratum. Based on real data from GIANT (>175 000 individuals), we exemplify the impact of the approaches on the detection of sex-differences for body fat distribution (identifying up to 10 loci). Our recommendations provide tangible guidelines for future GWAMAs that aim at identifying between-strata differences. A better understanding of such effects will help pinpoint the underlying mechanisms.
Collapse
Affiliation(s)
- Thomas W. Winkler
- Department of Genetic Epidemiology, University of Regensburg, Regensburg, Germany
| | - Anne E. Justice
- Department of Epidemiology, University of North Carolina, Chapel Hill, NC, United States of America
| | - L. Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, United States of America
- NHLBI Framingham Heart Study, Framingham, MA, United States of America
| | - Florian Kronenberg
- Division of Genetic Epidemiology, Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Zoltán Kutalik
- Institute of Social and Preventive Medicine, CHUV-UNIL, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Iris M. Heid
- Department of Genetic Epidemiology, University of Regensburg, Regensburg, Germany
- * E-mail:
| | | |
Collapse
|
42
|
Robotics in Lower-Limb Rehabilitation after Stroke. Behav Neurol 2017; 2017:3731802. [PMID: 28659660 PMCID: PMC5480018 DOI: 10.1155/2017/3731802] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 04/02/2017] [Accepted: 04/10/2017] [Indexed: 12/02/2022] Open
Abstract
With the increase in the elderly, stroke has become a common disease, often leading to motor dysfunction and even permanent disability. Lower-limb rehabilitation robots can help patients to carry out reasonable and effective training to improve the motor function of paralyzed extremity. In this paper, the developments of lower-limb rehabilitation robots in the past decades are reviewed. Specifically, we provide a classification, a comparison, and a design overview of the driving modes, training paradigm, and control strategy of the lower-limb rehabilitation robots in the reviewed literature. A brief review on the gait detection technology of lower-limb rehabilitation robots is also presented. Finally, we discuss the future directions of the lower-limb rehabilitation robots.
Collapse
|
43
|
A combination test for detection of gene-environment interaction in cohort studies. Genet Epidemiol 2017; 41:396-412. [DOI: 10.1002/gepi.22043] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 02/06/2017] [Accepted: 02/06/2017] [Indexed: 12/24/2022]
|
44
|
Burghardt LT, Young ND, Tiffin P. A Guide to Genome-Wide Association Mapping in Plants. ACTA ACUST UNITED AC 2017; 2:22-38. [PMID: 31725973 DOI: 10.1002/cppb.20041] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Genome-wide association studies (GWAS) have developed into a valuable approach for identifying the genetic basis of phenotypic variation. In this article, we provide an overview of the design, analysis, and interpretation of GWAS. First, we present results from simulations that explore key elements of experimental design as well as considerations for collecting the relevant genomic and phenotypic data. Next, we outline current statistical methods and tools used for GWA analyses and discuss the inclusion of covariates to account for population structure and the interpretation of results. Given that many false positive associations will occur in any GWA analysis, we highlight strategies for prioritizing GWA candidates for further statistical and empirical validation. While focused on plants, the material we cover is also applicable to other systems. © 2017 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Liana T Burghardt
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota
| | - Nevin D Young
- Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota
| |
Collapse
|
45
|
Sheng X, Xie W, Zhou Y. Analysis of genotype effects for the immunosuppression via two-step method. BIO WEB OF CONFERENCES 2017. [DOI: 10.1051/bioconf/20170802010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
46
|
Unsupervised gene set testing based on random matrix theory. BMC Bioinformatics 2016; 17:442. [PMID: 27809777 PMCID: PMC5096314 DOI: 10.1186/s12859-016-1299-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 10/21/2016] [Indexed: 11/10/2022] Open
Abstract
Background Results Conclusions Electronic supplementary material
Collapse
|
47
|
Frost HR, Amos CI, Moore JH. A global test for gene-gene interactions based on random matrix theory. Genet Epidemiol 2016; 40:689-701. [PMID: 27386793 PMCID: PMC5132142 DOI: 10.1002/gepi.21990] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Revised: 05/04/2016] [Accepted: 06/05/2016] [Indexed: 11/29/2022]
Abstract
Statistical interactions between markers of genetic variation, or gene-gene interactions, are believed to play an important role in the etiology of many multifactorial diseases and other complex phenotypes. Unfortunately, detecting gene-gene interactions is extremely challenging due to the large number of potential interactions and ambiguity regarding marker coding and interaction scale. For many data sets, there is insufficient statistical power to evaluate all candidate gene-gene interactions. In these cases, a global test for gene-gene interactions may be the best option. Global tests have much greater power relative to multiple individual interaction tests and can be used on subsets of the markers as an initial filter prior to testing for specific interactions. In this paper, we describe a novel global test for gene-gene interactions, the global epistasis test (GET), that is based on results from random matrix theory. As we show via simulation studies based on previously proposed models for common diseases including rheumatoid arthritis, type 2 diabetes, and breast cancer, our proposed GET method has superior performance characteristics relative to existing global gene-gene interaction tests. A glaucoma GWAS data set is used to demonstrate the practical utility of the GET method.
Collapse
Affiliation(s)
- H. Robert Frost
- Department of Biomedical Data ScienceGeisel School of Medicine, Dartmouth CollegeHanoverNew HampshireUnited States of America
| | - Christopher I. Amos
- Department of Biomedical Data ScienceGeisel School of Medicine, Dartmouth CollegeHanoverNew HampshireUnited States of America
| | - Jason H. Moore
- Division of InformaticsDepartment of Biostatistics and EpidemiologyInstitute for Biomedical InformaticsPerelman School of Medicine, University of PennsylvaniaPhiladelphiaPennsylvaniaUnited States of America
| |
Collapse
|
48
|
Liu Y, Chen Y, Scheet P. A meta-analytic framework for detection of genetic interactions. Genet Epidemiol 2016; 40:534-543. [PMID: 27528046 PMCID: PMC5081230 DOI: 10.1002/gepi.21996] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Revised: 06/16/2016] [Accepted: 07/14/2016] [Indexed: 12/27/2022]
Abstract
With varying, but substantial, proportions of heritability remaining unexplained by summaries of single-SNP genetic variation, there is a demand for methods that extract maximal information from genetic association studies. One source of variation that is difficult to assess is genetic interactions. A major challenge for naive detection methods is the large number of possible combinations, with a requisite need to correct for multiple testing. Assumptions of large marginal effects, to reduce the search space, may be restrictive and miss higher order interactions with modest marginal effects. In this paper, we propose a new procedure for detecting gene-by-gene interactions through heterogeneity in estimated low-order (e.g., marginal) effect sizes by leveraging population structure, or ancestral differences, among studies in which the same phenotypes were measured. We implement this approach in a meta-analytic framework, which offers numerous advantages, such as robustness and computational efficiency, and is necessary when data-sharing limitations restrict joint analysis. We effectively apply a dimension reduction procedure that scales to allow searches for higher order interactions. For comparison to our method, which we term phylogenY-aware Effect-size Tests for Interactions (YETI), we adapt an existing method that assumes interacting loci will exhibit strong marginal effects to our meta-analytic framework. As expected, YETI excels when multiple studies are from highly differentiated populations and maintains its superiority in these conditions even when marginal effects are small. When these conditions are less extreme, the advantage of our method wanes. We assess the Type-I error and power characteristics of complementary approaches to evaluate their strengths and limitations.
Collapse
Affiliation(s)
- Yulun Liu
- Division of Biostatistics, The University of Texas School of Public Health, Houston, TX 77030, USA
- Department of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Yong Chen
- Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Paul Scheet
- Division of Biostatistics, The University of Texas School of Public Health, Houston, TX 77030, USA
- Department of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
- The University of Texas Graduate School of Biomedical Sciences, Houston, TX 77030, USA
| |
Collapse
|
49
|
Gupta J, Johansson E, Bernstein JA, Chakraborty R, Khurana Hershey GK, Rothenberg ME, Mersha TB. Resolving the etiology of atopic disorders by using genetic analysis of racial ancestry. J Allergy Clin Immunol 2016; 138:676-699. [PMID: 27297995 PMCID: PMC5014679 DOI: 10.1016/j.jaci.2016.02.045] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Revised: 02/09/2016] [Accepted: 02/25/2016] [Indexed: 12/23/2022]
Abstract
Atopic dermatitis (AD), food allergy, allergic rhinitis, and asthma are common atopic disorders of complex etiology. The frequently observed atopic march from early AD to asthma, allergic rhinitis, or both later in life and the extensive comorbidity of atopic disorders suggest common causal mechanisms in addition to distinct ones. Indeed, both disease-specific and shared genomic regions exist for atopic disorders. Their prevalence also varies among races; for example, AD and asthma have a higher prevalence in African Americans when compared with European Americans. Whether this disparity stems from true genetic or race-specific environmental risk factors or both is unknown. Thus far, the majority of the genetic studies on atopic diseases have used populations of European ancestry, limiting their generalizability. Large-cohort initiatives and new analytic methods, such as admixture mapping, are currently being used to address this knowledge gap. Here we discuss the unique and shared genetic risk factors for atopic disorders in the context of ancestry variations and the promise of high-throughput "-omics"-based systems biology approach in providing greater insight to deconstruct their genetic and nongenetic etiologies. Future research will also focus on deep phenotyping and genotyping of diverse racial ancestry, gene-environment, and gene-gene interactions.
Collapse
Affiliation(s)
- Jayanta Gupta
- Division of Asthma Research, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, Ohio
| | - Elisabet Johansson
- Division of Asthma Research, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, Ohio
| | - Jonathan A Bernstein
- Division of Immunology/Allergy Section, Department of Internal Medicine, University of Cincinnati, Cincinnati, Ohio
| | - Ranajit Chakraborty
- Center for Computational Genomics, Institute of Applied Genetics, Department of Molecular and Medical Genetics, University of North Texas Health Science Center, Fort Worth, Tex
| | - Gurjit K Khurana Hershey
- Division of Asthma Research, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, Ohio
| | - Marc E Rothenberg
- Division of Allergy and Immunology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, Ohio
| | - Tesfaye B Mersha
- Division of Asthma Research, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, Ohio.
| |
Collapse
|
50
|
Frost HR, Shen L, Saykin AJ, Williams SM, Moore JH. Identifying significant gene-environment interactions using a combination of screening testing and hierarchical false discovery rate control. Genet Epidemiol 2016; 40:544-557. [PMID: 27578615 PMCID: PMC5108431 DOI: 10.1002/gepi.21997] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Revised: 07/01/2016] [Accepted: 07/17/2016] [Indexed: 12/27/2022]
Abstract
Although gene‐environment (G× E) interactions play an important role in many biological systems, detecting these interactions within genome‐wide data can be challenging due to the loss in statistical power incurred by multiple hypothesis correction. To address the challenge of poor power and the limitations of existing multistage methods, we recently developed a screening‐testing approach for G× E interaction detection that combines elastic net penalized regression with joint estimation to support a single omnibus test for the presence of G× E interactions. In our original work on this technique, however, we did not assess type I error control or power and evaluated the method using just a single, small bladder cancer data set. In this paper, we extend the original method in two important directions and provide a more rigorous performance evaluation. First, we introduce a hierarchical false discovery rate approach to formally assess the significance of individual G× E interactions. Second, to support the analysis of truly genome‐wide data sets, we incorporate a score statistic‐based prescreening step to reduce the number of single nucleotide polymorphisms prior to fitting the first stage penalized regression model. To assess the statistical properties of our method, we compare the type I error rate and statistical power of our approach with competing techniques using both simple simulation designs as well as designs based on real disease architectures. Finally, we demonstrate the ability of our approach to identify biologically plausible SNP‐education interactions relative to Alzheimer's disease status using genome‐wide association study data from the Alzheimer's Disease Neuroimaging Initiative (ADNI).
Collapse
Affiliation(s)
- H Robert Frost
- Departments of Biomedical Data Science and Genetics, Institute for Quantitative Biomedical Sciences, Geisel School of Medicine, Dartmouth College, Hanover, NH 03755, USA.
| | - Li Shen
- Center for Neuroimaging and Indiana Alzheimer's Disease Center, Department of Radiology and Imaging Sciences, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Andrew J Saykin
- Center for Neuroimaging and Indiana Alzheimer's Disease Center, Department of Radiology and Imaging Sciences, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Scott M Williams
- Departments of Biomedical Data Science and Genetics, Institute for Quantitative Biomedical Sciences, Geisel School of Medicine, Dartmouth College, Hanover, NH 03755, USA
| | - Jason H Moore
- Departments of Biomedical Data Science and Genetics, Institute for Quantitative Biomedical Sciences, Geisel School of Medicine, Dartmouth College, Hanover, NH 03755, USA.,Division of Informatics, Department of Biostatistics and Epidemiology, Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104-6021, USA
| | | |
Collapse
|