1
|
Patchen BK, Zhang J, Gaddis N, Bartz TM, Chen J, Debban C, Leonard H, Nguyen NQ, Seo J, Tern C, Allen R, DeMeo DL, Fornage M, Melbourne C, Minto M, Moll M, O'Connor G, Pottinger T, Psaty BM, Rich SS, Rotter JI, Silverman EK, Stratford J, Graham Barr R, Cho MH, Gharib SA, Manichaikul A, North K, Oelsner EC, Simonsick EM, Tobin MD, Yu B, Choi SH, Dupuis J, Cassano PA, Hancock DB. Multi-ancestry genome-wide association study reveals novel genetic signals for lung function decline. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.11.25.24317794. [PMID: 39649580 PMCID: PMC11623738 DOI: 10.1101/2024.11.25.24317794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Rationale Accelerated decline in lung function contributes to the development of chronic respiratory disease. Despite evidence for a genetic component, few genetic associations with lung function decline have been identified. Objectives To evaluate genome-wide associations and putative downstream functionality of genetic variants with lung function decline in diverse general population cohorts. Methods We conducted genome-wide association study (GWAS) analyses of decline in the forced expiratory volume in the first second (FEV1), forced vital capacity (FVC), and their ratio (FEV1/FVC) in participants across six cohorts in the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium and the UK Biobank. Genotypes were imputed to TOPMed (CHARGE cohorts) or Haplotype Reference Consortium (HRC) (UK Biobank) reference panels, and GWAS analyses used generalized estimating equation models with robust standard error. Models were stratified by cohort, ancestry, and sex, and adjusted for important lung function confounders and genotype principal components. Results were combined in cross-ancestry and ancestry-specific meta-analyses. Selected top variants were tested for replication in two independent COPD-enriched cohorts. Measurements and Main Results Our discovery analyses included 52,056 self-reported White (N=44,988), Black (N=5,788), Hispanic (N=550), and Chinese American (N=730) participants with a mean of 2.3 spirometry measurements and 8.6 years of follow-up. Functional mapping of GWAS meta-analysis results identified 361 distinct genome-wide significant (p<5E-08) variants in one or more of the FEV1, FVC, and FEV1/FVC decline phenotypes, which overlapped with previously reported genetic signals for several related pulmonary traits. Of these, 8 variants, or 20.5% of the variant set available for replication testing, were nominally associated (p<0.05) with at least one decline phenotype in COPD-enriched cohorts (White [N=4,778] and Black [N=1,118]). Using the GWAS results, gene-level analysis implicated 38 genes, including eight (XIRP2, GRIN2D, SATB1, MARCHF4, SIPA1L2, ANO5, H2BC10, and FAF2) with consistent associations across ancestries or decline phenotypes. Annotation class analysis revealed significant enrichment of several regulatory processes for corticosteroid biosynthesis and metabolism. Drug repurposing analysis identified 43 approved compounds targeting eight of the implicated 38 genes. Conclusions Our multi-ancestry GWAS meta-analyses identified numerous genetic loci associated with lung function decline. These findings contribute knowledge to the genetic architecture of lung function decline, provide evidence for a role of endogenous corticosteroids in the etiology of lung function decline, and identify drug targets that merit further study for potential repurposing to slow lung function decline and treat lung disease.
Collapse
Affiliation(s)
- Bonnie K Patchen
- Division of Nutritional Sciences, Cornell University
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
- Harvard Medical School, Boston, MA
| | - Jingwen Zhang
- Boston University School of Public Health, Boston, MA
| | | | - Traci M Bartz
- Cardiovascular Health Research Unit, Departments of Biostatistics, Medicine, Epidemiology, Health Systems and Population Health, University of Washington, Seattle, WA
| | - Jing Chen
- Department of Population Health Sciences, University of Leicester, Leicester, UK
| | - Catherine Debban
- Department of Genome Sciences, University of Virginia School of Medicine, Charlottesville, VA
| | - Hampton Leonard
- Laboratory of Neurogenetics, National Institute of Aging, National Institute of Health, Bethesda, MD
| | - Ngoc Quynh Nguyen
- School of Public Health, University of Texas Health Science Center, Houston, TX
| | - Jungkun Seo
- Department of Epidemiology, University of North Carolina, Chapel Hill, NC
- Department of MetaBioHealth, Sungkyunkwan University (SKKU), Suwon, Republic of Korea
| | - Courtney Tern
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
| | - Richard Allen
- College of Life Sciences, University of Leicester, Leicester, UK
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
- Harvard Medical School, Boston, MA
| | - Myriam Fornage
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center, Houston, TX
| | - Carl Melbourne
- Department of Population Health Sciences, University of Leicester, Leicester, UK
- UK Biobank, Ltd., Stockport, UK
| | | | - Matthew Moll
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
- Harvard Medical School, Boston, MA
| | | | - Tess Pottinger
- Institute for Genomic Medicine, Columbia University Irving Medical Center, New York, NY
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Departments of Biostatistics, Medicine, Epidemiology, Health Systems and Population Health, University of Washington, Seattle, WA
| | - Stephen S Rich
- Department of Genome Sciences, University of Virginia School of Medicine, Charlottesville, VA
| | - Jerome I Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA
| | - Edwin K Silverman
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
- Harvard Medical School, Boston, MA
| | | | - R Graham Barr
- Department of Medicine, Columbia University College of Physicians and Surgeons, New York, NY
| | - Michael H Cho
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA
- Harvard Medical School, Boston, MA
| | - Sina A Gharib
- Division of Pulmonary, Critical Care and Sleep Medicine, University of Washington, Seattle, WA
| | - Ani Manichaikul
- Department of Genome Sciences, University of Virginia School of Medicine, Charlottesville, VA
| | - Kari North
- Department of Epidemiology, University of North Carolina, Chapel Hill, NC
| | - Elizabeth C Oelsner
- Department of Medicine, Columbia University College of Physicians and Surgeons, New York, NY
| | | | - Martin D Tobin
- Department of Population Health Sciences, University of Leicester, Leicester, UK
| | - Bing Yu
- School of Public Health, University of Texas Health Science Center, Houston, TX
| | | | - Josee Dupuis
- Boston University School of Public Health, Boston, MA
- Department of Epidemiology, Biostatistics and Occupational Health, School of Population and Global Health, McGill University, Montréal, Québec
| | - Patricia A Cassano
- Division of Nutritional Sciences, Cornell University
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY
| | | |
Collapse
|
2
|
Shin JE, Shin N, Park T, Park M. Multipartite network analysis to identify environmental and genetic associations of metabolic syndrome in the Korean population. Sci Rep 2024; 14:20283. [PMID: 39217223 PMCID: PMC11366034 DOI: 10.1038/s41598-024-71217-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 08/26/2024] [Indexed: 09/04/2024] Open
Abstract
Network analysis has become a crucial tool in genetic research, enabling the exploration of associations between genes and diseases. Its utility extends beyond genetics to include the assessment of environmental factors. Unipartite network analysis is commonly used in genomics to visualize initial insights and relationships among variables. Syndromic diseases, such as metabolic syndrome, are characterized by the simultaneous occurrence of various signs, symptoms, and clinicopathological features. Metabolic syndrome encompasses hypertension, diabetes, obesity, and dyslipidemia, and both genetic and environmental factors contribute to its development. Given that relevant data often consist of distinct sets of variables, a more intuitive visualization method is needed. This study applied multipartite network analysis as an effective method to understand the associations among genetic, environmental, and disease components in syndromic diseases. We considered three distinct variable sets: genetic factors, environmental factors, and disease components. The process involved projecting a tripartite network onto a two-mode bipartite network and then simplifying it into a one-mode network. This approach facilitated the visualization of relationships among factors across different sets and within individual sets. To transition from multipartite to unipartite networks, we suggest both sequential and concurrent projection methods. Data from the Korean Association Resource (KARE) project were utilized, including 352,228 SNPs from 8840 individuals, alongside information on environmental factors such as lifestyle, dietary, and socioeconomic factors. The single-SNP analysis step filtered SNPs, supplemented by reference SNPs reported in a genome-wide association study catalog. The resulting network patterns differed significantly by sex: demographic factors and fat intake were crucial for women, while alcohol consumption was central for men. Indirect relationships were identified through projected bipartite networks, revealing that SNPs such as rs4244457, rs2156552, and rs10899345 had lifestyle interactions on metabolic components. Our approach offers several advantages: it simplifies the visualization of complex relationships among different datasets, identifies environmental interactions, and provides insights into SNP clusters sharing common environmental factors and metabolic components. This framework provides a comprehensive approach to elucidate the mechanisms underlying complex diseases like metabolic syndrome.
Collapse
Affiliation(s)
- Ji-Eun Shin
- Department of Biomedical Informatics, Konyang University, Daejeon, Republic of Korea
| | - Nari Shin
- Department of Statistics, Korea University, Seoul, Republic of Korea
| | - Taesung Park
- Department of Statistics, Seoul National University, Seoul, Republic of Korea
| | - Mira Park
- Department of Preventive Medicine, Eulji University, Daejeon, Republic of Korea.
| |
Collapse
|
3
|
Lee M, Park T, Shin JY, Park M. A comprehensive multi-task deep learning approach for predicting metabolic syndrome with genetic, nutritional, and clinical data. Sci Rep 2024; 14:17851. [PMID: 39090161 PMCID: PMC11294629 DOI: 10.1038/s41598-024-68541-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 07/24/2024] [Indexed: 08/04/2024] Open
Abstract
Metabolic syndrome (MetS) is a complex disorder characterized by a cluster of metabolic abnormalities, including abdominal obesity, hypertension, elevated triglycerides, reduced high-density lipoprotein cholesterol, and impaired glucose tolerance. It poses a significant public health concern, as individuals with MetS are at an increased risk of developing cardiovascular diseases and type 2 diabetes. Early and accurate identification of individuals at risk for MetS is essential. Various machine learning approaches have been employed to predict MetS, such as logistic regression, support vector machines, and several boosting techniques. However, these methods use MetS as a binary status and do not consider that MetS comprises five components. Therefore, a method that focuses on these characteristics of MetS is needed. In this study, we propose a multi-task deep learning model designed to predict MetS and its five components simultaneously. The benefit of multi-task learning is that it can manage multiple tasks with a single model, and learning related tasks may enhance the model's predictive performance. To assess the efficacy of our proposed method, we compared its performance with that of several single-task approaches, including logistic regression, support vector machine, CatBoost, LightGBM, XGBoost and one-dimensional convolutional neural network. For the construction of our multi-task deep learning model, we utilized data from the Korean Association Resource (KARE) project, which includes 352,228 single nucleotide polymorphisms (SNPs) from 7729 individuals. We also considered lifestyle, dietary, and socio-economic factors that affect chronic diseases, in addition to genomic data. By evaluating metrics such as accuracy, precision, F1-score, and the area under the receiver operating characteristic curve, we demonstrate that our multi-task learning model surpasses traditional single-task machine learning models in predicting MetS.
Collapse
Affiliation(s)
- Minhyuk Lee
- Department of Statistics, Korea University, Seoul, Republic of Korea
| | - Taesung Park
- Department of Statistics, Seoul National University, Seoul, Republic of Korea
| | - Ji-Yeon Shin
- Department of Preventive Medicine, School of Medicine, Kyungpook National University, Daegu, Republic of Korea.
| | - Mira Park
- Department of Preventive Medicine, School of Medicine, Eulji University, Daejeon, Republic of Korea.
| |
Collapse
|
4
|
Lee HA, Park H, Park B. Genetic predisposition, lifestyle inflammation score, food-based dietary inflammatory index, and the risk for incident diabetes: Findings from the KoGES data. Nutr Metab Cardiovasc Dis 2024; 34:642-650. [PMID: 38161120 DOI: 10.1016/j.numecd.2023.10.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 09/21/2023] [Accepted: 10/22/2023] [Indexed: 01/03/2024]
Abstract
BACKGROUND AND AIMS We investigated whether genetic predisposition, the Lifestyle Inflammation Score (LIS), or the Food-based Dietary Inflammatory Index (FDII) were associated with diabetes incidence and whether these factors interact. METHODS AND RESULTS The study was conducted using population-based cohort data derived from the Korean Genome and Epidemiology Study, which included 6568 people aged 40-69 years. Based on 25 genetic variants related to diabetes, genetic risk scores (GRSs) were determined and LISs and FDIIs were calculated and stratified into quartiles. We investigated the effects of gene-lifestyle interactions on the incident diabetes. The multivariate Cox proportional hazard model was used to generate hazard ratios with 95 % CIs. During the 16-year follow-up period, diabetes incidence was 13.6 per 1000 person-years. A dose-response association with diabetes was observed for both GRS and LIS quartiles but not for FDII quartiles. The GRS and LIS were also independently associated with diabetes incidence in a multivariate model. Compared to the bottom quartile, the top LIS quartile and the top GRS quartile had a 2.4-fold (95 % CI, 2.0-2.8) and a 1.4-fold (95 % CI, 1.2-1.7) higher diabetes risk, respectively. However, the FDII exhibited null association. When each genetic variant was evaluated, the top versus bottom LIS quartiles exhibited heterogeneous diabetes risks for rs560887 within G6PC2, rs7072268 within HK1, and rs837763 within CDT1; however, these differences were not statistically significant in multiple comparison. CONCLUSION Both GRS and LIS factors independently affect the incident diabetes, but their interaction effect showed insignificant association. Therefore, regardless of genetic susceptibility, more effort is needed to lower the risk for diabetes by improving lifestyle behaviors.
Collapse
Affiliation(s)
- Hye Ah Lee
- Clinical Trial Center, Ewha Womans University Mokdong Hospital, Seoul, Republic of Korea.
| | - Hyesook Park
- Department of Preventive Medicine, College of Medicine, Ewha Womans University, Seoul, Republic of Korea; Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, Republic of Korea
| | - Bomi Park
- Department of Preventive Medicine, College of Medicine, Chung-Ang University, Seoul, Republic of Korea
| |
Collapse
|
5
|
Li D, Kim W, An J, Kim S, Lee S, Do A, Kim W, Lee S, Yoon D, Lee K, Ha S, Silverman EK, Cho M, Shin C, Won S. Heritability Analyses Uncover Shared Genetic Effects of Lung Function and Change over Time. Genes (Basel) 2022; 13:genes13071261. [PMID: 35886044 PMCID: PMC9316642 DOI: 10.3390/genes13071261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 07/05/2022] [Accepted: 07/08/2022] [Indexed: 01/27/2023] Open
Abstract
Genetic influence on lung functions has been identified in previous studies; however, the relative longitudinal effects of genetic factors and their interactions with smoking on lung function remain unclear. Here, we identified the longitudinal effects of genetic variants on lung function by determining single nucleotide polymorphism (SNP) heritability and genetic correlations, and by analyzing interactions with smoking. Subject-specific means and annual change rates were calculated for eight spirometric measures obtained from 6622 Korean adults aged 40−69 years every two years for 14 years, and their heritabilities were estimated separately. Statistically significant (p < 0.05) heritability for the subject-specific means of all spirometric measures (8~32%) and change rates of forced expiratory volume in 1 s to forced vital capacity ratio (FEV1/FVC; 16%) and post-bronchodilator FEV1/FVC (17%) were detected. Significant genetic correlations of the change rate with the subject-specific mean were observed for FEV1/FVC (ρg = 0.64) and post-bronchodilator FEV1/FVC (ρg = 0.47). Furthermore, post-bronchodilator FEV1/FVC showed significant heritability of SNP-by-smoking interaction (hGXS2 = 0.4) for the annual change rate. The GWAS also detected genome-wide significant SNPs for FEV1 (rs4793538), FEV1/FVC (rs2704589, rs62201158, and rs9391733), and post-bronchodilator FEV1/FVC (rs2445936). We found statistically significant evidence of heritability role on the change in lung function, and this was shared with the effects on cross-sectional measurements. We also found some evidence of interaction with smoking for the change of lung function.
Collapse
Affiliation(s)
- Donghe Li
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Korea; (D.L.); (A.D.)
- Department of Medicine (Biomedical Genetics), Boston University School of Medicine, Boston, MA 02118, USA
| | - Woojin Kim
- Department of Internal Medicine and Environmental Health Center, School of Medicine, Kangwon National University, Chuncheon 24341, Korea;
| | - Jahoon An
- Department of Public Health Science, Graduate School of Public Health, Seoul National University, Seoul 08826, Korea;
| | - Soriul Kim
- Institute for Human Genomic Study, College of Medicine, Korea University, Seoul 136701, Korea; (S.K.); (S.L.)
| | - Seungku Lee
- Institute for Human Genomic Study, College of Medicine, Korea University, Seoul 136701, Korea; (S.K.); (S.L.)
| | - Ahra Do
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Korea; (D.L.); (A.D.)
| | - Wonji Kim
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA; (W.K.); (E.K.S.); (M.C.)
| | - Sanghun Lee
- Department of Medical Consilience, Graduate School, Dankook University, Yongin 16890, Korea;
| | - Dankyu Yoon
- Division of Allergy and Respiratory Disease Research, Department of Chronic Disease Convergence Research, National Institute of Health, Korea Disease Control and Prevention Agency, Cheongju 28159, Korea;
| | - Kwangbae Lee
- Korea Medical Institute, Seoul 03173, Korea; (K.L.); (S.H.)
| | - Seounguk Ha
- Korea Medical Institute, Seoul 03173, Korea; (K.L.); (S.H.)
| | - Edwin K. Silverman
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA; (W.K.); (E.K.S.); (M.C.)
| | - Michael Cho
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA; (W.K.); (E.K.S.); (M.C.)
| | - Chol Shin
- Institute for Human Genomic Study, College of Medicine, Korea University, Seoul 136701, Korea; (S.K.); (S.L.)
- Division of Pulmonary Sleep and Critical Care Medicine, Department of Internal Medicine, Korea University Ansan Hospital, Ansan 15355, Korea
- Correspondence: (C.S.); (S.W.)
| | - Sungho Won
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, Korea; (D.L.); (A.D.)
- Department of Public Health Science, Graduate School of Public Health, Seoul National University, Seoul 08826, Korea;
- Institute of Health and Environment, Seoul National University, Seoul 08826, Korea
- RexSoft Inc., Seoul 08826, Korea
- Correspondence: (C.S.); (S.W.)
| |
Collapse
|