1
|
Fathy W, Emeriaud G, Cheriet F. A comprehensive review of ICU readmission prediction models: From statistical methods to deep learning approaches. Artif Intell Med 2025; 165:103126. [PMID: 40300338 DOI: 10.1016/j.artmed.2025.103126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 10/04/2024] [Accepted: 03/29/2025] [Indexed: 05/01/2025]
Abstract
The prediction of Intensive Care Unit (ICU) readmission has become a crucial area of research due to the increasing demand for ICU resources and the need to provide timely interventions to critically ill patients. In recent years, several studies have explored the use of statistical, machine learning (ML), and deep learning (DL) models to predict ICU readmission. This review paper presents an extensive overview of these studies and discusses the challenges associated with ICU readmission prediction. We categorize the studies based on the type of model used and evaluate their strengths and limitations. We also discuss the performance metrics used to evaluate the models and their potential clinical applications. In addition, this review explores current methodologies, data usage, and recent advances in interpretability and explainable AI for medical applications, offering insights to guide future research and development in this field. Finally, we identify gaps in the current literature and provide recommendations for future research. Recent advances like ML and DL have moderately improved the prediction of the risk of ICU readmission. However, more progress is needed to reach the precision required to build computerized decision support tools.
Collapse
Affiliation(s)
- Waleed Fathy
- Department of Computer and Software Engineering, Polytechnique Montréal, Montreal, Quebec, Canada; Department of Electronic and Communication Engineering, Zagazig Univeristy, Zagazig, Sharkia, Egypt.
| | - Guillaume Emeriaud
- Department of Pediatrics, CHU Sainte-Justine, Université de Montréal, Montreal, Quebec, Canada.
| | - Farida Cheriet
- Department of Computer and Software Engineering, Polytechnique Montréal, Montreal, Quebec, Canada.
| |
Collapse
|
2
|
Yan C, Grabowska ME, Thakkar R, Dickson AL, Embí PJ, Feng Q, Denny JC, Kerchberger VE, Malin BA, Wei WQ. Beyond Phecodes: leveraging PheMAP to identify patients lacking diagnosis codes in electronic health records. J Am Med Inform Assoc 2025; 32:1007-1014. [PMID: 40156924 DOI: 10.1093/jamia/ocaf055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Revised: 02/16/2025] [Accepted: 03/18/2025] [Indexed: 04/01/2025] Open
Abstract
OBJECTIVE Diagnosis codes documented in electronic health records (EHR) are often relied upon to clinically phenotype patients for biomedical research. However, these diagnoses can be incomplete and inaccurate, leading to false negatives when searching for patients with phenotypes of interest. This study aims to determine whether PheMAP, a comprehensive knowledgebase integrating multiple clinical terminologies beyond diagnosis to capture phenotypes, can effectively identify patients lacking relevant EHR diagnosis codes. MATERIALS AND METHODS We investigated a collection of 3.5 million patient records from Vanderbilt University Medical Center's EHR and focused on 4 well-studied phenotypes: (1) type 2 diabetes mellitus (T2DM), (2) dementia, (3) prostate cancer, and (4) sensorineural hearing loss. We applied PheMAP to match structured concepts in patient records and calculated a phenotype risk score (PheScore) to indicate patient-phenotype similarity. Patients meeting predefined PheScore criteria but lacking diagnosis codes were identified. Clinically knowledgeable experts adjudicated randomly selected patients per phenotype as Positive, Possibly Positive, or Negative. RESULTS Our approach indicated that 5.3% of patients lacked a diagnosis for T2DM, 4.5% for dementia, 2.2% for prostate cancer, and 0.2% for sensorineural hearing loss. The expert review indicated 100% precision (for Possibly Positive or Positive cases) for dementia and sensorineural hearing loss, and 90.0% and 85.0% precision for T2DM and prostate cancer, respectively. Excluding Possibly Positive cases, the precision for T2DM and prostate cancer was 88.9% and 81.3%, respectively. CONCLUSIONS Leveraging clinical terminologies incorporated by PheMAP can effectively identify patients with phenotypes who lack EHR diagnosis codes, thereby enhancing phenotyping quality and related research reliability.
Collapse
Affiliation(s)
- Chao Yan
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Monika E Grabowska
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Rut Thakkar
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Alyson L Dickson
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Peter J Embí
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - QiPing Feng
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Joshua C Denny
- All of United States Research Program, National Institute of Health, Bethesda, MD 20892, United States
| | - Vern Eric Kerchberger
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Bradley A Malin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Computer Science, Vanderbilt University, Nashville, TN 37203, United States
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Computer Science, Vanderbilt University, Nashville, TN 37203, United States
| |
Collapse
|
3
|
Reddy IA, Han L, Sanchez-Roige S, Niarchou M, Ruderfer DM, Davis LK. Identification of Transdiagnostic Childhood Externalizing Pathology Within an Electronic Medical Records Database and Application to the Analysis of Rare Copy Number Variation. Am J Med Genet B Neuropsychiatr Genet 2025; 198:e33020. [PMID: 39744833 PMCID: PMC12048253 DOI: 10.1002/ajmg.b.33020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 11/18/2024] [Accepted: 12/18/2024] [Indexed: 01/19/2025]
Abstract
Externalizing traits and behaviors are broadly defined by impairments in self-regulation and impulse control that typically begin in childhood and adolescence. Externalizing behaviors, traits, and symptoms span a range of traditional psychiatric diagnostic categories. In this study, we sought to generate an algorithm that could reliably identify transdiagnostic childhood-onset externalizing cases and controls within a university hospital electronic health record (EHR) database. Within the Vanderbilt University Medical Center (VUMC) EHR, our algorithm identified cases with a clinician-validated positive predictive value of 90% and controls with a negative predictive value of 88%. In individuals of genetically defined European ancestry (CEU-clustered; Ncase = 487, Ncontrol = 5638), case status was significantly associated with psychiatric comorbidity and with elevated externalizing polygenic scores (OR: 1.20; 95% CI: 1.09-1.33; p = 1.14 × 10-3; based on published genome-wide association data). To test whether our cohort definitions could be applied to generate novel genetic insights, we examined rare (allele frequency < 0.5%) copy number variation. An association (OR: 9.70; CI: 3.24-29.0) was identified in the CEU-clustered cohort on chromosome 2 (chr2: 45,408,678-45,551,530; duplication), although the statistical strength of this association was modest (p = 0.052). We also examined the role of an externalizing burden score based on the number of externalizing diagnoses present in cases and found similar results to our case-control analysis. This analysis identified several other statistically significant CNV region associations. This study provides a framework for identifying childhood externalizing case-control cohorts within an EHR. Future work should validate this framework within other health systems. A broadly applicable algorithm, like this one, may allow for detection of rare outcomes or outcomes in populations historically excluded from genomic research through meta-analysis of data across health care systems.
Collapse
Affiliation(s)
- India A Reddy
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Lide Han
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Sandra Sanchez-Roige
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Psychiatry, University of California San Diego, La Jolla, California, USA
- Institute for Genomic Medicine, University of California San Diego, La Jolla, California, USA
| | - Maria Niarchou
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Douglas M Ruderfer
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Lea K Davis
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, USA
- Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| |
Collapse
|
4
|
Lake AM, Reddy IA, Havranek R, Davis LK, Fox J. Clinical Characteristics associated with functional seizures in individuals with psychosis. Schizophr Res 2025; 281:209-215. [PMID: 40398098 DOI: 10.1016/j.schres.2025.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Revised: 05/06/2025] [Accepted: 05/12/2025] [Indexed: 05/23/2025]
Abstract
BACKGROUND AND HYPOTHESIS Functional seizures (FS) are episodes characterized by seizure-like events that are not caused by hypersynchronous neuronal activity. Prior studies have suggested an increased prevalence of psychotic disorders among patients with FS, but results have been inconsistent. We hypothesize that FS are associated with psychosis and that among patients with psychosis, the presence of FS may influence patient clinical characteristics, mortality, and medical resource utilization. STUDY DESIGN The association between FS and psychosis was assessed using electronic health records data from a total of 761,848 individuals receiving care at Vanderbilt University Medical Center between 1989 and 2023. Analyses of the association between FS and psychiatric outcomes, sexual trauma, healthcare utilization, and other clinical comorbidities were conducted in a subset of 5219 patients with psychosis. STUDY RESULTS Odds of FS were elevated among patients with psychosis compared to controls (OR = 10.09, 95 % CI = 8.40-12.13). Among patients with psychosis, those with FS exhibited higher rates of suicidality (OR = 2.18 95 % CI = 1.50-3.17), catatonia (OR = 2.15, 95 % CI = 1.33-3.45), sexual trauma history (OR = 2.93, 95 % CI = 2.00-4.29) and had a greater number of antipsychotic trials (4.63 versus 3.37, beta = 1.23, SE = 0.18, adjusted p < 0.001) than those without FS. Furthermore, patients with comorbid FS had more hospital presentations at one, three, five, and ten years after receiving a psychosis diagnosis (adjusted p < 0.001). CONCLUSIONS FS are more common among patients with psychosis and are associated with increased healthcare utilization as well as an increased prevalence of suicidality, catatonia, and certain psychiatric and medical comorbidities.
Collapse
Affiliation(s)
- Allison M Lake
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
| | - India A Reddy
- Department of Psychiatry & Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Robert Havranek
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Lea K Davis
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Department of Psychiatry & Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jonah Fox
- Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, USA.
| |
Collapse
|
5
|
Jasper EA, Mautz BS, Hellwege JN, Piekos JA, Jones SH, Zhang Y, Torstenson ES, Pendergrass SA, Lee MTM, Edwards TL, Velez Edwards DR. A phenome-wide association study of uterine fibroids reveals a marked burden of comorbidities. COMMUNICATIONS MEDICINE 2025; 5:174. [PMID: 40374878 DOI: 10.1038/s43856-025-00884-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 04/25/2025] [Indexed: 05/18/2025] Open
Abstract
BACKGROUND The burden of comorbidities in those with uterine fibroids compared to those without fibroids is understudied. We performed a phenome-wide association study to systematically assess the association between fibroids and other conditions. METHODS Vanderbilt University Medical Center's Synthetic Derivative and Geisinger Health System Database, two electronic health record databases, were used for discovery and validation. Non-Hispanic Black and White females were included. Fibroid cases were identified through a previously validated algorithm. Race-stratified and multi-population phenome-wide association analyses, adjusting for age and body mass index, were performed before statistically significant, validated results were meta-analyzed. RESULTS There were 52,295 and 26,918 (9022 and 10,232 fibroid cases) females included in discovery and validation analyses. In multi-population meta-analysis, 389 conditions were associated with fibroid risk, with evidence of enrichment of circulatory, dermatologic, genitourinary, musculoskeletal, and sense organ conditions. The strongest associations within and across racial groups included conditions previously associated with fibroids. Numerous novel diagnoses, including cancers in female genital organs, were tied to fibroid status. CONCLUSIONS Overall, individuals with fibroids have a marked increase in comorbidities compared to those without fibroids. This approach to evaluate the health context of fibroids highlights the potential to understand fibroid etiology through studying the common biology of comorbid diagnoses and through disease networks.
Collapse
Affiliation(s)
- Elizabeth A Jasper
- Division of Quantitative and Clinical Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN, USA
- Center for Precision Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
- Institute for Medicine and Public Health, Vanderbilt University Medical Center, Nashville, TN, USA
- Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, TN, USA
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA
| | - Brian S Mautz
- Population Analytics, Analytics & Insights, Data Sciences, Janssen Research & Development, Spring House, PA, USA
| | - Jacklyn N Hellwege
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA
- Population Analytics, Analytics & Insights, Data Sciences, Janssen Research & Development, Spring House, PA, USA
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | | | - Sarah H Jones
- Institute for Medicine and Public Health, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Yanfei Zhang
- Genomic Medicine Institute, Geisinger Health Systems, Danville, PA, USA
| | - Eric S Torstenson
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Sarah A Pendergrass
- Genentech, South San Francisco, CA, USA
- Department of Biomedical and Translational Informatics, Geisinger, Rockville, MD, USA
| | | | - Todd L Edwards
- Institute for Medicine and Public Health, Vanderbilt University Medical Center, Nashville, TN, USA
- Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, TN, USA
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Digna R Velez Edwards
- Division of Quantitative and Clinical Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN, USA.
- Center for Precision Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
- Institute for Medicine and Public Health, Vanderbilt University Medical Center, Nashville, TN, USA.
- Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, TN, USA.
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
6
|
Meurers T, Otte K, Abu Attieh H, Briki F, Despraz J, Halilovic M, Kaabachi B, Milicevic V, Müller A, Papapostolou G, Wirth FN, Raisaro JL, Prasser F. A quantitative analysis of the use of anonymization in biomedical research. NPJ Digit Med 2025; 8:279. [PMID: 40369095 PMCID: PMC12078711 DOI: 10.1038/s41746-025-01644-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Accepted: 04/16/2025] [Indexed: 05/16/2025] Open
Abstract
Anonymized biomedical data sharing faces several challenges. This systematic review analyzes 1084 PubMed-indexed studies (2018-2022) using anonymized biomedical data to quantify usage trends across geographic, regulatory, and cultural regions to identify effective approaches and inform implementation agendas. We identified a significant yearly increase in such studies with a slope of 2.16 articles per 100,000 when normalized against the total number of PubMed-indexed articles (p = 0.021). Most studies used data from the US, UK, and Australia (78.2%). This trend remained when normalized by country-specific research output. Cross-border sharing was rare (10.5% of studies). We identified twelve common data sources, primarily in the US (seven) and UK (three), including commercial (seven) and public entities (five). The prevalence of anonymization in the US, UK, and Australia suggests their practices could guide broader adoption. Rare cross-border anonymized data sharing and differences between countries with comparable regulations underscore the need for global standards.
Collapse
Affiliation(s)
- Thierry Meurers
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.
| | - Karen Otte
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Hammam Abu Attieh
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Farah Briki
- Biomedical Data Science Center, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Jérémie Despraz
- Biomedical Data Science Center, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Mehmed Halilovic
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Bayrem Kaabachi
- Biomedical Data Science Center, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Vladimir Milicevic
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Armin Müller
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Grigorios Papapostolou
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Felix Nikolaus Wirth
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Jean Louis Raisaro
- Biomedical Data Science Center, Centre Hospitalier Universitaire Vaudois, Lausanne, Switzerland
| | - Fabian Prasser
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.
| |
Collapse
|
7
|
Rämö JT, Gorman BR, Weng LC, Jurgens SJ, Singhanetr P, Tieger MG, van Dijk EH, Halladay CW, Wang X, Hauser BM, Kim SH, Brinks J, Choi SH, Luo Y, Pyarajan S, Nealon CL, Gorin MB, Wu WC, Anthony SA, Roncone DP, Sobrin L, Kaarniranta K, Yzer S, Palotie A, Peachey NS, Turunen JA, Boon CJ, Ellinor PT, Iyengar SK, Daly MJ, Rossin EJ. Rare genetic variation in PTPRB is associated with central serous chorioretinopathy, varicose veins and glaucoma. Nat Commun 2025; 16:4127. [PMID: 40319023 PMCID: PMC12049426 DOI: 10.1038/s41467-025-58686-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 03/28/2025] [Indexed: 05/07/2025] Open
Abstract
Central serous chorioretinopathy is an eye disease characterized by fluid buildup under the central retina whose etiology is not well understood. Abnormal choroidal veins in central serous chorioretinopathy patients have been shown to have similarities with varicose veins. To identify potential mechanisms, we analyzed genotype data from 1,477 patients and 455,449 controls in FinnGen. We identified an association for a low-frequency (allele frequency = 0.5%) missense variant (rs113791087) in PTPRB, the gene encoding vascular endothelial protein tyrosine phosphatase (odds ratio=2.85, P = 4.5 × 10-9). This was confirmed in a meta-analysis of 2,452 patients and 865,767 controls from 4 studies (odds ratio=3.06, P = 7.4 × 10-15). Rs113791087 was associated with a 56% higher prevalence of retinal abnormalities (35.3% vs 22.6%, P = 8.0 × 10-4) in 708 UK Biobank participants and, surprisingly, with increased risk of varicose veins (odds ratio=1.31, P = 2.3 × 10-11) and reduced risk of glaucoma (odds ratio=0.82, P = 6.9 × 10-9). Predicted loss-of-function variants in PTPRB, though rare in number, were associated with central serous chorioretinopathy in All of Us (odds ratio=17.09, P = 0.018). These findings highlight the significance of vascular endothelial protein tyrosine phosphatase in diverse ocular and systemic veno-vascular diseases.
Collapse
Affiliation(s)
- Joel T Rämö
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Massachusetts Eye and Ear, Boston, MA, USA
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
| | - Bryan R Gorman
- Center for Data and Computational Sciences (C-DACS), VA Cooperative Studies Program, VA Boston Healthcare System, Boston, MA, USA
- Booz Allen Hamilton, McLean, VA, USA
| | - Lu-Chen Weng
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
| | - Sean J Jurgens
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Department of Experimental Cardiology, Amsterdam Cardiovascular Sciences, Heart Failure & Arrhythmias, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands
| | - Panisa Singhanetr
- Massachusetts Eye and Ear, Boston, MA, USA
- Mettapracharak Eye Institute, Mettapracharak (Wat Rai Khing) Hospital, Nakhon Pathom, Thailand
| | - Marisa G Tieger
- New England Eye Center, Tufts Medical Center, Boston, MA, USA
| | - Elon Hc van Dijk
- Department of Ophthalmology, Leiden University Medical Center, Leiden, The Netherlands
| | - Christopher W Halladay
- Center of Innovation in Long Term Services and Supports, Providence VA Medical Center, Providence, RI, USA
| | - Xin Wang
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Blake M Hauser
- Massachusetts Eye and Ear, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Soo Hyun Kim
- Massachusetts Eye and Ear, Boston, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Joost Brinks
- Department of Ophthalmology, Leiden University Medical Center, Leiden, The Netherlands
| | - Seung Hoan Choi
- Department of Biostatistics, Boston University, Boston, MA, USA
| | - Yuyang Luo
- Massachusetts Eye and Ear, Boston, MA, USA
| | - Saiju Pyarajan
- VA Cooperative Studies Program, VA Boston Healthcare System, Boston, MA, USA
- Department of Medicine, Brigham and Women's Hospital and Harvard School of Medicine, Boston, MA, USA
| | - Cari L Nealon
- Eye Clinic, VA Northeast Ohio Healthcare System, Cleveland, OH, USA
| | - Michael B Gorin
- Department of Ophthalmology, David Geffen School of Medicine, Stein Eye Institute, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Human Genetics, David Geffen School of Medicine, Stein Eye Institute, University of California, Los Angeles, Los Angeles, CA, USA
| | - Wen-Chih Wu
- Section of Cardiology, Medical Service, VA Providence Healthcare System, Providence, RI, USA
| | - Scott A Anthony
- Eye Clinic, VA Northeast Ohio Healthcare System, Cleveland, OH, USA
| | - David P Roncone
- Eye Clinic, VA Northeast Ohio Healthcare System, Cleveland, OH, USA
| | - Lucia Sobrin
- Harvard Medical School Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Kai Kaarniranta
- Department of Ophthalmology, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland
| | - Suzanne Yzer
- Department of Ophthalmology, Radboud University Medical Center, Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Aarno Palotie
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Psychiatric and Neurodevelopmental Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Neal S Peachey
- Research Service, VA Northeast Ohio Healthcare System, Cleveland, OH, USA
- Cole Eye Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
- Cleveland Clinic Lerner College of Medicine of Case Western Reserve University, Cleveland, OH, USA
| | - Joni A Turunen
- Folkhälsan Research Center, Biomedicum, Helsinki, Finland
- Department of Ophthalmology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Camiel Jf Boon
- Department of Ophthalmology, Leiden University Medical Center, Leiden, The Netherlands
- Department of Ophthalmology, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | - Patrick T Ellinor
- Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Sudha K Iyengar
- Research Service, VA Northeast Ohio Healthcare System, Cleveland, OH, USA
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH, USA
| | - Mark J Daly
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Analytic and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Elizabeth J Rossin
- Harvard Medical School Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
8
|
Dutta D, Chatterjee N. Expanding scope of genetic studies in the era of biobanks. Hum Mol Genet 2025:ddaf054. [PMID: 40312842 DOI: 10.1093/hmg/ddaf054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2025] [Revised: 03/25/2025] [Accepted: 04/08/2025] [Indexed: 05/03/2025] Open
Abstract
Biobanks have become pivotal in genetic research, particularly through genome-wide association studies (GWAS), driving transformative insights into the genetic basis of complex diseases and traits through the integration of genetic data with phenotypic, environmental, family history, and behavioral information. This review explores the distinct design and utility of different biobanks, highlighting their unique contributions to genetic research. We further discuss the utility and methodological advances in combining data from disease-specific study or consortia with that of biobanks, especially focusing on summary statistics based meta-analysis. Subsequently we review the spectrum of additional advantages offered by biobanks in genetic studies in representing population differences, calibration of polygenic scores, assessment of pleiotropy and improving post-GWAS in silico analyses. Advances in sequencing technologies, particularly whole-exome and whole-genome sequencing, have further enabled the discovery of rare variants at biobank scale. Among recent developments, the integration of large-scale multi-omics data especially proteomics and metabolomics, within biobanks provides deeper insights into disease mechanisms and regulatory pathways. Despite challenges like ascertainment strategies and phenotypic misclassification, biobanks continue to evolve, driving methodological innovation and enabling precision medicine. We highlight the contributions of biobanks to genetic research, their growing integration with multi-omics, and finally discuss their future potential for advancing healthcare and therapeutic development.
Collapse
Affiliation(s)
- Diptavo Dutta
- Integrative Tumor Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, Rockville, MD, 20879, United States
| | - Nilanjan Chatterjee
- Department of Biostatistics, Johns Hopkins University, 615 N Wolfe Street, Baltimore, MD, 21205, United States
- Department of Oncology, Johns Hopkins University, 615 N Wolfe Street, Baltimore, MD, 21205, United States
| |
Collapse
|
9
|
Coon H, Shabalin AA, DiBlasi E, Monson ET, Han S, Kaufman EA, Chen D, Kious B, Molina N, Yu Z, Staley MJ, Crockett DK, Colbert SM, Mullins N, Bakian AV, Docherty AR, Keeshin BR. Absence of nonfatal suicidal behavior preceding suicide death reveals differences in clinical risks. Psychiatry Res 2025; 347:116391. [PMID: 40020535 PMCID: PMC11976895 DOI: 10.1016/j.psychres.2025.116391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 12/17/2024] [Accepted: 02/05/2025] [Indexed: 03/03/2025]
Abstract
Nonfatal suicidal behavior is the most robust predictor of suicide death. However, only ∼10 % of those who survive an attempt go on to die by suicide. Moreover, ∼50 % of suicide deaths occur in the absence of prior known attempts, suggesting risks other than nonfatal suicide attempt need to be identified to help prevent suicide mortality. We studied data from 4,000 population-ascertained suicide deaths and 26,191 population controls to improve understanding of suicide deaths without prior nonfatal attempts. This study included 2,253 suicide deaths and 3,375 controls with evidence of nonfatal suicidal ideation or behaviors (SUI_SI/SB and CTL_SI/SB) from diagnostic codes and natural language processing of electronic health records notes. Characteristics of these groups were compared to 1,669 suicides with no prior nonfatal SI/SB (SUI_None) and 22,816 controls with no lifetime suicidality (CTL_None). The SUI_None and CTL_None groups had fewer overall diagnoses and were older than SUI_SI/SB and CTL_SI/SB. Mental health diagnoses were far less common in both the SUI_None and CTL_None groups; mental health problems were far less associated with suicide death than with presence of SI/SB. Physical health diagnoses were conversely more often associated with risk of suicide death than with presence of SI/SB. Pending replication, results indicate highly significant clinical differences among suicide deaths with versus without prior nonfatal SI/SB, and suggest that, for a substantial number of individuals at risk for suicide mortality, history of SI/SB does not serve as an effective clinical marker of risk.
Collapse
Affiliation(s)
- Hilary Coon
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA.
| | - Andrey A Shabalin
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Emily DiBlasi
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Eric T Monson
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Seonggyun Han
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Erin A Kaufman
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA.
| | - Danli Chen
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Brent Kious
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | - Zhe Yu
- Pedigree & Population Resource, Huntsman Cancer Institute, University of Utah, Salt Lake City, UT, USA
| | - Michael J Staley
- Utah State Office of the Medical Examiner, Utah Department of Health and Human Services, Salt Lake City, UT, USA
| | - David K Crockett
- Clinical Analytics, Intermountain Health, Salt Lake City, UT, USA
| | - Sarah M Colbert
- Department of Psychiatry, Mount Sinai School of Medicine, New York, NY, USA
| | - Niamh Mullins
- Department of Psychiatry, Mount Sinai School of Medicine, New York, NY, USA
| | - Amanda V Bakian
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Anna R Docherty
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Brooks R Keeshin
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA; Department of Pediatrics, University of Utah, Salt Lake City, UT, USA; Primary Children's Hospital Center for Safe and Healthy Families, Salt Lake City, UT, USA; Department of Public Health and Caring Science, Child Health and Parenting (CHAP), Uppsala University, Uppsala, Sweden
| |
Collapse
|
10
|
Ng JCM, Schooling CM. Sex-specific Mendelian randomization phenome-wide association study of basal metabolic rate. Sci Rep 2025; 15:14368. [PMID: 40274879 PMCID: PMC12022104 DOI: 10.1038/s41598-025-98017-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 04/08/2025] [Indexed: 04/26/2025] Open
Abstract
Observationally, higher basal metabolic rate (BMR) is associated with metabolism-related disorders, cancer, aging, and mortality. In this Mendelian randomization (MR) phenome-wide association study, using two-sample MR methods, we systematically and comprehensively investigated the health effects of genetically predicted BMR across the phenome sex-specifically. We obtained sex-specific genetic variants strongly (p < 5 × 10- 8) and independently (r2 < 0.001) predicting BMR from the UK Biobank and applied them to over 1,000 phenotypes within the same study. We combined genetic variant-specific Wald estimates using inverse-variance weighting, supplemented by sensitivity analysis. We used a false-discovery rate correction to allow for multiple comparisons as well as multivariable MR adjusted for body mass index and testosterone to investigate the independent effects of BMR on phenotypes with significant univariable associations. We obtained 217/219 genetic variants predicting BMR and applied them to 1,150/1,242 phenotypes in men/women, respectively. BMR was associated with 190/270 phenotypes in univariable analysis and 122/123 phenotypes in multivariable analysis in men/women. Examples of robust associations in multivariable analysis included those with neoplasms, diseases of the circulatory system, and growth and reproductive investment. In conclusion, BMR might affect a wide range of health-related outcomes. The underlying mechanisms and interactions between phenotypes warrant further study, as BMR is modifiable.
Collapse
Affiliation(s)
- Jack C M Ng
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pok Fu Lam, Hong Kong Special Administrative Region, China
| | - C Mary Schooling
- School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Pok Fu Lam, Hong Kong Special Administrative Region, China.
- Graduate School of Public Health and Health Policy, The City University of New York, 55 West 125th St, New York, NY, 10027, USA.
| |
Collapse
|
11
|
Nam Y, Lee DG, Woerner J, Lee SH, Lee MJ, Jo SH, Jung J, Heo SC, Jo CH, Kim D. Phenome-wide comorbidity network analysis reveals clinical risk patterns in enthesopathy and enthesitis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.04.21.25326169. [PMID: 40313276 PMCID: PMC12045441 DOI: 10.1101/2025.04.21.25326169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Background Enthesopathy and enthesitis, including rotator cuff disease and other tendon disorders, represent a heterogeneous group of musculoskeletal conditions with complex etiologies. Understanding how systemic health profiles influence their onset remains a critical challenge in musculoskeletal medicine. Methods We conducted a large-scale, phenome-wide comorbidity analysis using longitudinal electronic health records (EHR) from 432,757 UK Biobank participants. Incident cases of peripheral enthesopathies were compared to controls across 434 baseline disease phenotypes. A directed ego network was constructed to link significantly associated comorbidities to the target condition using odds ratio-based associations. Unsupervised clustering via UMAP and DBSCAN identified data-driven comorbidity clusters, which were consolidated into unified endotypes-interpreted as distinct systemic profiles contributing to disease risk. Additionally, metapath-based trajectory analysis was applied to uncover temporally structured multimorbidity chains leading to disease onset. Results We identified 183 baseline conditions significantly associated with the future development of enthesopathy (FDR < 0.05). Network clustering revealed eight comorbidity clusters, which were consolidated into four unified endotypes: Metabolic-Psychosomatic, Inflammatory-Multisystem, Mechanical-Injury-driven, and Aging-Intervention-related. Metapath analysis uncovered common three-step disease trajectories, such as metabolic-infectious-musculoskeletal and inflammatory skin-to-joint progressions, highlighting potential mechanistic pathways. These endotypes showed diverse clinical features but shared biological coherence, suggesting that different systemic health profiles can converge to drive tendon-related disease. Conclusions This study introduces a scalable framework for identifying systemic multimorbidity patterns underlying enthesopathy and enthesitis using phenome-wide comorbidity networks. By integrating network clustering and metapath analysis, we uncover interpretable, data-driven endotypes that may inform individualized risk assessment and targeted care strategies. These findings contribute to the growing field of biobank-scale disease modeling and offer a foundation for precision approaches in musculoskeletal medicine.
Collapse
Affiliation(s)
- Yonghyun Nam
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA 19104 USA
| | - Dong-Gi Lee
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA 19104 USA
| | - Jakob Woerner
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA 19104 USA
| | - Se-Hwan Lee
- McKay Orthopaedic Research Laboratory, Department of Orthopaedic Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Min Ji Lee
- Department of Orthopedic Surgery, SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul, Republic of Korea
- Department of Translational Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Sung-Han Jo
- McKay Orthopaedic Research Laboratory, Department of Orthopaedic Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jaeun Jung
- McKay Orthopaedic Research Laboratory, Department of Orthopaedic Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Su Chin Heo
- McKay Orthopaedic Research Laboratory, Department of Orthopaedic Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Translational Musculoskeletal Research Center, Corporal Michael J Crescenz VA Medical Center, Philadelphia, PA, 19104, USA
| | - Chris Hyunchul Jo
- Department of Orthopedic Surgery, SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul, Republic of Korea
- Department of Translational Medicine, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Dokyoon Kim
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA 19104 USA
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA 19104 USA
| |
Collapse
|
12
|
Han J, Gerring ZF, Wang L, Bahlo M. GeneSetPheno: a web application for the integration, summary, and visualization of gene and variant-phenotype associations across gene sets. BIOINFORMATICS ADVANCES 2025; 5:vbaf078. [PMID: 40260119 PMCID: PMC12011357 DOI: 10.1093/bioadv/vbaf078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2024] [Revised: 03/20/2025] [Accepted: 04/15/2025] [Indexed: 04/23/2025]
Abstract
Motivation The comprehensive study of genotype-phenotype relationships requires the integration of multiple data types to "triangulate" signals and derive meaningful biological conclusions. Large-scale biobanks and public resources generate a wealth of comprehensive results, facilitating the discovery of associations between genes or genetic variants and multiple phenotypes. However, analyzing these data across resources presents several challenges, including limited flexibility in gene set analysis, the integration of multipe databases, and the need for effective data visualization to aid interpretation. Results GeneSetPheno is a user-friendly graphical interface that integrates, summarizes, and visualizes gene and variant-phenotype associations across genomic resources. It allows users to explore interrelationships between genetic variants and phenotypes, offering insights into the genetic factors driving phenotypic variation within user-defined gene sets. GeneSetPheno also supports comparisons across gene sets to identify shared or unique genetic variants, phenotypic associations, biological pathways, and potential gene-gene interactions. GeneSetPheno is a free and highly configurable tool for exploring the complex relationships between gene sets, genetic variants, and phenotypes. Target users include molecular biologists and clinicians who wish to explore a gene or gene set of particular interest. Availability and implementation GeneSetPheno is freely accessible at: https://shiny.wehi.edu.au/han.ji/GeneSetPheno/. The source code is available on GitHub at: https://github.com/bahlolab/GeneSetPheno.
Collapse
Affiliation(s)
- Jiru Han
- Genetics and Gene Regulation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Zachary F Gerring
- Genetics and Gene Regulation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Longfei Wang
- Genetics and Gene Regulation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Melanie Bahlo
- Genetics and Gene Regulation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, VIC 3010, Australia
| |
Collapse
|
13
|
Zhu W, Chen L, Aphinyanaphongs Y, Kastrinos F, Simeone DM, Pochapin M, Stender C, Razavian N, Gonda TA. Identification of patients at risk for pancreatic cancer in a 3-year timeframe based on machine learning algorithms. Sci Rep 2025; 15:11697. [PMID: 40188106 PMCID: PMC11972345 DOI: 10.1038/s41598-025-89607-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Accepted: 02/06/2025] [Indexed: 04/07/2025] Open
Abstract
Early detection of pancreatic cancer (PC) remains challenging largely due to the low population incidence and few known risk factors. However, screening in at-risk populations and detection of early cancer has the potential to significantly alter survival. In this study, we aim to develop a predictive model to identify patients at risk for developing new-onset PC at two and a half to three year time frame. We used the Electronic Health Records (EHR) of a large medical system from 2000 to 2021 (N = 537,410). The EHR data analyzed in this work consists of patients' demographic information, diagnosis records, and lab values, which are used to identify patients who were diagnosed with pancreatic cancer and the risk factors used in the machine learning algorithm for prediction. We identified 73 risk factors of pancreatic cancer with the Phenome-wide Association Study (PheWAS) on a matched case-control cohort. Based on them, we built a large-scale machine learning algorithm based on EHR. A temporally stratified validation based on patients not included in any stage of the training of the model was performed. This model showed an AUROC at 0.742 [0.727, 0.757] which was similar in both the general population and in a subset of the population who has had prior cross-sectional imaging. The rate of diagnosis of pancreatic cancer in those in the top 1 percentile of the risk score was 6 folds higher than the general population. Our model leverages data extracted from a 6-month window of time in the electronic health record to identify patients at nearly sixfold higher than baseline risk of developing pancreatic cancer 2.5-3 years from evaluation. This approach offers an opportunity to define an enriched population entirely based on static data, where current screening may be recommended.
Collapse
Affiliation(s)
- Weicheng Zhu
- Center for Data Science, New York University, New York, NY, USA
| | - Long Chen
- Center for Data Science, New York University, New York, NY, USA
| | - Yindalon Aphinyanaphongs
- Department of Population Health, New York University Grossman School of Medicine, 227 East 30th Street, 6th Floor, New York, NY, 10016, USA
| | - Fay Kastrinos
- Department of Medicine, Division of Digestive and Liver Diseases, Columbia University Irving Medical Center, New York, NY, USA
| | - Diane M Simeone
- Moores Cancer Center, UC San Diego Health, San Diego, CA, USA
| | - Mark Pochapin
- Division of Gastroenterology and Hepatology, Department of Medicine, New York University, 240 East 38th Street, 23rd Floor, New York, NY, 10016, USA
| | - Cody Stender
- Department of Surgery, New York University, New York, NY, USA
| | - Narges Razavian
- Department of Population Health, New York University Grossman School of Medicine, 227 East 30th Street, 6th Floor, New York, NY, 10016, USA.
| | - Tamas A Gonda
- Division of Gastroenterology and Hepatology, Department of Medicine, New York University, 240 East 38th Street, 23rd Floor, New York, NY, 10016, USA.
| |
Collapse
|
14
|
King CP, Chitre AS, Leal‐Gutiérrez JD, Tripi JA, Netzley AH, Horvath AP, Lamparelli AC, George A, Martin C, St. Pierre CL, Missfeldt Sanches T, Bimschleger HV, Gao J, Cheng R, Nguyen K, Holl KL, Polesskaya O, Ishiwari K, Chen H, Robinson TE, Flagel SB, Solberg Woods LC, Palmer AA, Meyer PJ. Genetic Loci Influencing Cue-Reactivity in Heterogeneous Stock Rats. GENES, BRAIN, AND BEHAVIOR 2025; 24:e70018. [PMID: 40049657 PMCID: PMC11884905 DOI: 10.1111/gbb.70018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 01/23/2025] [Accepted: 02/12/2025] [Indexed: 03/10/2025]
Abstract
Addiction vulnerability is associated with the tendency to attribute incentive salience to reward predictive cues. Both addiction and the attribution of incentive salience are influenced by environmental and genetic factors. To characterize the genetic contributions to incentive salience attribution, we performed a genome-wide association study (GWAS) in a cohort of 1596 heterogeneous stock (HS) rats. Rats underwent a Pavlovian conditioned approach task that characterized the responses to food-associated stimuli ("cues"). Responses ranged from cue-directed "sign-tracking" behavior to food-cup directed "goal-tracking" behavior (12 measures, SNP heritability: 0.051-0.215). Next, rats performed novel operant responses for unrewarded presentations of the cue using the conditioned reinforcement procedure. GWAS identified 14 quantitative trait loci (QTLs) for 11 of the 12 traits across both tasks. Interval sizes of these QTLs varied widely. Seven traits shared a QTL on chromosome 1 that contained a few genes (e.g., Tenm4, Mir708) that have been associated with substance use disorders and other psychiatric disorders in humans. Other candidate genes (e.g., Wnt11, Pak1) in this region had coding variants and expression-QTLs in mesocorticolimbic regions of the brain. We also conducted a Phenome-Wide Association Study (PheWAS) on addiction-related behaviors in HS rats and found that the QTL on chromosome 1 was also associated with nicotine self-administration in a separate cohort of HS rats. These results provide a starting point for the molecular genetic dissection of incentive motivational processes and provide further support for a relationship between the attribution of incentive salience and drug abuse-related traits.
Collapse
Affiliation(s)
- Christopher P. King
- Department of PsychologyUniversity at BuffaloBuffaloNew YorkUSA
- Clinical and Research Institute on AddictionsBuffaloNew YorkUSA
| | - Apurva S. Chitre
- Department of PsychiatryUniversity of California San DiegoLa JollaCaliforniaUSA
| | | | - Jordan A. Tripi
- Department of PsychologyUniversity at BuffaloBuffaloNew YorkUSA
| | - Alesa H. Netzley
- Department of Emergency MedicineUniversity of MichiganAnn ArborMichiganUSA
| | - Aidan P. Horvath
- Department of PsychologyUniversity of MichiganAnn ArborMichiganUSA
| | | | - Anthony George
- Clinical and Research Institute on AddictionsBuffaloNew YorkUSA
| | - Connor Martin
- Clinical and Research Institute on AddictionsBuffaloNew YorkUSA
| | | | | | | | - Jianjun Gao
- Department of PsychiatryUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Riyan Cheng
- Department of PsychiatryUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Khai‐Minh Nguyen
- Department of PsychiatryUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Katie L. Holl
- Department of PhysiologyMedical College of WisconsinMilwaukeeWisconsinUSA
| | - Oksana Polesskaya
- Department of PsychiatryUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Keita Ishiwari
- Clinical and Research Institute on AddictionsBuffaloNew YorkUSA
- Department of Pharmacology and ToxicologyUniversity at BuffaloBuffaloNew YorkUSA
| | - Hao Chen
- Department of Pharmacology, Addiction Science and ToxicologyUniversity of Tennessee Health Science CenterMemphisTennesseeUSA
| | | | - Shelly B. Flagel
- Department of PsychiatryUniversity of MichiganAnn ArborMichiganUSA
- Michigan Neuroscience Institute, University of MichiganAnn ArborMichiganUSA
| | - Leah C. Solberg Woods
- Department of Internal Medicine, Molecular Medicine, Center on Diabetes, Obesity and MetabolismWake Forest School of MedicineWinston‐SalemNorth CarolinaUSA
| | - Abraham A. Palmer
- Department of PsychiatryUniversity of California San DiegoLa JollaCaliforniaUSA
- Institute for Genomic Medicine, University of California San DiegoLa JollaCaliforniaUSA
| | - Paul J. Meyer
- Department of PsychologyUniversity at BuffaloBuffaloNew YorkUSA
| |
Collapse
|
15
|
Mihov M, Shoctor H, Douglas A, Hay DC, O'Shaughnessy PJ, Iredale JP, Shaw S, Fowler PA, Grassmann F. Linking epidemiology and genomics of maternal smoking during pregnancy in utero and in ageing: a population-based study using human foetuses and the UK Biobank cohort. EBioMedicine 2025; 114:105590. [PMID: 40074595 DOI: 10.1016/j.ebiom.2025.105590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 01/10/2025] [Accepted: 01/22/2025] [Indexed: 03/14/2025] Open
Abstract
BACKGROUND Maternal smoking and foetal exposure to nicotine and other harmful chemicals in utero remains a serious public health issue with little knowledge about the underlying genetics and consequences of maternal smoking in ageing individuals. Here, we investigated the epidemiology and genomic architecture of maternal smoking in a middle-aged population and compare the results to effects observed in the developing foetus. METHODS In the current project, we included 351,562 participants from the UK Biobank (UKB) and estimated exposure to maternal smoking status during pregnancy through self-reporting from the UKB participants about the mother's smoking status around their birth. In addition, we analysed 64 foetal liver transcriptomic expression datasets collected from women seeking elective pregnancy terminations. Foetal maternal smoking exposure was confirmed through measurement of foetal plasma cotinine levels. FINDINGS Foetal exposure to maternal smoking had a greater impact on males than females, with more differentially expressed genes in liver tissue (3313 vs. 1163) and higher liver pathway activation. In the UKB, maternal smoking exposure was linked to an unhealthy lifestyle, lower education, and liver damage. In a genome-wide analysis in the UKB, we leveraged the shared genetic basis between affected offspring and their mothers and identified five genome-wide significant regions. We found a low heritability of the trait (∼4%) and implicated several disease-related genes in a transcriptome-wide association study. Maternal smoking increased all-cause mortality risk (Hazard ratio and 95% CI: 1.10 [1.04; 1.16], P = 4.04 × 10-4), which was attenuated in non-smoking males. INTERPRETATION Although male foetuses are more affected than females by maternal smoking in pregnancy, this effect was largely reduced in middle-aged individuals. Importantly, our results highlight that the overall 10% increased mortality due to maternal smoking in pregnancy was greatly attenuated in non-smokers. This study demonstrates the importance of campaigns promoting offspring smoking prevention in families where the parent(s) smoke. FUNDING Funding for this project was provided by the University of Aberdeen, the Science Initiative Panel of the Institute of Medical Science, the UK Medical Research Council, the Seventh Framework Programme of the European Union under Grant Agreement 212885 (REEF), NHS Grampian Endowments grants and the European Commission Horizon Europe research grant Agreement 101094099 (INITIALISE).
Collapse
Affiliation(s)
- Mihail Mihov
- Institute of Medical Sciences, School of Medicine, Medical Sciences & Nutrition, University of Aberdeen, Aberdeen, UK.
| | - Hannah Shoctor
- Institute of Medical Sciences, School of Medicine, Medical Sciences & Nutrition, University of Aberdeen, Aberdeen, UK
| | - Alex Douglas
- Institute of Applied Health Sciences, School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Aberdeen, UK
| | - David C Hay
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, The University of Edinburgh, Edinburgh, UK
| | | | | | - Sophie Shaw
- All Wales Medical Genomics Service, Institute of Medical Genetics, University Hospital of Wales, Cardiff, UK
| | - Paul A Fowler
- Institute of Medical Sciences, School of Medicine, Medical Sciences & Nutrition, University of Aberdeen, Aberdeen, UK
| | - Felix Grassmann
- Institute of Medical Sciences, School of Medicine, Medical Sciences & Nutrition, University of Aberdeen, Aberdeen, UK; Institute for Clinical Research and Systems Medicine, Health and Medical University, Potsdam, Germany
| |
Collapse
|
16
|
Zhong X, Jia G, Yin Z, Cheng K, Rzhetsky A, Li B, Cox NJ. Longitudinal Analysis of Electronic Health Records Reveals Medical Conditions Associated with Subsequent Alzheimer's Disease Development. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.03.22.25324197. [PMID: 40196258 PMCID: PMC11974777 DOI: 10.1101/2025.03.22.25324197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
Background Several health conditions are known to increase the risk of Alzheimer's disease (AD). We aim to systematically identify medical conditions that are associated with subsequent development of AD by leveraging the growing resources of electronic health records (EHRs). Methods This retrospective cohort study used de-identified EHRs from two independent databases (MarketScan and VUMC) with 153 million individuals to identify AD cases and age- and gender-matched controls. By tracking their EHRs over a 10-year window before AD diagnosis and comparing the EHRs between AD cases and controls, we identified medical conditions that occur more likely in those who later develop AD. We further assessed the genetic underpinnings of these conditions in relation to AD genetics using data from two large-scale biobanks (BioVU and UK Biobank, total N=450,000). Results We identified 43,508 AD cases and 419,455 matched controls in MarketScan, and 1,320 AD cases and 12,720 matched controls in VUMC. We detected 406 and 102 medical phenotypes that are significantly enriched among the future AD cases in MarketScan and VUMC databases, respectively. In both EHR databases, mental disorders and neurological disorders emerged as the top two most enriched clinical categories. More than 70 medical phenotypes are replicated in both EHR databases, which are dominated by mental disorders (e.g., depression), neurological disorders (e.g., sleep orders), circulatory system disorders (e.g. cerebral atherosclerosis) and endocrine/metabolic disorders (e.g., type 2 diabetes). We identified 19 phenotypes that are either associated with individual risk variants of AD or a polygenic risk score of AD. Conclusions In this study, analysis of longitudinal EHRs from independent large-scale databases enables robust identification of health conditions associated with subsequent development of AD, highlighting potential opportunities of therapeutics and interventions to reduce AD risk.
Collapse
Affiliation(s)
- Xue Zhong
- Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN
| | - Gengjie Jia
- Department of Medicine, Institute of Genomics and Systems Biology, University of Chicago, Chicago, IL
| | - Zhijun Yin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
| | - Kerou Cheng
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
| | - Andrey Rzhetsky
- Department of Human Genetics, Department of Medicine, University of Chicago, Chicago, IL
| | - Bingshan Li
- Department of Molecular Physiology and Biophysics, Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN
| | - Nancy J. Cox
- Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN
| |
Collapse
|
17
|
Le MHN, Nguyen PK, Nguyen TPT, Nguyen HQ, Tam DNH, Huynh HH, Huynh PK, Le NQK. An in-depth review of AI-powered advancements in cancer drug discovery. Biochim Biophys Acta Mol Basis Dis 2025; 1871:167680. [PMID: 39837431 DOI: 10.1016/j.bbadis.2025.167680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Revised: 01/12/2025] [Accepted: 01/16/2025] [Indexed: 01/23/2025]
Abstract
The convergence of artificial intelligence (AI) and genomics is redefining cancer drug discovery by facilitating the development of personalized and effective therapies. This review examines the transformative role of AI technologies, including deep learning and advanced data analytics, in accelerating key stages of the drug discovery process: target identification, drug design, clinical trial optimization, and drug response prediction. Cutting-edge tools such as DrugnomeAI and PandaOmics have made substantial contributions to therapeutic target identification, while AI's predictive capabilities are driving personalized treatment strategies. Additionally, advancements like AlphaFold highlight AI's capacity to address intricate challenges in drug development. However, the field faces significant challenges, including the management of large-scale genomic datasets and ethical concerns surrounding AI deployment in healthcare. This review underscores the promise of data-centric AI approaches and emphasizes the necessity of continued innovation and interdisciplinary collaboration. Together, AI and genomics are charting a path toward more precise, efficient, and transformative cancer therapeutics.
Collapse
Affiliation(s)
- Minh Huu Nhat Le
- International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan; AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan
| | - Phat Ky Nguyen
- International Master/Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan; AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan.
| | | | - Hien Quang Nguyen
- Cardiovascular Research Department, Methodist Hospital, Merrillville, IN 46410, USA
| | - Dao Ngoc Hien Tam
- Regulatory Affairs Department, Asia Shine Trading & Service Co. LTD, Viet Nam
| | - Han Hong Huynh
- International Master Program for Translational Science, College of Medical Science and Technology, Taipei Medical University, Taipei 110, Taiwan
| | - Phat Kim Huynh
- Department of Industrial and Systems Engineering, North Carolina A&T State University, Greensboro, NC 27411, USA.
| | - Nguyen Quoc Khanh Le
- AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan; In-Service Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan; Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan.
| |
Collapse
|
18
|
Lee YH, Zhang Y, Espinosa Dice AL, Li JH, Tubbs JD, Feng YCA, Ge T, Maihofer AX, Nievergelt CM, Smoller JW, Koenen KC, Roberts AL, Slopen N. Towards Scalable Biomarker Discovery in Posttraumatic Stress Disorder: Triangulating Genomic and Phenotypic Evidence from a Health System Biobank. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.02.27.25322886. [PMID: 40061358 PMCID: PMC11888531 DOI: 10.1101/2025.02.27.25322886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]
Abstract
Importance Biomarkers can potentially improve the diagnosis, monitoring, and treatment of posttraumatic stress disorder (PTSD). However, PTSD biomarkers that are scalable and easily integrated into real-world clinical settings have not been identified. Objective To triangulate phenotypic and genomic evidence from a health system biobank with a goal of identifying scalable and clinically relevant biomarkers for PTSD. Design setting and participants The analysis was conducted between June to November 2024 using genomic samples and laboratory test results recorded in the Mass General Brigham (MGB) Health System. The analysis included 23,743 European ancestry participants from the nested MGB Biobank study. Exposures The first exposure was polygenic risk score (PRS) for PTSD, calculated using the largest available European ancestry genome-wide association study (GWAS), employing a Bayesian polygenic scoring method. The second exposure was a clinical diagnosis of PTSD, determined by the presence of two or more qualifying PTSD phecodes in the longitudinal electronic health records (EHR). Main outcomes and measures The primary outcomes were the inverse normal quantile transformed, median lab values of 241 laboratory traits with non-zero h 2 SNP estimates. Results Sixteen unique laboratory traits across the cardiometabolic, hematologic, hepatic, and immune systems were implicated in both genomic and phenotypic lab-wide association scans (LabWAS). Two-sample Mendelian randomization analyses provided evidence of potential unidirectional causal effects of PTSD liability on five laboratory traits. Conclusion and relevance These findings demonstrate the potential of a triangulation approach to uncover scalable and clinically relevant biomarkers for PTSD.
Collapse
Affiliation(s)
- Younga Heather Lee
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
- Department of Psychiatry, Harvard Medical School, Boston, MA
- Broad Trauma Initiative, Broad Institute of MIT and Harvard, Cambridge, MA
- Department of Epidemiology, Havard T. H. Chan School of Public Health, Boston, MA
| | - Yingzhe Zhang
- Department of Epidemiology, Havard T. H. Chan School of Public Health, Boston, MA
| | | | - Josephine H Li
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
- Diabetes Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Justin D Tubbs
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
- Department of Psychiatry, Harvard Medical School, Boston, MA
| | - Yen-Chen Anne Feng
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
- Department of Psychiatry, Harvard Medical School, Boston, MA
- Institute of Health Data Analytics and Statistics, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Tian Ge
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
- Department of Psychiatry, Harvard Medical School, Boston, MA
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA
| | - Adam X Maihofer
- Department of Psychiatry, University of California San Diego, La Jolla, CA
- Veterans Affairs San Diego Healthcare System, Center of Excellence for Stress and Mental Health, San Diego, CA
- Veterans Affairs San Diego Healthcare System, Research Service, San Diego, CA
| | - Caroline M Nievergelt
- Department of Psychiatry, University of California San Diego, La Jolla, CA
- Veterans Affairs San Diego Healthcare System, Center of Excellence for Stress and Mental Health, San Diego, CA
- Veterans Affairs San Diego Healthcare System, Research Service, San Diego, CA
| | - Jordan W Smoller
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
- Department of Psychiatry, Harvard Medical School, Boston, MA
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA
| | - Karestan C Koenen
- Department of Psychiatry, Harvard Medical School, Boston, MA
- Broad Trauma Initiative, Broad Institute of MIT and Harvard, Cambridge, MA
- Department of Epidemiology, Havard T. H. Chan School of Public Health, Boston, MA
- Department of Social and Behavioral Sciences, Havard T. H. Chan School of Public Health, Boston, MA
| | - Andrea L Roberts
- Department of Environmental Health, Havard T. H. Chan School of Public Health, Boston, MA
| | - Natalie Slopen
- Department of Social and Behavioral Sciences, Havard T. H. Chan School of Public Health, Boston, MA
- Center on the Developing Child, Harvard University, Cambridge, MA
| |
Collapse
|
19
|
Du J, Yu Y, Zhang M, Wu Z, Ryan AM, Mukherjee B. Outcome adaptive propensity score methods for handling censoring and high-dimensionality: Application to insurance claims. Stat Methods Med Res 2025:9622802241306856. [PMID: 40013476 DOI: 10.1177/09622802241306856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2025]
Abstract
Propensity scores are commonly used to reduce the confounding bias in non-randomized observational studies for estimating the average treatment effect. An important assumption underlying this approach is that all confounders that are associated with both the treatment and the outcome of interest are measured and included in the propensity score model. In the absence of strong prior knowledge about potential confounders, researchers may agnostically want to adjust for a high-dimensional set of pre-treatment variables. As such, variable selection procedure is needed for propensity score estimation. In addition, studies show that including variables related to treatment only in the propensity score model may inflate the variance of the treatment effect estimators, while including variables that are predictive of only the outcome can improve efficiency. In this article, we propose to incorporate outcome-covariate relationship in the propensity score model by including the predicted binary outcome probability as a covariate. Our approach can be easily adapted to an ensemble of variable selection methods, including regularization methods and modern machine-learning tools based on classification and regression trees. We evaluate our method to estimate the treatment effects on a binary outcome, which is possibly censored, across multiple treatment groups. Simulation studies indicate that incorporating outcome probability for estimating the propensity scores can improve statistical efficiency and protect against model misspecification. The proposed methods are applied to a cohort of advanced-stage prostate cancer patients identified from a private insurance claims database for comparing the adverse effects of four commonly used drugs for treating castration-resistant prostate cancer.
Collapse
Affiliation(s)
- Jiacong Du
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Youfei Yu
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Min Zhang
- Vanke School of Public Health, Tsinghua University, Beijing, China
| | - Zhenke Wu
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Andrew M Ryan
- Department of Health Management and Policy, University of Michigan, Ann Arbor, MI, USA
| | - Bhramar Mukherjee
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
20
|
Colbert SMC, Lepow L, Fennessy B, Iwata N, Ikeda M, Saito T, Terao C, Preuss M, Pathak J, Mann JJ, Coon H, Mullins N. Distinguishing clinical and genetic risk factors for suicidal ideation and behavior in a diverse hospital population. Transl Psychiatry 2025; 15:63. [PMID: 39979244 PMCID: PMC11842747 DOI: 10.1038/s41398-025-03287-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 01/13/2025] [Accepted: 02/12/2025] [Indexed: 02/22/2025] Open
Abstract
Suicidal ideation (SI) and behavior (SB) are major public health concerns, but risk factors for their development and progression are poorly understood. We used ICD codes and a natural language processing algorithm to identify individuals in a hospital biobank with SI-only, SB, and controls without either. We compared the profiles of SB and SI-only patients to controls, and each other, using phenome-wide association studies (PheWAS) and polygenic risk scores (PRS). PheWAS identified many risk factors for SB and SI-only, plus specific psychiatric disorders which may be involved in progression from SI-only to SB. PRS for suicide attempt were only associated with SB, and even after accounting for psychiatric disorder PRS. SI PRS were only associated with SI-only, although not after accounting for psychiatric disorder PRS. These findings advance understanding of distinct genetic and clinical risk factors for SB and SI-only, which will aid in early detection and intervention efforts.
Collapse
Affiliation(s)
- Sarah M C Colbert
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Lauren Lepow
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Brian Fennessy
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Nakao Iwata
- Department of Psychiatry, Fujita Health University School of Medicine, Toyoake, Japan
| | - Masashi Ikeda
- Department of Psychiatry, Fujita Health University School of Medicine, Toyoake, Japan
- Department of Psychiatry, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Takeo Saito
- Department of Psychiatry, Fujita Health University School of Medicine, Toyoake, Japan
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
- Department of Applied Genetics, The School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Michael Preuss
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jyotishman Pathak
- Department of Population Health Sciences, Weill Cornell Medicine/NewYork-Presbyterian, New York, NY, USA
| | - J John Mann
- Department of Psychiatry, Columbia University Irving Medical Center, Columbia University, New York, NY, USA
- Department of Radiology, Columbia University Irving Medical Center, Columbia University, New York, NY, USA
- Division of Molecular Imaging and Neuropathology, New York State Psychiatric Institute, New York, NY, USA
| | - Hilary Coon
- Department of Psychiatry & Huntsman Mental Health Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Niamh Mullins
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
21
|
Motelow JE, Malakar A, Murthy SBK, Verbitsky M, Kahn A, Estrella E, Kunkel L, Wiesenhahn M, Becket J, Harris N, Lee R, Adam R, Kiryluk K, Gharavi AG, Brownstein CA. Interstitial Cystitis: a phenotype and rare variant exome sequencing study: Interstitial Cystitis: a phenotype and exome sequencing study. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.02.16.25322147. [PMID: 40034785 PMCID: PMC11875234 DOI: 10.1101/2025.02.16.25322147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Interstitial cystitis/bladder pain syndrome (IC/BPS) is a poorly understood and underdiagnosed syndrome of chronic bladder/pelvic pain with urinary frequency and urgency. Though IC/BPS can be hereditary, little is known of its genetic etiology. Using the eMERGE data, we confirmed known phenotypic associations such as gastroesophageal reflux disease and irritable bowel syndrome and detected new associations, including osteoarthrosis/osteoarthritis and Barrett's esophagus. An exome wide ultra-rare variants analysis in 348 IC/BPS and 11,981 controls extended the previously reported association with ATP2C1 and ATP2A2, implicated in Mendelian desquamating skin disorders, but did not provide evidence for other previously proposed pathogenic pathways such as bladder development, nociception or inflammation. Pathway analysis detected new associations with "anaphase-promoting complex-dependent catabolic process", the "regulation of MAPK cascade" and "integrin binding". These findings suggest perturbations in biological networks for epithelial integrity and cell cycle progression in IC/BPS pathogenesis, and provide a roadmap for its future investigation.
Collapse
Affiliation(s)
- Joshua E Motelow
- Division of Critical Care and Hospital Medicine, Department of Pediatrics, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
| | - Ayan Malakar
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
- Center for Precision Medicine and Genomics, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY
| | - Sarath Babu Krishna Murthy
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
- Center for Precision Medicine and Genomics, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY
| | - Miguel Verbitsky
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
- Center for Precision Medicine and Genomics, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY
| | - Atlas Kahn
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
| | - Elicia Estrella
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA
| | - Louis Kunkel
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA
- The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston MA
| | - Madelyn Wiesenhahn
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA
- The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston MA
| | - Jaimee Becket
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
- Center for Precision Medicine and Genomics, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY
| | - Natasha Harris
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
- Center for Precision Medicine and Genomics, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY
| | - Richard Lee
- Department of Urology, Boston Children's Hospital, Harvard Medical School, Boston MA
| | - Rosalyn Adam
- Department of Urology, Boston Children's Hospital, Harvard Medical School, Boston MA
- Department of Surgery, Harvard Medical School, Boston, MA
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
| | - Ali G Gharavi
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY
- Center for Precision Medicine and Genomics, Department of Medicine, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY
| | - Catherine A Brownstein
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA
- The Manton Center for Orphan Disease Research, Boston Children's Hospital, Harvard Medical School, Boston MA
| |
Collapse
|
22
|
Scalici A, Miller-Fleming TW, Shuey MM, Baker JT, Betti M, Hirbo J, Knapik EW, Cox NJ. Gene and phenome-based analysis of the shared genetic architecture of eye diseases. Am J Hum Genet 2025; 112:318-331. [PMID: 39879988 PMCID: PMC11866973 DOI: 10.1016/j.ajhg.2025.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 12/31/2024] [Accepted: 01/03/2025] [Indexed: 01/31/2025] Open
Abstract
While many eye disorders are linked through defects in vascularization and optic nerve degeneration, genetic correlation studies have yielded variable results despite shared features. For example, glaucoma and myopia both share optic neuropathy as a feature, but genetic correlation studies demonstrated minimal overlap. By leveraging electronic health record (EHR) resources that contain genetic variables such as genetically predicted gene expression (GPGE), researchers have the potential to improve the identification of shared genetic drivers of disease by incorporating knowledge of shared features to identify disease-causing mechanisms. In this study, we examined shared genetic architecture across eye diseases. Our gene-based approach used transcriptome-wide association methods to identify shared transcriptomic profiles across eye diseases within BioVU, Vanderbilt University Medical Center's (VUMC's) EHR-linked biobank. Our phenome-based approach leveraged phenome-wide association studies (PheWASs) to identify eye disease comorbidities. Using the beta estimates from the significantly associated comorbidities, we constructed a phenotypic risk score (PheRS) representing a weighted sum of an individual's eye disease comorbidities. This PheRS is predictive of eye disease status and associated with the altered GPGE of significant genes in an independent population. The implementation of both gene- and phenome-based approaches can expand genetic associations and shed greater insight into the underlying mechanisms of shared genetic architecture across eye diseases.
Collapse
Affiliation(s)
- Alexandra Scalici
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Tyne W Miller-Fleming
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Megan M Shuey
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - James T Baker
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Michael Betti
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Jibril Hirbo
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Ela W Knapik
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Nancy J Cox
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
| |
Collapse
|
23
|
Wan NC, Grabowska ME, Kerchberger VE, Wei WQ. Exploring beyond diagnoses in electronic health records to improve discovery: a review of the phenome-wide association study. JAMIA Open 2025; 8:ooaf006. [PMID: 40041255 PMCID: PMC11879097 DOI: 10.1093/jamiaopen/ooaf006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2024] [Revised: 12/30/2024] [Accepted: 01/24/2025] [Indexed: 03/06/2025] Open
Abstract
Objective The phenome-wide association study (PheWAS) systematically examines the phenotypic spectrum extracted from electronic health records (EHRs) to uncover correlations between phenotypes and exposures. This review explores methodologies, highlights challenges, and outlines future directions for EHR-driven PheWAS. Materials and Methods We searched the PubMed database for articles spanning from 2010 to 2023, and we collected data regarding exposures, phenotypes, cohorts, terminologies, replication, and ancestry. Results Our search yielded 690 articles. Following exclusion criteria, we identified 291 articles published between January 1, 2010, and December 31, 2023. A total number of 162 (55.6%) articles defined phenomes using phecodes, indicating that research is reliant on the organization of billing codes. Moreover, 72.8% of articles utilized exposures consisting of genetic data, and the majority (69.4%) of PheWAS lacked replication analyses. Discussion Existing literature underscores the need for deeper phenotyping, variability in PheWAS exposure variables, and absence of replication in PheWAS. Current applications of PheWAS mainly focus on cardiovascular, metabolic, and endocrine phenotypes; thus, applications of PheWAS in uncommon diseases, which may lack structured data, remain largely understudied. Conclusions With modern EHRs, future PheWAS should extend beyond diagnosis codes and consider additional data like clinical notes or medications to create comprehensive phenotype profiles that consider severity, temporality, risk, and ancestry. Furthermore, data interoperability initiatives may help mitigate the paucity of PheWAS replication analyses. With the growing availability of data in EHR, PheWAS will remain a powerful tool in precision medicine.
Collapse
Affiliation(s)
- Nicholas C Wan
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN 37240, United States
| | - Monika E Grabowska
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37302, United States
| | - Vern Eric Kerchberger
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37302, United States
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37302, United States
| |
Collapse
|
24
|
Sriram V, Woerner J, Ahn YY, Kim D. The interplay of sex and genotype in disease associations: a comprehensive network analysis in the UK Biobank. Hum Genomics 2025; 19:4. [PMID: 39825454 PMCID: PMC11740496 DOI: 10.1186/s40246-024-00710-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Accepted: 12/17/2024] [Indexed: 01/20/2025] Open
Abstract
BACKGROUND Disease comorbidities and longer-term complications, arising from biologically related associations across phenotypes, can lead to increased risk of severe health outcomes. Given that many diseases exhibit sex-specific differences in their genetics, our objective was to determine whether genotype-by-sex (GxS) interactions similarly influence cross-phenotype associations. Through comparison of sex-stratified disease-disease networks (DDNs)-where nodes represent diseases and edges represent their relationships-we investigate sex differences in patterns of polygenicity and pleiotropy between diseases. RESULTS Using UK Biobank summary statistics, we built male- and female-specific DDNs for 103 diseases. This revealed that male and female diseasomes have similar topology and central diseases (e.g., hypertensive, chronic respiratory, and thyroid-based disorders), yet some phenotypes exhibit sex-specific influence in cross-phenotype associations. Multiple sclerosis and osteoarthritis are central only in the female DDN, while cardiometabolic diseases and skin cancer are more prominent in the male DDN. Edge comparison indicated similar shared genetics between the two graphs relative to a random model of disease association, though notable discrepancies in embedding distances and clustering patterns imply a more expansive genetic influence on multimorbidity risk for females than males. Analysis of pleiotropic contributions of two sexually-dimorphic single-nucleotide polymorphisms related to thyroid disorders further validated a distinct genetic architecture across sexes that influences associations, confirmed through examination of corresponding gene expression profiles from the GTEx Portal. CONCLUSIONS Our analysis affirms the presence of GxS interactions in cross-phenotype associations, emphasizing the need to investigate the role of sex in disease onset and its importance in biomedical discovery and precision medicine research.
Collapse
Affiliation(s)
- Vivek Sriram
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Richards Building B304, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA
| | - Jakob Woerner
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Richards Building B304, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA
| | - Yong-Yeol Ahn
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing, and Engineering, Indiana University Bloomington, Bloomington, IN, 47405, USA.
| | - Dokyoon Kim
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Richards Building B304, 3700 Hamilton Walk, Philadelphia, PA, 19104, USA.
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
25
|
Friedman SF, Khurshid S, Venn RA, Wang X, Diamant N, Di Achille P, Weng LC, Choi SH, Reeder C, Pirruccello JP, Singh P, Lau ES, Philippakis A, Anderson CD, Maddah M, Batra P, Ellinor PT, Ho JE, Lubitz SA. Unsupervised deep learning of electrocardiograms enables scalable human disease profiling. NPJ Digit Med 2025; 8:23. [PMID: 39799251 PMCID: PMC11724961 DOI: 10.1038/s41746-024-01418-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Accepted: 12/21/2024] [Indexed: 01/15/2025] Open
Abstract
The 12-lead electrocardiogram (ECG) is inexpensive and widely available. Whether conditions across the human disease landscape can be detected using the ECG is unclear. We developed a deep learning denoising autoencoder and systematically evaluated associations between ECG encodings and ~1,600 Phecode-based diseases in three datasets separate from model development, and meta-analyzed the results. The latent space ECG model identified associations with 645 prevalent and 606 incident Phecodes. Associations were most enriched in the circulatory (n = 140, 82% of category-specific Phecodes), respiratory (n = 53, 62%) and endocrine/metabolic (n = 73, 45%) categories, with additional associations across the phenome. The strongest ECG association was with hypertension (p < 2.2×10-308). The ECG latent space model demonstrated more associations than models using standard ECG intervals, and offered favorable discrimination of prevalent disease compared to models comprising age, sex, and race. We further demonstrate how latent space models can be used to generate disease-specific ECG waveforms and facilitate individual disease profiling.
Collapse
Grants
- U01NS069763 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- K24HL105780 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- 21SFRN812095 American Heart Association (American Heart Association, Inc.)
- 18SFRN34250007 American Heart Association (American Heart Association, Inc.)
- 23CDA1050571 American Heart Association (American Heart Association, Inc.)
- 18SFRN34110082 American Heart Association (American Heart Association, Inc.)
- R01HL140224 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R01HL139731 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- K23 HL159243 NHLBI NIH HHS
- K23HL159243 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- K08HL159346 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R01HL160003 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- 1R01HL092577 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- K23HL169839 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R01NS103924 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- K24HL153669 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- 1R01HL139731 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R01HL134893 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- 853922 American Heart Association (American Heart Association, Inc.)
- U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- NHLBI BioData Catalyst Fellows program
- European Union MAESTRIA 965286
Collapse
Affiliation(s)
- Sam F Friedman
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Shaan Khurshid
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Telemachus and Irene Demoulas Family Foundation Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA
| | - Rachael A Venn
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Telemachus and Irene Demoulas Family Foundation Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA
| | - Xin Wang
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nate Diamant
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Paolo Di Achille
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Lu-Chen Weng
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Seung Hoan Choi
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Christopher Reeder
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - James P Pirruccello
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
- Division of Cardiology, University of California San Francisco, San Francisco, San Francisco, CA, USA
| | - Pulkit Singh
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Emily S Lau
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Cardiology, Massachusetts General Hospital, Boston, MA, USA
| | | | - Christopher D Anderson
- Department of Neurology, Brigham and Women's Hospital, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Henry and Allison McCance Center for Brain Health, Massachusetts General Hospital, Boston, MA, USA
| | - Mahnaz Maddah
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Puneet Batra
- Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Patrick T Ellinor
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Telemachus and Irene Demoulas Family Foundation Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA
| | - Jennifer E Ho
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- CardioVascular Institute and Division of Cardiology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Steven A Lubitz
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA.
- Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Telemachus and Irene Demoulas Family Foundation Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, MA, USA.
| |
Collapse
|
26
|
Jung EM, Raduski AR, Mills LJ, Spector LG. A phenome-wide association study of polygenic scores for selected childhood cancer: Results from the UK Biobank. HGG ADVANCES 2025; 6:100356. [PMID: 39340156 PMCID: PMC11538869 DOI: 10.1016/j.xhgg.2024.100356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 09/24/2024] [Accepted: 09/04/2024] [Indexed: 09/30/2024] Open
Abstract
The aim of this study was to scan phenotypes in adulthood associated with polygenic risk scores (PRS) for childhood cancers with well-articulated genetic architectures-acute lymphoblastic leukemia (ALL), Ewing sarcoma, and neuroblastoma-to examine genetic pleiotropy. Furthermore, we aimed to determine which SNPs could drive associations. Per-SNP summary statistics were extracted for PRS calculation. Participants with white British ancestry were exclusively included for analyses. SNPs were queried from the UK Biobank genotype imputation data. Records from the cancer registry, death registry, and inpatient diagnoses were abstracted for phenome-wide scans. Firth logistic regression was used to estimate odds ratios (ORs) and 95% confidence intervals (CIs) alongside corresponding p values, adjusting for age at recruitment and sex. A total of 244,332 unrelated white British participants were included. We observed a significant association between ALL-PRS and ALL (OR: 1.20e+24, 95% CI: 9.08e+14-1.60e+33). In addition, we observed a significant association between high-risk neuroblastoma PRS and nonrheumatic aortic valve disorders (OR: 43.9, 95% CI: 7.42-260). There were no significant phenotype associations with Ewing sarcoma and neuroblastoma PRS. Regarding individual SNPs, rs17607816 increased the risk of ALL (OR: 6.40, 95% CI: 3.26-12.57). For high-risk neuroblastoma, rs80059929 elevated the risk of atrioventricular block (OR: 3.04, 95% CI: 1.85-4.99). Our findings suggest that individuals with genetic susceptibility to ALL may face a lifelong risk for developing ALL, along with a genetic pleiotropic association between high-risk neuroblastoma and circulatory diseases.
Collapse
Affiliation(s)
- Eun Mi Jung
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA.
| | - Andrew R Raduski
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA
| | - Lauren J Mills
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA; Masonic Cancer Center, University of Minnesota, Minneapolis, MN, USA
| | - Logan G Spector
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA; Masonic Cancer Center, University of Minnesota, Minneapolis, MN, USA.
| |
Collapse
|
27
|
Zucker R, Kelman G, Linial M. PWAS Hub: exploring gene-based associations of complex diseases with sex dependency. Nucleic Acids Res 2025; 53:D1132-D1143. [PMID: 39565197 PMCID: PMC11701668 DOI: 10.1093/nar/gkae1125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 10/15/2024] [Accepted: 11/18/2024] [Indexed: 11/21/2024] Open
Abstract
The Proteome-Wide Association Study (PWAS) is a protein-based genetic association approach designed to complement traditional variant-based methods like GWAS. PWAS operates in two stages: first, machine learning models predict the impact of genetic variants on protein-coding genes, generating effect scores. These scores are then aggregated into a gene-damaging score for each individual. This score is then used in case-control statistical tests to significantly link to specific phenotypes. PWAS Hub (v1.2) is a user-friendly platform that facilitates the exploration of gene-disease associations using clinical and genetic data from the UK Biobank (UKB), encompassing 500k individuals. PWAS Hub reports on 819 diseases and phenotypes determined by PheCode and ICD-10 clinical codes, each with a minimum of 400 affected individuals. PWAS-derived gene associations were reported for 72% of the tested phenotypes. The PWAS Hub also analyzes gene associations separately for males and females, considering sex-specific genetic effects, inheritance patterns (dominant and recessive), and gene pleiotropy. We illustrated the utility of the PWAS Hub for primary (essential) hypertension (I10), type 2 diabetes mellitus (E11), and specified haematuria (R31) that showed sex-dependent genetic signals. The PWAS Hub, available at pwas.huji.ac.il, is a valuable resource for studying genetic contributions to common diseases and sex-specific effects.
Collapse
Affiliation(s)
- Roei Zucker
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
| | - Guy Kelman
- The Jerusalem Center for Personalized Computational Medicine, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 9112102, Israel
| | - Michal Linial
- Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
| |
Collapse
|
28
|
De Jager P, Zeng L, Khan A, Lama T, Chitnis T, Weiner H, Wang G, Fujita M, Zipp F, Taga M, Kiryluk K. GWAS highlights the neuronal contribution to multiple sclerosis susceptibility. RESEARCH SQUARE 2025:rs.3.rs-5644532. [PMID: 39866869 PMCID: PMC11760239 DOI: 10.21203/rs.3.rs-5644532/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2025]
Abstract
Multiple Sclerosis (MS) is a chronic inflammatory and neurodegenerative disease affecting the brain and spinal cord. Genetic studies have identified many risk loci, that were thought to primarily impact immune cells and microglia. Here, we performed a multi-ancestry genome-wide association study with 20,831 MS and 729,220 control participants, identifying 236 susceptibility variants outside the Major Histocompatibility Complex, including four novel loci. We derived a polygenic score for MS and, optimized for European ancestry, it is informative for African-American and Latino participants. Integrating single-cell data from blood and brain tissue, we identified 76 genes affected by MS risk variants. Notably, while T cells showed the strongest enrichment, inhibitory neurons emerged as a key cell type. The expression of IL7 and STAT3 are affected only in inhibitory neurons, highlighting the importance of neuronal and glial dysfunction in MS susceptibility.
Collapse
Affiliation(s)
| | - Lu Zeng
- Columbia University Irving Medical Center
| | | | | | | | | | | | | | - Frauke Zipp
- University Medical Center of the Johannes Gutenberg University Mainz
| | - Mariko Taga
- Center for Translational & Computational Neuroimmunology
| | | |
Collapse
|
29
|
Winters JLG, Piekos JA, Hellwege JN, Dikilitas O, Kullo IJ, Schaid DJ, Edwards TL, Velez Edwards DR. Constructing a multi-ancestry polygenic risk score for uterine fibroids using publicly available data highlights need for inclusive genetic research. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2025; 30:268-280. [PMID: 39670376 PMCID: PMC11731894 DOI: 10.1142/9789819807024_0020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2025]
Abstract
Uterine leiomyomata, or fibroids, are common gynecological tumors causing pelvic and menstrual symptoms that can negatively affect quality of life and child-bearing desires. As fibroids grow, symptoms can intensify and lead to invasive treatments that are less likely to preserve fertility. Identifying individuals at highest risk for fibroids can aid in access to earlier diagnoses. Polygenic risk scores (PRS) quantify genetic risk to identify those at highest risk for disease. Utilizing the PRS software PRS-CSx and publicly available genome-wide association study (GWAS) summary statistics from FinnGen and Biobank Japan, we constructed a multi-ancestry (META) PRS for fibroids. We validated the META PRS in two cross-ancestry cohorts. In the cross-ancestry Electronic Medical Record and Genomics (eMERGE) Network cohort, the META PRS was significantly associated with fibroid status and exhibited 1.11 greater odds for fibroids per standard deviation increase in PRS (95% confidence interval [CI]: 1.05 - 1.17, p = 5.21x10-5). The META PRS was validated in two BioVU cohorts: one using ICD9/ICD10 codes and one requiring imaging confirmation of fibroid status. In the ICD cohort, a standard deviation increase in the META PRS increased the odds of fibroids by 1.23 (95% CI: 1.15 - 1.32, p = 9.68x10-9), while in the imaging cohort, the odds increased by 1.26 (95% CI: 1.18 - 1.35, p = 2.40x10-11). We subsequently constructed single ancestry PRS for FinnGen (European ancestry [EUR]) and Biobank Japan (East Asian ancestry [EAS]) using PRS-CS and discovered a nominally significant association in the eMERGE cohort within fibroids and EAS PRS but not EUR PRS (95% CI: 1.09 - 1.20, p = 1.64x10-7). These findings highlight the strong predictive power of multi-ancestry PRS over single ancestry PRS. This study underscores the necessity of diverse population inclusion in genetic research to ensure precision medicine benefits all individuals equitably.
Collapse
Affiliation(s)
- Jessica L G Winters
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Jacqueline A Piekos
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Jacklyn N Hellwege
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Ozan Dikilitas
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN 55905, USA
| | - Daniel J Schaid
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN 55905, USA
| | - Todd L Edwards
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, USA,
| | - Digna R Velez Edwards
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, USA,
| |
Collapse
|
30
|
Šimon M, Čater M, Kunej T, Morton NM, Horvat S. A bioinformatics toolbox to prioritize causal genetic variants in candidate regions. Trends Genet 2025; 41:33-46. [PMID: 39414414 DOI: 10.1016/j.tig.2024.09.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 08/28/2024] [Accepted: 09/19/2024] [Indexed: 10/18/2024]
Abstract
This review addresses the significant challenge of identifying causal genetic variants within quantitative trait loci (QTLs) for complex traits and diseases. Despite progress in detecting the ever-larger number of such loci, establishing causality remains daunting. We advocate for integrating bioinformatics and multiomics analyses to streamline the prioritization of candidate genes' variants. Our case study on the Pla2g4e gene, identified previously as a positional candidate obesity gene through genetic mapping and expression studies, demonstrates how applying multiomic data filtered through regulatory elements containing SNPs can refine the search for causative variants. This approach can yield results that guide more efficient experimental strategies, accelerating genetic research toward functional validation and therapeutic development.
Collapse
Affiliation(s)
- Martin Šimon
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, 1230 Domžale, Slovenia
| | - Maša Čater
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, 1230 Domžale, Slovenia
| | - Tanja Kunej
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, 1230 Domžale, Slovenia
| | - Nicholas M Morton
- Department of Biosciences, Centre for Systems Health and Integrated Metabolic Research, School of Science and Technology, Nottingham Trent University, Clifton Lane, Nottingham NG11 8NS, UK.
| | - Simon Horvat
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, 1230 Domžale, Slovenia.
| |
Collapse
|
31
|
Merritt VC, Zhang R, Sherva R, Ly MT, Marra D, Panizzon MS, Tsuang DW, Hauger RL, Logue MW. Curation and validation of electronic medical record-based dementia diagnoses in the VA Million Veteran Program. J Alzheimers Dis 2025; 103:180-193. [PMID: 39692476 DOI: 10.1177/13872877241299130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2024]
Abstract
BACKGROUND The age distribution and diversity of the VA Million Veteran Program (MVP) cohort make it a valuable resource for studying the genetics of Alzheimer's disease (AD) and related dementias (ADRD). OBJECTIVE We present and evaluate the performance of several International Classification of Diseases (ICD) code-based classification algorithms for AD, ADRD, and dementia for use in MVP genetic studies and other studies using VA electronic medical record (EMR) data. These were benchmarked relative to existing ICD algorithms and AD-medication-identified cases. METHODS We used chart review of n = 103 MVP participants to evaluate diagnostic utility of the algorithms. Suitability for genetic studies was examined by assessing association with APOE ε4, the strongest genetic AD risk factor, in a large MVP cohort (n = 286 K). RESULTS The newly developed MVP-ADRD algorithm performed well, comparable to the existing PheCode dementia algorithm (Phe-Dementia) in terms of sensitivity (0.95 and 0.95) and specificity (0.65 and 0.70). The strongest APOE ε4 associations were observed in cases identified using MVP-ADRD and Phe-Dementia augmented with medication-identified cases (MVP-ADRD or medication, p = 3.6 ×10-290; Phe-Dementia or medication, p = 1.4 ×10-290). Performance was improved when cases were restricted to those with onset age ≥60. CONCLUSIONS We found that our MVP-developed ICD-based algorithms had good performance in chart review and generated strong genetic signals, especially after inclusion of medication-identified cases. Ultimately, our MVP-derived algorithms are likely to have good performance in the broader VA, and their performance may also be suitable for use in other large-scale EMR-based biobanks in the absence of definitive biomarkers such as amyloid-PET and cerebrospinal fluid biomarkers.
Collapse
Affiliation(s)
- Victoria C Merritt
- Research Service, VA San Diego Healthcare System, San Diego, USA
- Department of Psychiatry, School of Medicine, University of California, San Diego, La Jolla, USA
- Center of Excellence for Stress and Mental Health, VA San Diego Healthcare System, San Diego, USA
| | - Rui Zhang
- National Center for PTSD, Behavioral Sciences Division, VA Boston Healthcare System, Boston, USA
| | - Richard Sherva
- National Center for PTSD, Behavioral Sciences Division, VA Boston Healthcare System, Boston, USA
- Boston University Chobanian & Avedisian School of Medicine, Biomedical Genetics, Boston, USA
| | - Monica T Ly
- Research Service, VA San Diego Healthcare System, San Diego, USA
- Department of Psychiatry, School of Medicine, University of California, San Diego, La Jolla, USA
- Department of Neurology, Boston University School of Medicine, Boston, USA
| | - David Marra
- Department of Psychiatry, Boston University Chobanian & Avedisian School of Medicine, Boston, USA
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, USA
| | - Matthew S Panizzon
- Department of Psychiatry, School of Medicine, University of California, San Diego, La Jolla, USA
- Center for Behavior Genetics of Aging, University of California, San Diego, La Jolla, USA
| | - Debby W Tsuang
- Geriatric Research, Education, and Clinical Center, VA Puget Sound Health Care System, Seattle, USA
- Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, USA
| | - Richard L Hauger
- Research Service, VA San Diego Healthcare System, San Diego, USA
- Department of Psychiatry, School of Medicine, University of California, San Diego, La Jolla, USA
- Center of Excellence for Stress and Mental Health, VA San Diego Healthcare System, San Diego, USA
- Center for Behavior Genetics of Aging, University of California, San Diego, La Jolla, USA
| | - Mark W Logue
- National Center for PTSD, Behavioral Sciences Division, VA Boston Healthcare System, Boston, USA
- Boston University Chobanian & Avedisian School of Medicine, Biomedical Genetics, Boston, USA
- Department of Neurology, Boston University School of Medicine, Boston, USA
- Department of Biostatistics, Boston University School of Public Health, Boston, USA
| |
Collapse
|
32
|
Reed ZE, Wootton RE, Khouja JN, Richardson TG, Sanderson E, Davey Smith G, Munafò MR. Exploring pleiotropy in Mendelian randomisation analyses: What are genetic variants associated with 'cigarette smoking initiation' really capturing? Genet Epidemiol 2025; 49:e22583. [PMID: 39099143 PMCID: PMC7616876 DOI: 10.1002/gepi.22583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 06/28/2024] [Accepted: 07/17/2024] [Indexed: 08/06/2024]
Abstract
Genetic variants used as instruments for exposures in Mendelian randomisation (MR) analyses may have horizontal pleiotropic effects (i.e., influence outcomes via pathways other than through the exposure), which can undermine the validity of results. We examined the extent of this using smoking behaviours as an example. We first ran a phenome-wide association study in UK Biobank, using a smoking initiation genetic instrument. From the most strongly associated phenotypes, we selected those we considered could either plausibly or not plausibly be caused by smoking. We examined associations between genetic instruments for smoking initiation, smoking heaviness and lifetime smoking and these phenotypes in UK Biobank and the Avon Longitudinal Study of Parents and Children (ALSPAC). We conducted negative control analyses among never smokers, including children. We found evidence that smoking-related genetic instruments were associated with phenotypes not plausibly caused by smoking in UK Biobank and (to a lesser extent) ALSPAC. We observed associations with phenotypes among never smokers. Our results demonstrate that smoking-related genetic risk scores are associated with unexpected phenotypes that are less plausibly downstream of smoking. This may reflect horizontal pleiotropy in these genetic risk scores, and we would encourage researchers to exercise caution this when using these and genetic risk scores for other complex behavioural exposures. We outline approaches that could be taken to consider this and overcome issues caused by potential horizontal pleiotropy, for example, in genetically informed causal inference analyses (e.g., MR) it is important to consider negative control outcomes and triangulation approaches, to avoid arriving at incorrect conclusions.
Collapse
Affiliation(s)
- Zoe E. Reed
- School of Psychological ScienceUniversity of BristolBristolUK
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
| | - Robyn E. Wootton
- School of Psychological ScienceUniversity of BristolBristolUK
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
- Nic Waals Institute, Lovisenberg Diaconal HospitalOsloNorway
| | - Jasmine N. Khouja
- School of Psychological ScienceUniversity of BristolBristolUK
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
| | - Tom G. Richardson
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
| | - Eleanor Sanderson
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
| | - George Davey Smith
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
| | - Marcus R. Munafò
- School of Psychological ScienceUniversity of BristolBristolUK
- MRC Integrative Epidemiology Unit, Bristol Medical SchoolUniversity of BristolBristolUK
- National Institute for Health Research Bristol Biomedical Research Centre, University Hospitals Bristol NHS Foundation TrustUniversity of BristolBristolUK
| |
Collapse
|
33
|
Lake AM, Zhou Y, Wang B, Actkins KV, Zhang Y, Shelley JP, Rajamani A, Steigman M, Kennedy CJ, Smoller JW, Choi KW, Khankari NK, Davis LK. Sexual Trauma, Polygenic Scores, and Mental Health Diagnoses and Outcomes. JAMA Psychiatry 2025; 82:75-84. [PMID: 39475956 PMCID: PMC11581726 DOI: 10.1001/jamapsychiatry.2024.3426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 08/16/2024] [Indexed: 11/13/2024]
Abstract
Importance Leveraging real-world clinical biobanks to investigate the associations between genetic and environmental risk factors for mental illness may help direct clinical screening efforts and evaluate the portability of polygenic scores across environmental contexts. Objective To examine the associations between sexual trauma, polygenic liability to mental health outcomes, and clinical diagnoses of schizophrenia, bipolar disorder, and major depressive disorder in a clinical biobank setting. Design, Setting, and Participants This genetic association study was conducted using clinical and genotyping data from 96 002 participants across hospital-linked biobanks located at Vanderbilt University Medical Center (VUMC), Nashville, Tennessee (including 58 262 individuals with high genetic similarity to the 1000 Genomes Project [1KG] Northern European from Utah reference population [1KG-EU-clustered] and 11 047 with high genetic similarity to the 1KG African-ancestry reference population of Yoruba in Ibadan, Nigeria [1KG-YRI-clustered]), and Mass General Brigham (MGB), Boston, Massachusetts (26 693 individuals with high genetic similarity to the combined European-ancestry superpopulation [1KG-EU-clustered]). Clinical data analyzed included diagnostic billing codes and clinical notes spanning from 1976 to 2023. Data analysis was performed from 2022 to 2024. Exposures Clinically documented sexual trauma disclosures and polygenic scores for schizophrenia, bipolar disorder, and major depressive disorder. Main Outcomes and Measures Diagnoses of schizophrenia, bipolar disorder, and major depressive disorder, determined by aggregating related diagnostic billing codes, were the dependent variables in logistic regression models including sexual trauma disclosure status, polygenic scores, and their interactions as the independent variables. Results Across the VUMC and MGB biobanks, 96 002 individuals were included in analyses (VUMC 1KG-EU-clustered: 33 011 [56.7%] female; median [range] age, 56.8 [10.0 to >89] years; MGB 1KG-EU-clustered: 14 647 [54.9%] female; median [range] age, 58.0 [10.0 to >89] years; VUMC 1KG-YRI-clustered: 6961 [63.0%] female; median [range] age, 44.6 [10.1 to >89] years). Sexual trauma history was associated with all mental health conditions across institutions (ORs ranged from 8.83 [95% CI, 5.50-14.18] for schizophrenia in the VUMC 1KG-YRI-clustered cohort to 17.65 [95% CI, 12.77-24.40] for schizophrenia in the VUMC 1KG-EU-clustered cohort). Sexual trauma history and polygenic scores jointly explained 3.8% to 8.8% of mental health phenotypic variance. Schizophrenia and bipolar disorder polygenic scores had greater associations with mental health outcomes in individuals with no documented disclosures of sexual trauma (schizophrenia interaction: OR, 0.70 [95% CI, 0.56-0.88]; bipolar disorder interaction: OR, 0.83 [95% CI, 0.74-0.94]). Conclusions and Relevance Sexual trauma and mental health polygenic scores, while correlated with one another, were independent and joint risk factors for severe mental illness in a large, diverse hospital biobank population. Furthermore, associations of schizophrenia and bipolar disorder polygenic scores with respective diagnoses were greater in those without disclosures, suggesting that genetic predisposition to mental illness as measured by polygenic scores may be less impactful in the presence of this severe environmental risk factor.
Collapse
Affiliation(s)
- Allison M. Lake
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Yu Zhou
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston
| | - Bo Wang
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston
| | - Ky’Era V. Actkins
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Yingzhe Zhang
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts
| | - John P. Shelley
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Anindita Rajamani
- Department of Computer Science and Engineering, University of Minnesota, Minneapolis
| | - Michael Steigman
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston
| | - Chris J. Kennedy
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston
| | - Jordan W. Smoller
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston
| | - Karmel W. Choi
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston
| | - Nikhil K. Khankari
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
| | - Lea K. Davis
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee
- Division of Data-Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York
| |
Collapse
|
34
|
Hall M, Skinderhaug MK, Almaas E. Phenome-wide association network demonstrates close connection with individual disease trajectories from the HUNT study. PLoS One 2024; 19:e0311485. [PMID: 39729424 DOI: 10.1371/journal.pone.0311485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Accepted: 09/16/2024] [Indexed: 12/29/2024] Open
Abstract
Disease networks offer a potential road map of connections between diseases. Several studies have created disease networks where diseases are connected either based on shared genes or Single Nucleotide Polymorphism (SNP) associations. However, it is still unclear to which degree SNP-based networks map to empirical, co-observed diseases within a different, general, adult study population spanning over a long time period. We created a SNP-based phenome-wide association network (PheNet) from a large population using the UK biobank phenome-wide association studies. Importantly, the SNP-associations are unbiased towards much studied diseases, adjusted for linkage disequilibrium, case/control imbalances, as well as relatedness. We map the PheNet to significantly co-occurring diseases in the Norwegian HUNT study population, and further, identify consecutively occurring diseases with significant ordering in occurrence, independent of age and gender in the PheNet. Our analysis reveals an overlap far larger than expected by chance between the two disease networks, with diseases typically connecting within their own category. Upon examining the sequential occurrence of diseases in the HUNT dataset, we find a giant component consisting of mostly cardiovascular disorders. This allows us to identify sequentially occurring diseases that are genetically linked and co-occur frequently, while also highlighting non-sequential diseases. Furthermore, we observe that survivors of severe cardiovascular diseases subsequently often face less severe conditions, but with a reduced time until their next fatal illness. The HUNT sub-PheNet showing both genetically and co-observed diseases offers an interesting framework to study groups of diseases and examine if they, in fact, are comorbidities. We find that the HUNT sub-PheNet offers the possibility to pinpoint exactly which mutation(s) constitute shared cause of the diseases. This could be of great benefit to both researchers and clinicians studying relationships between diseases.
Collapse
Affiliation(s)
- Martina Hall
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - Marit K Skinderhaug
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - Eivind Almaas
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- K. G. Jebsen Center for Genetic Epidemiology, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| |
Collapse
|
35
|
Tran TC, Schlueter DJ, Zeng C, Mo H, Carroll RJ, Denny JC. PheWAS analysis on large-scale biobank data with PheTK. Bioinformatics 2024; 41:btae719. [PMID: 39657951 PMCID: PMC11709244 DOI: 10.1093/bioinformatics/btae719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 10/16/2024] [Accepted: 12/06/2024] [Indexed: 12/12/2024] Open
Abstract
SUMMARY With the rapid growth of genetic data linked to electronic health record (EHR) data in huge cohorts, large-scale phenome-wide association study (PheWAS) have become powerful discovery tools in biomedical research. PheWAS is an analysis method to study phenotype associations utilizing longitudinal EHR data. Previous PheWAS packages were developed mostly with smaller datasets and with earlier PheWAS approaches. PheTK was designed to simplify analysis and efficiently handle biobank-scale data. PheTK uses multithreading and supports a full PheWAS workflow including extraction of data from OMOP databases and Hail matrix tables as well as PheWAS analysis for both phecode version 1.2 and phecodeX. Benchmarking results showed PheTK took 64% less time than the R PheWAS package to complete the same workflow. PheTK can be run locally or on cloud platforms such as the All of Us Researcher Workbench (All of Us) or the UK Biobank (UKB) Research Analysis Platform (RAP). AVAILABILITY AND IMPLEMENTATION The PheTK package is freely available on the Python Package Index, on GitHub under GNU General Public License (GPL-3) at https://github.com/nhgritctran/PheTK, and on Zenodo, DOI 10.5281/zenodo.14217954, at https://doi.org/10.5281/zenodo.14217954. PheTK is implemented in Python and platform independent.
Collapse
Affiliation(s)
- Tam C Tran
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, United States
| | - David J Schlueter
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, United States
- University of Toronto, ON, M5S 1A1, Canada
| | - Chenjie Zeng
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, United States
| | - Huan Mo
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, United States
| | - Robert J Carroll
- Vanderbilt University School of Medicine, Nashville, TN, 37240, United States
| | - Joshua C Denny
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, United States
- All of Us Research Program, National Institutes of Health, Bethesda, MD, 20892, United States
| |
Collapse
|
36
|
Wilcox H, Saxena R, Winkelman JW, Dashti HS. Clinical and genetic associations for night eating syndrome in a patient biobank. J Eat Disord 2024; 12:211. [PMID: 39716312 DOI: 10.1186/s40337-024-01180-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 12/12/2024] [Indexed: 12/25/2024] Open
Abstract
OBJECTIVE Night eating syndrome (NES) is an eating disorder characterized by evening hyperphagia. Despite having a prevalence comparable to some other eating disorders, NES remains sparsely investigated and poorly characterized. The present study examined the phenotypic and genetic associations for NES in the clinical Mass General Brigham Biobank. METHOD Cases of NES were identified through relevant billing codes for eating disorders (F50.89/F50.9) and subsequent chart review; patients likely without NES were set as controls. Other diagnoses were determined from billing codes and collapsed into one of 1,857 distinct phenotypes based on clinical similarity. NES associations with diagnoses were systematically conducted in phenome-wide association scans using logistic regression models with adjustments for age, sex, race, and ethnicity. Polygenic scores for six related traits, namely for anorexia nervosa, depression, insomnia, sleep apnea, obesity, and type 2 diabetes were tested for associations with NES among participants of European ancestry using adjusted logistic regression models. RESULTS Phenome-wide scans comparing patients with NES against controls (cases n = 88; controls n = 64,539) identified associations with 159 clinical diagnoses spanning 13 broad disease groups including endocrine/metabolic and digestive diseases. Notable associations were evident for bariatric surgery, vitamin D deficiency, sleep disorders (sleep apnea, insomnia, and restless legs syndrome), and attention deficit hyperactivity disorder. The polygenic scores for insomnia and obesity were associated with higher odds of NES (insomnia: odds ratio [OR], 1.24; 95% CI, 1.07, 1.43; obesity: 1.98; 95% CI, 1.71, 2.28). DISCUSSION Complementary phenome-wide and genetic exploratory analyses provided information on unique and shared features of NES, offering insights that may facilitate its precise definition, diagnosis, and the development of targeted therapeutic interventions.
Collapse
Affiliation(s)
- Hannah Wilcox
- Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Richa Saxena
- Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of M.I.T and Harvard, Cambridge, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
| | - John W Winkelman
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
- Sleep Disorders Clinical Research Program, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Hassan S Dashti
- Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
- Programs in Metabolism and Medical & Population Genetics, The Broad Institute of M.I.T and Harvard, Cambridge, MA, USA.
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA.
- Division of Nutrition, Harvard Medical School, Boston, MA, USA.
- , 55 Fruit Street, Edwards 410C, Boston, MA, 02114, USA.
| |
Collapse
|
37
|
Zietz M, Gisladottir U, LaRow Brown K, Tatonetti NP. WebGWAS: A web server for instant GWAS on arbitrary phenotypes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.12.11.24318870. [PMID: 39711729 PMCID: PMC11661389 DOI: 10.1101/2024.12.11.24318870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
Complex disease genetics is a key area of research for reducing disease and improving human health. Genome-wide association studies (GWAS) help in this research by identifying regions of the genome that contribute to complex disease risk. However, GWAS are computationally intensive and require access to individual-level genetic and health information, which presents concerns about privacy and imposes costs on researchers seeking to study complex diseases. Publicly released pan-biobank GWAS summary statistics provide immediate access to results for a subset of phenotypes, but they do not inform about all phenotypes or hand-crafted phenotype definitions, which are often more relevant to study. Here, we present WebGWAS, a new tool that allows researchers to obtain GWAS summary statistics for a phenotype of interest without needing access to individual-level genetic and phenotypic data. Our public web app can be used to study custom phenotype definitions, including inclusion and exclusion criteria, and to produce approximate GWAS summary statistics for that phenotype. WebGWAS computes approximate GWAS summary statistics very quickly (<10 seconds), and it does not store private health information. We also show how the statistical approximation underlying WebGWAS can be used to accelerate the computation of multi-phenotype GWAS among correlated phenotypes. Our tool provides a faster approach to GWAS for researchers interested in complex disease, providing approximate summary statistics in short order, without the need to collect, process, and produce GWAS results. Overall, this method advances complex disease research by facilitating more accessible and cost-effective genetic studies using large observational data.
Collapse
Affiliation(s)
- Michael Zietz
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, West Hollywood, CA 90069
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY 10032
| | - Undina Gisladottir
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY 10032
| | - Kathleen LaRow Brown
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY 10032
| | - Nicholas P. Tatonetti
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, West Hollywood, CA 90069
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY 10032
| |
Collapse
|
38
|
Zeng L, Atlas K, Lama T, Chitnis T, Weiner H, Wang G, Fujita M, Zipp F, Taga M, Kiryluk K, De Jager PL. GWAS highlights the neuronal contribution to multiple sclerosis susceptibility. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.12.04.24318500. [PMID: 39677438 PMCID: PMC11643295 DOI: 10.1101/2024.12.04.24318500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/17/2024]
Abstract
Multiple Sclerosis (MS) is a chronic inflammatory and neurodegenerative disease affecting the brain and spinal cord. Genetic studies have identified many risk loci, that were thought to primarily impact immune cells and microglia. Here, we performed a multi-ancestry genome-wide association study with 20,831 MS and 729,220 control participants, identifying 236 susceptibility variants outside the Major Histocompatibility Complex, including four novel loci. We derived a polygenic score for MS and, optimized for European ancestry, it is informative for African-American and Latino participants. Integrating single-cell data from blood and brain tissue, we identified 76 genes affected by MS risk variants. Notably, while T cells showed the strongest enrichment, inhibitory neurons emerged as a key cell type, highlighting the importance of neuronal and glial dysfunction in MS susceptibility.
Collapse
Affiliation(s)
- Lu Zeng
- Center for Translational and Computational Neuroimmunology & Columbia Multiple Sclerosis Center, Department of Neurology, Columbia University Irving Medical Center, New York, NY, USA
| | - Khan Atlas
- Division of Nephrology, Dept of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY, USA
| | - Tsering Lama
- Center for Translational and Computational Neuroimmunology & Columbia Multiple Sclerosis Center, Department of Neurology, Columbia University Irving Medical Center, New York, NY, USA
| | | | - Tanuja Chitnis
- Anne Romney Center for Neurologic Diseases and Brigham Multiple Sclerosis Center, Department of Neurology, Brigham & Women’s Hospital, Boston MA
| | - Howard Weiner
- Anne Romney Center for Neurologic Diseases and Brigham Multiple Sclerosis Center, Department of Neurology, Brigham & Women’s Hospital, Boston MA
| | - Gao Wang
- The Gertrude H. Sergievsky Center and the Department of Neurology, Columbia University, New York, NY, USA
| | - Masashi Fujita
- Center for Translational and Computational Neuroimmunology & Columbia Multiple Sclerosis Center, Department of Neurology, Columbia University Irving Medical Center, New York, NY, USA
| | - Frauke Zipp
- Department of Neurology and Focus Program Translational Neuroscience (FTN), Rhine Main Neuroscience Network (rmn2), University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany
| | - Mariko Taga
- Center for Translational and Computational Neuroimmunology & Columbia Multiple Sclerosis Center, Department of Neurology, Columbia University Irving Medical Center, New York, NY, USA
| | - Krzysztof Kiryluk
- Division of Nephrology, Dept of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY, USA
| | - Philip L. De Jager
- Center for Translational and Computational Neuroimmunology & Columbia Multiple Sclerosis Center, Department of Neurology, Columbia University Irving Medical Center, New York, NY, USA
| |
Collapse
|
39
|
Li R, Benz L, Duan R, Denny JC, Hakonarson H, Mosley JD, Smoller JW, Wei WQ, Lumley T, Ritchie MD, Moore JH, Chen Y. A One-Shot Lossless Algorithm for Cross-Cohort Learning in Mixed-Outcomes Analysis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.01.09.24301073. [PMID: 38260403 PMCID: PMC10802662 DOI: 10.1101/2024.01.09.24301073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
In cross-cohort studies, integrating diverse datasets, such as electronic health records (EHRs), is both essential and challenging due to cohort-specific variations, distributed data storage, and data privacy concerns. Traditional methods often require data pooling or complex data harmonization, which can reduce efficiency and limit the scope of cross-cohort learning. We introduce mixWAS, a one-shot, lossless algorithm that efficiently integrates distributed EHR datasets via summary statistics. Unlike existing approaches, mixWAS preserves cohort-specific covariate associations and supports simultaneous mixed-outcome analyses. Simulations demonstrate that mixWAS outperforms conventional methods in accuracy and efficiency across various scenarios. Applied to EHR data from seven cohorts in the US, mixWAS identified 4,534 significant cross-cohort genetic associations among traits such as blood lipids, BMI, and circulatory diseases. Validation with an independent UK EHR dataset confirmed 97.7% of these associations, underscoring the algorithm's robustness. By enabling lossless cross-cohort integration, mixWAS improves the precision of multi-outcome analyses and expands the potential for actionable insights in healthcare research.
Collapse
Affiliation(s)
- Ruowang Li
- Department of Computational Biomedicine, Cedars-Sinai Medical Center
| | - Luke Benz
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| | - Rui Duan
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| | - Joshua C Denny
- National Human Genome Research Institute, National Institutes of Health
| | - Hakon Hakonarson
- Division of Human Genetics, Children's Hospital of Philadelphia
- Center for Applied Genomics, Children's Hospital of Philadelphia
- Department of Pediatrics, University of Pennsylvania, Perelman School of Medicine
| | - Jonathan D Mosley
- Department of Medicine, Vanderbilt University Medical Center
- Department of Biomedical Informatics, Vanderbilt University Medical Center
| | - Jordan W Smoller
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center
| | | | - Marylyn D Ritchie
- Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania
| | - Jason H Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center
| | - Yong Chen
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania
| |
Collapse
|
40
|
Yu YH, Pridgen KM, Nelson TJ, Miller DR, Wells JM, Assimes TL, O'Donnell CJ, Tsao PS, Chang KM, Lynch JA. Oral Health, Inflammation, and Cardiometabolic Factors in the VA Million Veteran Program. JDR Clin Trans Res 2024:23800844241291780. [PMID: 39629945 DOI: 10.1177/23800844241291780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2025] Open
Abstract
INTRODUCTION Associations between cardiometabolic comorbidities and self-reported oral health (OH) are often underexplored in large biobank datasets. While these associations are unaffected by dental care access, they could be mediated by immune responses and inflammation. OBJECTIVES This study assessed the associations between cardiometabolic comorbidities and self-reported OH, periodontitis, and tooth loss using the International Classification of Diseases (ICD) codes in participants from the U.S. Veterans Affairs Million Veteran Program (MVP), adjusting for immune and inflammatory covariates. METHODS Data from 154,167 MVP participants were extracted from January 2011 to September 2021, including lifetime cardiometabolic comorbidities, self-reported OH, ICD-coded periodontitis and tooth loss, and laboratory measurements. Multivariate logistic regression analysis was used to calculate the odds ratios of cardiometabolic comorbidities for self-reported OH, periodontitis, and tooth loss, adjusting for demographic, socioeconomic, cardiovascular, and inflammatory (neutrophil and lymphocyte cell counts) risk factors. A separate dataset was used for additional sensitivity analyses, adjusting for serum levels of C-reactive protein and albumin. RESULTS Complete data were analyzed for 154,167 participants (19%). Most participants (92%) were male and from European ancestry (94%). The mean age was 65.5 y (SD 11.4 y). Ten percent of participants had excellent self-reported OH. Fourteen percent had any periodontitis, and 17% had any tooth loss. Significant associations were found between tooth loss and congestive heart failure (odds ratio [OR], 1.74, P < 0.001) and peripheral vascular diseases (OR, 1.82, P < 0.001). There were also significant associations between congestive heart failure and self-reported OH (excellent versus "poor/fair/good/very good"), with increasing odds as self-reported OH declined (P < 0.001 for trend). These associations remained significant even after sensitivity analyses, albeit with slight attenuation. CONCLUSION This study of veterans underscores the important cardiometabolic links of self-reported poor OH and tooth loss, akin to those observed with periodontitis, even after adjusting for potential confounders related to demographics, lifestyle, and inflammation. KNOWLEDGE TRANSFER STATEMENT Exploring cardiometabolic associations with self-reported OH, clinically diagnosed periodontitis, and tooth loss using the ICD in the Veterans Affairs Million Veteran Program, we found significant associations. These associations persisted after adjustment for inflammatory confounders. These findings emphasized the benefit of assessing OH as a vital indicator of overall cardiometabolic health in large-scale biobank studies.
Collapse
Affiliation(s)
- Y H Yu
- VA Bedford Healthcare System, Bedford, MA, USA
- Department of Periodontology, Tufts University School of Dental Medicine, Boston, MA, USA
| | - K M Pridgen
- VA Informatics and Computing Infrastructure, VA Salt Lake City Health Care System, Salt Lake City, UT, USA
- Department of Internal Medicine, Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - T J Nelson
- VA Informatics and Computing Infrastructure, VA Salt Lake City Health Care System, Salt Lake City, UT, USA
- VA San Diego Healthcare System, San Diego, CA, USA
| | - D R Miller
- Center for Healthcare Organization and Implementation Research, VA Bedford Healthcare System, Bedford, MA, USA
- Center for Population Health, Department of Biomedical and Nutritional Sciences, University of Massachusetts, Lowell, MA, USA
| | - J M Wells
- Department of Periodontology, Tufts University School of Dental Medicine, Boston, MA, USA
- Department of Public Health, Zuckerberg College of Health Sciences, University of Massachusetts, Lowell, MA, USA
- Center of Biomedical and Health Research in Data Sciences (CHORDS), University of Massachusetts, Lowell, MA, USA
| | - T L Assimes
- Epidemiology and Information Center for Genomics, VA Palo Alto Health Care System, Palo Alto, CA, USA
- Department of Medicine (Division of Cardiovascular Medicine), Stanford University School of Medicine, Palo Alto, CA, USA
| | - C J O'Donnell
- Department of Medicine, VA Boston Healthcare System, Harvard Medical School, Boston, MA, USA
| | - P S Tsao
- Epidemiology and Information Center for Genomics, VA Palo Alto Health Care System, Palo Alto, CA, USA
- Department of Medicine (Division of Cardiovascular Medicine), Stanford University School of Medicine, Palo Alto, CA, USA
| | - K M Chang
- Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA
- University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - J A Lynch
- VA Informatics and Computing Infrastructure, VA Salt Lake City Health Care System, Salt Lake City, UT, USA
- Department of Internal Medicine, Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| |
Collapse
|
41
|
Zhang J, Ryu JY, Tirado SR, Dickinson LD, Abosch A, Aziz-Sultan MA, Boulos AS, Barrow DL, Batjer HH, Binyamin TR, Blackburn SL, Chang EF, Chen PR, Colby GP, Cosgrove GR, David CA, Day AL, Folkerth RD, Frerichs KU, Howard BM, Jahromi BR, Niemela M, Ojemann SG, Patel NJ, Richardson RM, Shi X, Valle-Giler EP, Wang AC, Welch BG, Williams Z, Zusman EE, Weiss ST, Du R. A Transcriptomic Comparative Study of Cranial Vasculature. Transl Stroke Res 2024; 15:1108-1122. [PMID: 37612482 DOI: 10.1007/s12975-023-01186-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 07/06/2023] [Accepted: 08/07/2023] [Indexed: 08/25/2023]
Abstract
In genetic studies of cerebrovascular diseases, the optimal vessels to use as controls remain unclear. Our goal is to compare the transcriptomic profiles among 3 different types of control vessels: superficial temporal artery (STA), middle cerebral arteries (MCA), and arteries from the circle of Willis obtained from autopsies (AU). We examined the transcriptomic profiles of STA, MCA, and AU using RNAseq. We also investigated the effects of using these control groups on the results of the comparisons between aneurysms and the control arteries. Our study showed that when comparing pathological cerebral arteries to control groups, all control groups presented similar responses in the activation of immunological processes, the regulation of intracellular signaling pathways, and extracellular matrix productions, despite their intrinsic biological differences. When compared to STA, AU exhibited upregulation of stress and apoptosis genes, whereas MCA showed upregulation of genes associated with tRNA/rRNA processing. Moreover, our results suggest that the matched case-control study design, which involves control STA samples collected from the same subjects of matched aneurysm samples in our study, can improve the identification of non-inherited disease-associated genes. Given the challenges associated with obtaining fresh intracranial arteries from healthy individuals, our study suggests that using MCA, AU, or paired STA samples as controls are feasible strategies for future large-scale studies investigating cerebral vasculopathies. However, the intrinsic differences of each type of control should be taken into consideration when interpreting the results. With the limitations of each control type, it may be most optimal to use multiple tissues as controls.
Collapse
Affiliation(s)
- Jianing Zhang
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | - Jee-Yeon Ryu
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | - Selena-Rae Tirado
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | | | - Aviva Abosch
- Department of Neurosurgery, University of Nebraska Medical Center, Omaha, NE, USA
| | - M Ali Aziz-Sultan
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | - Alan S Boulos
- Department of Neurosurgery, Albany Medical Center, Albany, NY, USA
| | - Daniel L Barrow
- Department of Neurosurgery, Emory University, Atlanta, GA, USA
| | - H Hunt Batjer
- Department of Neurosurgery, University of Texas Southwestern, Dallas, TX, USA
| | | | - Spiros L Blackburn
- Department of Neurosurgery, University of Texas Health Science Center, Houston, TX, USA
| | - Edward F Chang
- Department of Neurosurgery, University of California San Francisco, San Francisco, CA, USA
- Department of Neurosurgery, University of California Los Angeles, Los Angeles, CA, USA
| | - P Roc Chen
- Department of Neurosurgery, University of Texas Health Science Center, Houston, TX, USA
| | - Geoffrey P Colby
- Department of Neurosurgery, University of California Los Angeles, Los Angeles, CA, USA
| | - G Rees Cosgrove
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | - Carlos A David
- Department of Neurosurgery, University of North Carolina Chapel Hill, Chapel Hill, NC, USA
| | - Arthur L Day
- Department of Neurosurgery, University of Texas Health Science Center, Houston, TX, USA
| | - Rebecca D Folkerth
- Department of Forensic Medicine, New York University Grossman School of Medicine, New York, NY, USA
| | - Kai U Frerichs
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | - Brian M Howard
- Department of Neurosurgery, Emory University, Atlanta, GA, USA
| | - Behnam R Jahromi
- Department of Neurosurgery, Helsinki University and Helsinki University Hospital, Helsinki, Finland
| | - Mika Niemela
- Department of Neurosurgery, Helsinki University and Helsinki University Hospital, Helsinki, Finland
| | - Steven G Ojemann
- Department of Neurosurgery, University of Colorado, Denver, CO, USA
| | - Nirav J Patel
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA
| | - R Mark Richardson
- Department of Neurosurgery, Massachusetts General Hospital, Boston, MA, USA
| | - Xiangen Shi
- Department of Neurosurgery, Affiliated Fuxing Hospital, Capital Medical University, Beijing, China
| | | | - Anthony C Wang
- Department of Neurosurgery, University of California Los Angeles, Los Angeles, CA, USA
| | - Babu G Welch
- Department of Neurosurgery, University of Texas Southwestern, Dallas, TX, USA
| | - Ziv Williams
- Department of Neurosurgery, Massachusetts General Hospital, Boston, MA, USA
| | | | - Scott T Weiss
- Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Rose Du
- Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 75 Francis Street, Boston, MA, 02115, USA.
| |
Collapse
|
42
|
Shyr C, Sulieman L, Harris PA. Illuminating the landscape of high-level clinical trial opportunities in the All of Us Research Program. J Am Med Inform Assoc 2024; 31:2890-2898. [PMID: 38622899 PMCID: PMC11631138 DOI: 10.1093/jamia/ocae062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 03/02/2024] [Accepted: 03/07/2024] [Indexed: 04/17/2024] Open
Abstract
OBJECTIVE With its size and diversity, the All of Us Research Program has the potential to power and improve representation in clinical trials through ancillary studies like Nutrition for Precision Health. We sought to characterize high-level trial opportunities for the diverse participants and sponsors of future trial investment. MATERIALS AND METHODS We matched All of Us participants with available trials on ClinicalTrials.gov based on medical conditions, age, sex, and geographic location. Based on the number of matched trials, we (1) developed the Trial Opportunities Compass (TOC) to help sponsors assess trial investment portfolios, (2) characterized the landscape of trial opportunities in a phenome-wide association study (PheWAS), and (3) assessed the relationship between trial opportunities and social determinants of health (SDoH) to identify potential barriers to trial participation. RESULTS Our study included 181 529 All of Us participants and 18 634 trials. The TOC identified opportunities for portfolio investment and gaps in currently available trials across federal, industrial, and academic sponsors. PheWAS results revealed an emphasis on mental disorder-related trials, with anxiety disorder having the highest adjusted increase in the number of matched trials (59% [95% CI, 57-62]; P < 1e-300). Participants from certain communities underrepresented in biomedical research, including self-reported racial and ethnic minorities, had more matched trials after adjusting for other factors. Living in a nonmetropolitan area was associated with up to 13.1 times fewer matched trials. DISCUSSION AND CONCLUSION All of Us data are a valuable resource for identifying trial opportunities to inform trial portfolio planning. Characterizing these opportunities with consideration for SDoH can provide guidance on prioritizing the most pressing barriers to trial participation.
Collapse
Affiliation(s)
- Cathy Shyr
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Lina Sulieman
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Paul A Harris
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN 37240, United States
| |
Collapse
|
43
|
Coombes BJ, Sanchez-Ruiz JA, Fennessy B, Pazdernik VK, Adekkanattu P, Nuñez NA, Lepow L, Melhuish Beaupre LM, Ryu E, Talati A, Mann JJ, Weissman MM, Olfson M, Pathak J, Charney AW, Biernacka JM. Clinical associations with treatment resistance in depression: An electronic health record study. Psychiatry Res 2024; 342:116203. [PMID: 39321638 PMCID: PMC11617277 DOI: 10.1016/j.psychres.2024.116203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Revised: 09/03/2024] [Accepted: 09/15/2024] [Indexed: 09/27/2024]
Abstract
Treatment resistance is common in major depressive disorder (MDD), yet clinical risk factors are not well understood. Using a discovery-replication design, we conducted phenome-wide association studies (PheWASs) of MDD treatment resistance in two electronic health record (EHR)-linked biobanks. The PheWAS included participants with an MDD diagnosis in the EHR and at least one antidepressant (AD) prescription. Participant lifetime diagnoses were mapped to phecodes. PheWASs were conducted for three treatment resistance outcomes based on AD prescription data: number of unique ADs prescribed, ≥1 and ≥2 CE switches. Of the 180 phecodes significantly associated with these outcomes in the discovery cohort (n = 12,558), 71 replicated (n = 8,206). In addition to identifying known clinical factors for treatment resistance in MDD, the total unique AD prescriptions was associated with additional clinical variables including irritable bowel syndrome, gastroesophageal reflux disease, symptomatic menopause, and spondylosis. We calculated polygenic risk of specific-associated conditions and tested their association with AD outcomes revealing that genetic risk for many of these conditions is also associated with the total unique AD prescriptions. The number of unique ADs prescribed, which is easily assessed in EHRs, provides a more nuanced measure of treatment resistance, and may facilitate future research and clinical application in this area.
Collapse
Affiliation(s)
- Brandon J Coombes
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA.
| | | | - Brian Fennessy
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Prakash Adekkanattu
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA; Clinical and Translational Science Center, Weill Cornell Medicine, New York, NY, USA
| | - Nicolas A Nuñez
- Department of Psychiatry & Psychology, Mayo Clinic, Rochester, MN, USA
| | - Lauren Lepow
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Euijung Ryu
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Ardesheer Talati
- Department of Psychiatry, Vagelos College of Physicians and Surgeons Columbia University & NY State Psychiatric Institute, New York, NY, USA
| | - J John Mann
- Department of Psychiatry, Vagelos College of Physicians and Surgeons Columbia University & NY State Psychiatric Institute, New York, NY, USA
| | - Myrna M Weissman
- Department of Psychiatry, Vagelos College of Physicians and Surgeons Columbia University & NY State Psychiatric Institute, New York, NY, USA
| | - Mark Olfson
- Department of Psychiatry, Vagelos College of Physicians and Surgeons Columbia University & NY State Psychiatric Institute, New York, NY, USA
| | - Jyotishman Pathak
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA; Department of Psychiatry, Weill Cornell Medicine, New York, NY, USA
| | - Alexander W Charney
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA; Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Joanna M Biernacka
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA; Department of Psychiatry & Psychology, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
44
|
Hartwell EE, Jinwala Z, Milone J, Ramirez S, Gelernter J, Kranzler HR, Kember RL. Application of polygenic scores to a deeply phenotyped sample enriched for substance use disorders reveals extensive pleiotropy with psychiatric and somatic traits. Neuropsychopharmacology 2024; 49:1958-1967. [PMID: 39043921 PMCID: PMC11480112 DOI: 10.1038/s41386-024-01922-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 06/07/2024] [Accepted: 06/28/2024] [Indexed: 07/25/2024]
Abstract
Co-occurring psychiatric, medical, and substance use disorders (SUDs) are common, but the complex pathways leading to such comorbidities are poorly understood. A greater understanding of genetic influences on this phenomenon could inform precision medicine efforts. We used the Yale-Penn dataset, a cross-sectional sample enriched for individuals with SUDs, to examine pleiotropic effects of genetic liability for psychiatric and somatic traits. Participants completed an in-depth interview that provides information on demographics, environment, medical illnesses, and psychiatric and SUDs. Polygenic scores (PGS) for psychiatric disorders and somatic traits were calculated in European-ancestry (EUR; n = 5691) participants and, when discovery datasets were available, for African-ancestry (AFR; n = 4918) participants. Phenome-wide association studies (PheWAS) were then conducted. In AFR participants, the only PGS with significant associations was bipolar disorder (BD), all of which were with substance use phenotypes. In EUR participants, PGS for major depressive disorder (MDD), generalized anxiety disorder (GAD), post-traumatic stress disorder (PTSD), schizophrenia (SCZ), body mass index (BMI), coronary artery disease (CAD), and type 2 diabetes (T2D) all showed significant associations, the majority of which were with phenotypes in the substance use categories. For instance, PGSMDD was associated with over 200 phenotypes, 15 of which were depression-related (e.g., depression criterion count), 55 of which were other psychiatric phenotypes, and 126 of which were substance use phenotypes; and PGSBMI was associated with 138 phenotypes, 105 of which were substance related. Genetic liability for psychiatric and somatic traits is associated with numerous phenotypes across multiple categories, indicative of the broad genetic liability of these traits.
Collapse
Affiliation(s)
- Emily E Hartwell
- Crescenz VA Medical Center, Philadelphia, PA, USA
- University of Pennsylvania, Philadelphia, PA, USA
| | - Zeal Jinwala
- Crescenz VA Medical Center, Philadelphia, PA, USA
- University of Pennsylvania, Philadelphia, PA, USA
| | | | | | - Joel Gelernter
- West Haven VA Medical Center, West Haven, CT, USA
- Yale University, New Haven, CT, USA
| | - Henry R Kranzler
- Crescenz VA Medical Center, Philadelphia, PA, USA
- University of Pennsylvania, Philadelphia, PA, USA
| | - Rachel L Kember
- Crescenz VA Medical Center, Philadelphia, PA, USA.
- University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
45
|
Kunej T, Šimon M, Luštrek B, Horvat S, Potočnik K. Examining genotype-phenotype associations of GRAM domain proteins using GWAS, PheWAS and literature review in cattle, human, pig, mouse and chicken. Sci Rep 2024; 14:28889. [PMID: 39572677 PMCID: PMC11582632 DOI: 10.1038/s41598-024-80117-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Accepted: 11/14/2024] [Indexed: 11/24/2024] Open
Abstract
The GRAMD genes are involved in maintaining cholesterol homeostasis, apoptosis, cancer and production traits in livestock. A lipid-binding GRAM domain is implicated in lipid transport and metabolism. The functions of GRAMD proteins remain incompletely understood. The aim of the present study was therefore to investigate the associations between six GRAMD genes in cattle using data from the international genomic evaluation of the Interbull InterGenomics Centre and to evaluate genotype-phenotype associations in human, cattle, pig, mouse and, chicken. Genotyping of 55,013 bulls was performed using DNA microarrays and 11 SNPs were mapped to the five GRAMD genes. A phenome-wide association study (PheWAS) tested associations between the 11 SNPs and 36 traits. The integrated analysis of SNP effects, rankings, and clustering patterns revealed their potential for improving cattle productivity, health, and robustness, and established a baseline for the targeted improvement of cattle traits. This study lays the groundwork for functional experiments aimed at uncovering the mechanism of action of GRAMD genes and to evaluate the potential of using GRAMD sequence variants for selection programs in dairy cattle. The study presents an example of how the combination of GWAS and the PheWAS offers a promising toolbox for the systematic functional annotation of vertebrate genomes.
Collapse
Affiliation(s)
- Tanja Kunej
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, Domžale, SI-1230, Slovenia.
| | - Martin Šimon
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, Domžale, SI-1230, Slovenia
| | - Barbara Luštrek
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, Domžale, SI-1230, Slovenia
| | - Simon Horvat
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, Domžale, SI-1230, Slovenia
| | - Klemen Potočnik
- Biotechnical Faculty, Department of Animal Science, University of Ljubljana, Groblje 3, Domžale, SI-1230, Slovenia.
| |
Collapse
|
46
|
Huffman JE, Nicholas J, Hahn J, Heath AS, Raffield LM, Yanek LR, Brody JA, Thibord F, Almasy L, Bartz TM, Bielak LF, Bowler RP, Carrasquilla GD, Chasman DI, Chen MH, Emmert DB, Ghanbari M, Haessler J, Hottenga JJ, Kleber ME, Le NQ, Lee J, Lewis JP, Li-Gao R, Luan J, Malmberg A, Mangino M, Marioni RE, Martinez-Perez A, Pankratz N, Polasek O, Richmond A, Rodriguez BAT, Rotter JI, Steri M, Suchon P, Trompet S, Weiss S, Zare M, Auer P, Cho MH, Christofidou P, Davies G, de Geus E, Deleuze JF, Delgado GE, Ekunwe L, Faraday N, Gögele M, Greinacher A, Gao H, Howard T, Joshi PK, Kilpeläinen TO, Lahti J, Linneberg A, Naitza S, Noordam R, Paüls-Vergés F, Rich SS, Rosendaal FR, Rudan I, Ryan KA, Souto JC, van Rooij FJA, Wang H, Zhao W, Becker LC, Beswick A, Brown MR, Cade BE, Campbell H, Cho K, Crapo JD, Curran JE, de Maat MPM, Doyle M, Elliott P, Floyd JS, Fuchsberger C, Grarup N, Guo X, Harris SE, Hou L, Kolcic I, Kooperberg C, Menni C, Nauck M, O'Connell JR, Orrù V, Psaty BM, Räikkönen K, Smith JA, Soria JM, Stott DJ, van Hylckama Vlieg A, Watkins H, Willemsen G, Wilson PWF, Ben-Shlomo Y, et alHuffman JE, Nicholas J, Hahn J, Heath AS, Raffield LM, Yanek LR, Brody JA, Thibord F, Almasy L, Bartz TM, Bielak LF, Bowler RP, Carrasquilla GD, Chasman DI, Chen MH, Emmert DB, Ghanbari M, Haessler J, Hottenga JJ, Kleber ME, Le NQ, Lee J, Lewis JP, Li-Gao R, Luan J, Malmberg A, Mangino M, Marioni RE, Martinez-Perez A, Pankratz N, Polasek O, Richmond A, Rodriguez BAT, Rotter JI, Steri M, Suchon P, Trompet S, Weiss S, Zare M, Auer P, Cho MH, Christofidou P, Davies G, de Geus E, Deleuze JF, Delgado GE, Ekunwe L, Faraday N, Gögele M, Greinacher A, Gao H, Howard T, Joshi PK, Kilpeläinen TO, Lahti J, Linneberg A, Naitza S, Noordam R, Paüls-Vergés F, Rich SS, Rosendaal FR, Rudan I, Ryan KA, Souto JC, van Rooij FJA, Wang H, Zhao W, Becker LC, Beswick A, Brown MR, Cade BE, Campbell H, Cho K, Crapo JD, Curran JE, de Maat MPM, Doyle M, Elliott P, Floyd JS, Fuchsberger C, Grarup N, Guo X, Harris SE, Hou L, Kolcic I, Kooperberg C, Menni C, Nauck M, O'Connell JR, Orrù V, Psaty BM, Räikkönen K, Smith JA, Soria JM, Stott DJ, van Hylckama Vlieg A, Watkins H, Willemsen G, Wilson PWF, Ben-Shlomo Y, Blangero J, Boomsma D, Cox SR, Dehghan A, Eriksson JG, Fiorillo E, Fornage M, Hansen T, Hayward C, Ikram MA, Jukema JW, Kardia SLR, Lange LA, März W, Mathias RA, Mitchell BD, Mook-Kanamori DO, Morange PE, Pedersen O, Pramstaller PP, Redline S, Reiner A, Ridker PM, Silverman EK, Spector TD, Völker U, Wareham NJ, Wilson JF, Yao J, Trégouët DA, Johnson AD, Wolberg AS, de Vries PS, Sabater-Lleal M, Morrison AC, Smith NL. Whole-genome analysis of plasma fibrinogen reveals population-differentiated genetic regulators with putative liver roles. Blood 2024; 144:2248-2265. [PMID: 39226462 PMCID: PMC11600029 DOI: 10.1182/blood.2023022596] [Show More Authors] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 06/12/2024] [Accepted: 06/17/2024] [Indexed: 09/05/2024] Open
Abstract
ABSTRACT Genetic studies have identified numerous regions associated with plasma fibrinogen levels in Europeans, yet missing heritability and limited inclusion of non-Europeans necessitates further studies with improved power and sensitivity. Compared with array-based genotyping, whole-genome sequencing (WGS) data provide better coverage of the genome and better representation of non-European variants. To better understand the genetic landscape regulating plasma fibrinogen levels, we meta-analyzed WGS data from the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program (n = 32 572), with array-based genotype data from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium (n = 131 340) imputed to the TOPMed or Haplotype Reference Consortium panel. We identified 18 loci that have not been identified in prior genetic studies of fibrinogen. Of these, 4 are driven by common variants of small effect with reported minor allele frequency (MAF) at least 10 percentage points higher in African populations. Three signals (SERPINA1, ZFP36L2, and TLR10) contain predicted deleterious missense variants. Two loci, SOCS3 and HPN, each harbor 2 conditionally distinct, noncoding variants. The gene region encoding the fibrinogen protein chain subunits (FGG;FGB;FGA) contains 7 distinct signals, including 1 novel signal driven by rs28577061, a variant common in African ancestry populations but extremely rare in Europeans (MAFAFR = 0.180; MAFEUR = 0.008). Through phenome-wide association studies in the VA Million Veteran Program, we found associations between fibrinogen polygenic risk scores and thrombotic and inflammatory disease phenotypes, including an association with gout. Our findings demonstrate the utility of WGS to augment genetic discovery in diverse populations and offer new insights for putative mechanisms of fibrinogen regulation.
Collapse
Affiliation(s)
- Jennifer E. Huffman
- Palo Alto VA Institute for Research, VA Palo Alto Heath Care System, Palo Alto, CA
- MAVERIC, VA Boston Healthcare System, Boston, MA
| | - Jayna Nicholas
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Julie Hahn
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
| | - Adam S. Heath
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
| | - Laura M. Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Lisa R. Yanek
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Jennifer A. Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA
| | - Florian Thibord
- National Heart, Lung, and Blood Institute, Division of Intramural Research, Population Sciences Branch, The Framingham Heart Study, Framingham, MA
| | - Laura Almasy
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Traci M. Bartz
- Department of Biostatistics, University of Washington, Seattle, WA
| | - Lawrence F. Bielak
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI
| | | | - Germán D. Carrasquilla
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Daniel I. Chasman
- Division of Preventive Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Ming-Huei Chen
- National Heart, Lung, and Blood Institute, Division of Intramural Research, Population Sciences Branch, The Framingham Heart Study, Framingham, MA
| | - David B. Emmert
- Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Bolzano, Italy
| | - Mohsen Ghanbari
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Jeffrey Haessler
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA
| | - Jouke-Jan Hottenga
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
| | - Marcus E. Kleber
- SYNLAB MVZ für Humangenetik Mannheim, Mannheim, Germany
- Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Ngoc-Quynh Le
- Unit of Genomics of Complex Disease, Institut d’Investigació Biomèdica Sant Pau, Barcelona, Spain
| | - Jiwon Lee
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Joshua P. Lewis
- Department of Medicine, University of Maryland, Baltimore, MD
| | - Ruifang Li-Gao
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
| | - Jian'an Luan
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Anni Malmberg
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
| | - Massimo Mangino
- Department of Twin Research and Genetic Epidemiology, Kings College London, London, United Kingdom
- National Institute for Health and Care Research Biomedical Research Centre, Guy’s and St Thomas’ Foundation Trust, London, United Kingdom
| | - Riccardo E. Marioni
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Angel Martinez-Perez
- Unit of Genomics of Complex Disease, Institut de Recerca Sant Paul, Barcelona, Spain
- Centre for Biomedical Network Research on Rare Diseases, Instituto de Salud Carlos III, Madrid, Spain
| | - Nathan Pankratz
- Department of Laboratory Medicine and Pathology, University of Minnesota Medical School, Minneapolis, MN
| | - Ozren Polasek
- Faculty of Medicine, University of Split, Split, Croatia
| | - Anne Richmond
- Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Benjamin A. T. Rodriguez
- National Heart, Lung, and Blood Institute, Division of Intramural Research, Population Sciences Branch, The Framingham Heart Study, Framingham, MA
| | - Jerome I. Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA
| | - Maristella Steri
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Monserrato-Cagliari, Italy
| | - Pierre Suchon
- Centre de Recherche en Cardiovascular et Nutrition, INSERM, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Aix Marseille University, Marseille, France
- Laboratory of Haematology, La Timone Hospital, Marseille, France
| | - Stella Trompet
- Section of Gerontology and Geriatrics, Department of Internal Medicin, Leiden University Medical Center, Leiden, The Netherlands
| | - Stefan Weiss
- Interfaculty Institute for Genetics and Functional Genomics, Department of Functional Genomics, German Center for Cardiovascular Research, Partner Site Greifswald, University Medicine Greifswald, Greifswald, Germany
| | - Marjan Zare
- Maternal-Fetal Medicine Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, United Kingdom
| | - Paul Auer
- Division of Biostatistics, Institute for Health and Equity, and Cancer Center, Medical College of Wisconsin, Milwaukee, WI
| | - Michael H. Cho
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Paraskevi Christofidou
- Department of Twin Research and Genetic Epidemiology, Kings College London, London, United Kingdom
| | - Gail Davies
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Eco de Geus
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
| | - Jean-François Deleuze
- Centre National de Recherche en Génomique Humaine, Commissariat a l'Energie Atomique et aux Energies Alternatives, Université Paris-Saclay, Evry, France
| | - Graciela E. Delgado
- Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Lynette Ekunwe
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS
| | - Nauder Faraday
- Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Martin Gögele
- Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Bolzano, Italy
| | - Andreas Greinacher
- Department of Transfusion Medicine, University Medicine Greifswald, Greifswald, Germany
| | - He Gao
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, United Kingdom
- Medical Research Council-Public Health England Centre for Environment and Health, School of Public Health, Imperial College London, London, United Kingdom
| | - Tom Howard
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
| | - Peter K. Joshi
- Centre for Global Health Research, Usher Institute for Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, Scotland
| | - Tuomas O. Kilpeläinen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Jari Lahti
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
| | - Allan Linneberg
- Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Copenhagen, Denmark
| | - Silvia Naitza
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Monserrato-Cagliari, Italy
| | - Raymond Noordam
- Section of Gerontology and Geriatrics, Department of Internal Medicin, Leiden University Medical Center, Leiden, The Netherlands
| | - Ferran Paüls-Vergés
- Unit of Genomics of Complex Disease, Institut d’Investigació Biomèdica Sant Pau, Barcelona, Spain
| | - Stephen S. Rich
- Department of Public Health Sciences, Center for Public Health Genomics, University of Virginia, Charlottesville, VA
| | - Frits R. Rosendaal
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
| | - Igor Rudan
- Centre for Global Health Research, Usher Institute for Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, Scotland
| | | | - Juan Carlos Souto
- Unit of Genomics of Complex Disease, Institut de Recerca Sant Paul, Barcelona, Spain
- Centre for Biomedical Network Research on Rare Diseases, Instituto de Salud Carlos III, Madrid, Spain
- Unit of Thrombosis and Hemostasis, Hospital de la Santa Creu i Sant Pau, Barcelona, Spain
| | - Frank J. A. van Rooij
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Heming Wang
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI
| | - Lewis C. Becker
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Andrew Beswick
- Translational Health Sciences, University of Bristol, Bristol, United Kingdom
| | - Michael R. Brown
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
| | - Brian E. Cade
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Harry Campbell
- Centre for Global Health Research, Usher Institute for Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, Scotland
| | - Kelly Cho
- MAVERIC, VA Boston Healthcare System, Boston, MA
- Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA
| | | | - Joanne E. Curran
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
| | - Moniek P. M. de Maat
- Department of Hematology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Margaret Doyle
- Department of Pathology and Laboratory Medicine, The University of Vermont Larner College of Medicine, Colchester, VT
| | - Paul Elliott
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, United Kingdom
- Medical Research Council-Public Health England Centre for Environment and Health, School of Public Health, Imperial College London, London, United Kingdom
- United Kingdom-Dementia Research Institute, Imperial College London, London, United Kingdom
| | - James S. Floyd
- Departments of Medicine and Epidemiology, University of Washington, Seattle, WA
| | - Christian Fuchsberger
- Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Bolzano, Italy
| | - Niels Grarup
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA
| | - Sarah E. Harris
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Lifang Hou
- Department of Preventive Medicine, Northwestern University, Chicago, IL
| | - Ivana Kolcic
- Faculty of Medicine, University of Split, Split, Croatia
| | - Charles Kooperberg
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA
| | - Cristina Menni
- Department of Twin Research and Genetic Epidemiology, Kings College London, London, United Kingdom
| | - Matthias Nauck
- Institute of Clinical Chemistry and Laboratory Medicine, German Center for Cardiovascular Research, Partner Site Greifswald, Greifswald, Germany, University Medicine Greifswald, Greifswald, Germany
| | | | - Valeria Orrù
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Lanusei, Italy
| | - Bruce M. Psaty
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA
- Department of Epidemiology, University of Washington, Seattle, WA
- Department of Health Systems and Population Health, University of Washington, Seattle, WA
| | - Katri Räikkönen
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
| | - Jennifer A. Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI
| | - Jose Manuel Soria
- Unit of Genomics of Complex Disease, Institut de Recerca Sant Paul, Barcelona, Spain
- Centre for Biomedical Network Research on Rare Diseases, Instituto de Salud Carlos III, Madrid, Spain
| | - David J. Stott
- Institute of Cardiovascular and Medical Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, Scotland, United Kingdom
| | | | - Hugh Watkins
- Radcliffe Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Gonneke Willemsen
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
| | - Peter W. F. Wilson
- VA Atlanta Healthcare System, Decatur, GA
- Division of Cardiology, Emory University School of Medicine, Atlanta, GA
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
| | - Yoav Ben-Shlomo
- Poulation Health Sciences, University of Bristol, Bristol, United Kingdom
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
| | - Dorret Boomsma
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
| | - Simon R. Cox
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Abbas Dehghan
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, United Kingdom
- Medical Research Council-Public Health England Centre for Environment and Health, School of Public Health, Imperial College London, London, United Kingdom
- United Kingdom-Dementia Research Institute, Imperial College London, London, United Kingdom
| | - Johan G. Eriksson
- Department of General Practice and Primary Health Care, University of Helsinki, Helsinki, Finland
- Folkhälsan Research Centre, Helsinki, Finland
- Department of Obstetrics and Gynecology, Yong Loo Lin School of Medicine, National University Singapore, Singapore, Singapore
| | - Edoardo Fiorillo
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Lanusei, Italy
| | - Myriam Fornage
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX
| | - Torben Hansen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Caroline Hayward
- Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - M. Arfan Ikram
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - J. Wouter Jukema
- Department of Cardiology, Leiden University Medical Center, Leiden, The Netherlands
- Netherlands Heart Institute, Utrecht, The Netherlands
| | - Sharon L. R. Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI
| | - Leslie A. Lange
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO
| | - Winfried März
- Synlab Academy, Synlab Holding Deutschland GmbH, and Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
| | - Rasika A. Mathias
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Braxton D. Mitchell
- Department of Medicine, University of Maryland, Baltimore, MD
- Geriatric Research and Education Clinical Center, Baltimore Veterans Administration Medical Center, Baltimore, MD
| | - Dennis O. Mook-Kanamori
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
- Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
| | - Pierre-Emmanuel Morange
- Centre de Recherche en Cardiovascular et Nutrition, INSERM, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Aix Marseille University, Marseille, France
- Laboratory of Haematology, La Timone Hospital, Marseille, France
| | - Oluf Pedersen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Peter P. Pramstaller
- Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Bolzano, Italy
| | - Susan Redline
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Department of Medicine, Beth Israel Deaconness Medical Center, Boston, MA
| | - Alexander Reiner
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA
| | - Paul M. Ridker
- Division of Preventive Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Edwin K. Silverman
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Tim D. Spector
- Department of Twin Research and Genetic Epidemiology, Kings College London, London, United Kingdom
| | - Uwe Völker
- Interfaculty Institute for Genetics and Functional Genomics, Department of Functional Genomics, German Center for Cardiovascular Research, Partner Site Greifswald, University Medicine Greifswald, Greifswald, Germany
| | - Nicholas J. Wareham
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - James F. Wilson
- Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- Centre for Global Health Research, Usher Institute for Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, Scotland
| | - Jie Yao
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA
| | - VA Million Veteran Program
- Palo Alto VA Institute for Research, VA Palo Alto Heath Care System, Palo Alto, CA
- MAVERIC, VA Boston Healthcare System, Boston, MA
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA
- National Heart, Lung, and Blood Institute, Division of Intramural Research, Population Sciences Branch, The Framingham Heart Study, Framingham, MA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
- Department of Biostatistics, University of Washington, Seattle, WA
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI
- National Jewish Health, Denver, CO
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
- Division of Preventive Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Bolzano, Italy
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- SYNLAB MVZ für Humangenetik Mannheim, Mannheim, Germany
- Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
- Unit of Genomics of Complex Disease, Institut d’Investigació Biomèdica Sant Pau, Barcelona, Spain
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Department of Medicine, University of Maryland, Baltimore, MD
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
- Department of Twin Research and Genetic Epidemiology, Kings College London, London, United Kingdom
- National Institute for Health and Care Research Biomedical Research Centre, Guy’s and St Thomas’ Foundation Trust, London, United Kingdom
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- Unit of Genomics of Complex Disease, Institut de Recerca Sant Paul, Barcelona, Spain
- Centre for Biomedical Network Research on Rare Diseases, Instituto de Salud Carlos III, Madrid, Spain
- Department of Laboratory Medicine and Pathology, University of Minnesota Medical School, Minneapolis, MN
- Faculty of Medicine, University of Split, Split, Croatia
- Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Monserrato-Cagliari, Italy
- Centre de Recherche en Cardiovascular et Nutrition, INSERM, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Aix Marseille University, Marseille, France
- Laboratory of Haematology, La Timone Hospital, Marseille, France
- Section of Gerontology and Geriatrics, Department of Internal Medicin, Leiden University Medical Center, Leiden, The Netherlands
- Interfaculty Institute for Genetics and Functional Genomics, Department of Functional Genomics, German Center for Cardiovascular Research, Partner Site Greifswald, University Medicine Greifswald, Greifswald, Germany
- Maternal-Fetal Medicine Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, United Kingdom
- Division of Biostatistics, Institute for Health and Equity, and Cancer Center, Medical College of Wisconsin, Milwaukee, WI
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- Centre National de Recherche en Génomique Humaine, Commissariat a l'Energie Atomique et aux Energies Alternatives, Université Paris-Saclay, Evry, France
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS
- Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
- Department of Transfusion Medicine, University Medicine Greifswald, Greifswald, Germany
- Medical Research Council-Public Health England Centre for Environment and Health, School of Public Health, Imperial College London, London, United Kingdom
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
- Centre for Global Health Research, Usher Institute for Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, Scotland
- Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Copenhagen, Denmark
- Department of Public Health Sciences, Center for Public Health Genomics, University of Virginia, Charlottesville, VA
- Unit of Thrombosis and Hemostasis, Hospital de la Santa Creu i Sant Pau, Barcelona, Spain
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI
- Translational Health Sciences, University of Bristol, Bristol, United Kingdom
- Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
- Department of Hematology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Department of Pathology and Laboratory Medicine, The University of Vermont Larner College of Medicine, Colchester, VT
- United Kingdom-Dementia Research Institute, Imperial College London, London, United Kingdom
- Departments of Medicine and Epidemiology, University of Washington, Seattle, WA
- Department of Preventive Medicine, Northwestern University, Chicago, IL
- Institute of Clinical Chemistry and Laboratory Medicine, German Center for Cardiovascular Research, Partner Site Greifswald, Greifswald, Germany, University Medicine Greifswald, Greifswald, Germany
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Lanusei, Italy
- Department of Epidemiology, University of Washington, Seattle, WA
- Department of Health Systems and Population Health, University of Washington, Seattle, WA
- Institute of Cardiovascular and Medical Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, Scotland, United Kingdom
- Radcliffe Department of Medicine, University of Oxford, Oxford, United Kingdom
- VA Atlanta Healthcare System, Decatur, GA
- Division of Cardiology, Emory University School of Medicine, Atlanta, GA
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
- Poulation Health Sciences, University of Bristol, Bristol, United Kingdom
- Department of General Practice and Primary Health Care, University of Helsinki, Helsinki, Finland
- Folkhälsan Research Centre, Helsinki, Finland
- Department of Obstetrics and Gynecology, Yong Loo Lin School of Medicine, National University Singapore, Singapore, Singapore
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX
- Department of Cardiology, Leiden University Medical Center, Leiden, The Netherlands
- Netherlands Heart Institute, Utrecht, The Netherlands
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO
- Synlab Academy, Synlab Holding Deutschland GmbH, and Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
- Geriatric Research and Education Clinical Center, Baltimore Veterans Administration Medical Center, Baltimore, MD
- Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
- Department of Medicine, Beth Israel Deaconness Medical Center, Boston, MA
- Bordeaux Population Health Research Center, INSERM UMR 1219, University of Bordeaux, Bordeaux, France
- Department of Pathology and Laboratory Medicine and UNC Blood Research Center, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institutet, Center for Molecular Medicine, Stockholm, Sweden
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA
- Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA
| | - NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium
- Palo Alto VA Institute for Research, VA Palo Alto Heath Care System, Palo Alto, CA
- MAVERIC, VA Boston Healthcare System, Boston, MA
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA
- National Heart, Lung, and Blood Institute, Division of Intramural Research, Population Sciences Branch, The Framingham Heart Study, Framingham, MA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
- Department of Biostatistics, University of Washington, Seattle, WA
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI
- National Jewish Health, Denver, CO
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
- Division of Preventive Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Institute for Biomedicine (affiliated to the University of Lübeck), Eurac Research, Bolzano, Italy
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, WA
- Netherlands Twin Register, Department of Biological Psychology, Vrije Universiteit, Amsterdam, The Netherlands
- SYNLAB MVZ für Humangenetik Mannheim, Mannheim, Germany
- Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
- Unit of Genomics of Complex Disease, Institut d’Investigació Biomèdica Sant Pau, Barcelona, Spain
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Department of Medicine, University of Maryland, Baltimore, MD
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
- Department of Twin Research and Genetic Epidemiology, Kings College London, London, United Kingdom
- National Institute for Health and Care Research Biomedical Research Centre, Guy’s and St Thomas’ Foundation Trust, London, United Kingdom
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- Unit of Genomics of Complex Disease, Institut de Recerca Sant Paul, Barcelona, Spain
- Centre for Biomedical Network Research on Rare Diseases, Instituto de Salud Carlos III, Madrid, Spain
- Department of Laboratory Medicine and Pathology, University of Minnesota Medical School, Minneapolis, MN
- Faculty of Medicine, University of Split, Split, Croatia
- Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Monserrato-Cagliari, Italy
- Centre de Recherche en Cardiovascular et Nutrition, INSERM, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Aix Marseille University, Marseille, France
- Laboratory of Haematology, La Timone Hospital, Marseille, France
- Section of Gerontology and Geriatrics, Department of Internal Medicin, Leiden University Medical Center, Leiden, The Netherlands
- Interfaculty Institute for Genetics and Functional Genomics, Department of Functional Genomics, German Center for Cardiovascular Research, Partner Site Greifswald, University Medicine Greifswald, Greifswald, Germany
- Maternal-Fetal Medicine Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, United Kingdom
- Division of Biostatistics, Institute for Health and Equity, and Cancer Center, Medical College of Wisconsin, Milwaukee, WI
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, Scotland, United Kingdom
- Centre National de Recherche en Génomique Humaine, Commissariat a l'Energie Atomique et aux Energies Alternatives, Université Paris-Saclay, Evry, France
- Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS
- Department of Anesthesiology and Critical Care Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
- Department of Transfusion Medicine, University Medicine Greifswald, Greifswald, Germany
- Medical Research Council-Public Health England Centre for Environment and Health, School of Public Health, Imperial College London, London, United Kingdom
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
- Centre for Global Health Research, Usher Institute for Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, Scotland
- Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Copenhagen, Denmark
- Department of Public Health Sciences, Center for Public Health Genomics, University of Virginia, Charlottesville, VA
- Unit of Thrombosis and Hemostasis, Hospital de la Santa Creu i Sant Pau, Barcelona, Spain
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI
- Translational Health Sciences, University of Bristol, Bristol, United Kingdom
- Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX
- Department of Hematology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Department of Pathology and Laboratory Medicine, The University of Vermont Larner College of Medicine, Colchester, VT
- United Kingdom-Dementia Research Institute, Imperial College London, London, United Kingdom
- Departments of Medicine and Epidemiology, University of Washington, Seattle, WA
- Department of Preventive Medicine, Northwestern University, Chicago, IL
- Institute of Clinical Chemistry and Laboratory Medicine, German Center for Cardiovascular Research, Partner Site Greifswald, Greifswald, Germany, University Medicine Greifswald, Greifswald, Germany
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Lanusei, Italy
- Department of Epidemiology, University of Washington, Seattle, WA
- Department of Health Systems and Population Health, University of Washington, Seattle, WA
- Institute of Cardiovascular and Medical Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, Scotland, United Kingdom
- Radcliffe Department of Medicine, University of Oxford, Oxford, United Kingdom
- VA Atlanta Healthcare System, Decatur, GA
- Division of Cardiology, Emory University School of Medicine, Atlanta, GA
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA
- Poulation Health Sciences, University of Bristol, Bristol, United Kingdom
- Department of General Practice and Primary Health Care, University of Helsinki, Helsinki, Finland
- Folkhälsan Research Centre, Helsinki, Finland
- Department of Obstetrics and Gynecology, Yong Loo Lin School of Medicine, National University Singapore, Singapore, Singapore
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX
- Department of Cardiology, Leiden University Medical Center, Leiden, The Netherlands
- Netherlands Heart Institute, Utrecht, The Netherlands
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO
- Synlab Academy, Synlab Holding Deutschland GmbH, and Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
- Geriatric Research and Education Clinical Center, Baltimore Veterans Administration Medical Center, Baltimore, MD
- Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, The Netherlands
- Department of Medicine, Beth Israel Deaconness Medical Center, Boston, MA
- Bordeaux Population Health Research Center, INSERM UMR 1219, University of Bordeaux, Bordeaux, France
- Department of Pathology and Laboratory Medicine and UNC Blood Research Center, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institutet, Center for Molecular Medicine, Stockholm, Sweden
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA
- Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA
| | - David-Alexandre Trégouët
- Bordeaux Population Health Research Center, INSERM UMR 1219, University of Bordeaux, Bordeaux, France
| | - Andrew D. Johnson
- National Heart, Lung, and Blood Institute, Division of Intramural Research, Population Sciences Branch, The Framingham Heart Study, Framingham, MA
| | - Alisa S. Wolberg
- Department of Pathology and Laboratory Medicine and UNC Blood Research Center, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Paul S. de Vries
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
| | - Maria Sabater-Lleal
- Unit of Genomics of Complex Disease, Institut de Recerca Sant Paul, Barcelona, Spain
- Centre for Biomedical Network Research on Rare Diseases, Instituto de Salud Carlos III, Madrid, Spain
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institutet, Center for Molecular Medicine, Stockholm, Sweden
| | - Alanna C. Morrison
- Human Genetics Center, Department of Epidemiology, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
| | - Nicholas L. Smith
- Department of Epidemiology, University of Washington, Seattle, WA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA
- Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA
| |
Collapse
|
47
|
Jeong HG, Jeon M, Ryu KJ, Kim J, Choe BY, Joo YY, Park H. Similar but Distinct Comorbidity Patterns Between Polycystic Ovary Syndrome and Endometriosis in Korean Women: A Nationwide Cohort Study. J Korean Med Sci 2024; 39:e284. [PMID: 39561807 DOI: 10.3346/jkms.2024.39.e284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 08/15/2024] [Indexed: 11/21/2024] Open
Abstract
BACKGROUND Polycystic ovary syndrome (PCOS) and endometriosis are widely recognized as significant risk factors affecting the reproductive health of women. The underlying mechanisms impacting fertility may vary, potentially leading to divergent outcomes. We aimed to examine and contrast the prevalence patterns of diseases coexisting with PCOS and endometriosis, using a large-scale nationwide insurance claims data from Asian women of reproductive age. METHODS We analyzed health insurance and examination data of 157,662 Korean women aged 15-45 years, drawn from the Korea National Health Insurance Service-National Sample Cohort database. International Classification of Disease, Tenth Revision codes were mapped to phenome-wide association study codes (phecodes). Subsequently, multivariate logistic regression was performed to assess the comorbidity patterns among patients diagnosed with PCOS and endometriosis and healthy control groups. RESULTS Our analysis revealed that PCOS was correlated with a wider range of metabolic disorders and symptoms, such as hyperlipidemia, type 2 diabetes, various gastrointestinal (GI) issues, and an array of pregnancy-related complications. Conversely, endometriosis was more prevalent among benign neoplasms of female reproductive and digestive organs, endometrial hyperplasia, and angina pectoris. Notably, infertility and glaucoma demonstrated significant associations with both conditions. Furthermore, a comparison of symptom-related codes in women with endometriosis revealed a predominance of pain-related symptoms, whereas those with PCOS exhibited a broader spectrum, encompassing pain, pruritus, GI problems, cough, fever, menstrual cycle disorders, edema, and dizziness. CONCLUSION PCOS and endometriosis, which are prevalent gynecological disorders affecting similar age groups of women, rarely co-occur and exhibit unique comorbidity profiles. Tailored healthcare strategies that take into account these distinct patterns have the potential to enhance long-term healthcare outcomes of affected patients. Further research is required to elucidate the underlying mechanisms and contrasting comorbidity profiles between PCOS and endometriosis.
Collapse
Affiliation(s)
- Hye Gyeong Jeong
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Seoul, Korea
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea
| | - Minhyek Jeon
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ki-Jin Ryu
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Seoul, Korea
| | - Jina Kim
- Department of Statistics, Korea University, Seoul, Korea
| | - Byeol Yi Choe
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Seoul, Korea
| | - Yoonjung Yoonie Joo
- Department of Digital Health, Samsung Advanced Institute for Health Sciences & Technology (SAIHST), Samsung Medical Center, Sungkyunkwan University, Seoul, Korea.
| | - Hyuntae Park
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Seoul, Korea.
| |
Collapse
|
48
|
Li M, Li X, Pan K, Geva A, Yang D, Sweet SM, Bonzel CL, Ayakulangara Panickan V, Xiong X, Mandl K, Cai T. Multisource representation learning for pediatric knowledge extraction from electronic health records. NPJ Digit Med 2024; 7:319. [PMID: 39533050 PMCID: PMC11558010 DOI: 10.1038/s41746-024-01320-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 10/28/2024] [Indexed: 11/16/2024] Open
Abstract
Electronic Health Record (EHR) systems are particularly valuable in pediatrics due to high barriers in clinical studies, but pediatric EHR data often suffer from low content density. Existing EHR code embeddings tailored for the general patient population fail to address the unique needs of pediatric patients. To bridge this gap, we introduce a transfer learning approach, MUltisource Graph Synthesis (MUGS), aimed at accurate knowledge extraction and relation detection in pediatric contexts. MUGS integrates graphical data from both pediatric and general EHR systems, along with hierarchical medical ontologies, to create embeddings that adaptively capture both the homogeneity and heterogeneity between hospital systems. These embeddings enable refined EHR feature engineering and nuanced patient profiling, proving particularly effective in identifying pediatric patients similar to specific profiles, with a focus on pulmonary hypertension (PH). MUGS embeddings, resistant to negative transfer, outperform other benchmark methods in multiple applications, advancing evidence-based pediatric research.
Collapse
Affiliation(s)
- Mengyan Li
- Department of Mathematical Sciences, Bentley University, Waltham, MA, USA
| | - Xiaoou Li
- School of Statistics, University of Minnesota, Minneapolis, MN, USA
| | - Kevin Pan
- Mission San Jose High School, Fremont, CA, USA
| | - Alon Geva
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA
- Department of Anesthesiology, Critical Care, and Pain Medicine, Boston Children's Hospital, Boston, MA, USA
- Department of Anaesthesia, Harvard Medical School, Boston, MA, USA
| | - Doris Yang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Sara Morini Sweet
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Clara-Lea Bonzel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | | | - Xin Xiong
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Kenneth Mandl
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Tianxi Cai
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| |
Collapse
|
49
|
Merritt VC, Chen AW, Bonzel CL, Hong C, Sangar R, Morini Sweet S, Sorg SF, Chanfreau-Coffinier C. Development and validation of an electronic health record-based algorithm for identifying TBI in the VA: A VA Million Veteran Program study. Brain Inj 2024; 38:1084-1092. [PMID: 39004925 DOI: 10.1080/02699052.2024.2373920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 06/12/2024] [Accepted: 06/24/2024] [Indexed: 07/16/2024]
Abstract
The purpose of this study was to develop and validate an algorithm for identifying Veterans with a history of traumatic brain injury (TBI) in the Veterans Affairs (VA) electronic health record using VA Million Veteran Program (MVP) data. Manual chart review (n = 200) was first used to establish 'gold standard' diagnosis labels for TBI ('Yes TBI' vs. 'No TBI'). To develop our algorithm, we used PheCAP, a semi-supervised pipeline that relied on the chart review diagnosis labels to train and create a prediction model for TBI. Cross-validation was used to train and evaluate the proposed algorithm, 'TBI-PheCAP.' TBI-PheCAP performance was compared to existing TBI algorithms and phenotyping methods, and the final algorithm was run on all MVP participants (n = 702,740) to assign a predicted probability for TBI and a binary classification status choosing specificity = 90%. The TBI-PheCAP algorithm had an area under the receiver operating characteristic curve of 0.92, sensitivity of 84%, and positive predictive value (PPV) of 98% at specificity = 90%. TBI-PheCAP generally performed better than other classification methods, with equivalent or higher sensitivity and PPV than existing rules-based TBI algorithms and MVP TBI-related survey data. Given its strong classification metrics, the TBI-PheCAP algorithm is recommended for use in future population-based TBI research.
Collapse
Affiliation(s)
- Victoria C Merritt
- VA San Diego Healthcare System (VASDHS), San Diego, CA, USA
- Department of Psychiatry, University of California San Diego, La Jolla, CA, USA
- Center of Excellence for Stress and Mental Health, VASDHS, San Diego, CA, USA
| | | | | | - Chuan Hong
- Department of Biostatistics and Bioinformatics, Duke University, Durham, NH, USA
| | | | | | - Scott F Sorg
- Home Base, A Red Sox Foundation and Massachusetts General Hospital Program, Boston, MA, USA
| | | |
Collapse
|
50
|
Breeyear JH, Mitchell SL, Nealon CL, Hellwege JN, Charest B, Khakharia A, Halladay CW, Yang J, Garriga GA, Wilson OD, Basnet TB, Hung AM, Reaven PD, Meigs JB, Rhee MK, Sun Y, Lynch MG, Sobrin L, Brantley MA, Sun YV, Wilson PW, Iyengar SK, Peachey NS, Phillips LS, Edwards TL, Giri A. Development of electronic health record based algorithms to identify individuals with diabetic retinopathy. J Am Med Inform Assoc 2024; 31:2560-2570. [PMID: 39158361 PMCID: PMC11491608 DOI: 10.1093/jamia/ocae213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 07/17/2024] [Accepted: 07/30/2024] [Indexed: 08/20/2024] Open
Abstract
OBJECTIVES To develop, validate, and implement algorithms to identify diabetic retinopathy (DR) cases and controls from electronic health care records (EHRs). MATERIALS AND METHODS We developed and validated electronic health record (EHR)-based algorithms to identify DR cases and individuals with type I or II diabetes without DR (controls) in 3 independent EHR systems: Vanderbilt University Medical Center Synthetic Derivative (VUMC), the VA Northeast Ohio Healthcare System (VANEOHS), and Massachusetts General Brigham (MGB). Cases were required to meet 1 of the following 3 criteria: (1) 2 or more dates with any DR ICD-9/10 code documented in the EHR, (2) at least one affirmative health-factor or EPIC code for DR along with an ICD9/10 code for DR on a different day, or (3) at least one ICD-9/10 code for any DR occurring within 24 hours of an ophthalmology examination. Criteria for controls included affirmative evidence for diabetes as well as an ophthalmology examination. RESULTS The algorithms, developed and evaluated in VUMC through manual chart review, resulted in a positive predictive value (PPV) of 0.93 for cases and negative predictive value (NPV) of 0.91 for controls. Implementation of algorithms yielded similar metrics in VANEOHS (PPV = 0.94; NPV = 0.86) and lower in MGB (PPV = 0.84; NPV = 0.76). In comparison, the algorithm for DR implemented in Phenome-wide association study (PheWAS) in VUMC yielded similar PPV (0.92) but substantially reduced NPV (0.48). Implementation of the algorithms to the Million Veteran Program identified over 62 000 DR cases with genetic data including 14 549 African Americans and 6209 Hispanics with DR. CONCLUSIONS/DISCUSSION We demonstrate the robustness of the algorithms at 3 separate healthcare centers, with a minimum PPV of 0.84 and substantially improved NPV than existing automated methods. We strongly encourage independent validation and incorporation of features unique to each EHR to enhance algorithm performance for DR cases and controls.
Collapse
Affiliation(s)
- Joseph H Breeyear
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
| | - Sabrina L Mitchell
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Department of Ophthalmology and Visual Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Cari L Nealon
- Eye Clinic, VA Northeast Ohio Healthcare System, Cleveland, OH 44106, United States
| | - Jacklyn N Hellwege
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37232, United States
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Brian Charest
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA 02111, United States
| | - Anjali Khakharia
- VA Atlanta Healthcare System, Decatur, GA 30033, United States
- Department of Medicine and Geriatrics, Emory University School of Medicine, Atlanta, GA 30307, United States
| | | | - Janine Yang
- Department of Ophthalmology, Mass Eye and Ear Infirmary, Harvard Medical School, Boston, MA 02114, United States
| | - Gustavo A Garriga
- Division of Quantitative and Clinical Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Otis D Wilson
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Division of Nephrology and Hypertension, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Til B Basnet
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37232, United States
- Division of Quantitative and Clinical Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Adriana M Hung
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Division of Nephrology and Hypertension, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Peter D Reaven
- Phoenix VA Health Care System, Phoenix, AZ 85012, United States
- College of Medicine, University of Arizona, Phoenix, AZ 85721, United States
| | - James B Meigs
- Program in Medical and Population Genetics, Broad Institute, Cambridge, MA 02142, United States
- Department of Medicine, Harvard Medical School, Boston, MA 02115, United States
- Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA 02114, United States
| | - Mary K Rhee
- VA Atlanta Healthcare System, Decatur, GA 30033, United States
- Division of Endocrinology, Metabolism, and Lipids, Department of Medicine, Emory University School of Medicine, Atlanta, GA 30307, United States
| | - Yang Sun
- Department of Ophthalmology, Stanford University School of Medicine, Palo Alto, CA 94305, United States
| | - Mary G Lynch
- VA Atlanta Healthcare System, Decatur, GA 30033, United States
| | - Lucia Sobrin
- Department of Ophthalmology, Mass Eye and Ear Infirmary, Harvard Medical School, Boston, MA 02114, United States
| | - Milam A Brantley
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Department of Ophthalmology and Visual Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, United States
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37232, United States
| | - Yan V Sun
- VA Atlanta Healthcare System, Decatur, GA 30033, United States
- Department of Epidemiology, Emory University Rollins School of Public Health, Atlanta, GA 30307, United States
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA 30307, United States
| | - Peter W Wilson
- VA Atlanta Healthcare System, Decatur, GA 30033, United States
- Division of Cardiology, Department of Medicine, Emory University School of Medicine, Atlanta, GA 30307, United States
| | - Sudha K Iyengar
- Research Service, VA Northeast Ohio Healthcare System, Cleveland, OH 44106, United States
- Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, OH 44106, United States
| | - Neal S Peachey
- Research Service, VA Northeast Ohio Healthcare System, Cleveland, OH 44106, United States
- Cole Eye Institute, Cleveland Clinic, Cleveland, OH 44106, United States
- Department of Ophthalmology, Cleveland Clinic Lerner College of Medicine of Case Western Reserve University, Cleveland, OH 44195, United States
| | - Lawrence S Phillips
- VA Atlanta Healthcare System, Decatur, GA 30033, United States
- Division of Endocrinology, Metabolism, and Lipids, Department of Medicine, Emory University School of Medicine, Atlanta, GA 30307, United States
| | - Todd L Edwards
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
| | - Ayush Giri
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
- VA Tennessee Valley Healthcare System (626), Nashville, TN 37212, United States
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37232, United States
- Division of Quantitative and Clinical Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| |
Collapse
|