1
|
Shah Y, Kulm S, Nauseef JT, Chen Z, Elemento O, Kensler KH, Sharaf RN. Benchmarking multi-ancestry prostate cancer polygenic risk scores in a real-world cohort. PLoS Comput Biol 2024; 20:e1011990. [PMID: 38598551 PMCID: PMC11034641 DOI: 10.1371/journal.pcbi.1011990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 04/22/2024] [Accepted: 03/11/2024] [Indexed: 04/12/2024] Open
Abstract
Prostate cancer is a heritable disease with ancestry-biased incidence and mortality. Polygenic risk scores (PRSs) offer promising advancements in predicting disease risk, including prostate cancer. While their accuracy continues to improve, research aimed at enhancing their effectiveness within African and Asian populations remains key for equitable use. Recent algorithmic developments for PRS derivation have resulted in improved pan-ancestral risk prediction for several diseases. In this study, we benchmark the predictive power of six widely used PRS derivation algorithms, including four of which adjust for ancestry, against prostate cancer cases and controls from the UK Biobank and All of Us cohorts. We find modest improvement in discriminatory ability when compared with a simple method that prioritizes variants, clumping, and published polygenic risk scores. Our findings underscore the importance of improving upon risk prediction algorithms and the sampling of diverse cohorts.
Collapse
Affiliation(s)
- Yajas Shah
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, New York, United States of America
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York City, New York, United States of America
| | - Scott Kulm
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, New York, United States of America
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York City, New York, United States of America
| | - Jones T. Nauseef
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, New York, United States of America
- Department of Medicine—Hematology and Medical Oncology, Weill Cornell Medicine, New York City, New York, United States of America
| | - Zhengming Chen
- Department of Population Health Sciences, Weill Cornell Medicine, New York City, New York, United States of America
| | - Olivier Elemento
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, New York, United States of America
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York City, New York, United States of America
| | - Kevin H. Kensler
- Department of Population Health Sciences, Weill Cornell Medicine, New York City, New York, United States of America
| | - Ravi N. Sharaf
- Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, New York, United States of America
- Department of Population Health Sciences, Weill Cornell Medicine, New York City, New York, United States of America
- Department of Medicine–Gastroenterology and Hepatology, Weill Cornell Medicine, New York City, New York, United States of America
| |
Collapse
|
2
|
Busby GB, Kulm S, Bolli A, Kintzle J, Domenico PD, Bottà G. Ancestry-specific polygenic risk scores are risk enhancers for clinical cardiovascular disease assessments. Nat Commun 2023; 14:7105. [PMID: 37925478 PMCID: PMC10625612 DOI: 10.1038/s41467-023-42897-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 10/25/2023] [Indexed: 11/06/2023] Open
Abstract
Clinical implementation of new prediction models requires evaluation of their utility in a broad range of intended use populations. Here we develop and validate ancestry-specific Polygenic Risk Scores (PRSs) for Coronary Artery Disease (CAD) using 29,389 individuals from diverse cohorts and genetic ancestry groups. The CAD PRSs outperform published scores with an average Odds Ratio per Standard Deviation of 1.57 (SD = 0.14) and identify between 12% and 24% of individuals with high genetic risk. Using this risk factor to reclassify borderline or intermediate 10 year Atherosclerotic Cardiovascular Disease (ASCVD) risk improves assessments for both CAD (Net Reclassification Improvement (NRI) = 13.14% (95% CI 9.23-17.06%)) and ASCVD (NRI = 10.70 (95% CI 7.35-14.05)) in an independent cohort of 9,691 individuals. Our analyses demonstrate that using PRSs as Risk Enhancers improves ASCVD risk assessments outlining an approach for guiding ASCVD prevention with genetic information.
Collapse
Affiliation(s)
| | - Scott Kulm
- Allelica Inc, 447 Broadway, New York, NY, 10013, USA
| | | | - Jen Kintzle
- Allelica Inc, 447 Broadway, New York, NY, 10013, USA
| | | | | |
Collapse
|
3
|
Wickersham M, Bartelo N, Kulm S, Liu Y, Zhang Y, Elemento O. USING MACHINE LEARNING METHODS TO ASSESS THE RISK OF ALCOHOL MISUSE IN OLDER ADULTS. Res Sq 2023:rs.3.rs-3154584. [PMID: 37886491 PMCID: PMC10602059 DOI: 10.21203/rs.3.rs-3154584/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
The population of older adults, defined in this study as those 50 years of age or older, continues to increase every year. Substance misuse, particularly alcohol misuse, is often neglected in these individuals. To better identify older adults who might not be properly assessed for alcohol misuse, we have derived a risk assessment tool using patients from the United Kingdom Biobank (UKB), which was validated on patients in the Weill Cornell Medicine (WCM) electronic health record (EHR). The model and tooling created stratifies the risk of alcohol misuse in older adults using 10 features that are commonly found in most EHR systems. We found that the area under the receiver operating curve (AUROC) to correctly predict alcohol misuse in older adults for the UKB and WCM models were 0.84 and 0.78, respectively. We further show that of those who self-identified as having ongoing alcohol misuse in the UKB cohort, only 12.5% of these patients had any alcohol-related F.10 ICD-10 code. Extending this to the WCM cohort, we forecast that 7,838 out of 12,360 older adults with no F.10 ICD-10 code (63.4%) may be missed as having alcohol misuse in the EHR. Overall, this study importantly prioritizes the health of older adults by being able to predict alcohol misuse in an understudied population.
Collapse
Affiliation(s)
- Matthew Wickersham
- Weill-Cornell/Rockefeller/Sloan-Kettering Tri-Institutional MD-PhD Program, New York, New York, United States
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York, United States
| | - Nicholas Bartelo
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York, United States
| | - Scott Kulm
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York, United States
| | - Yifan Liu
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, United States
| | - Yiye Zhang
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, United States
- Department of Emergency Medicine, Weill Cornell Medicine, New York, New York, United States
| | - Olivier Elemento
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York, United States
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, New York, United States
| |
Collapse
|
4
|
Kulm S, Kaidi AC, Kolin D, Langhans MT, Bostrom MP, Elemento O, Shen TS. Genetic Risk Factors for End-Stage Hip Osteoarthritis Treated With Total Hip Arthroplasty: A Genome-wide Association Study. J Arthroplasty 2023; 38:2149-2153.e1. [PMID: 37179025 DOI: 10.1016/j.arth.2023.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 04/28/2023] [Accepted: 05/03/2023] [Indexed: 05/15/2023] Open
Abstract
BACKGROUND Although a genetic component to hip osteoarthritis (OA) has been described, focused evaluation of the genetic components of end-stage disease is limited. We present a genomewide association study for patients undergoing total hip arthroplasty (THA) to characterize the genetic risk factors associated with end-stage hip osteoarthritis (ESHO), defined as utilization of the procedure. METHODS Patients who underwent primary THA for hip OA were identified in a national patient data repository using administrative codes. Fifteen thousand three hundred and fifty-five patients with ESHO and 374,193 control patients were identified. Whole genome regression of genotypic data for patients who underwent primary THA for hip OA corrected for age, sex, and body mass index (BMI) was performed. Multivariate logistic regression models were used to evaluate the composite genetic risk from the identified genetic variants. RESULTS There were 13 significant genes identified. Composite genetic factors resulted in an odds ratio 1.04 for ESHO (P < .001). The effect of genetics was lower than that of age (Odds Ratio (OR): 2.38; P < .001) and BMI (1.81; P < .001). CONCLUSION Multiple genetic variants, including 5 novel loci, were associated with end-stage hip OA treated with primary THA. Age and BMI were associated with greater odds of developing end-stage disease when compared to genetic factors.
Collapse
Affiliation(s)
- Scott Kulm
- Weill Cornell Medicine, New York, New York; Englander Institute for Precision Medicine, New York, New York
| | - Austin C Kaidi
- Adult Reconstruction and Joint Replacement, Hospital for Special Surgery, New York, New York
| | - David Kolin
- Adult Reconstruction and Joint Replacement, Hospital for Special Surgery, New York, New York
| | - Mark T Langhans
- Adult Reconstruction and Joint Replacement, Hospital for Special Surgery, New York, New York
| | - Mathias P Bostrom
- Adult Reconstruction and Joint Replacement, Hospital for Special Surgery, New York, New York
| | - Olivier Elemento
- Weill Cornell Medicine, New York, New York; Englander Institute for Precision Medicine, New York, New York
| | - Tony S Shen
- Adult Reconstruction and Joint Replacement, Hospital for Special Surgery, New York, New York
| |
Collapse
|
5
|
Kandola MS, Kulm S, Kim LK, Markowitz SM, Liu CF, Thomas G, Ip JE, Lerman BB, Elemento O, Cheung JW. Population-Level Prevalence of Rare Variants Associated With Atrial Fibrillation and its Impact on Patient Outcomes. JACC Clin Electrophysiol 2023; 9:1137-1146. [PMID: 36669898 DOI: 10.1016/j.jacep.2022.11.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 11/21/2022] [Accepted: 11/22/2022] [Indexed: 01/20/2023]
Abstract
BACKGROUND Whole exome sequencing may identify rare pathogenic/likely pathogenic variants (LPVs) that are linked to atrial fibrillation (AF). The impact of LPVs associated with AF on a population level on outcomes is unclear. OBJECTIVES This study sought to examine the association of LPVs with AF and their impact on clinical outcomes using the UK Biobank, a national repository of participants with available whole exome sequencing data. METHODS A total of 200,631 individuals in the UK Biobank were studied. Incident and prevalent AF, comorbidities, and outcomes were identified using self-reported assessments and hospital stay operative, and death registry records. LPVs were determined using arrhythmia and cardiomyopathy gene panels with LOFTEE and ClinVar to predict variants of functional significance. RESULTS Compared with control subjects, there was a modestly increased prevalence of LPVs among 9,585 patients with AF (2.0% vs 1.7%, respectively; P = 0.01). Among those with prevalent AF at <45 years of age, 4.2% were LPV carriers. LPVs in TTN and PKP2 were associated with AF with adjusted odds ratios of 2.69 (95% CI: 1.57-4.61) and 2.69 (95% CI: 1.54-4.68), respectively. There was no significant difference in combined ischemic stroke, heart failure hospitalization, and mortality among patients who have AF with and without LPVs (25.1% vs 23.8%; P = 0.49). Among participants with AF and available cardiac magnetic resonance imaging data, LPV carriers had lower left ventricular ejection fractions than non-LPV carriers (42% vs 52%; P = 0.027). CONCLUSIONS Patients with AF had a modestly increased prevalence of LPVs. Among reference arrhythmia and cardiomyopathy genes, the contribution of rare variants to AF risk at a population level is modest and its impact on outcomes appears to be limited, despite an association of LPVs with reduced left ventricular ejection fraction among patients with AF.
Collapse
Affiliation(s)
- Manjinder S Kandola
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - Scott Kulm
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, New York, USA
| | - Luke K Kim
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - Steven M Markowitz
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - Christopher F Liu
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - George Thomas
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - James E Ip
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - Bruce B Lerman
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA
| | - Olivier Elemento
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, New York, USA
| | - Jim W Cheung
- Weill Cornell Cardiovascular Outcomes Research Group, Department of Medicine, Division of Cardiology, Weill Cornell Medicine-New York Presbyterian Hospital, New York, New York, USA.
| |
Collapse
|
6
|
Bovonratwet P, Kulm S, Kolin DA, Song J, Morse KW, Cunningham ME, Albert TJ, Sandhu HS, Kim HJ, Iyer S, Elemento O, Qureshi SA. Identification of Novel Genetic Markers for the Risk of Spinal Pathologies: A Genome-Wide Association Study of 2 Biobanks. J Bone Joint Surg Am 2023:00004623-990000000-00758. [PMID: 36927824 DOI: 10.2106/jbjs.22.00872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]
Abstract
BACKGROUND Identifying genetic risk factors for spinal disorders may lead to knowledge regarding underlying molecular mechanisms and the development of new treatments. METHODS Cases of lumbar spondylolisthesis, spinal stenosis, degenerative disc disease, and pseudarthrosis after spinal fusion were identified from the UK Biobank. Controls were patients without the diagnosis. Whole-genome regressions were used to test for genetic variants potentially implicated in the occurrence of each phenotype. External validation was performed in FinnGen. RESULTS A total of 389,413 participants were identified from the UK Biobank. A locus on chromosome 2 spanning GFPT1, NFU1, AAK1, and LOC124906020 was implicated in lumbar spondylolisthesis. Two loci on chromosomes 2 and 12 spanning genes GFPT1, NFU1, and PDE3A were implicated in spinal stenosis. Three loci on chromosomes 6, 10, and 15 spanning genes CHST3, LOC102723493, and SMAD3 were implicated in degenerative disc disease. Finally, 2 novel loci on chromosomes 5 and 9, with the latter corresponding to the LOC105376270 gene, were implicated in pseudarthrosis. Some of these variants associated with spinal stenosis and degenerative disc disease were also replicated in FinnGen. CONCLUSIONS This study revealed nucleotide variations in select genetic loci that were potentially implicated in 4 different spinal pathologies, providing potential insights into the pathological mechanisms. LEVEL OF EVIDENCE Prognostic Level III. See Instructions for Authors for a complete description of levels of evidence.
Collapse
Affiliation(s)
| | - Scott Kulm
- Caryl and Israel Englander Institute of Precision Medicine, Weill Cornell Medicine, New York, NY
| | - David A Kolin
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| | - Junho Song
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| | - Kyle W Morse
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| | | | - Todd J Albert
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| | | | - Han Jo Kim
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| | - Sravisht Iyer
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| | - Olivier Elemento
- Caryl and Israel Englander Institute of Precision Medicine, Weill Cornell Medicine, New York, NY
| | - Sheeraz A Qureshi
- Department of Spine Surgery, Hospital for Special Surgery, New York, NY
| |
Collapse
|
7
|
Kulm S, Langhans MT, Shen TS, Kolin DA, Elemento O, Rodeo SA. Genome-Wide Association Study of Adhesive Capsulitis Suggests Significant Genetic Risk Factors. J Bone Joint Surg Am 2022; 104:1869-1876. [PMID: 36223477 DOI: 10.2106/jbjs.21.01407] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
BACKGROUND Adhesive capsulitis of the shoulder involves loss of passive range of motion with associated pain and can develop spontaneously, with no obvious injury or inciting event. The pathomechanism of this disorder remains to be elucidated, but known risk factors for adhesive capsulitis include diabetes, female sex, and thyroid dysfunction. Additionally, transcriptional profiling and pedigree analyses have suggested a role for genetics. Identification of elements of genetic risk for adhesive capsulitis using population-based techniques can provide the basis for guiding both the personalized treatment of patients based on their genetic profiles and the development of new treatments by identification of the pathomechanism. METHODS A genome-wide association study (GWAS) was conducted using the U.K. Biobank (a collection of approximately 500,000 patients with genetic data and associated ICD-10 [International Classification of Diseases, 10th Revision] codes), comparing 2,142 patients with the ICD-10 code for adhesive capsulitis (M750) to those without. Separate GWASs were conducted controlling for 2 of the known risk factors of adhesive capsulitis-hypothyroidism and diabetes. Logistic regression analysis was conducted controlling for factors including sex, thyroid dysfunction, diabetes, shoulder dislocation, smoking, and genetics. RESULTS Three loci of significance were identified: rs34315830 (in WNT7B; odds ratio [OR] = 1.28; 95% confidence interval [CI], 1.22 to 1.39), rs2965196 (in MAU2; OR = 1.67; 95% CI, 1.39 to 2.00), and rs1912256 (in POU1F1; OR = 1.22; 95% CI, 1.14 to 1.31). These loci retained significance when controlling for thyroid dysfunction and diabetes. The OR for total genetic risk was 5.81 (95% CI, 4.08 to 8.31), compared with 1.70 (95% CI, 1.18 to 2.36) for hypothyroidism and 4.23 (95% CI, 2.32 to 7.05) for diabetes. CONCLUSIONS The total genetic risk associated with adhesive capsulitis was significant and similar to the risks associated with hypothyroidism and diabetes. Identification of WNT7B, POU1F1, and MAU2 implicates the Wnt pathway and cell proliferation response in the pathomechanism of adhesive capsulitis. LEVEL OF EVIDENCE Prognostic Level III . See Instructions for Authors for a complete description of levels of evidence.
Collapse
|
8
|
Kulm S, Kolin DA, Langhans MT, Kaidi AC, Elemento O, Bostrom MP, Shen TS. Characterization of Genetic Risk of End-Stage Knee Osteoarthritis Treated with Total Knee Arthroplasty: A Genome-Wide Association Study. J Bone Joint Surg Am 2022; 104:1814-1820. [PMID: 36000784 DOI: 10.2106/jbjs.22.00364] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
BACKGROUND End-stage knee osteoarthritis (OA) is a highly debilitating disease for which total knee arthroplasty (TKA) serves as an effective treatment option. Although a genetic component to OA in general has been described, evaluation of the genetic contribution to end-stage OA of the knee is limited. To this end, we present a genome-wide association study involving patients undergoing TKA for primary knee OA to characterize the genetic features of severe disease on a population level. METHODS Individuals with the diagnosis of knee OA who underwent primary TKA were identified in the U.K. Biobank using administrative codes. The U.K. Biobank is a data repository containing prospectively collected clinical and genomic data for >500,000 patients. A genome-wide association analysis was performed using the REGENIE software package. Logistic regression was also used to compare the total genetic risk between subgroups stratified by age and body mass index (BMI). RESULTS A total of 16,032 patients with end-stage knee OA who underwent primary TKA were identified. Seven genetic loci were found to be significantly associated with end-stage knee OA. The odds ratio (OR) for developing end-stage knee OA attributable to genetics was 1.12 (95% confidence interval [CI], 1.10 to 1.14), which was lower than the OR associated with BMI (OR = 1.81; 95% CI, 1.78 to 1.83) and age (OR = 2.38; 95% CI, 2.32 to 2.45). The magnitude of the OR for developing end-stage knee OA attributable to genetics was greater in patients <60 years old than in patients ≥60 years old (p = 0.002). CONCLUSIONS This population-level genome-wide association study of end-stage knee OA treated with primary TKA was notable for identifying multiple significant genetic variants. These loci involve genes responsible for cartilage development, cartilage homeostasis, cell signaling, and metabolism. Age and BMI appear to have a greater impact on the risk of developing end-stage disease compared with genetic factors. The genetic contribution to the development of severe disease is greater in younger patients. LEVEL OF EVIDENCE Prognostic Level III . See Instructions for Authors for a complete description of levels of evidence.
Collapse
Affiliation(s)
- Scott Kulm
- Weill Cornell Medicine, Cornell University, New York, NY.,Englander Institute for Precision Medicine, Weill Cornell Medicine, Cornell University, New York, NY
| | - David A Kolin
- Weill Cornell Medicine, Cornell University, New York, NY
| | | | | | - Olivier Elemento
- Weill Cornell Medicine, Cornell University, New York, NY.,Englander Institute for Precision Medicine, Weill Cornell Medicine, Cornell University, New York, NY
| | | | | |
Collapse
|
9
|
Greenberg JA, Ivanov NA, Shah Y, Kulm S, Williams J, Tran CG, Scognamiglio T, Lee YJ, Egan CE, Min IM, Zarnegar R, Howe J, Keutgen X, Fahey TJ, Elemento O, Finnerty BM. Abstract 5270: Developing a predictive model for pancreatic neuroendocrine tumor metastatic potential: A multi-institutional analysis. Cancer Res 2022. [DOI: 10.1158/1538-7445.am2022-5270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Pancreatic neuroendocrine tumors (PNETs) are rare neoplasms that arise from cells in the islets of Langerhans, with surgical resection presently recommended for tumors > 2cm. While many PNETs have the propensity to be indolent, some small tumors display aggressive features with early metastatic potential. We used machine learning to develop a predictive model of metastatic potential dependent upon the transcriptomic signature of primary PNET tissue. To build this model, RNA sequencing data was obtained from the primary tissue of 96 surgically-resected PNETs from various institutions. Two cohorts were generated with equally balanced metastatic PNET composition (15 (32.6%) vs. 13 (26.5%), p=0.52). A differential gene expression analysis identified 20 concordantly differentially expressed genes associated with metastatic status between the two cohorts. Unsupervised surrogate variable analysis estimated and adjusted for significant sources of variation not related to metastatic potential and mitigated unwanted noise and batch effects. A gene set enrichment analysis identified an additional 29 genes that most frequently contributed to the enriched biologic pathways extrapolated from the sequencing data. Log transformed, batch corrected TPM values for these 49 genes were combined with an additional 10 clinically-relevant genes, including ARX and PDX1, that are known to contribute to PNET signatures or oncogenesis. The datasets were subsequently randomized in a 1:1 ratio and informative features with respect to metastatic status were identified utilizing a Boruta algorithm, with a priori exclusion of highly-correlative genes and those that displayed near zero variance. Nine genes, including AURKA, ARX, CDCA8, CPB2, MYT1L, NDC80, PAPPA2, SFMBT1 and ZPLD1, were identified as sufficient to classify the localized or metastatic outcome. Distributed random forests (DRF), generalized linear models (GLM), gradient boosting machines (GBM) and extreme gradient boosting (XGBoost) models were trained utilizing these 9 genes. Training ROC ranged from 0.92 for DRF to 1 for XGboost. When applied to 47 independent validation samples, the testing sensitivity ranged from 75% for DRF to 94% for GBM; specificity ranged from 84% for DRF to 94% to XGboost and GLM; positive predictive value ranged from 72% for DRF to 86% for GLM; negative predictive value ranged from 88% for GLM to 97% to GBM. The degree of predictive agreement between models ranged from 64% to 91%. Taken together, we have developed a highly sensitive predictive model of the metastatic PNET phenotype that is based on expression of nine genes. Its application as a guide for management should be studied prospectively in patients with newly diagnosed PNETs.
Citation Format: Jacques A. Greenberg, Nikolay A. Ivanov, Yajas Shah, Scott Kulm, Jelani Williams, Catherine G. Tran, Theresa Scognamiglio, Yeon Joo Lee, Caitlin E. Egan, Irene M. Min, Rasa Zarnegar, James Howe, Xavier Keutgen, Thomas J. Fahey, Olivier Elemento, Brendan M. Finnerty. Developing a predictive model for pancreatic neuroendocrine tumor metastatic potential: A multi-institutional analysis [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 5270.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - James Howe
- 3University of Iowa Carver College of Medicine, Iowa City, IA
| | | | | | | | | |
Collapse
|
10
|
Kulm S, Kofman L, Mezey J, Elemento O. Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches. JCO Clin Cancer Inform 2022; 6:e2100166. [PMID: 35239414 PMCID: PMC8920463 DOI: 10.1200/cci.21.00166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 01/17/2022] [Accepted: 01/28/2022] [Indexed: 11/20/2022] Open
Abstract
PURPOSE The ability to accurately predict an individual's risk for cancer is critical to the implementation of precision prevention measures. Current cancer risk predictions are frequently made with simple models that use a few proven risk factors, such as the Gail model for breast cancer, which are easy to interpret, but may theoretically be less accurate than advanced machine learning (ML) models. METHODS With the UK Biobank, a large prospective study, we developed models that predicted 13 cancer diagnoses within a 10-year time span. ML and linear models fit with all features, linear models fit with 10 features, and externally developed QCancer models, which are available to more than 4,000 general practices, were assessed. RESULTS The average area under the receiver operator curve (AUC) of the linear models (0.722, SE = 0.015) was greater than the average AUC of the ML models (0.720, SE = 0.016) when all 931 features were used. Linear models with only 10 features generated an average AUC of 0.706 (SE 0.015), which was comparable to the complex models using all features and greater than the average AUC of the QCancer models (0.684, SE 0.021). The high performance of the 10-feature linear model may be caused by the consideration of often omitted feature types, including census records and genetic information. CONCLUSION The high performance of the 10-feature linear models indicate that unbiased selection of diverse features, not ML models, may lead to impressively accurate predictions, possibly enabling personalized screening schedules that increase cancer survival.
Collapse
Affiliation(s)
- Scott Kulm
- Caryl and Israel Englander Institute of Precision Medicine, Weill Cornell Medicine, New York, NY
- Physiology, Biophysics and Systems Biology Graduate Program, Weill Cornell Medicine, New York, NY
| | - Lior Kofman
- Caryl and Israel Englander Institute of Precision Medicine, Weill Cornell Medicine, New York, NY
- Department of Computer Science, Tufts University, Medford, MA
| | - Jason Mezey
- Department of Genetic Medicine, Weill Cornell Medicine, New York, NY
- Department of Computational Biology, Cornell University, Ithaca, NY
| | - Olivier Elemento
- Caryl and Israel Englander Institute of Precision Medicine, Weill Cornell Medicine, New York, NY
- Physiology, Biophysics and Systems Biology Graduate Program, Weill Cornell Medicine, New York, NY
| |
Collapse
|
11
|
Kolin DA, Kulm S, Elemento O. Prediction of primary venous thromboembolism based on clinical and genetic factors within the U.K. Biobank. Sci Rep 2021; 11:21340. [PMID: 34725413 PMCID: PMC8560817 DOI: 10.1038/s41598-021-00796-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 10/15/2021] [Indexed: 01/18/2023] Open
Abstract
Both clinical and genetic factors drive the risk of venous thromboembolism. However, whether clinically recorded risk factors and genetic variants can be combined into a clinically applicable predictive score remains unknown. Using Cox proportional-hazard models, we analyzed the association of risk factors with the likelihood of venous thromboembolism in U.K. Biobank, a large prospective cohort. We then created a polygenic risk score of 36 single nucleotide polymorphisms and a clinical score determined by age, sex, body mass index, previous cancer diagnosis, smoking status, and fracture in the last 5 years. Participants were at significantly increased risk of venous thromboembolism if they were at high clinical risk (subhazard ratio, 4.37 [95% CI, 3.85-4.97]) or high genetic risk (subhazard ratio, 3.02 [95% CI, 2.63-3.47]) relative to participants at low clinical or genetic risk, respectively. The combined model, consisting of clinical and genetic components, was significantly better than either the clinical or the genetic model alone (P < 0.001). Participants at high risk in the combined score had nearly an eightfold increased risk of venous thromboembolism relative to participants at low risk (subhazard ratio, 7.51 [95% CI, 6.28-8.98]). This risk score can be used to guide decisions regarding venous thromboembolism prophylaxis, although external validation is needed.
Collapse
Affiliation(s)
- David A Kolin
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA.
- Physiology, Biophysics, and Systems Biology, Weill Cornell Medicine, New York, NY, USA.
| | - Scott Kulm
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA
- Physiology, Biophysics, and Systems Biology, Weill Cornell Medicine, New York, NY, USA
| | - Olivier Elemento
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA
- Physiology, Biophysics, and Systems Biology, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|
12
|
Marderstein AR, Kulm S, Peng C, Tamimi R, Clark AG, Elemento O. A polygenic-score-based approach for identification of gene-drug interactions stratifying breast cancer risk. Am J Hum Genet 2021; 108:1752-1764. [PMID: 34363748 PMCID: PMC8456164 DOI: 10.1016/j.ajhg.2021.07.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Accepted: 07/16/2021] [Indexed: 12/24/2022] Open
Abstract
An individual's genetics can dramatically influence breast cancer (BC) risk. Although clinical measures for prevention do exist, non-invasive personalized measures for reducing BC risk are limited. Commonly used medications are a promising set of modifiable factors, but no previous study has explored whether a range of widely taken approved drugs modulate BC genetics. In this study, we describe a quantitative framework for exploring the interaction between the genetic susceptibility of BC and medication usage among UK Biobank women. We computed BC polygenic scores (PGSs) that summarize BC genetic risk and find that the PGS explains nearly three-times greater variation in disease risk within corticosteroid users compared to non-users. We map 35 genes significantly interacting with corticosteroid use (FDR < 0.1), highlighting the transcription factor NRF2 as a common regulator of gene-corticosteroid interactions in BC. Finally, we discover a regulatory variant strongly stratifying BC risk according to corticosteroid use. Within risk allele carriers, 18.2% of women taking corticosteroids developed BC, compared to 5.1% of the non-users (with an HR = 3.41 per-allele within corticosteroid users). In comparison, there are no differences in BC risk within the reference allele homozygotes. Overall, this work highlights the clinical relevance of gene-drug interactions in disease risk and provides a roadmap for repurposing biobanks in drug repositioning and precision medicine.
Collapse
Affiliation(s)
- Andrew R Marderstein
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY 10021, USA; Institute of Computational Biomedicine, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10021, USA; Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Scott Kulm
- Institute of Computational Biomedicine, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10021, USA
| | - Cheng Peng
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Rulla Tamimi
- Population Health Sciences, Weill Cornell Medicine, New York, NY 10065, USA
| | - Andrew G Clark
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY 10021, USA; Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA.
| | - Olivier Elemento
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY 10021, USA; Institute of Computational Biomedicine, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10021, USA.
| |
Collapse
|
13
|
Manohar J, Abedian S, Martini R, Kulm S, Salvatore M, Ho K, Christos P, Campion T, Imperato-McGinley J, Ibrahim S, Evering TH, Phillips E, Tamimi R, Bea V, Balogun OD, Sboner A, Elemento O, Davis MB. Social and Clinical Determinants of COVID-19 Outcomes: Modeling Real-World Data from a Pandemic Epicenter. medRxiv 2021:2021.04.06.21254728. [PMID: 33851193 PMCID: PMC8043490 DOI: 10.1101/2021.04.06.21254728] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
IMPORTANCE As the United States continues to accumulate COVID-19 cases and deaths, and disparities persist, defining the impact of risk factors for poor outcomes across patient groups is imperative. OBJECTIVE Our objective is to use real-world healthcare data to quantify the impact of demographic, clinical, and social determinants associated with adverse COVID-19 outcomes, to identify high-risk scenarios and dynamics of risk among racial and ethnic groups. DESIGN A retrospective cohort of COVID-19 patients diagnosed between March 1 and August 20, 2020. Fully adjusted logistical regression models for hospitalization, severe disease and mortality outcomes across 1-the entire cohort and 2- within self-reported race/ethnicity groups. SETTING Three sites of the NewYork-Presbyterian health care system serving all boroughs of New York City. Data was obtained through automated data abstraction from electronic medical records. PARTICIPANTS During the study timeframe, 110,498 individuals were tested for SARS-CoV-2 in the NewYork-Presbyterian health care system; 11,930 patients were confirmed for COVID-19 by RT-PCR or covid-19 clinical diagnosis. MAIN OUTCOMES AND MEASURES The predictors of interest were patient race/ethnicity, and covariates included demographics, comorbidities, and census tract neighborhood socio-economic status. The outcomes of interest were COVID-19 hospitalization, severe disease, and death. RESULTS Of confirmed COVID-19 patients, 4,895 were hospitalized, 1,070 developed severe disease and 1,654 suffered COVID-19 related death. Clinical factors had stronger impacts than social determinants and several showed race-group specificities, which varied among outcomes. The most significant factors in our all-patients models included: age over 80 (OR=5.78, p= 2.29x10-24) and hypertension (OR=1.89, p=1.26x10-10) having the highest impact on hospitalization, while Type 2 Diabetes was associated with all three outcomes (hospitalization: OR=1.48, p=1.39x10-04; severe disease: OR=1.46, p=4.47x10-09; mortality: OR=1.27, p=0.001). In race-specific models, COPD increased risk of hospitalization only in Non-Hispanics (NH)-Whites (OR=2.70, p=0.009). Obesity (BMI 30+) showed race-specific risk with severe disease NH-Whites (OR=1.48, p=0.038) and NH-Blacks (OR=1.77, p=0.025). For mortality, Cancer was the only risk factor in Hispanics (OR=1.97, p=0.043), and heart failure was only a risk in NH-Asians (OR=2.62, p=0.001). CONCLUSIONS AND RELEVANCE Comorbidities were more influential on COVID-19 outcomes than social determinants, suggesting clinical factors are more predictive of adverse trajectory than social factors.
Collapse
Affiliation(s)
- Jyothi Manohar
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, NY
- Department of Medicine, Weill Cornell Medicine, New York City, NY
| | - Sajjad Abedian
- Information Technology and Services Department, Weill Cornell Medicine, New York City, NY
| | - Rachel Martini
- Department of Surgery, Weill Cornell Medicine, New York City, NY
| | - Scott Kulm
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, NY
- Department of Physiology, Weill Cornell Medicine, New York City, NY
| | - Mirella Salvatore
- Department of Medicine, Weill Cornell Medicine, New York City, NY
- Department of Population Health Sciences, Will Cornell Medicine, New York City, NY
| | - Kaylee Ho
- Department of Population Health Sciences, Will Cornell Medicine, New York City, NY
| | - Paul Christos
- Department of Population Health Sciences, Will Cornell Medicine, New York City, NY
| | - Thomas Campion
- Department of Population Health Sciences, Will Cornell Medicine, New York City, NY
- Clinical Translational Science Center, Weill Cornell Medicine, New York City, NY
| | | | - Said Ibrahim
- Department of Population Health Sciences, Will Cornell Medicine, New York City, NY
- Clinical Translational Science Center, Weill Cornell Medicine, New York City, NY
| | | | - Erica Phillips
- Department of Medicine, Weill Cornell Medicine, New York City, NY
- Department of Integrative Medicine, Weill Cornell Medicine, New York City, NY
| | - Rulla Tamimi
- Department of Population Health Sciences, Will Cornell Medicine, New York City, NY
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York City, NY
| | - Vivian Bea
- Department of Surgery, Weill Cornell Medicine, New York City, NY
| | - Onyinye D. Balogun
- Department of Radiation Oncology, Weill Cornell Medicine, New York City, NY
| | - Andrea Sboner
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, NY
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York City, NY
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York City, NY
| | - Olivier Elemento
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, NY
- Department of Physiology, Weill Cornell Medicine, New York City, NY
- Clinical Translational Science Center, Weill Cornell Medicine, New York City, NY
- WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York City, NY
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York City, NY
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York City, NY
| | - Melissa Boneta Davis
- Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York City, NY
- Department of Surgery, Weill Cornell Medicine, New York City, NY
- Institute for the Study of Breast Cancer Subtypes, Weill Cornell Medicine, New York City, NY
| |
Collapse
|
14
|
Marderstein AR, Davenport ER, Kulm S, Van Hout CV, Elemento O, Clark AG. Leveraging phenotypic variability to identify genetic interactions in human phenotypes. Am J Hum Genet 2021; 108:49-67. [PMID: 33326753 DOI: 10.1016/j.ajhg.2020.11.016] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 11/23/2020] [Indexed: 12/13/2022] Open
Abstract
Although thousands of loci have been associated with human phenotypes, the role of gene-environment (GxE) interactions in determining individual risk of human diseases remains unclear. This is partly because of the severe erosion of statistical power resulting from the massive number of statistical tests required to detect such interactions. Here, we focus on improving the power of GxE tests by developing a statistical framework for assessing quantitative trait loci (QTLs) associated with the trait means and/or trait variances. When applying this framework to body mass index (BMI), we find that GxE discovery and replication rates are significantly higher when prioritizing genetic variants associated with the variance of the phenotype (vQTLs) compared to when assessing all genetic variants. Moreover, we find that vQTLs are enriched for associations with other non-BMI phenotypes having strong environmental influences, such as diabetes or ulcerative colitis. We show that GxE effects first identified in quantitative traits such as BMI can be used for GxE discovery in disease phenotypes such as diabetes. A clear conclusion is that strong GxE interactions mediate the genetic contribution to body weight and diabetes risk.
Collapse
Affiliation(s)
- Andrew R Marderstein
- Tri-Institutional Program in Computational Biology & Medicine, Weill Cornell Medicine, New York, NY 10021, USA; Institute of Computational Biomedicine, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10021, USA; Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA
| | - Emily R Davenport
- Department of Biology, Huck Institutes of the Life Sciences, Institute for Computational and Data Sciences, Pennsylvania State University, University Park, PA 16802, USA
| | - Scott Kulm
- Institute of Computational Biomedicine, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10021, USA
| | | | - Olivier Elemento
- Institute of Computational Biomedicine, Weill Cornell Medicine, New York, NY 10021, USA; Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY 10021, USA.
| | - Andrew G Clark
- Department of Computational Biology, Cornell University, Ithaca, NY 14850, USA.
| |
Collapse
|
15
|
Kolin DA, Kulm S, Christos PJ, Elemento O. Clinical, regional, and genetic characteristics of Covid-19 patients from UK Biobank. PLoS One 2020; 15:e0241264. [PMID: 33201886 PMCID: PMC7671499 DOI: 10.1371/journal.pone.0241264] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Accepted: 10/12/2020] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Coronavirus disease 2019 (Covid-19) has rapidly infected millions of people worldwide. Recent studies suggest that racial minorities and patients with comorbidities are at higher risk of Covid-19. In this study, we analyzed the effects of clinical, regional, and genetic factors on Covid-19 positive status. METHODS The UK Biobank is a longitudinal cohort study that recruited participants from 2006 to 2010 from throughout the United Kingdom. Covid-19 test results were provided to UK Biobank starting on March 16, 2020. The main outcome measure in this study was Covid-19 positive status, determined by the presence of any positive test for a single individual. Clinical risk factors were derived from UK Biobank at baseline, and regional risk factors were imputed using census features local to each participant's home zone. We used robust adjusted Poisson regression with clustering by testing laboratory to estimate relative risk. Blood types were derived using genetic variants rs8176719 and rs8176746, and genomewide tests of association were conducted using logistic-Firth hybrid regression. RESULTS This prospective cohort study included 397,064 UK Biobank participants, of whom 968 tested positive for Covid-19. The unadjusted relative risk of Covid-19 for Black participants was 3.66 (95% CI 2.83-4.74), compared to White participants. Adjusting for Townsend deprivation index alone reduced the relative risk to 2.44 (95% CI 1.86-3.20). Comorbidities that significantly increased Covid-19 risk included chronic obstructive pulmonary disease (adjusted relative risk [ARR] 1.64, 95% CI 1.18-2.27), ischemic heart disease (ARR 1.48, 95% CI 1.16-1.89), and depression (ARR 1.32, 95% CI 1.03-1.70). There was some evidence that angiotensin converting enzyme inhibitors (ARR 1.48, 95% CI 1.13-1.93) were associated with increased risk of Covid-19. Each standard deviation increase in the number of total individuals living in a participant's locality was associated with increased risk of Covid-19 (ARR 1.14, 95% CI 1.08-1.20). Analyses of genetically inferred blood types confirmed that participants with type A blood had increased odds of Covid-19 compared to participants with type O blood (odds ratio [OR] 1.16, 95% CI 1.01-1.33). A meta-analysis of genomewide association studies across ancestry groups did not reveal any significant loci. Study limitations include confounding by indication, bias due to limited information on early Covid-19 test results, and inability to accurately gauge disease severity. CONCLUSIONS When assessing the association of Black race with Covid-19, adjusting for deprivation reduced the relative risk of Covid-19 by 33%. In the context of sociological research, these findings suggest that discrimination in the labor market may play a role in the high relative risk of Covid-19 for Black individuals. In this study, we also confirmed the association of blood type A with Covid-19, among other clinical and regional factors.
Collapse
Affiliation(s)
- David A. Kolin
- The Meyer Cancer Center, Weill Cornell Medicine, Caryl and Israel Englander Institute for Precision Medicine, New York, NY, United States of America
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States of America
| | - Scott Kulm
- The Meyer Cancer Center, Weill Cornell Medicine, Caryl and Israel Englander Institute for Precision Medicine, New York, NY, United States of America
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States of America
| | - Paul J. Christos
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, United States of America
| | - Olivier Elemento
- The Meyer Cancer Center, Weill Cornell Medicine, Caryl and Israel Englander Institute for Precision Medicine, New York, NY, United States of America
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States of America
| |
Collapse
|
16
|
Abstract
Objective To explore both clinical and genetic risk factors for Covid-19 in a cohort from the United Kingdom. Design Prospective cohort study. Participants 669 positive Covid-19 patients within a cohort of 502,536 UK Biobank participants, recruited between 2006 and 2010. Main Outcome Measures The main outcome measure was Covid-19 positive status, determined by the presence of any positive test for a single individual. We also assessed risk factors for inpatient and outpatient status for Covid-19 positive individuals. Results We found that black participants were at over three times increased risk of testing positive for Covid-19, relative to white participants, even after adjusting for confounders (adjusted relative risk [ARR] 3.14, 95% confidence interval [CI] 2.28 to 4.31). Asian participants were also at higher risk of Covid-19 (ARR 2.03, 95% CI 1.40 to 2.95). Next, we analyzed the association of comorbidities with Covid-19. We found that participants were at increased risk of Covid-19 if they had chronic obstructive pulmonary disease (ARR 1.54, 95% CI 1.02 to 2.31) or ischemic heart disease (ARR 1.56, 95% CI 1.18 to 2.07). However, there was no evidence that either angiotensin converting enzyme inhibitors (ARR 1.32, 95% CI 0.95 to 1.84) or angiotensin II receptor blockers (ARR 1.37, 95% CI 0.94 to 1.98) increased the risk of Covid-19. We confirmed that blood type A was associated with Covid-19 relative to blood type O individuals, and we also found that the HLA variant DQA1_509 was enriched in Covid-19 positive cases, even after Bonferroni correction (P = 1.0 × 10−5). Conclusions In this study, we found that black and Asian participants were at increased risk of Covid-19, even after adjusting for confounders. We also identified a novel genetic association with the HLA variant DQA1_509. Further investigations of genetic associations with Covid-19 may lead to important discoveries of genetic drivers of severe disease.
Collapse
Affiliation(s)
- David A Kolin
- Caryl and Israel Englander Institute for Precision Medicine, The Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA.,Physiology, Biophysics, and Systems Biology, Weill Cornell Medicine, New York, NY, USA
| | - Scott Kulm
- Caryl and Israel Englander Institute for Precision Medicine, The Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA.,Physiology, Biophysics, and Systems Biology, Weill Cornell Medicine, New York, NY, USA
| | - Olivier Elemento
- Caryl and Israel Englander Institute for Precision Medicine, The Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA.,Physiology, Biophysics, and Systems Biology, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|