1
|
Qin G, Wei J, Sun Y, Du W. Research Advance of Causal Inference in Clinical Medicine: A Bibliometrics Analysis via Citespace. J Multidiscip Healthc 2025; 18:2603-2627. [PMID: 40370682 PMCID: PMC12077415 DOI: 10.2147/jmdh.s516826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2025] [Accepted: 04/25/2025] [Indexed: 05/16/2025] Open
Abstract
Objective Causal inference in clinical medicine provides scientific evidence for precision medicine and individualized treatment by revealing the true associations between interventions and health outcomes. This study aims to conduct a comprehensive bibliometric analysis to identify current research trends, primary themes, and future directions for the application of causal inference in clinical medicine. Methods We conducted a literature search in the Web of Science database using causal inference and medical terminology as subject keywords, covering the period from January 1986 to December 2024. After screening, we obtained 4,316 documents for analysis. Utilizing CiteSpace to generate network diagrams, we analyzed data related to the number of publications, citation analysis, collaboration relationships, keyword co-occurrence, and highlighted terms to illustrate the knowledge map and collaboration network in this field. Results Publications on medical causal inference shows a fluctuating growth trend over time. The United States was the top contributors to this field. Harvard University is the leading research institution. George David Smith is the most prolific author, Robbins JM is the most cited scholar. The major research hotspots concentrated in fields such as epidemiology, coronary heart disease and health. Notably, marginal structural models, counterfactual forecasting, and Mendelian randomization have consistently been key methodologies in research. The burstness of keywords reveals that big data, DNA methylation, and robust estimation are emerging research directions. Conclusion In clinical research, counterfactual forecasting provides prospective guidance for optimizing clinical strategies; Mendelian randomization helps uncover potential therapeutic targets; and marginal structural models enhance the accuracy of causal effect estimation in clinical studies. The future integration of various data sources to improve causal inference methods is anticipated to enhance the sensitivity and specificity of trials, ultimately elucidating the complex mechanisms of diseases and drug effects. The literature retrieve strategy and the metrics of the tools adopted may have a certain impact on the results of this study.
Collapse
Affiliation(s)
- Guoqiang Qin
- School of Management, Nanjing University of Posts and Telecommunications, Nanjing, People’s Republic of China
| | - Jianxiang Wei
- School of Management, Nanjing University of Posts and Telecommunications, Nanjing, People’s Republic of China
- Library, Nanjing University of Posts and Telecommunications, Nanjing, People’s Republic of China
| | - Yuehong Sun
- School of Mathematical Sciences, Nanjing Normal University, Nanjing, People’s Republic of China
| | - Wenwen Du
- School of Management, Nanjing University of Posts and Telecommunications, Nanjing, People’s Republic of China
| |
Collapse
|
2
|
Zhu H, Ni H, Yang Q, Ni J, Ji J, Yang S, Peng F. Evaluating the Bidirectional Causal Effects of Alzheimer's Disease Across Multiple Conditions: A Systematic Review and Meta-Analysis of Mendelian Randomization Studies. Int J Mol Sci 2025; 26:3589. [PMID: 40332115 PMCID: PMC12027472 DOI: 10.3390/ijms26083589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2025] [Revised: 04/04/2025] [Accepted: 04/07/2025] [Indexed: 05/08/2025] Open
Abstract
This study systematically evaluates and meta-analyzes Mendelian randomization studies on the bidirectional causal relationship between Alzheimer's disease (AD) and systemic diseases. We searched five databases, assessed study quality, and extracted data. Diseases were classified using ICD-11, and the meta-analysis was performed with RevMan 5.4. A total of 56 studies identified genetic links between AD susceptibility and systemic diseases. Notably, genetic proxies for hip osteoarthritis (OR = 0.80; p = 0.007) and rheumatoid arthritis (OR = 0.97; p = 0.004) were inversely associated with AD risk, while gout (OR = 1.02; p = 0.049) showed a positive association. Genetic liability to depression (OR = 1.03; p = 0.001) elevated AD risk, and AD genetic risk increased susceptibility to delirium (OR = 1.32; p = 0.0005). Cardiovascular traits, including coronary artery disease (OR = 1.07; p = 0.021) and hypertension (OR = 4.30; p = 0.044), were causally linked to a higher AD risk. Other conditions, such as insomnia, chronic periodontitis, migraine, and certain cancers, exhibited significant genetic correlations. Intriguingly, herpes zoster (OR = 0.87; p = 0.005) and cataracts (OR = 0.96; p = 0.012) demonstrated inverse genetic associations with AD. These findings suggest potential therapeutic targets and preventive strategies, emphasizing the need to address comorbid systemic diseases to reduce AD risk and progression.
Collapse
Affiliation(s)
- Haoning Zhu
- Key Laboratory of Drug-Targeting and Drug Delivery System of the Education Ministry and Sichuan Province, Sichuan Engineering Laboratory for Plant-Sourced Drug, Sichuan Research Center for Drug Precision Industrial Technology, West China School of Pharmacy, Sichuan University, Chengdu 610041, China; (H.Z.); (H.N.); (J.N.); (F.P.)
| | - Huitong Ni
- Key Laboratory of Drug-Targeting and Drug Delivery System of the Education Ministry and Sichuan Province, Sichuan Engineering Laboratory for Plant-Sourced Drug, Sichuan Research Center for Drug Precision Industrial Technology, West China School of Pharmacy, Sichuan University, Chengdu 610041, China; (H.Z.); (H.N.); (J.N.); (F.P.)
| | - Qiuling Yang
- Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China;
| | - Jiaqi Ni
- Key Laboratory of Drug-Targeting and Drug Delivery System of the Education Ministry and Sichuan Province, Sichuan Engineering Laboratory for Plant-Sourced Drug, Sichuan Research Center for Drug Precision Industrial Technology, West China School of Pharmacy, Sichuan University, Chengdu 610041, China; (H.Z.); (H.N.); (J.N.); (F.P.)
| | - Jianguang Ji
- Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - Shu Yang
- Key Laboratory of Drug-Targeting and Drug Delivery System of the Education Ministry and Sichuan Province, Sichuan Engineering Laboratory for Plant-Sourced Drug, Sichuan Research Center for Drug Precision Industrial Technology, West China School of Pharmacy, Sichuan University, Chengdu 610041, China; (H.Z.); (H.N.); (J.N.); (F.P.)
| | - Fu Peng
- Key Laboratory of Drug-Targeting and Drug Delivery System of the Education Ministry and Sichuan Province, Sichuan Engineering Laboratory for Plant-Sourced Drug, Sichuan Research Center for Drug Precision Industrial Technology, West China School of Pharmacy, Sichuan University, Chengdu 610041, China; (H.Z.); (H.N.); (J.N.); (F.P.)
| |
Collapse
|
3
|
Wan NC, Grabowska ME, Kerchberger VE, Wei WQ. Exploring beyond diagnoses in electronic health records to improve discovery: a review of the phenome-wide association study. JAMIA Open 2025; 8:ooaf006. [PMID: 40041255 PMCID: PMC11879097 DOI: 10.1093/jamiaopen/ooaf006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2024] [Revised: 12/30/2024] [Accepted: 01/24/2025] [Indexed: 03/06/2025] Open
Abstract
Objective The phenome-wide association study (PheWAS) systematically examines the phenotypic spectrum extracted from electronic health records (EHRs) to uncover correlations between phenotypes and exposures. This review explores methodologies, highlights challenges, and outlines future directions for EHR-driven PheWAS. Materials and Methods We searched the PubMed database for articles spanning from 2010 to 2023, and we collected data regarding exposures, phenotypes, cohorts, terminologies, replication, and ancestry. Results Our search yielded 690 articles. Following exclusion criteria, we identified 291 articles published between January 1, 2010, and December 31, 2023. A total number of 162 (55.6%) articles defined phenomes using phecodes, indicating that research is reliant on the organization of billing codes. Moreover, 72.8% of articles utilized exposures consisting of genetic data, and the majority (69.4%) of PheWAS lacked replication analyses. Discussion Existing literature underscores the need for deeper phenotyping, variability in PheWAS exposure variables, and absence of replication in PheWAS. Current applications of PheWAS mainly focus on cardiovascular, metabolic, and endocrine phenotypes; thus, applications of PheWAS in uncommon diseases, which may lack structured data, remain largely understudied. Conclusions With modern EHRs, future PheWAS should extend beyond diagnosis codes and consider additional data like clinical notes or medications to create comprehensive phenotype profiles that consider severity, temporality, risk, and ancestry. Furthermore, data interoperability initiatives may help mitigate the paucity of PheWAS replication analyses. With the growing availability of data in EHR, PheWAS will remain a powerful tool in precision medicine.
Collapse
Affiliation(s)
- Nicholas C Wan
- Department of Biomedical Engineering, Vanderbilt University, Nashville, TN 37240, United States
| | - Monika E Grabowska
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37302, United States
| | - Vern Eric Kerchberger
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37302, United States
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, United States
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37302, United States
| |
Collapse
|
4
|
Jung EM, Raduski AR, Mills LJ, Spector LG. A phenome-wide association study of polygenic scores for selected childhood cancer: Results from the UK Biobank. HGG ADVANCES 2025; 6:100356. [PMID: 39340156 PMCID: PMC11538869 DOI: 10.1016/j.xhgg.2024.100356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 09/24/2024] [Accepted: 09/04/2024] [Indexed: 09/30/2024] Open
Abstract
The aim of this study was to scan phenotypes in adulthood associated with polygenic risk scores (PRS) for childhood cancers with well-articulated genetic architectures-acute lymphoblastic leukemia (ALL), Ewing sarcoma, and neuroblastoma-to examine genetic pleiotropy. Furthermore, we aimed to determine which SNPs could drive associations. Per-SNP summary statistics were extracted for PRS calculation. Participants with white British ancestry were exclusively included for analyses. SNPs were queried from the UK Biobank genotype imputation data. Records from the cancer registry, death registry, and inpatient diagnoses were abstracted for phenome-wide scans. Firth logistic regression was used to estimate odds ratios (ORs) and 95% confidence intervals (CIs) alongside corresponding p values, adjusting for age at recruitment and sex. A total of 244,332 unrelated white British participants were included. We observed a significant association between ALL-PRS and ALL (OR: 1.20e+24, 95% CI: 9.08e+14-1.60e+33). In addition, we observed a significant association between high-risk neuroblastoma PRS and nonrheumatic aortic valve disorders (OR: 43.9, 95% CI: 7.42-260). There were no significant phenotype associations with Ewing sarcoma and neuroblastoma PRS. Regarding individual SNPs, rs17607816 increased the risk of ALL (OR: 6.40, 95% CI: 3.26-12.57). For high-risk neuroblastoma, rs80059929 elevated the risk of atrioventricular block (OR: 3.04, 95% CI: 1.85-4.99). Our findings suggest that individuals with genetic susceptibility to ALL may face a lifelong risk for developing ALL, along with a genetic pleiotropic association between high-risk neuroblastoma and circulatory diseases.
Collapse
Affiliation(s)
- Eun Mi Jung
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA.
| | - Andrew R Raduski
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA
| | - Lauren J Mills
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA; Masonic Cancer Center, University of Minnesota, Minneapolis, MN, USA
| | - Logan G Spector
- Division of Epidemiology and Clinical Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN, USA; Masonic Cancer Center, University of Minnesota, Minneapolis, MN, USA.
| |
Collapse
|
5
|
Lei Y, Christian Naj A, Xu H, Li R, Chen Y. Balancing the efforts of chart review and gains in PRS prediction accuracy: An empirical study. J Biomed Inform 2024; 157:104705. [PMID: 39134233 DOI: 10.1016/j.jbi.2024.104705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 06/12/2024] [Accepted: 08/05/2024] [Indexed: 08/22/2024]
Abstract
OBJECTIVE Phenotypic misclassification in genetic association analyses can impact the accuracy of PRS-based prediction models. The bias reduction method proposed by Tong et al. (2019) has demonstrated its efficacy in reducing the effects of bias on the estimation of association parameters between genotype and phenotype while minimizing variance by employing chart reviews on a subset of the data for validating phenotypes, however its improvement of subsequent PRS prediction accuracy remains unclear. Our study aims to fill this gap by assessing the performance of simulated PRS models and estimating the optimal number of chart reviews needed for validation. METHODS To comprehensively assess the efficacy of the bias reduction method proposed by Tong et al. in enhancing the accuracy of PRS-based prediction models, we simulated each phenotype under different correlation structures (an independent model, a weakly correlated model, a strongly correlated model) and introduced error-prone phenotypes using two distinct error mechanisms (differential and non-differential phenotyping errors). To facilitate this, we used genotype and phenotype data from 12 case-control datasets in the Alzheimer's Disease Genetics Consortium (ADGC) to produce simulated phenotypes. The evaluation included analyses across various misclassification rates of original phenotypes as well as quantities of validation set. Additionally, we determined the median threshold, identifying the minimal validation size required for a meaningful improvement in the accuracy of PRS-based predictions across a broad spectrum. RESULTS This simulation study demonstrated that incorporating chart review does not universally guarantee enhanced performance of PRS-based prediction models. Specifically, in scenarios with minimal misclassification rates and limited validation sizes, PRS models utilizing debiased regression coefficients demonstrated inferior predictive capabilities compared to models using error-prone phenotypes. Put differently, the effectiveness of the bias reduction method is contingent upon the misclassification rates of phenotypes and the size of the validation set employed during chart reviews. Notably, when dealing with datasets featuring higher misclassification rates, the advantages of utilizing this bias reduction method become more evident, requiring a smaller validation set to achieve better performance. CONCLUSION This study highlights the importance of choosing an appropriate validation set size to balance between the efforts of chart review and the gain in PRS prediction accuracy. Consequently, our study establishes a valuable guidance for validation planning, across a diverse array of sensitivity and specificity combinations.
Collapse
Affiliation(s)
- Yuqing Lei
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA
| | - Adam Christian Naj
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA; Penn Neurodegeneration Genomics Center, Department of Pathology and Laboratory Medicine, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA
| | - Hua Xu
- Department for Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA
| | - Ruowang Li
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA.
| | - Yong Chen
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
6
|
Fu M, Valiente-Banuet L, Wadhwa SS, Pasaniuc B, Vossel K, Chang TS. Improving genetic risk modeling of dementia from real-world data in underrepresented populations. Commun Biol 2024; 7:1049. [PMID: 39183196 PMCID: PMC11345412 DOI: 10.1038/s42003-024-06742-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 08/16/2024] [Indexed: 08/27/2024] Open
Abstract
Genetic risk modeling for dementia offers significant benefits, but studies based on real-world data, particularly for underrepresented populations, are limited. We employ an Elastic Net model for dementia risk prediction using single-nucleotide polymorphisms prioritized by functional genomic data from multiple neurodegenerative disease genome-wide association studies. We compare this model with APOE and polygenic risk score models across genetic ancestry groups (Hispanic Latino American sample: 610 patients with 126 cases; African American sample: 440 patients with 84 cases; East Asian American sample: 673 patients with 75 cases), using electronic health records from UCLA Health for discovery and the All of Us cohort for validation. Our model significantly outperforms other models across multiple ancestries, improving the area-under-precision-recall curve by 31-84% (Wilcoxon signed-rank test p-value <0.05) and the area-under-the-receiver-operating characteristic by 11-17% (DeLong test p-value <0.05) compared to the APOE and the polygenic risk score models. We identify shared and ancestry-specific risk genes and biological pathways, reinforcing and adding to existing knowledge. Our study highlights the benefits of integrating functional mapping, multiple neurodegenerative diseases, and machine learning for genetic risk models in diverse populations. Our findings hold potential for refining precision medicine strategies in dementia diagnosis.
Collapse
Affiliation(s)
- Mingzhou Fu
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Medical Informatics Home Area, Department of Bioinformatics, University of California, Los Angeles, Los Angeles, CA, 90024, USA
| | - Leopoldo Valiente-Banuet
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Satpal S Wadhwa
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Bogdan Pasaniuc
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA
| | - Keith Vossel
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Timothy S Chang
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
| |
Collapse
|
7
|
Chang T, Fu M, Valiente-Banuet L, Wadhwa S, Pasaniuc B, Vossel K. Improving genetic risk modeling of dementia from real-world data in underrepresented populations. RESEARCH SQUARE 2024:rs.3.rs-3911508. [PMID: 38410460 PMCID: PMC10896371 DOI: 10.21203/rs.3.rs-3911508/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/28/2024]
Abstract
BACKGROUND Genetic risk modeling for dementia offers significant benefits, but studies based on real-world data, particularly for underrepresented populations, are limited. METHODS We employed an Elastic Net model for dementia risk prediction using single-nucleotide polymorphisms prioritized by functional genomic data from multiple neurodegenerative disease genome-wide association studies. We compared this model with APOE and polygenic risk score models across genetic ancestry groups, using electronic health records from UCLA Health for discovery and All of Us cohort for validation. RESULTS Our model significantly outperforms other models across multiple ancestries, improving the area-under-precision-recall curve by 21-61% and the area-under-the-receiver-operating characteristic by 10-21% compared to the APOEand the polygenic risk score models. We identified shared and ancestry-specific risk genes and biological pathways, reinforcing and adding to existing knowledge. CONCLUSIONS Our study highlights benefits of integrating functional mapping, multiple neurodegenerative diseases, and machine learning for genetic risk models in diverse populations. Our findings hold potential for refining precision medicine strategies in dementia diagnosis.
Collapse
Affiliation(s)
- Timothy Chang
- David Geffen School of Medicine, University of California, Los Angeles
| | | | | | | | | | | |
Collapse
|
8
|
Fu M, Valiente-Banuet L, Wadhwa SS, UCLA Precision Health Data Discovery Repository Working Group, UCLA Precision Health ATLAS Working Group, Pasaniuc B, Vossel K, Chang TS. Improving genetic risk modeling of dementia from real-world data in underrepresented populations. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.05.24302355. [PMID: 38370649 PMCID: PMC10871463 DOI: 10.1101/2024.02.05.24302355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
BACKGROUND Genetic risk modeling for dementia offers significant benefits, but studies based on real-world data, particularly for underrepresented populations, are limited. METHODS We employed an Elastic Net model for dementia risk prediction using single-nucleotide polymorphisms prioritized by functional genomic data from multiple neurodegenerative disease genome-wide association studies. We compared this model with APOE and polygenic risk score models across genetic ancestry groups, using electronic health records from UCLA Health for discovery and All of Us cohort for validation. RESULTS Our model significantly outperforms other models across multiple ancestries, improving the area-under-precision-recall curve by 21-61% and the area-under-the-receiver-operating characteristic by 10-21% compared to the APOE and the polygenic risk score models. We identified shared and ancestry-specific risk genes and biological pathways, reinforcing and adding to existing knowledge. CONCLUSIONS Our study highlights benefits of integrating functional mapping, multiple neurodegenerative diseases, and machine learning for genetic risk models in diverse populations. Our findings hold potential for refining precision medicine strategies in dementia diagnosis.
Collapse
Affiliation(s)
- Mingzhou Fu
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, United States
- Medical Informatics Home Area, Department of Bioinformatics, University of California, Los Angeles, Los Angeles, CA, 90024, United States
| | - Leopoldo Valiente-Banuet
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, United States
| | - Satpal S. Wadhwa
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, United States
| | | | | | - Bogdan Pasaniuc
- Department of Computational Medicine, David Geffen School of Medicine at UCLA, Los Angeles, CA, 90095, USA
| | - Keith Vossel
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, United States
| | - Timothy S. Chang
- Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, United States
| |
Collapse
|
9
|
Fu M, Yan Y, Olde Loohuis LM, Chang TS. Defining the distance between diseases using SNOMED CT embeddings. J Biomed Inform 2023; 139:104307. [PMID: 36738869 DOI: 10.1016/j.jbi.2023.104307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 12/10/2022] [Accepted: 01/29/2023] [Indexed: 02/05/2023]
Abstract
Characterizing disease relationships is essential to biomedical research to understand disease etiology and improve clinical decision-making. Measurements of distance between disease pairs enable valuable research tasks, such as subgrouping patients and identifying common time courses of disease onset. Distance metrics developed in prior work focused on smaller, targeted disease sets. Distance metrics covering all diseases have not yet been defined, which limits the applications to a broader disease spectrum. Our current study defines disease distances for all disease pairs within the International Classification of Diseases, version 10 (ICD-10), the diagnostic classification system universally used in electronic health records. Our proposed distance is computed based on a biomedical ontology, SNOMED CT (Systemized Nomenclature of Medicine, Clinical Terms), which can also be viewed as a structured knowledge graph. We compared the knowledge graph-based metric to three other distance metrics based on the hierarchical structure of ICD, clinical comorbidity, and genetic correlation, to evaluate how each may capture similar or unique aspects of disease relationships. We show that our knowledge graph-based distance metric captures known phenotypic, clinical, and molecular characteristics at a finer granularity than the other three. With the continued growth of using electronic health records data for research, we believe that our distance metric will play an important role in subgrouping patients for precision health, and enabling individualized disease prevention and treatments.
Collapse
Affiliation(s)
- Mingzhou Fu
- Movement Disorders Program, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, USA; Medical Informatics Home Area, Department of Bioinformatics, University of California, Los Angeles, CA, USA
| | - Yu Yan
- Medical Informatics Home Area, Department of Bioinformatics, University of California, Los Angeles, CA, USA
| | - Loes M Olde Loohuis
- Center for Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA.
| | - Timothy S Chang
- Movement Disorders Program, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, USA.
| |
Collapse
|
10
|
Wu BS, Zhang YR, Yang L, Zhang W, Deng YT, Chen SD, Feng JF, Cheng W, Yu JT. Polygenic Liability to Alzheimer's Disease Is Associated with a Wide Range of Chronic Diseases: A Cohort Study of 312,305 Participants. J Alzheimers Dis 2023; 91:437-447. [PMID: 36442194 DOI: 10.3233/jad-220740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
BACKGROUND Alzheimer's disease (AD) patients rank among the highest levels of comorbidities compared to persons with other diseases. However, it is unclear whether the conditions are caused by shared pathophysiology due to the genetic pleiotropy for AD risk genes. OBJECTIVE To figure out the genetic pleiotropy for AD risk genes in a wide range of diseases. METHODS We estimated the polygenic risk score (PRS) for AD and tested the association between PRS and 16 ICD10 main chapters, 136 ICD10 level-1 chapters, and 377 diseases with cases more than 1,000 in 312,305 individuals without AD diagnosis from the UK Biobank. RESULTS After correction for multiple testing, AD PRS was associated with two main ICD10 chapters: Chapter IV (endocrine, nutritional and metabolic diseases) and Chapter VII (eye and adnexa disorders). When narrowing the definition of the phenotypes, positive associations were observed between AD PRS and other types of dementia (OR = 1.39, 95% CI [1.34, 1.45], p = 1.96E-59) and other degenerative diseases of the nervous system (OR = 1.18, 95% CI [1.13, 1.24], p = 7.74E-10). In contrast, we detected negative associations between AD PRS and diabetes mellitus, obesity, chronic bronchitis, other retinal disorders, pancreas diseases, and cholecystitis without cholelithiasis (ORs range from 0.94 to 0.97, FDR < 0.05). CONCLUSION Our study confirms several associations reported previously and finds some novel results, which extends the knowledge of genetic pleiotropy for AD in a range of diseases. Further mechanistic studies are necessary to illustrate the molecular mechanisms behind these associations.
Collapse
Affiliation(s)
- Bang-Sheng Wu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Ya-Ru Zhang
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Liu Yang
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Wei Zhang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Yue-Ting Deng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Shi-Dong Chen
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Jian-Feng Feng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Department of Computer Science, University of Warwick, Coventry, UK
| | - Wei Cheng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
- Fudan ISTBI-ZJNU Algorithm Centre for Brain-Inspired Intelligence, Zhejiang Normal University, Jinhua, China
| | - Jin-Tai Yu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| |
Collapse
|
11
|
Guan S, Yu YN, Li B, Gu H, Chen L, Wang N, Wang B, Liu X, Liu J, Wang Z. Discovery of Drug-Responsive Phenomic Alteration-Related Driver Genes in the Treatment of Coronary Heart Disease. Pharmgenomics Pers Med 2023; 16:201-217. [PMID: 36945217 PMCID: PMC10024908 DOI: 10.2147/pgpm.s398522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 02/25/2023] [Indexed: 03/17/2023] Open
Abstract
Background The Xueyu Zheng (XYZ) phenome is central to coronary heart disease (CHD), but efforts to detect genetic associations in the XYZ phenome have been disappointing. Methods The phenomic alteration-related genes (PARGs) for the XYZ phenome were screened using |ρ| > 0.4 and p < 0.05 after treatment with Danhong injection at day 14 and day 30. Then, the driver genes for the Protein-Protein Interaction (PPI) networks of the PARGs established using STRING 11.0 were detected using a personalized network control algorithm (PNC). Finally, the molecular correlations of the driver genes with the XYZ phenome were analyzed with the Gene Ontology (GO) biological processes and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways from a holistic viewpoint. Results A total of 525 and 309 PARGs in the XYZ phenome at day 14 and day 30 were identified. These genes were separately enriched in 48 and 35 pathways. Furthermore, five driver genes were detected. These genes were mainly correlated with endoplasmic reticulum stress-mediated apoptosis and autophagy regulation, which could suppress atherosclerosis progression. Conclusion Our study detected the drug-responsive PARGs of the XYZ phenome in CHD and provides an exemplary strategy to investigate the genetic associations among this common phenome and its component symptoms in patients with CHD. Trial Registration ClinicalTrials.gov, NCT01681316; registered on September 7, 2012.
Collapse
Affiliation(s)
- Shuang Guan
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Ya-Nan Yu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Bing Li
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Hao Gu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Lin Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Nian Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Bo Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Xi Liu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Jun Liu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
| | - Zhong Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, People’s Republic of China
- Correspondence: Zhong Wang; Jun Liu, Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, No. 16 Nanxiaojie, Dongzhimennei, Beijing, People’s Republic of China, Email ;
| |
Collapse
|