1
|
Hajishah H, Kazemi D, Safaee E, Amini MJ, Peisepar M, Tanhapour MM, Tavasol A. Evaluation of machine learning methods for prediction of heart failure mortality and readmission: meta-analysis. BMC Cardiovasc Disord 2025; 25:264. [PMID: 40189534 PMCID: PMC11974104 DOI: 10.1186/s12872-025-04700-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Accepted: 03/24/2025] [Indexed: 04/09/2025] Open
Abstract
BACKGROUND Heart failure (HF) impacts nearly 6 million individuals in the U.S., with a projected 46% increase by 2030, is creating significant healthcare burdens. Predictive models, particularly machine learning (ML)-based models, offer promising solutions to identify patients at greater risk of adverse outcomes, such as mortality and hospital readmission. This review aims to assess the effectiveness of ML models in predicting HF-related outcomes, with a focus on their potential to improve patient care and clinical decision-making. We aim to assess how effectively machine learning models predict mortality and readmission in heart failure patients to improve clinical outcomes. METHOD The study followed PRISMA 2020 guidelines and was registered in the PROSPERO database (CRD42023481167). We conducted a systematic search in PubMed, Scopus, and Web of Science databases using specific keywords related to heart failure, machine learning, mortality and readmission. Extracted data focused on study characteristics, machine learning details, and outcomes, with AUC or c-index used as the primary outcomes for pooling analysis. The PROBAST tool was used to assess bias risk, evaluating models based on participants, predictors, outcomes, and statistical analysis. The meta-analysis pooled AUCs for different machine learning models predicting mortality and readmission. Prediction accuracy data was categorized by timeframes, with high heterogeneity determined by an I² value above 50%, leading to a random-effects model when applicable. Publication bias was assessed using Egger's and Begg's tests, with a p-value below 0.05 considered significant RESULT: A total of 4,505 studies were identified, and after screening, 64 were included in the final analysis, covering 943,941 patients. Of these, 40 studies focused on mortality, 17 on readmission, and 7 on both outcomes. In total, 346 machine learning models were evaluated, with the most common algorithms being random forest, logistic regression, and gradient boosting. The neural network model achieved the highest overall AUC for mortality prediction (0.808), while the support vector machine performed best for readmission prediction (AUC 0.733). The analysis revealed a significant risk of bias, primarily due to reliance on retrospective data and inadequate sample size justification. CONCLUSION In conclusion, this review emphasizes the strong potential of ML models in predicting HF readmission and mortality. ML algorithms show promise in improving prognostic accuracy and enabling personalized patient care. However, challenges like model interpretability, generalizability, and clinical integration persist. Overcoming these requires refined ML techniques and a robust regulatory framework to enhance HF outcomes.
Collapse
Affiliation(s)
- Hamed Hajishah
- Student Research Committee, Tehran Medical Sciences Branch, Islamic Azad University, Tehran, Iran
| | - Danial Kazemi
- Student Research Committee, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Ehsan Safaee
- Student Research Committee, Faculty of Medicine, Shahed University, Tehran, Iran
| | - Mohammad Javad Amini
- Student Research Committee, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Maral Peisepar
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Universal Scientific Education and Research Network (USERN), Tehran, Iran
| | - Mohammad Mahdi Tanhapour
- Student Research Committee, Tehran Medical Sciences Branch, Islamic Azad University, Tehran, Iran
| | - Arian Tavasol
- Student Research Committee, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
- Faculaty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
2
|
Lin Q, Zhao W, Zhang H, Chen W, Lian S, Ruan Q, Qu Z, Lin Y, Chai D, Lin X. Predicting the risk of heart failure after acute myocardial infarction using an interpretable machine learning model. Front Cardiovasc Med 2025; 12:1444323. [PMID: 39925976 PMCID: PMC11802525 DOI: 10.3389/fcvm.2025.1444323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Accepted: 01/06/2025] [Indexed: 02/11/2025] Open
Abstract
Background Early prediction of heart failure (HF) after acute myocardial infarction (AMI) is essential for personalized treatment. We aimed to use interpretable machine learning (ML) methods to develop a risk prediction model for HF in AMI patients. Methods We retrospectively included patients initially with AMI who received percutaneous coronary intervention (PCI) in our hospital from November 2016 to February 2020. The primary endpoint was the occurrence of HF within 3 years after operation. For developing a predictive model for HF risk in AMI patients, the least absolute shrinkage and selection operator (LASSO) Regression was used to feature selection, and four ML algorithms including Random Forest (RF), Extreme Gradient Boost (XGBoost), Support Vector Machine (SVM), and Logistic Regression (LR) were employed to develop the model on the training set. The performance evaluation of the prediction model was carried out on the training set and the testing set, utilizing metrics including AUC (Area under the receiver operating characteristic curve), calibration plot, and decision curve analysis (DCA). In addition, we used the Shapley Additive Explanations (SHAP) value to determine the importance of the selected features and interpret the optimal model. Results A total of 1220 AMI patients were included and 244 (20%) patients developed HF during follow-up. Among the four evaluated ML models, the XGBoost model exhibited exceptional accuracy, with an AUC value of 0.922. The SHAP method showed that left ventricular ejection fraction (LVEF), left ventricular end-systolic diameter (LVDs) and lactate dehydrogenase (LDH) were identified as the three most important characteristics to predict HF risk in AMI patients. Individual risk assessment was performed using SHAP plots and waterfall plot analysis. Conclusions Our research demonstrates the potential of ML methods in the early prediction of HF risk in AMI patients. Furthermore, it enhances the interpretability of the XGBoost model through SHAP analysis to guide clinical decision-making.
Collapse
Affiliation(s)
- Qingqing Lin
- Department of Ultrasound, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
| | - Wenxiang Zhao
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- Department of Cardiology, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
| | - Hailin Zhang
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- Department of Cardiology, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
| | - Wenhao Chen
- Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, China
| | - Sheng Lian
- Fujian Key Laboratory of Network Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou, China
| | - Qinyun Ruan
- Department of Ultrasound, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
| | - Zhaoyang Qu
- Department of Ultrasound, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
| | - Yimin Lin
- Department of Ultrasound, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
| | - Dajun Chai
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- Department of Cardiology, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- Key Laboratory of Metabolic Cardiovascular Disease of Fujian Province Colleges and Universities, Fuzhou, China
- Clinical Research Center for Metabolic Heart Disease of Fujian Province, Fuzhou, China
| | - Xiaoyan Lin
- Department of Ultrasound, The First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- National Regional Medical Center, Binhai Branch of the First Affiliated Hospital, Fujian Medical University, Fuzhou, China
- Key Laboratory of Metabolic Cardiovascular Disease of Fujian Province Colleges and Universities, Fuzhou, China
- Clinical Research Center for Metabolic Heart Disease of Fujian Province, Fuzhou, China
| |
Collapse
|
3
|
Ding L, Yin X, Wen G, Sun D, Xian D, Zhao Y, Zhang M, Yang W, Chen W. Prediction of preterm birth using machine learning: a comprehensive analysis based on large-scale preschool children survey data in Shenzhen of China. BMC Pregnancy Childbirth 2024; 24:810. [PMID: 39633287 PMCID: PMC11616287 DOI: 10.1186/s12884-024-06980-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2024] [Accepted: 11/11/2024] [Indexed: 12/07/2024] Open
Abstract
BACKGROUND Preterm birth (PTB) is a significant cause of neonatal mortality and long-term health issues. Accurate prediction and timely prevention of PTB are essential for reducing associated child mortality and morbidity. Traditional predictive methods face challenges due to heterogeneous risk factors and their interaction effects. This study aims to develop and evaluate six machine learning (ML) models to predict PTB using large-scale children survey data from Shenzhen, China, and to identify key predictors through Shapley Additive Explanations (SHAP) analysis. METHODS Data from 84,050 mother-child pairs, collected in 2021 and 2022, were processed and divided into training, validation, and test sets. Six ML models were tested: L1-Regularised Logistic Regression, Light Gradient Boosting Machine (LightGBM), Naive Bayes, Random Forests, Support Vector Machine, and Extreme Gradient Boosting (XGBoost). Model performance was evaluated based on discrimination, calibration and clinical utility. SHAP analysis was used to interpret the importance and impact of individual features on PTB prediction. RESULTS The XGBoost model demonstrated the best overall performance, with the area under the receiver operating characteristic curve (AUC) scores of 0.752 and 0.757 in the validation and test sets, respectively, along with favorable calibration and clinical utility. Key predictors identified were multiple pregnancies, threatened abortion, and maternal age of conception. SHAP analysis highlighted the positive impacts of multiple pregnancies and threatened abortion, as well as the negative impact of micronutrient supplementation on PTB. CONCLUSION Our study found that ML models, particularly XGBoost, show promise in accurately predicting PTB and identifying key risk factors. These findings provide the potential of ML for enhancing clinical interventions, personalizing prenatal care, and informing public health initiatives.
Collapse
Affiliation(s)
- Liwen Ding
- Department of Epidemiology and Health Statistics, School of Public Health, Sun Yat-Sen University, Guangzhou, 510080, China
| | - Xiaona Yin
- Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China
| | - Guomin Wen
- Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China
| | - Dengli Sun
- Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China
| | - Danxia Xian
- Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China
| | - Yafen Zhao
- Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China
| | - Maolin Zhang
- Department of Epidemiology and Health Statistics, School of Public Health, Sun Yat-Sen University, Guangzhou, 510080, China
| | - Weikang Yang
- Women's and Children's Hospital of Longhua District of Shenzhen, Shenzhen, 518109, China.
| | - Weiqing Chen
- Department of Epidemiology and Health Statistics, School of Public Health, Sun Yat-Sen University, Guangzhou, 510080, China.
- School of Health Management, Xinhua College of Guangzhou, Guangzhou, 510080, China.
| |
Collapse
|
4
|
Liu L, Liang L, Luo Y, Han J, Lu D, Cai R, Sethi G, Mai S. Unveiling the Power of Gut Microbiome in Predicting Neoadjuvant Immunochemotherapy Responses in Esophageal Squamous Cell Carcinoma. RESEARCH (WASHINGTON, D.C.) 2024; 7:0529. [PMID: 39545038 PMCID: PMC11562848 DOI: 10.34133/research.0529] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 10/16/2024] [Accepted: 10/18/2024] [Indexed: 11/17/2024]
Abstract
The role of the gut microbiome in enhancing the efficacy of anticancer treatments like chemotherapy and radiotherapy is well acknowledged. However, there is limited empirical evidence on its predictive capabilities for neoadjuvant immunochemotherapy (NICT) responses in esophageal squamous cell carcinoma (ESCC). Our study fills this gap by comprehensively analyzing the gut microbiome's influence on NICT outcomes. We analyzed 16S rRNA gene sequences from 136 fecal samples from 68 ESCC patients before and after NICT, along with 19 samples from healthy controls. After NICT, marked microbiome composition changes were noted, including a decrease in ESCC-associated pathogens and an increase in beneficial microbes such as Limosilactobacillus, Lacticaseibacillus, and Staphylococcus. Baseline microbiota profiles effectively differentiated responders from nonresponders, with responders showing higher levels of short-chain fatty acid (SCFA)-producing bacteria such as Faecalibacterium, Eubacterium_eligens_group, Anaerostipes, and Odoribacter, and nonresponders showing increases in Veillonella, Campylobacter, Atopobium, and Trichococcus. We then divided our patient cohort into training and test sets at a 4:1 ratio and utilized the XGBoost-RFE algorithm to identify 7 key microbial biomarkers-Faecalibacterium, Subdoligranulum, Veillonella, Hungatella, Odoribacter, Butyricicoccus, and HT002. A predictive model was developed using LightGBM, which achieved an area under the receiver operating characteristic curve (AUC) of 86.8% [95% confidence interval (CI), 73.8% to 99.4%] in the training set, 76.8% (95% CI, 41.2% to 99.7%) in the validation set, and 76.5% (95% CI, 50.4% to 100%) in the testing set. Our findings underscore the gut microbiome as a novel source of biomarkers for predicting NICT responses in ESCC, highlighting its potential to enhance personalized treatment strategies and advance the integration of microbiome profiling into clinical practice for modulating cancer treatment responses.
Collapse
Affiliation(s)
- Le Liu
- Integrated Clinical Microecology Center, Shenzhen Hospital,
Southern Medical University, Shenzhen, China
- Department of Gastroenterology, Zhujiang Hospital,
Southern Medical University, Guangzhou, China
| | - Liping Liang
- Department of Gastroenterology and Hepatology, Guangzhou Key Laboratory of Digestive Diseases, Guangzhou Digestive Disease Center, Guangzhou First People’s Hospital, School of Medicine,
South China University of Technology, Guangzhou, China
| | - YingJie Luo
- Department of Thoracic Surgery, Nanfang Hospital,
Southern Medical University, Guangzhou, China
| | - Jimin Han
- School of Life Sciences,
Tsinghua University, Beijing, China
| | - Di Lu
- Department of Thoracic Surgery, Nanfang Hospital,
Southern Medical University, Guangzhou, China
| | - RuiJun Cai
- Department of Thoracic Surgery, Nanfang Hospital,
Southern Medical University, Guangzhou, China
| | - Gautam Sethi
- Department of Pharmacology, Yong Loo Lin School of Medicine,
National University of Singapore, Singapore, Singapore
| | - Shijie Mai
- Department of Thoracic Surgery, Nanfang Hospital,
Southern Medical University, Guangzhou, China
| |
Collapse
|
5
|
Salih AM, Galazzo IB, Gkontra P, Rauseo E, Lee AM, Lekadir K, Radeva P, Petersen SE, Menegaz G. A review of evaluation approaches for explainable AI with applications in cardiology. Artif Intell Rev 2024; 57:240. [PMID: 39132011 PMCID: PMC11315784 DOI: 10.1007/s10462-024-10852-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2024] [Indexed: 08/13/2024]
Abstract
Explainable artificial intelligence (XAI) elucidates the decision-making process of complex AI models and is important in building trust in model predictions. XAI explanations themselves require evaluation as to accuracy and reasonableness and in the context of use of the underlying AI model. This review details the evaluation of XAI in cardiac AI applications and has found that, of the studies examined, 37% evaluated XAI quality using literature results, 11% used clinicians as domain-experts, 11% used proxies or statistical analysis, with the remaining 43% not assessing the XAI used at all. We aim to inspire additional studies within healthcare, urging researchers not only to apply XAI methods but to systematically assess the resulting explanations, as a step towards developing trustworthy and safe models. Supplementary Information The online version contains supplementary material available at 10.1007/s10462-024-10852-w.
Collapse
Affiliation(s)
- Ahmed M. Salih
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
- Department of Population Health Sciences, University of Leicester, University Rd, Leicester, LE1 7RH UK
- Department of Computer Science, University of Zakho, Duhok road, Zakho, Kurdistan Iraq
| | - Ilaria Boscolo Galazzo
- Department of Engineering for Innovative Medicine, University of Verona, S. Francesco, 22, 37129 Verona, Italy
| | - Polyxeni Gkontra
- Artificial Intelligence in Medicine Lab (BCN-AIM), Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585, 08007 Barcelona, Spain
| | - Elisa Rauseo
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
| | - Aaron Mark Lee
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
| | - Karim Lekadir
- Artificial Intelligence in Medicine Lab (BCN-AIM), Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585, 08007 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluís Companys 23, Barcelona, Spain
| | - Petia Radeva
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585, 08007 Barcelona, Spain
| | - Steffen E. Petersen
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
- Barts Heart Centre, St Bartholomew’s Hospital, Barts Health NHS Trust, West Smithfield, London, UK
- Health Data Research, London, UK
- Alan Turing Institute, London, UK
| | - Gloria Menegaz
- Department of Engineering for Innovative Medicine, University of Verona, S. Francesco, 22, 37129 Verona, Italy
| |
Collapse
|
6
|
Bu ZJ, Jiang N, Li KC, Lu ZL, Zhang N, Yan SS, Chen ZL, Hao YH, Zhang YH, Xu RB, Chi HW, Chen ZY, Liu JP, Wang D, Xu F, Liu ZL. Development and Validation of an Interpretable Machine Learning Model for Early Prognosis Prediction in ICU Patients with Malignant Tumors and Hyperkalemia. Medicine (Baltimore) 2024; 103:e38747. [PMID: 39058887 PMCID: PMC11272258 DOI: 10.1097/md.0000000000038747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 06/07/2024] [Indexed: 07/28/2024] Open
Abstract
This study aims to develop and validate a machine learning (ML) predictive model for assessing mortality in patients with malignant tumors and hyperkalemia (MTH). We extracted data on patients with MTH from the Medical Information Mart for Intensive Care-IV, version 2.2 (MIMIC-IV v2.2) database. The dataset was split into a training set (75%) and a validation set (25%). We used the Least Absolute Shrinkage and Selection Operator (LASSO) regression to identify potential predictors, which included clinical laboratory indicators and vital signs. Pearson correlation analysis tested the correlation between predictors. In-hospital death was the prediction target. The Area Under the Curve (AUC) and accuracy of the training and validation sets of 7 ML algorithms were compared, and the optimal 1 was selected to develop the model. The calibration curve was used to evaluate the prediction accuracy of the model further. SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) enhanced model interpretability. 496 patients with MTH in the Intensive Care Unit (ICU) were included. After screening, 17 clinical features were included in the construction of the ML model, and the Pearson correlation coefficient was <0.8, indicating that the correlation between the clinical features was small. eXtreme Gradient Boosting (XGBoost) outperformed other algorithms, achieving perfect scores in the training set (accuracy: 1.000, AUC: 1.000) and high scores in the validation set (accuracy: 0.734, AUC: 0.733). The calibration curves indicated good predictive calibration of the model. SHAP analysis identified the top 8 predictive factors: urine output, mean heart rate, maximum urea nitrogen, minimum oxygen saturation, minimum mean blood pressure, maximum total bilirubin, mean respiratory rate, and minimum pH. In addition, SHAP and LIME performed in-depth individual case analyses. This study demonstrates the effectiveness of ML methods in predicting mortality risk in ICU patients with MTH. It highlights the importance of predictors like urine output and mean heart rate. SHAP and LIME significantly enhanced the model's interpretability.
Collapse
Affiliation(s)
- Zhi-Jun Bu
- Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Nan Jiang
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- The Third Affiliated Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Ke-Cheng Li
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- Department of Andrology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhi-Lin Lu
- First Clinical College, Hubei University of Chinese Medicine, Wuhan, China
| | - Nan Zhang
- School of International Studies, University of International Business and Economics, Beijing, China
| | - Shao-Shuai Yan
- Department of Thyropathy, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhi-Lin Chen
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Yu-Han Hao
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Yu-Huan Zhang
- School of Acupuncture and Orthopedics, Hubei University of Chinese Medicine, Wuhan, China
| | - Run-Bing Xu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- Department of Hematology and Oncology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Han-Wei Chi
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Zu-Yi Chen
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Jian-Ping Liu
- Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Dan Wang
- Surgery of Thyroid Gland and Breast, Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
- Hubei Shizhen Laboratory, Wuhan, China
| | - Feng Xu
- The Third Affiliated Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhao-Lan Liu
- Centre for Evidence-Based Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|
7
|
Jin J, Lu J, Su X, Xiong Y, Ma S, Kong Y, Xu H. Development and Validation of an ICU-Venous Thromboembolism Prediction Model Using Machine Learning Approaches: A Multicenter Study. Int J Gen Med 2024; 17:3279-3292. [PMID: 39070227 PMCID: PMC11283785 DOI: 10.2147/ijgm.s467374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 07/12/2024] [Indexed: 07/30/2024] Open
Abstract
Purpose The purpose of this study was to establish and validate machine learning-based models for predicting the risk of venous thromboembolism (VTE) in intensive care unit (ICU) patients. Patients and Methods The clinical data of 1494 ICU patients who underwent Doppler ultrasonography or venography between December 2020 and March 2023 were extracted from three tertiary hospitals. The Boruta algorithm was used to screen the essential variables associated with VTE. Five machine learning algorithms were employed: Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Gradient Boosting Decision Tree (GBDT), and Logistic Regression (LR). Hyperparameter optimization was conducted on the predictive model of the training dataset. The performance in the validation dataset was measured using indicators, including the area under curve (AUC) of the receiver operating characteristic (ROC) curve, specificity, and F1 score. Finally, the optimal model was interpreted using the SHapley Additive exPlanation (SHAP) package. Results The incidence of VTE among the ICU patients in this study was 26.04%. We screened 19 crucial features for the risk prediction model development. Among the five models, the RF model performed best, with an AUC of 0.788 (95% CI: 0.738-0.838), an accuracy of 0.759 (95% CI: 0.709-0.809), a sensitivity of 0.633, and a Brier score of 0.166. Conclusion A machine learning-based model for prediction of VTE in ICU patients were successfully developed, which could assist clinical medical staff in identifying high-risk populations for VTE in the early stages so that prevention measures can be implemented to reduce the burden on the ICU patients.
Collapse
Affiliation(s)
- Jie Jin
- School of Nursing, Binzhou Medical University, Binzhou, People’s Republic of China
| | - Jie Lu
- School of Nursing, Binzhou Medical University, Binzhou, People’s Republic of China
| | - Xinyang Su
- Department of Spine Surgery, Binzhou Medical University Hospital, Binzhou, People’s Republic of China
| | - Yinhuan Xiong
- Department of Nursing, Binzhou People’s Hospital, Binzhou, People’s Republic of China
| | - Shasha Ma
- Department of Neurosurgery, Binzhou Medical University Hospital, Binzhou, People’s Republic of China
| | - Yang Kong
- School of Health Management, Binzhou Medical University, Yantai, People’s Republic of China
| | - Hongmei Xu
- School of Nursing, Binzhou Medical University, Binzhou, People’s Republic of China
| |
Collapse
|
8
|
Ketabi M, Andishgar A, Fereidouni Z, Sani MM, Abdollahi A, Vali M, Alkamel A, Tabrizi R. Predicting the risk of mortality and rehospitalization in heart failure patients: A retrospective cohort study by machine learning approach. Clin Cardiol 2024; 47:e24239. [PMID: 38402566 PMCID: PMC10894620 DOI: 10.1002/clc.24239] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 01/17/2024] [Accepted: 02/09/2024] [Indexed: 02/26/2024] Open
Abstract
BACKGROUND Heart failure (HF) is a global problem, affecting more than 26 million people worldwide. This study evaluated the performance of 10 machine learning (ML) algorithms and chose the best algorithm to predict mortality and readmission of HF patients by using The Fasa Registry on Systolic HF (FaRSH) database. HYPOTHESIS ML algorithms may better identify patients at increased risk of HF readmission or death with demographic and clinical data. METHODS Through comprehensive evaluation, the best-performing model was used for prediction. Finally, all the trained models were applied to the test data, which included 20% of the total data. For the final evaluation and comparison of the models, five metrics were used: accuracy, F1-score, sensitivity, specificity and Area Under Curve (AUC). RESULTS Ten ML algorithms were evaluated. The CatBoost (CAT) algorithm uses a series of decision tree models to create a nonlinear model, and this CAT algorithm performed the best of the 10 models studied. According to the three final outcomes from this study, which involved 2488 participants, 366 (14.7%) of the patients were readmitted to the hospital, 97 (3.9%) of the patients died within 1 month of the follow-up, and 342 (13.7%) of the patients died within 1 year of the follow-up. The most significant variables to predict the events were length of stay in the hospital, hemoglobin level, and family history of MI. CONCLUSIONS The ML-based risk stratification tool was able to assess the risk of 5-year all-cause mortality and readmission in patients with HF. ML could provide an explicit explanation of individualized risk prediction and give physicians an intuitive understanding of the influence of critical features in the model.
Collapse
Affiliation(s)
- Marzieh Ketabi
- Student Research CommitteeFasa University of Medical SciencesFasaIran
| | | | - Zhila Fereidouni
- Department of Medical Surgical NursingFasa University of Medical ScienceFarsIran
| | | | - Ashkan Abdollahi
- School of MedicineShiraz University of Medical SciencesShirazIran
| | - Mohebat Vali
- Student Research CommitteeShiraz University of Medical SciencesShirazIran
| | - Abdulhakim Alkamel
- Noncommunicable Diseases Research CenterFasa University of Medical ScienceFasaIran
| | - Reza Tabrizi
- Noncommunicable Diseases Research CenterFasa University of Medical ScienceFasaIran
- Clinical Research Development UnitFasa University of Medical SciencesFasaIran
| |
Collapse
|
9
|
Caterson J, Lewin A, Williamson E. The application of explainable artificial intelligence (XAI) in electronic health record research: A scoping review. Digit Health 2024; 10:20552076241272657. [PMID: 39493635 PMCID: PMC11528818 DOI: 10.1177/20552076241272657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 07/09/2024] [Indexed: 11/05/2024] Open
Abstract
Machine Learning (ML) and Deep Learning (DL) models show potential in surpassing traditional methods including generalised linear models for healthcare predictions, particularly with large, complex datasets. However, low interpretability hinders practical implementation. To address this, Explainable Artificial Intelligence (XAI) methods are proposed, but a comprehensive evaluation of their effectiveness is currently limited. The aim of this scoping review is to critically appraise the application of XAI methods in ML/DL models using Electronic Health Record (EHR) data. In accordance with PRISMA scoping review guidelines, the study searched PUBMED and OVID/MEDLINE (including EMBASE) for publications related to tabular EHR data that employed ML/DL models with XAI. Out of 3220 identified publications, 76 were included. The selected publications published between February 2017 and June 2023, demonstrated an exponential increase over time. Extreme Gradient Boosting and Random Forest models were the most frequently used ML/DL methods, with 51 and 50 publications, respectively. Among XAI methods, Shapley Additive Explanations (SHAP) was predominant in 63 out of 76 publications, followed by partial dependence plots (PDPs) in 11 publications, and Locally Interpretable Model-Agnostic Explanations (LIME) in 8 publications. Despite the growing adoption of XAI methods, their applications varied widely and lacked critical evaluation. This review identifies the increasing use of XAI in tabular EHR research and highlights a deficiency in the reporting of methods and a lack of critical appraisal of validity and robustness. The study emphasises the need for further evaluation of XAI methods and underscores the importance of cautious implementation and interpretation in healthcare settings.
Collapse
Affiliation(s)
| | - Alexandra Lewin
- London School of Hygiene and Tropical Medicine, Bloomsbury, UK
| | | |
Collapse
|
10
|
Huang G, Jin Q, Mao Y. Predicting the 5-Year Risk of Nonalcoholic Fatty Liver Disease Using Machine Learning Models: Prospective Cohort Study. J Med Internet Res 2023; 25:e46891. [PMID: 37698911 PMCID: PMC10523217 DOI: 10.2196/46891] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 08/02/2023] [Accepted: 08/16/2023] [Indexed: 09/13/2023] Open
Abstract
BACKGROUND Nonalcoholic fatty liver disease (NAFLD) has emerged as a worldwide public health issue. Identifying and targeting populations at a heightened risk of developing NAFLD over a 5-year period can help reduce and delay adverse hepatic prognostic events. OBJECTIVE This study aimed to investigate the 5-year incidence of NAFLD in the Chinese population. It also aimed to establish and validate a machine learning model for predicting the 5-year NAFLD risk. METHODS The study population was derived from a 5-year prospective cohort study. A total of 6196 individuals without NAFLD who underwent health checkups in 2010 at Zhenhai Lianhua Hospital in Ningbo, China, were enrolled in this study. Extreme gradient boosting (XGBoost)-recursive feature elimination, combined with the least absolute shrinkage and selection operator (LASSO), was used to screen for characteristic predictors. A total of 6 machine learning models, namely logistic regression, decision tree, support vector machine, random forest, categorical boosting, and XGBoost, were utilized in the construction of a 5-year risk model for NAFLD. Hyperparameter optimization of the predictive model was performed in the training set, and a further evaluation of the model performance was carried out in the internal and external validation sets. RESULTS The 5-year incidence of NAFLD was 18.64% (n=1155) in the study population. We screened 11 predictors for risk prediction model construction. After the hyperparameter optimization, CatBoost demonstrated the best prediction performance in the training set, with an area under the receiver operating characteristic (AUROC) curve of 0.810 (95% CI 0.768-0.852). Logistic regression showed the best prediction performance in the internal and external validation sets, with AUROC curves of 0.778 (95% CI 0.759-0.794) and 0.806 (95% CI 0.788-0.821), respectively. The development of web-based calculators has enhanced the clinical feasibility of the risk prediction model. CONCLUSIONS Developing and validating machine learning models can aid in predicting which populations are at the highest risk of developing NAFLD over a 5-year period, thereby helping delay and reduce the occurrence of adverse liver prognostic events.
Collapse
Affiliation(s)
- Guoqing Huang
- Department of Endocrinology, The First Affiliated Hospital of Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Qiankai Jin
- Department of Endocrinology, The First Affiliated Hospital of Ningbo University, Ningbo, China
- Health Science Center, Ningbo University, Ningbo, China
| | - Yushan Mao
- Department of Endocrinology, The First Affiliated Hospital of Ningbo University, Ningbo, China
| |
Collapse
|
11
|
Khan MS, Arshad MS, Greene SJ, Van Spall HGC, Pandey A, Vemulapalli S, Perakslis E, Butler J. Artificial intelligence and heart failure: A state-of-the-art review. Eur J Heart Fail 2023; 25:1507-1525. [PMID: 37560778 DOI: 10.1002/ejhf.2994] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 08/06/2023] [Accepted: 08/08/2023] [Indexed: 08/11/2023] Open
Abstract
Heart failure (HF) is a heterogeneous syndrome affecting more than 60 million individuals globally. Despite recent advancements in understanding of the pathophysiology of HF, many issues remain including residual risk despite therapy, understanding the pathophysiology and phenotypes of patients with HF and preserved ejection fraction, and the challenges related to integrating a large amount of disparate information available for risk stratification and management of these patients. Risk prediction algorithms based on artificial intelligence (AI) may have superior predictive ability compared to traditional methods in certain instances. AI algorithms can play a pivotal role in the evolution of HF care by facilitating clinical decision making to overcome various challenges such as allocation of treatment to patients who are at highest risk or are more likely to benefit from therapies, prediction of adverse outcomes, and early identification of patients with subclinical disease or worsening HF. With the ability to integrate and synthesize large amounts of data with multidimensional interactions, AI algorithms can supply information with which physicians can improve their ability to make timely and better decisions. In this review, we provide an overview of the AI algorithms that have been developed for establishing early diagnosis of HF, phenotyping HF with preserved ejection fraction, and stratifying HF disease severity. This review also discusses the challenges in clinical deployment of AI algorithms in HF, and the potential path forward for developing future novel learning-based algorithms to improve HF care.
Collapse
Affiliation(s)
| | | | - Stephen J Greene
- Division of Cardiology, Duke University School of Medicine, Durham, NC, USA
- Duke Clinical Research Institute, Durham, NC, USA
| | - Harriette G C Van Spall
- Department of Medicine and Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
| | - Ambarish Pandey
- Canada Population Health Research Institute, Hamilton, ON, Canada
- Division of Cardiology, Department of Internal Medicine, UT Southwestern Medical Center, Dallas, TX, USA
| | - Sreekanth Vemulapalli
- Division of Cardiology, Duke University School of Medicine, Durham, NC, USA
- Duke Clinical Research Institute, Durham, NC, USA
| | | | - Javed Butler
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
- Baylor Scott and White Research Institute, Dallas, TX, USA
| |
Collapse
|