1
|
Hajishah H, Kazemi D, Safaee E, Amini MJ, Peisepar M, Tanhapour MM, Tavasol A. Evaluation of machine learning methods for prediction of heart failure mortality and readmission: meta-analysis. BMC Cardiovasc Disord 2025; 25:264. [PMID: 40189534 PMCID: PMC11974104 DOI: 10.1186/s12872-025-04700-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Accepted: 03/24/2025] [Indexed: 04/09/2025] Open
Abstract
BACKGROUND Heart failure (HF) impacts nearly 6 million individuals in the U.S., with a projected 46% increase by 2030, is creating significant healthcare burdens. Predictive models, particularly machine learning (ML)-based models, offer promising solutions to identify patients at greater risk of adverse outcomes, such as mortality and hospital readmission. This review aims to assess the effectiveness of ML models in predicting HF-related outcomes, with a focus on their potential to improve patient care and clinical decision-making. We aim to assess how effectively machine learning models predict mortality and readmission in heart failure patients to improve clinical outcomes. METHOD The study followed PRISMA 2020 guidelines and was registered in the PROSPERO database (CRD42023481167). We conducted a systematic search in PubMed, Scopus, and Web of Science databases using specific keywords related to heart failure, machine learning, mortality and readmission. Extracted data focused on study characteristics, machine learning details, and outcomes, with AUC or c-index used as the primary outcomes for pooling analysis. The PROBAST tool was used to assess bias risk, evaluating models based on participants, predictors, outcomes, and statistical analysis. The meta-analysis pooled AUCs for different machine learning models predicting mortality and readmission. Prediction accuracy data was categorized by timeframes, with high heterogeneity determined by an I² value above 50%, leading to a random-effects model when applicable. Publication bias was assessed using Egger's and Begg's tests, with a p-value below 0.05 considered significant RESULT: A total of 4,505 studies were identified, and after screening, 64 were included in the final analysis, covering 943,941 patients. Of these, 40 studies focused on mortality, 17 on readmission, and 7 on both outcomes. In total, 346 machine learning models were evaluated, with the most common algorithms being random forest, logistic regression, and gradient boosting. The neural network model achieved the highest overall AUC for mortality prediction (0.808), while the support vector machine performed best for readmission prediction (AUC 0.733). The analysis revealed a significant risk of bias, primarily due to reliance on retrospective data and inadequate sample size justification. CONCLUSION In conclusion, this review emphasizes the strong potential of ML models in predicting HF readmission and mortality. ML algorithms show promise in improving prognostic accuracy and enabling personalized patient care. However, challenges like model interpretability, generalizability, and clinical integration persist. Overcoming these requires refined ML techniques and a robust regulatory framework to enhance HF outcomes.
Collapse
Affiliation(s)
- Hamed Hajishah
- Student Research Committee, Tehran Medical Sciences Branch, Islamic Azad University, Tehran, Iran
| | - Danial Kazemi
- Student Research Committee, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Ehsan Safaee
- Student Research Committee, Faculty of Medicine, Shahed University, Tehran, Iran
| | - Mohammad Javad Amini
- Student Research Committee, School of Medicine, Alborz University of Medical Sciences, Karaj, Iran
| | - Maral Peisepar
- School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- Universal Scientific Education and Research Network (USERN), Tehran, Iran
| | - Mohammad Mahdi Tanhapour
- Student Research Committee, Tehran Medical Sciences Branch, Islamic Azad University, Tehran, Iran
| | - Arian Tavasol
- Student Research Committee, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
- Faculaty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
2
|
Heffernan AM, Shin J, Otoki K, Parker RK, Heffernan DS. The application of machine learning models in a resource-constrained environment. Ir J Med Sci 2025:10.1007/s11845-025-03951-2. [PMID: 40172783 DOI: 10.1007/s11845-025-03951-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2025] [Accepted: 03/25/2025] [Indexed: 04/04/2025]
Abstract
BACKGROUND Machine learning models (MLMs) used to influence surgical decision making often require large and complex datasets upon which to train. However, there is a paucity of literature pertaining to the ability to apply standard MLMs to small ICU datasets within resource-constrained institutions. METHODS ML models were applied to a prospective cohort of critically ill mechanically ventilated patients from a teaching hospital in rural Kenya. Characteristics included an ICU scoring system specifically for resource-constrained environments (Tropical Intensive Care Score (TropICS)). Outputs included AUC of the ROC and the feature importance table. Python-based MLMs included XGBoost and KNN. AUC of the ROC was calculated to predict mortality as the primary endpoint. RESULTS There were 294 patients, with an average age of 40.2 years, 64.3% male, 23.8% trauma, and an overall mortality of 60.2%. With respect to mortality patients who died were older (43.5 versus 35 years; p < 0.001), but with no difference in male gender (64.8% versus 63.8%; p = 0.9), or having been transferred from outside facilities (34% versus 21.5%; p = 0.5). Whilst there was no difference in the rate of tachycardia or acidosis, patients who died were more likely to present with hemodynamic instability (31% versus 6%; p < 0.001) and higher clinical severity scores. In predicting mortality, the ML models performed very well (XGBoost AUC = 0.82). Within MLM feature importance, the Tropical Intensive Care Score (TropICS) performed as well as APACHE-II and the SAPS. CONCLUSION ML models can be effectively applied to a small ICU dataset within resource-constrained environments. ML models must demonstrate functionality prior to incorporating within prospective clinical predictive models. Contextualized ICU scoring systems (TropICS) performed well within MLMs.
Collapse
Affiliation(s)
- Addison M Heffernan
- Division of Trauma and Surgical Critical Care, Department of Surgery, Brown University, Rhode Island Hospital, Providence, USA
| | - Jaewook Shin
- Division of Trauma and Surgical Critical Care, Department of Surgery, Brown University, Rhode Island Hospital, Providence, USA
| | - Kemunto Otoki
- Department of Surgery, Tenwek Hospital, Bomet, Kenya
| | | | - Daithi S Heffernan
- Division of Trauma and Surgical Critical Care, Department of Surgery, Brown University, Rhode Island Hospital, Providence, USA.
| |
Collapse
|
3
|
Petmezas G, Papageorgiou VE, Vassilikos V, Pagourelias E, Tachmatzidis D, Tsaklidis G, Katsaggelos AK, Maglaveras N. Enhanced heart failure mortality prediction through model-independent hybrid feature selection and explainable machine learning. J Biomed Inform 2025; 163:104800. [PMID: 39956346 DOI: 10.1016/j.jbi.2025.104800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2024] [Revised: 01/02/2025] [Accepted: 01/23/2025] [Indexed: 02/18/2025]
Abstract
Heart failure (HF) remains a significant public health challenge with high mortality rates. Machine learning (ML) techniques offer a promising approach to predict HF mortality, potentially improving clinical outcomes. However, the effectiveness of these techniques heavily depends on the quality and relevance of the features used. This study introduces a novel hybrid feature selection methodology that combines Extremely Randomized Trees (Extra-Trees) and non-linear correlation measures to enhance 1-year all-cause mortality prediction in HF patients using echocardiographic and key demographic data. Unlike existing feature selection methods that are often tied to specific ML models and produce inconsistent feature sets across different algorithms, our proposed approach is model-independent, ensuring robustness and generalizability. Moreover, the optimal number of predictive features is identified through loss graph inspection, leading to a compact and highly informative subset of seven features. We trained and evaluated seven widely-used ML models on both the full feature set and the selected subset, finding that most models maintained or improved their predictive performance despite an 80% reduction in features. Model interpretability was enhanced using SHapley Additive exPlanations (SHAP), allowing for a detailed examination of how individual features influence predictions. To further assess its effectiveness, we compared our methodology against widely known feature selection techniques across all seven ML models. The results underscore the superiority of our proposed feature set in accurately predicting HF mortality over conventional methods, offering new opportunities for personalized management strategies based on a streamlined and explainable feature subset.
Collapse
Affiliation(s)
- Georgios Petmezas
- 2(nd) Department of Obstetrics and Gynecology, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece.
| | | | - Vassilios Vassilikos
- 3(rd) Department of Cardiology, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Efstathios Pagourelias
- 3(rd) Department of Cardiology, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Dimitrios Tachmatzidis
- 3(rd) Department of Cardiology, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - George Tsaklidis
- Department of Mathematics, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Aggelos K Katsaggelos
- Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL, USA
| | - Nicos Maglaveras
- 2(nd) Department of Obstetrics and Gynecology, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| |
Collapse
|
4
|
Tokac U, Chipps J, Brysiewicz P, Bruce J, Clarke D. Using Machine Learning to Improve Readmission Risk in Surgical Patients in South Africa. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2025; 22:345. [PMID: 40238336 PMCID: PMC11942159 DOI: 10.3390/ijerph22030345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Revised: 02/21/2025] [Accepted: 02/23/2025] [Indexed: 04/18/2025]
Abstract
Unplanned readmission within 30 days is a major challenge both globally and in South Africa. The aim of this study was to develop a machine learning model to predict unplanned surgical and trauma readmission to a public hospital in South Africa from unstructured text data. A retrospective cohort of records of patients was subjected to random forest analysis, using natural language processing and sentiment analysis to deal with data in free text in an electronic registry. Our findings were within the range of global studies, with reported AUC values between 0.54 and 0.92. For trauma unplanned readmissions, the discharge plan score was the most important predictor in the model, and for surgical unplanned readmissions, the problem score was the most important predictor in the model. The use of machine learning and natural language processing improved the accuracy of predicting readmissions.
Collapse
Affiliation(s)
- Umit Tokac
- College of Nursing, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| | - Jennifer Chipps
- School of Nursing, Faculty of Community Health Sciences, University of Western Cape, Cape Town 7530, South Africa
| | - Petra Brysiewicz
- School of Nursing and Public Health, University of KwaZulu-Natal, Durban 4041, South Africa
| | - John Bruce
- School of Clinical Medicine, University of KwaZulu-Natal, Durban 4041, South Africa
| | - Damian Clarke
- School of Clinical Medicine, University of KwaZulu-Natal, Durban 4041, South Africa
- School of Clinical Medicine, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg 2017, South Africa
| |
Collapse
|
5
|
Yan L, Zhang J, Chen L, Zhu Z, Sheng X, Zheng G, Yuan J. Predictive Value of Machine Learning for the Risk of In-Hospital Death in Patients With Heart Failure: A Systematic Review and Meta-Analysis. Clin Cardiol 2025; 48:e70071. [PMID: 39723651 PMCID: PMC11670054 DOI: 10.1002/clc.70071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 12/06/2024] [Accepted: 12/12/2024] [Indexed: 12/28/2024] Open
Abstract
BACKGROUND The efficiency of machine learning (ML) based predictive models in predicting in-hospital mortality for heart failure (HF) patients is a topic of debate. In this context, this study's objective is to conduct a meta-analysis to compare and assess existing prognostic models designed for predicting in-hospital mortality in HF patients. METHODS A systematic search of databases was conducted, including PubMed, Embase, Web of Science, and Cochrane Library up to January 2023. To ensure comprehensiveness, we performed an additional search in June 2023. The Prediction Model Risk of Bias Assessment Tool was employed to assess the validity and reliability of ML models. RESULTS Our analysis incorporated 28 studies involving a total of 106 predictive models based on 14 different ML techniques. In the training data set, these models showed a combined C-index of 0.781, sensitivity of 0.56, and specificity of 0.94. In the validation data set, the models exhibited a combined C-index of 0.758, sensitivity of 0.57, and specificity of 0.84. Logistic regression (LR) was the most frequently used ML algorithm. LR models in the training set had a combined C-index of 0.795, sensitivity of 0.63, and specificity of 0.85, and these measures for LR models in the validation set were 0.751, 0.66, and 0.79, respectively. CONCLUSIONS Our study indicates that although ML is increasingly being leveraged to predict in-hospital mortality for HF patients, the predictive performance remains suboptimal. Although these models have relatively high C-index and specificity, their ability to predict positive events is limited, as indicated by their low sensitivity.
Collapse
Affiliation(s)
- Liyuan Yan
- Department of CardiologyAffiliated Changshu Hospital of Nantong UniversityChangshuJiangsuChina
| | - Jinlong Zhang
- Department of CardiologyThe First People's Hospital of Yancheng, Fourth Affiliated Hospital of Nantong UniversityYanchengJiangsuChina
| | - Le Chen
- Department of CardiologyAffiliated Changshu Hospital of Nantong UniversityChangshuJiangsuChina
| | - Zongcheng Zhu
- Department of CardiologyThe First Affiliated Hospital of Soochow UniversitySuzhouJiangsuChina
| | - Xiaodong Sheng
- Department of CardiologyAffiliated Changshu Hospital of Nantong UniversityChangshuJiangsuChina
| | - Guanqun Zheng
- Department of CardiologyAffiliated Changshu Hospital of Nantong UniversityChangshuJiangsuChina
| | - Jiamin Yuan
- Department of CardiologyThe First Affiliated Hospital of Soochow UniversitySuzhouJiangsuChina
| |
Collapse
|
6
|
Li C, Wang Y, Meng L, Zhong W, Zhang C, Liu T. Integrating EPSOSA-BP neural network algorithm for enhanced accuracy and robustness in optimizing coronary artery disease prediction. Sci Rep 2024; 14:30993. [PMID: 39730803 DOI: 10.1038/s41598-024-82184-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Accepted: 12/03/2024] [Indexed: 12/29/2024] Open
Abstract
Coronary artery disease represents a formidable health threat to middle-aged and elderly populations worldwide. This research introduces an advanced BP neural network algorithm, EPSOSA-BP, which integrates particle swarm optimization, simulated annealing, and a particle elimination mechanism to elevate the precision of heart disease prediction models. To address prior limitations in feature selection, the study employs single-hot encoding and Principal Component Analysis, thereby enhancing the model's feature learning capability. The proposed method achieved remarkable accuracy rates of 93.22% and 95.20% on the UCI and Kaggle datasets, respectively, underscoring its exceptional performance even with small sample sizes. Ablation experiments further validated the efficacy of the data preprocessing and feature selection techniques employed. Notably, the EPSOSA algorithm surpassed classical optimization algorithms in terms of convergence speed, while also demonstrating improved sensitivity and specificity. This model holds significant potential for facilitating early identification of high-risk patients, which could ultimately save lives and optimize the utilization of medical resources. Despite implementation challenges, including technical integration and data standardization, the algorithm shows promise for use in emergency settings and community health services for regular cardiac risk monitoring.
Collapse
Affiliation(s)
- Chengjie Li
- The Key Laboratory for Computer Systems of State Ethnic Affairs Commission, School of Computer and Artificial Intelligence, Southwest Minzu University, Chengdu, 610041, China
- University of Electronic Science and Technology of China, Chengdu, 611731, China
| | - Yanglin Wang
- The Key Laboratory for Computer Systems of State Ethnic Affairs Commission, School of Computer and Artificial Intelligence, Southwest Minzu University, Chengdu, 610041, China
| | - Linghui Meng
- The Key Laboratory for Computer Systems of State Ethnic Affairs Commission, School of Computer and Artificial Intelligence, Southwest Minzu University, Chengdu, 610041, China
| | - Wen Zhong
- Department of General Medicine, Chengdu Third People's Hospital, Chengdu, 610014, China
| | - Chengfang Zhang
- Intelligent Policing and National Security Risk Management Laboratory, Intelligent Policing Key Laboratory of Sichuan Province, Sichuan Police College, Luzhou, China.
| | - Tao Liu
- The Key Laboratory for Computer Systems of State Ethnic Affairs Commission, School of Computer and Artificial Intelligence, Southwest Minzu University, Chengdu, 610041, China
| |
Collapse
|
7
|
Kirdeev A, Burkin K, Vorobev A, Zbirovskaya E, Lifshits G, Nikolaev K, Zelenskaya E, Donnikov M, Kovalenko L, Urvantseva I, Poptsova M. Machine learning models for predicting risks of MACEs for myocardial infarction patients with different VEGFR2 genotypes. Front Med (Lausanne) 2024; 11:1452239. [PMID: 39301488 PMCID: PMC11410707 DOI: 10.3389/fmed.2024.1452239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 08/19/2024] [Indexed: 09/22/2024] Open
Abstract
Background The development of prognostic models for the identification of high-risk myocardial infarction (MI) patients is a crucial step toward personalized medicine. Genetic factors are known to be associated with an increased risk of cardiovascular diseases; however, little is known about whether they can be used to predict major adverse cardiac events (MACEs) for MI patients. This study aimed to build a machine learning (ML) model to predict MACEs in MI patients based on clinical, imaging, laboratory, and genetic features and to assess the influence of genetics on the prognostic power of the model. Methods We analyzed the data from 218 MI patients admitted to the emergency department at the Surgut District Center for Diagnostics and Cardiovascular Surgery, Russia. Upon admission, standard clinical measurements and imaging data were collected for each patient. Additionally, patients were genotyped for VEGFR-2 variation rs2305948 (C/C, C/T, T/T genotypes with T being the minor risk allele). The study included a 9-year follow-up period during which major ischemic events were recorded. We trained and evaluated various ML models, including Gradient Boosting, Random Forest, Logistic Regression, and AutoML. For feature importance analysis, we applied the sequential feature selection (SFS) and Shapley's scheme of additive explanation (SHAP) methods. Results The CatBoost algorithm, with features selected using the SFS method, showed the best performance on the test cohort, achieving a ROC AUC of 0.813. Feature importance analysis identified the dose of statins as the most important factor, with the VEGFR-2 genotype among the top 5. The other important features are coronary artery lesions (coronary artery stenoses ≥70%), left ventricular (LV) parameters such as lateral LV wall and LV mass, diabetes, type of revascularization (CABG or PCI), and age. We also showed that contributions are additive and that high risk can be determined by cumulative negative effects from different prognostic factors. Conclusion Our ML-based approach demonstrated that the VEGFR-2 genotype is associated with an increased risk of MACEs in MI patients. However, the risk can be significantly reduced by high-dose statins and positive factors such as the absence of coronary artery lesions, absence of diabetes, and younger age.
Collapse
Affiliation(s)
- Alexander Kirdeev
- Faculty of Computer Science, AI and Digital Science Institute, International Laboratory of Bioinformatics, Higher School of Economics University, Moscow, Russia
| | - Konstantin Burkin
- Faculty of Computer Science, AI and Digital Science Institute, International Laboratory of Bioinformatics, Higher School of Economics University, Moscow, Russia
| | - Anton Vorobev
- Department of Cardiology, Surgut State University, Surgut, Russia
| | - Elena Zbirovskaya
- Faculty of Computer Science, AI and Digital Science Institute, International Laboratory of Bioinformatics, Higher School of Economics University, Moscow, Russia
| | - Galina Lifshits
- Institute of Chemical Biology and Fundamental Medicine, Novosibirsk, Russia
| | - Konstantin Nikolaev
- Federal Research Center Institute of Cytology and Genetics, Novosibirsk, Russia
| | - Elena Zelenskaya
- Department of Cardiology, Surgut State University, Surgut, Russia
| | - Maxim Donnikov
- Department of Cardiology, Surgut State University, Surgut, Russia
| | - Lyudmila Kovalenko
- Department of General Pathology and Pathophysiology, Surgut State University, Surgut, Russia
| | - Irina Urvantseva
- Department of Cardiology, Surgut State University, Surgut, Russia
- Ugra Center for Diagnostics and Cardiovascular Surgery, Surgut, Russia
| | - Maria Poptsova
- Faculty of Computer Science, AI and Digital Science Institute, International Laboratory of Bioinformatics, Higher School of Economics University, Moscow, Russia
| |
Collapse
|
8
|
Saqib M, Perswani P, Muneem A, Mumtaz H, Neha F, Ali S, Tabassum S. Machine learning in heart failure diagnosis, prediction, and prognosis: review. Ann Med Surg (Lond) 2024; 86:3615-3623. [PMID: 38846887 PMCID: PMC11152866 DOI: 10.1097/ms9.0000000000002138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 04/24/2024] [Indexed: 06/09/2024] Open
Abstract
Globally, cardiovascular diseases take the lives of over 17 million people each year, mostly through myocardial infarction, or MI, and heart failure (HF). This comprehensive literature review examines various aspects related to the diagnosis, prediction, and prognosis of HF in the context of machine learning (ML). The review covers an array of topics, including the diagnosis of HF with preserved ejection fraction (HFpEF) and the identification of high-risk patients with HF with reduced ejection fraction (HFrEF). The prediction of mortality in different HF populations using different ML approaches is explored, encompassing patients in the ICU, and HFpEF patients using biomarkers and gene expression. The review also delves into the prediction of mortality and hospitalization rates in HF patients with mid-range ejection fraction (HFmrEF) using ML methods. The findings highlight the significance of a multidimensional approach that encompasses clinical evaluation, laboratory assessments, and comprehensive research to improve our understanding and management of HF. Promising predictive models incorporating biomarkers, gene expression, and consideration of epigenetics demonstrate potential in estimating mortality and identifying high-risk HFpEF patients. This literature review serves as a valuable resource for researchers, clinicians, and healthcare professionals seeking a comprehensive and updated understanding of the role of ML diagnosis, prediction, and prognosis of HF across different subtypes and patient populations.
Collapse
Affiliation(s)
| | | | - Abraar Muneem
- College of Medicine, The Pennsylvania State University, Hershey, United States
| | | | - Fnu Neha
- Jinnah Sindh Medical University, Karachi
| | | | | |
Collapse
|
9
|
Sutradhar A, Al Rafi M, Shamrat FMJM, Ghosh P, Das S, Islam MA, Ahmed K, Zhou X, Azad AKM, Alyami SA, Moni MA. BOO-ST and CBCEC: two novel hybrid machine learning methods aim to reduce the mortality of heart failure patients. Sci Rep 2023; 13:22874. [PMID: 38129433 PMCID: PMC10739972 DOI: 10.1038/s41598-023-48486-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 11/27/2023] [Indexed: 12/23/2023] Open
Abstract
Heart failure (HF) is a leading cause of mortality worldwide. Machine learning (ML) approaches have shown potential as an early detection tool for improving patient outcomes. Enhancing the effectiveness and clinical applicability of the ML model necessitates training an efficient classifier with a diverse set of high-quality datasets. Hence, we proposed two novel hybrid ML methods ((a) consisting of Boosting, SMOTE, and Tomek links (BOO-ST); (b) combining the best-performing conventional classifier with ensemble classifiers (CBCEC)) to serve as an efficient early warning system for HF mortality. The BOO-ST was introduced to tackle the challenge of class imbalance, while CBCEC was responsible for training the processed and selected features derived from the Feature Importance (FI) and Information Gain (IG) feature selection techniques. We also conducted an explicit and intuitive comprehension to explore the impact of potential characteristics correlating with the fatality cases of HF. The experimental results demonstrated the proposed classifier CBCEC showcases a significant accuracy of 93.67% in terms of providing the early forecasting of HF mortality. Therefore, we can reveal that our proposed aspects (BOO-ST and CBCEC) can be able to play a crucial role in preventing the death rate of HF and reducing stress in the healthcare sector.
Collapse
Affiliation(s)
- Ananda Sutradhar
- Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City (DSC), Birulia, Savar, Dhaka, 1216, Bangladesh
| | - Mustahsin Al Rafi
- Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City (DSC), Birulia, Savar, Dhaka, 1216, Bangladesh
| | - F M Javed Mehedi Shamrat
- Department of Computer System and Technology, University of Malaya, 50603, Kuala Lumpur, Malaysia
| | - Pronab Ghosh
- Department of Computer Science, Lakehead University, 955 Oliver Rd, Thunder Bay, ON, P7B 5E1, Canada
| | - Subrata Das
- Department of Computer Science, Lakehead University, 955 Oliver Rd, Thunder Bay, ON, P7B 5E1, Canada
| | - Md Anaytul Islam
- Department of Computer Science, Lakehead University, 955 Oliver Rd, Thunder Bay, ON, P7B 5E1, Canada
| | - Kawsar Ahmed
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada
- Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh
- Health Informatics Research Lab, Department of Computer Science and Engineering, Daffodil International University, Daffodil Smart City, Birulia, Dhaka, 1216, Bangladesh
| | - Xujuan Zhou
- School of Business, University of Southern Queensland, Toowoomba, Australia
| | - A K M Azad
- Department of Mathematics and Statistics, Faculty of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), 13318, Riyadh, Saudi Arabia
| | - Salem A Alyami
- Department of Mathematics and Statistics, Faculty of Science, Imam Mohammad Ibn Saud Islamic University (IMSIU), 13318, Riyadh, Saudi Arabia
| | - Mohammad Ali Moni
- Centre for AI & Digital Health Technology, Artificial Intelligence & Cyber Future Institute, Charles Sturt University, Bathurst, NSW, 2795, Australia.
| |
Collapse
|