1
|
Maisonnave M, Rajabi E, Taghavi M, VanBerkel P. Explainable machine learning to identify risk factors for unplanned hospital readmissions in Nova Scotian hospitals. Comput Biol Med 2025; 190:110024. [PMID: 40147186 DOI: 10.1016/j.compbiomed.2025.110024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 02/19/2025] [Accepted: 03/11/2025] [Indexed: 03/29/2025]
Abstract
OBJECTIVE A report from the Canadian Institute for Health Information found unplanned hospital readmissions (UHR) common, costly, and potentially avoidable, estimating a $1.8 billion cost to the Canadian healthcare system associated with inpatient readmissions within 30 days of discharge for the studied period (11 months). The first step towards addressing this costly problem is enabling early detection of patients at risk through detecting UHR risk factors. METHODOLOGY We utilized Machine Learning and explainability tools to examine risk factors for UHR within 30 days of discharge, utilizing data from Nova Scotian (Canada) healthcare institutions (2015-2022). To the best of our knowledge, our research constitutes the most comprehensive study on UHR risk factors for the province. RESULTS We found that predicting UHR solely from healthcare data has limitations, as discharge information often falls short of accurately predicting readmission occurrences. However, despite this inherent limitation, integrating explainability tools offers insights into the underlying factors contributing to readmission risk, empowering medical personnel with information to improve patient care and outcomes. As part of this work, we identify and report risk factors for UHR and build a guideline to support medical personnel's decision-making regarding targeted post-discharge follow-ups. We found that conditions such as heart failure and Chronic Obstructive Pulmonary Disease (COPD) are associated with a higher likelihood of readmission. Patients admitted for procedures related to childbirth have a lower probability of readmission. We studied the impact of the admission type, patient characteristics, and patient stay characteristics on UHR. For example, we found that new and elective admission patients are less likely to be readmitted, while patients who received a transfusion are more likely to be readmitted. CONCLUSIONS We validated the risk factors and the guidelines using real-world data. Our results suggested that our proposal correctly identifies risk factors and effectively produces valuable guidelines for medical personnel. The guideline evaluation suggests we can screen half the patients while capturing more than 72% of the readmission episodes. Our study contributes insights into the challenge of identifying risk factors for UHR while providing a practical guideline for healthcare professionals to identify factors influencing patient readmission, particularly within Nova Scotia.
Collapse
Affiliation(s)
- Mariano Maisonnave
- Management Science Department, Shannon School of Business, Cape Breton University, 1250 Grand Lake Rd, Sydney, B1M 1A2, NS, Canada.
| | - Enayat Rajabi
- Management Science Department, Shannon School of Business, Cape Breton University, 1250 Grand Lake Rd, Sydney, B1M 1A2, NS, Canada.
| | - Majid Taghavi
- Sobey School of Business, Saint Mary's University, 903 Robie St, Halifax, B3H 3C2, NS, Canada.
| | - Peter VanBerkel
- Department of Industrial Engineering, Dalhousie University, 5269 Morris St, Halifax, B3J 1B6, NS, Canada.
| |
Collapse
|
2
|
Deng Q, Li S, Zhang Y, Jia Y, Yang Y. Development and validation of interpretable machine learning models to predict distant metastasis and prognosis of muscle-invasive bladder cancer patients. Sci Rep 2025; 15:11795. [PMID: 40189676 PMCID: PMC11973202 DOI: 10.1038/s41598-025-96089-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Accepted: 03/26/2025] [Indexed: 04/09/2025] Open
Abstract
Muscle-Invasive Bladder Cancer (MIBC) is a more aggressive disease than non-muscle-invasive bladder cancer (NMIBC), with greater chances of metastasis. We sought to develop machine learning (ML) models to predict metastasis and prognosis in MIBC patients. Clinical data of MIBC cases from 2000 to 2020 were sourced from the Surveillance, Epidemiology, and End Results (SEER) database. Clinical variables used to predict DM were identified through univariate and multivariate logistic regression, and Recursive Feature Elimination (RFE). Thirteen ML models predicting DM were evaluated based on AUC, PRAUC, accuracy, sensitivity, specificity, precision, cross-entropy, Brier score, balanced accuracy, and F-beta score. SHapley Additive exPlanations (SHAP) framework helped interpret the best model. Additionally, we utilized ML algorithm combinations to predict prognosis in MIBC patients with metastasis. A total of 43,951 T2-T4 MIBC patients aged over 18 years old from the SEER database were enrolled consecutively. Nine clinical variables were selected to predict DM. The CatBoost model was identified as the optimal predictor, with AUC values of 0.956 [0.933, 0.969] for the training set, 0.882 [0.857, 0.919] for the internal test set, and 0.839 [0.723, 0.936] for the external test set. The model achieved an accuracy of 0.875 [0.854, 0.896], sensitivity of 0.869 [0.851, 0.889], specificity of 0.883 [0.823, 0.912], and precision of 0.917 [0.885, 0.944]. SHAP analysis revealed that tumor size was the most influential factor in predicting distant metastasis. For prognosis, the "RSF + Enet[alpha = 0.8]" model emerged as the top performer, with C-index values of 0.683 in training, 0.688 in the internal test, and 0.666 in the external test sets. Our ML models provide high accuracy and dependability, delivering refined, individualized predictions for metastasis risk and prognosis in MIBC patients.
Collapse
Affiliation(s)
- Qian Deng
- Luoyang Central Hospital Affiliated of Zhengzhou University, Henan, China
| | - Shan Li
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing, China
| | - Yuxiang Zhang
- Department of Urology Surgery, The First Affiliated Hospital, and College of Clinical Medicine of Henan University of Science and Technology, Luoyang, Henan, China
| | - Yuanyuan Jia
- Department of Oncology, Huai'an Second People's Hospital, Affiliated to Xuzhou Medical University, Huai'an, Jiangsu, China.
| | - Yanhui Yang
- Department of Emergency Surgery (Trauma Center), The First Affiliated Hospital, and College of Clinical Medicine of Henan University of Science and Technology, Luoyang, Henan, China.
| |
Collapse
|
3
|
Li S, Wang J, Zhang Z, Ren C, He D. Individual risk and prognostic value prediction by interpretable machine learning for distant metastasis in neuroblastoma: A population-based study and an external validation. Int J Med Inform 2025; 196:105813. [PMID: 39904180 DOI: 10.1016/j.ijmedinf.2025.105813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 12/27/2024] [Accepted: 01/23/2025] [Indexed: 02/06/2025]
Abstract
PURPOSE Neuroblastoma (NB) is a childhood malignancy with a poor prognosis and a propensity for distant metastasis (DM). We aimed to establish machine learning (ML) based model to accurately predict risk of DM and prognosis of NB patients with DM. METHODS We analyzed NB patients from the Surveillance, Epidemiology, and End Results (SEER) database between 2000 and 2020. Univariate and multivariate logistic analysis were employed to select meaning variables. Recursive Feature Elimination (RFE) method based on 6 ML algorithms was utilized in feature selection. To construct predictive model, 13 ML algorithms were evaluated by area under the operating characteristic curve (AUC), accuracy, sensitivity, specificity, precision, cross-entropy, Brier scores, Balanced Accuracy and F-beta score. An optimal ML model was constructed to predict DM, and the predictive results were explained by SHapley Additive exPlanations (SHAP) framework. Meanwhile, 101 ML algorithm combinations were developed to select the best model with highest C-index to predict prognosis of NB patients with DM. RESULTS A total of 1,668 NB patients from SEER database was consecutively enrolled. We identified that tumor primary site, grade, surgery type, regional lymph nodes, radiotherapy and chemotherapy are significant risk factors for DM. CatBoost model was selected as the best prediction model, and AUC was 0.846 (95 %CI: [0.804,0.899]), 0.834 (95 %CI: [0.796,0.873]) and 0.813 (95 %CI: [0.776,0.852]) in training, internal test and external test sets, with 0.777 accuracy, 0.839 sensitivity, 0.72 specificity and 0.731 precision in training set. Grade, chemotherapy and radiotherapy had the greatest effects on DM according to SHAP results. For prognosis prediction, "RSF + GBM" algorithm was the best prognostic model with C-index of 0.656, 0.611 and 0.629 in training, internal test and external test sets. CONCLUSIONS Our ML models demonstrate excellent accuracy and reliability, offering more precise personalized metastasis diagnosis and prognostic prediction to NB patients.
Collapse
Affiliation(s)
- Shan Li
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing 400014, China; Chongqing Key Laboratory of Children Urogenital Development and Tissue Engineering, Chongqing 400014, China; China International Science and Technology Cooperation base of Child Development and Critical Disorders, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Children's Hospital of Chongqing Medical University, Chongqing 400014, China
| | - Jinkui Wang
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing 400014, China; Chongqing Key Laboratory of Children Urogenital Development and Tissue Engineering, Chongqing 400014, China; China International Science and Technology Cooperation base of Child Development and Critical Disorders, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Children's Hospital of Chongqing Medical University, Chongqing 400014, China
| | - Zhaoxia Zhang
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing 400014, China; Chongqing Key Laboratory of Children Urogenital Development and Tissue Engineering, Chongqing 400014, China; China International Science and Technology Cooperation base of Child Development and Critical Disorders, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Children's Hospital of Chongqing Medical University, Chongqing 400014, China
| | - Chunnian Ren
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing 400014, China; Chongqing Key Laboratory of Children Urogenital Development and Tissue Engineering, Chongqing 400014, China; China International Science and Technology Cooperation base of Child Development and Critical Disorders, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Children's Hospital of Chongqing Medical University, Chongqing 400014, China
| | - Dawei He
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing 400014, China; Chongqing Key Laboratory of Children Urogenital Development and Tissue Engineering, Chongqing 400014, China; China International Science and Technology Cooperation base of Child Development and Critical Disorders, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Children's Hospital of Chongqing Medical University, Chongqing 400014, China.
| |
Collapse
|
4
|
Oh EG, Oh S, Cho S, Moon M. Predicting Readmission Among High-Risk Discharged Patients Using a Machine Learning Model With Nursing Data: Retrospective Study. JMIR Med Inform 2025; 13:e56671. [PMID: 40106364 PMCID: PMC11921987 DOI: 10.2196/56671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 01/14/2025] [Accepted: 02/06/2025] [Indexed: 03/22/2025] Open
Abstract
Background Unplanned readmissions increase unnecessary health care costs and reduce the quality of care. It is essential to plan the discharge care from the beginning of hospitalization to reduce the risk of readmission. Machine learning-based readmission prediction models can support patients' preemptive discharge care services with improved predictive power. Objective This study aimed to develop a readmission early prediction model utilizing nursing data for high-risk discharge patients. Methods This retrospective study included the electronic medical records of 12,977 patients with 1 of the top 6 high-risk readmission diseases at a tertiary hospital in Seoul from January 2018 to January 2020. We used demographic, clinical, and nursing data to construct a prediction model. We constructed unplanned readmission prediction models by dividing them into Model 1 and Model 2. Model 1 used early hospitalization data (up to 1 day after admission), and Model 2 used all the data. To improve the performance of the machine learning method, we performed 5-fold cross-validation and utilized adaptive synthetic sampling to address data imbalance. The 6 algorithms of logistic regression, random forest, decision tree, XGBoost, CatBoost, and multiperceptron layer were employed to develop predictive models. The analysis was conducted using Python Language Reference, version 3.11.3. (Python Software Foundation). Results In Model 1, among the 6 prediction model algorithms, the random forest model had the best result, with an area under the receiver operating characteristic (AUROC) curve of 0.62. In Model 2, the CatBoost model had the best result, with an AUROC of 0.64. BMI, systolic blood pressure, and age consistently emerged as the most significant predictors of readmission risk across Models 1 and 2. Model 1, which enabled early readmission prediction, showed a higher proportion of nursing data variables among its important predictors compared to Model 2. Conclusions Machine learning-based readmission prediction models utilizing nursing data provide basic data for evidence-based clinical decision support for high-risk discharge patients with complex conditions and facilitate early intervention. By integrating nursing data containing diverse patient information, these models can provide more comprehensive risk assessment and improve patient outcomes.
Collapse
Affiliation(s)
- Eui Geum Oh
- College of Nursing, Yonsei University, Seoul, Republic of Korea
- Mo-Im Kim Nursing Research Institute, Yonsei University, Seoul, Republic of Korea
- Institute for Innovation in Digital Healthcare, Yonsei University, Seoul, Republic of Korea
| | - Sunyoung Oh
- School of Nursing, Yale University, New Haven, CT, United States
| | - Seunghyeon Cho
- Digital & Technology Group, CJ CheilJedang, Suwon, Republic of Korea
| | - Mir Moon
- Institute for Innovation in Digital Healthcare, Yonsei University, Seoul, Republic of Korea
- Department of Nursing, Graduate School, Yonsei University, Seoul, Republic of Korea
- Department of Nursing, Daejeon University, 62, Daehak-ro, Dong-gu, Daejeon, 34520, Republic of Korea, 82 10-9973-8813
| |
Collapse
|
5
|
Teng B, Zhang X, Ge M, Miao M, Li W, Ma J. Personalized three-year survival prediction and prognosis forecast by interpretable machine learning for pancreatic cancer patients: a population-based study and an external validation. Front Oncol 2024; 14:1488118. [PMID: 39497722 PMCID: PMC11532159 DOI: 10.3389/fonc.2024.1488118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Accepted: 09/19/2024] [Indexed: 11/07/2024] Open
Abstract
Purpose The overall survival of patients with pancreatic cancer is extremely low. We aimed to establish machine learning (ML) based model to accurately predict three-year survival and prognosis of pancreatic cancer patients. Methods We analyzed pancreatic cancer patients from the Surveillance, Epidemiology, and End Results (SEER) database between 2000 and 2021. Univariate and multivariate logistic analysis were employed to select variables. Recursive Feature Elimination (RFE) method based on 6 ML algorithms was utilized in feature selection. To construct predictive model, 13 ML algorithms were evaluated by area under the curve (AUC), area under precision-recall curve (PRAUC), accuracy, sensitivity, specificity, precision, cross-entropy, Brier scores and Balanced Accuracy (bacc) and F Beta Score (fbeta). An optimal ML model was constructed to predict three-year survival, and the predictive results were explained by SHapley Additive exPlanations (SHAP) framework. Meanwhile, 101 ML algorithm combinations were developed to select the best model with highest C-index to predict prognosis of pancreatic cancer patients. Results A total of 20,064 pancreatic cancer patients from SEER database was consecutively enrolled. We utilized eight clinical variables to establish prediction model for three-year survival. CatBoost model was selected as the best prediction model, and AUC was 0.932 [0.924, 0.939], 0.899 [0.873, 0.934] and 0.826 [0.735, 0.919] in training, internal test and external test sets, with 0.839 [0.831, 0.847] accuracy, 0.872 [0.858, 0.887] sensitivity, 0.803 [0.784, 0.825] specificity and 0.832 [0.821, 0.853] precision. Surgery type had the greatest effects on three-year survival according to SHAP results. For prognosis prediction, "RSF+GBM" algorithm was the best prognostic model with C-index of 0.774, 0.722 and 0.674 in training, internal test and external test sets. Conclusions Our ML models demonstrate excellent accuracy and reliability, offering more precise personalized prognostic prediction to pancreatic cancer patients.
Collapse
Affiliation(s)
- Buwei Teng
- Department of Hepatobiliary Surgery, The Affiliated Lianyungang Hospital of Xuzhou Medical University/The First People’s Hospital of Lianyungang, Lianyungang, China
| | - Xiaofeng Zhang
- Department of Hepatobiliary Surgery, The Affiliated Lianyungang Hospital of Xuzhou Medical University/The First People’s Hospital of Lianyungang, Lianyungang, China
| | - Mingshu Ge
- Department of Hepatobiliary Surgery, The Affiliated Lianyungang Hospital of Xuzhou Medical University/The First People’s Hospital of Lianyungang, Lianyungang, China
| | - Miao Miao
- Department of Hepatobiliary Surgery, The Affiliated Lianyungang Hospital of Xuzhou Medical University/The First People’s Hospital of Lianyungang, Lianyungang, China
| | - Wei Li
- Department of Hepatobiliary Surgery, The Affiliated Lianyungang Hospital of Xuzhou Medical University/The First People’s Hospital of Lianyungang, Lianyungang, China
| | - Jun Ma
- Department of Imaging, The Affiliated Huai’an Hospital of Xuzhou Medical University and the Second People’s Hospital of Huai’an, Huai’an, China
| |
Collapse
|
6
|
Arnold M, Liou L, Boland MR. Development, evaluation and comparison of machine learning algorithms for predicting in-hospital patient charges for congestive heart failure exacerbations, chronic obstructive pulmonary disease exacerbations and diabetic ketoacidosis. BioData Min 2024; 17:35. [PMID: 39267093 PMCID: PMC11395859 DOI: 10.1186/s13040-024-00387-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 08/30/2024] [Indexed: 09/14/2024] Open
Abstract
BACKGROUND Hospitalizations for exacerbations of congestive heart failure (CHF), chronic obstructive pulmonary disease (COPD) and diabetic ketoacidosis (DKA) are costly in the United States. The purpose of this study was to predict in-hospital charges for each condition using machine learning (ML) models. RESULTS We conducted a retrospective cohort study on national discharge records of hospitalized adult patients from January 1st, 2016, to December 31st, 2019. We constructed six ML models (linear regression, ridge regression, support vector machine, random forest, gradient boosting and extreme gradient boosting) to predict total in-hospital cost for admission for each condition. Our models had good predictive performance, with testing R-squared values of 0.701-0.750 (mean of 0.713) for CHF; 0.694-0.724 (mean 0.709) for COPD; and 0.615-0.729 (mean 0.694) for DKA. We identified important key features driving costs, including patient age, length of stay, number of procedures, and elective/nonelective admission. CONCLUSIONS ML methods may be used to accurately predict costs and identify drivers of high cost for COPD exacerbations, CHF exacerbations and DKA. Overall, our findings may inform future studies that seek to decrease the underlying high patient costs for these conditions.
Collapse
Affiliation(s)
- Monique Arnold
- Department of Emergency Medicine, The Mount Sinai Hospital at the Icahn School of Medicine, 306 E 96th Street, #4A, New York, NY, 10128, USA.
| | - Lathan Liou
- Icahn School of Medicine at Mount Sinai Hospital, New York City, NY, USA
| | - Mary Regina Boland
- Data Science, Department of Mathematics, Herbert W. Boyer School of Natural Sciences, Mathematics, and Computing, Saint Vincent College, Latrobe, PA, USA
| |
Collapse
|
7
|
Askar M, Tafavvoghi M, Småbrekke L, Bongo LA, Svendsen K. Using machine learning methods to predict all-cause somatic hospitalizations in adults: A systematic review. PLoS One 2024; 19:e0309175. [PMID: 39178283 PMCID: PMC11343463 DOI: 10.1371/journal.pone.0309175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 08/06/2024] [Indexed: 08/25/2024] Open
Abstract
AIM In this review, we investigated how Machine Learning (ML) was utilized to predict all-cause somatic hospital admissions and readmissions in adults. METHODS We searched eight databases (PubMed, Embase, Web of Science, CINAHL, ProQuest, OpenGrey, WorldCat, and MedNar) from their inception date to October 2023, and included records that predicted all-cause somatic hospital admissions and readmissions of adults using ML methodology. We used the CHARMS checklist for data extraction, PROBAST for bias and applicability assessment, and TRIPOD for reporting quality. RESULTS We screened 7,543 studies of which 163 full-text records were read and 116 met the review inclusion criteria. Among these, 45 predicted admission, 70 predicted readmission, and one study predicted both. There was a substantial variety in the types of datasets, algorithms, features, data preprocessing steps, evaluation, and validation methods. The most used types of features were demographics, diagnoses, vital signs, and laboratory tests. Area Under the ROC curve (AUC) was the most used evaluation metric. Models trained using boosting tree-based algorithms often performed better compared to others. ML algorithms commonly outperformed traditional regression techniques. Sixteen studies used Natural language processing (NLP) of clinical notes for prediction, all studies yielded good results. The overall adherence to reporting quality was poor in the review studies. Only five percent of models were implemented in clinical practice. The most frequently inadequately addressed methodological aspects were: providing model interpretations on the individual patient level, full code availability, performing external validation, calibrating models, and handling class imbalance. CONCLUSION This review has identified considerable concerns regarding methodological issues and reporting quality in studies investigating ML to predict hospitalizations. To ensure the acceptability of these models in clinical settings, it is crucial to improve the quality of future studies.
Collapse
Affiliation(s)
- Mohsen Askar
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Masoud Tafavvoghi
- Faculty of Science and Technology, Department of Computer Science, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Lars Småbrekke
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Lars Ailo Bongo
- Faculty of Science and Technology, Department of Computer Science, UiT-The Arctic University of Norway, Tromsø, Norway
| | - Kristian Svendsen
- Faculty of Health Sciences, Department of Pharmacy, UiT-The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
8
|
Panchangam PVR, A T, B U T, Maniaci MJ. Machine Learning-Based Prediction of Readmission Risk in Cardiovascular and Cerebrovascular Conditions Using Patient EMR Data. Healthcare (Basel) 2024; 12:1497. [PMID: 39120200 PMCID: PMC11311788 DOI: 10.3390/healthcare12151497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 07/19/2024] [Accepted: 07/26/2024] [Indexed: 08/10/2024] Open
Abstract
The primary objective of this study was to develop a risk-based readmission prediction model using the EMR data available at discharge. This model was then validated with the LACE plus score. The study cohort consisted of about 310,000 hospital admissions of patients with cardiovascular and cerebrovascular conditions. The EMR data of the patients consisted of lab results, vitals, medications, comorbidities, and admit/discharge settings. These data served as the input to an XGBoost model v1.7.6, which was then used to predict the number of days until the next readmission. Our model achieved remarkable results, with a precision score of 0.74 (±0.03), a recall score of 0.75 (±0.02), and an overall accuracy of approximately 82% (±5%). Notably, the model demonstrated a high accuracy rate of 78.39% in identifying the patients readmitted within 30 days and 80.81% accuracy for those with readmissions exceeding six months. The model was able to outperform the LACE plus score; of the people who were readmitted within 30 days, only 47.70 percent had a LACE plus score greater than 70, and, for people with greater than 6 months, only 10.09 percent had a LACE plus score less than 30. Furthermore, our analysis revealed that the patients with a higher comorbidity burden and lower-than-normal hemoglobin levels were associated with increased readmission rates. This study opens new doors to the world of differential patient care, helping both clinical decision makers and healthcare providers make more informed and effective decisions. This model is comparatively more robust and can potentially substitute the LACE plus score in cardiovascular and cerebrovascular settings for predicting the readmission risk.
Collapse
Affiliation(s)
| | - Tejas A
- Data Science Team, Saigeware Inc., Karnataka 560070, India; (T.A.); (T.B.U.)
| | - Thejas B U
- Data Science Team, Saigeware Inc., Karnataka 560070, India; (T.A.); (T.B.U.)
| | - Michael J. Maniaci
- Enterprise Physician Lead, Advanced Care at Home Program, Mayo Clinic Hospital, Jacksonville, FL 32224, USA;
| |
Collapse
|
9
|
Chou-Chen SW, Barboza LA. Forecasting hospital discharges for respiratory conditions in Costa Rica using climate and pollution data. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024; 21:6539-6558. [PMID: 39176407 DOI: 10.3934/mbe.2024285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2024]
Abstract
Respiratory diseases represent one of the most significant economic burdens on healthcare systems worldwide. The variation in the increasing number of cases depends greatly on climatic seasonal effects, socioeconomic factors, and pollution. Therefore, understanding these variations and obtaining precise forecasts allows health authorities to make correct decisions regarding the allocation of limited economic and human resources. We aimed to model and forecast weekly hospitalizations due to respiratory conditions in seven regional hospitals in Costa Rica using four statistical learning techniques (Random Forest, XGboost, Facebook's Prophet forecasting model, and an ensemble method combining the above methods), along with 22 climate change indices and aerosol optical depth as an indicator of pollution. Models were trained using data from 2000 to 2018 and were evaluated using data from 2019 as testing data. During the training period, we set up 2-year sliding windows and a 1-year assessment period, along with the grid search method to optimize hyperparameters for each model. The best model for each region was selected using testing data, based on predictive precision and to prevent overfitting. Prediction intervals were then computed using conformal inference. The relative importance of all climatic variables was computed for the best model, and similar patterns in some of the seven regions were observed based on the selected model. Finally, reliable predictions were obtained for each of the seven regional hospitals.
Collapse
Affiliation(s)
- Shu Wei Chou-Chen
- Centro de Investigación en Matematica Pura y Aplicada, Universidad de Costa Rica, Costa Rica
- Escuela de Estadística, Universidad de Costa Rica, Costa Rica
| | - Luis A Barboza
- Centro de Investigación en Matematica Pura y Aplicada, Universidad de Costa Rica, Costa Rica
- Escuela de Matemática, Universidad de Costa Rica, Costa Rica
| |
Collapse
|
10
|
Arnold M, Liou L, Boland MR. Development and Optimization of Machine Learning Algorithms for Predicting In-hospital Patient Charges for Congestive Heart Failure Exacerbations, Chronic Obstructive Pulmonary Disease Exacerbations and Diabetic Ketoacidosis. RESEARCH SQUARE 2024:rs.3.rs-4490027. [PMID: 38947079 PMCID: PMC11213225 DOI: 10.21203/rs.3.rs-4490027/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Background Hospitalizations for exacerbations of congestive heart failure (CHF), chronic obstructive pulmonary disease (COPD) and diabetic ketoacidosis (DKA) are costly in the United States. The purpose of this study was to predict in-hospital charges for each condition using machine learning (ML) models. Results We conducted a retrospective cohort study on national discharge records of hospitalized adult patients from January 1st, 2016, to December 31st, 2019. We used numerous ML techniques to predict in-hospital total cost. We found that linear regression (LM), gradient boosting (GBM) and extreme gradient boosting (XGB) models had good predictive performance and were statistically equivalent, with training R-square values ranging from 0.49-0.95 for CHF, 0.56-0.95 for COPD, and 0.32-0.99 for DKA. We identified important key features driving costs, including patient age, length of stay, number of procedures. and elective/nonelective admission. Conclusions ML methods may be used to accurately predict costs and identify drivers of high cost for COPD exacerbations, CHF exacerbations and DKA. Overall, our findings may inform future studies that seek to decrease the underlying high patient costs for these conditions.
Collapse
Affiliation(s)
- Monique Arnold
- The Mount Sinai Hospital at the Icahn School of Medicine
| | | | - Mary Regina Boland
- Alex G McKenna School of Business, Economics and Government. Saint Vincent College
| |
Collapse
|
11
|
Naylor KL, Vinegar M, Blake PG, Bota S, Luo B, Garg AX, Ip J, Yeung A, Gingras J, Aziz A, Iskander C, McFarlane P. Comparison of Acute Health Care Utilization Between Patients Receiving In-Center Hemodialysis and the General Population: A Population-Based Matched Cohort Study From Ontario, Canada. Can J Kidney Health Dis 2024; 11:20543581241231426. [PMID: 38449711 PMCID: PMC10916490 DOI: 10.1177/20543581241231426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/22/2023] [Indexed: 03/08/2024] Open
Abstract
Background Patients receiving maintenance hemodialysis have multiple comorbidities and are at high risk of presenting to the hospital. However, the incidence and cost of acute health care utilization in the in-center hemodialysis population and how this compares with other populations is poorly understood. Objective To determine the rate, pattern, and cost of emergency department visits and hospitalizations in patients receiving in-center hemodialysis compared with a matched general population. Design Population-based matched cohort study. Setting We used linked administrative health care databases from Ontario, Canada. Patients We included 25 379 patients (incident and prevalent) receiving in-center hemodialysis between January 1, 2010, and December 31, 2018. Patients were matched on birth date (±2 years), sex, and cohort entry date using a 1:4 ratio to 101 516 individuals from the general population. Measurements Our primary outcomes were emergency department visits (allowing for multiple visits per individual) and hospital admissions from the emergency department. We also assessed all-cause hospitalizations, all-cause readmissions within 30 days of discharge from the original hospitalization, length of stay for hospital admissions (including multiple visits per individual), and the financial cost of these admissions. Methods We presented the rate, percentage, median (25th, 75th percentiles), and incidence rate per 1000 person-years for emergency department visits and hospitalizations. Individual-level health care costs for emergency department visits and all-cause hospitalization were estimated using resource intensity weights multiplied by the cost per weighted case. Results Patients receiving in-center hemodialysis had substantially more comorbidities (eg, diabetes) than the matched general population. Eighty percent (n = 20 309) of patients receiving in-center hemodialysis had at least 1 emergency department visit compared with 56% (n = 56 452) of individuals in the matched general population, over a median follow-up of 1.8 years (25th, 75th percentiles: 0.7, 3.6) and 5.2 (2.5, 8.4) years, respectively. The incidence rate of emergency department visits, allowing for multiple visits per individual, was 2274 per 1000 person-years (95% confidence interval [CI]: 2263, 2286) for patients receiving in-center hemodialysis, which was almost 5 times as high as the matched general population (471 per 1000 person-years; 95% CI: 469, 473). The rate of hospital admissions from the emergency department and the rate of all-cause hospital admissions in the in-center hemodialysis population was more than 7 times as high as the matched general population (hospital admissions from the emergency department: 786 vs 101 per 1000 person-years; all-cause hospital admissions: 1056 vs 139 per 1000 person-years). The median number of all-cause hospitalization days per patient year was 4.0 (0, 16.5) in the in-center hemodialysis population compared with 0 (0, 0.5) in the matched general population. The cost per patient-year for emergency department visits in the in-center hemodialysis population was approximately 5.5 times as high as the matched general population while the cost of hospitalizations in the in-center hemodialysis population was approximately 11 times as high as the matched general population (emergency department visits: CAN$ 1153 vs CAN$ 209; hospitalizations: CAN$ 21 151 vs CAN$ 1873 [all costs in 2023 CAN$]). Limitations External generalizability and we could not determine whether emergency department visits and hospitalizations were preventable. Conclusions Patients receiving in-center hemodialysis have high acute health care utilization. These results improve our understanding of the burden of disease and the associated costs in the in-center hemodialysis population, highlight the need to improve acute outcomes, and can aid health care capacity planning. Additional research is needed to address the risk of hospitalization after controlling for patient comorbidities. Trial registration This is not applicable as this is a population-based matched cohort study and not a clinical trial.
Collapse
Affiliation(s)
- Kyla L. Naylor
- Lawson Health Research Institute, London Health Sciences Centre, ON, Canada
- ICES, ON, Canada
- Department of Epidemiology and Biostatistics, Western University, London, ON, Canada
| | - Marlee Vinegar
- Division of Nephrology, London Health Sciences Centre, ON, Canada
| | - Peter G. Blake
- Division of Nephrology, London Health Sciences Centre, ON, Canada
- Ontario Renal Network, Ontario Health, Toronto, Canada
| | - Sarah Bota
- Lawson Health Research Institute, London Health Sciences Centre, ON, Canada
- ICES, ON, Canada
| | - Bin Luo
- Lawson Health Research Institute, London Health Sciences Centre, ON, Canada
- ICES, ON, Canada
| | - Amit X. Garg
- Lawson Health Research Institute, London Health Sciences Centre, ON, Canada
- ICES, ON, Canada
- Department of Epidemiology and Biostatistics, Western University, London, ON, Canada
- Division of Nephrology, London Health Sciences Centre, ON, Canada
| | - Jane Ip
- Ontario Renal Network, Ontario Health, Toronto, Canada
| | - Angie Yeung
- Ontario Renal Network, Ontario Health, Toronto, Canada
| | | | - Anas Aziz
- Ontario Renal Network, Ontario Health, Toronto, Canada
| | | | - Phil McFarlane
- Ontario Renal Network, Ontario Health, Toronto, Canada
- Division of Nephrology, St. Michael’s Hospital, Toronto, ON, Canada
| |
Collapse
|
12
|
Park S, Lee C, Lee SB, Lee JY. Machine learning-based prediction model for emergency department visits using prescription information in community-dwelling non-cancer older adults. Sci Rep 2023; 13:18887. [PMID: 37919353 PMCID: PMC10622449 DOI: 10.1038/s41598-023-46094-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 10/27/2023] [Indexed: 11/04/2023] Open
Abstract
Older adults are more likely to require emergency department (ED) visits than others, which might be attributed to their medication use. Being able to predict the likelihood of an ED visit using prescription information and readily available data would be useful for primary care. This study aimed to predict the likelihood of ED visits using extensive medication variables generated according to explicit clinical criteria for elderly people and high-risk medication categories by applying machine learning (ML) methods. Patients aged ≥ 65 years were included, and ED visits were predicted with 146 variables, including demographic and comprehensive medication-related factors, using nationwide claims data. Among the eight ML models, the final model was developed using LightGBM, which showed the best performance. The final model incorporated 93 predictors, including six sociodemographic, 28 comorbidity, and 59 medication-related variables. The final model had an area under the receiver operating characteristic curve of 0.689 in the validation cohort. Approximately half of the top 20 strong predictors were medication-related variables. Here, an ED visit risk prediction model for older people was developed and validated using administrative data that can be easily applied in clinical settings to screen patients who are likely to visit an ED.
Collapse
Affiliation(s)
- Soyoung Park
- College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul, 08826, Republic of Korea
| | - Changwoo Lee
- Department of Transdisciplinary Medicine, Seoul National University Hospital, Seoul, 03080, Republic of Korea
- Department of Medical Device Development, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Seung-Bo Lee
- Department of Medical Informatics, Keimyung University School of Medicine, Daegu, 42601, Republic of Korea.
| | - Ju-Yeun Lee
- College of Pharmacy and Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul, 08826, Republic of Korea.
| |
Collapse
|
13
|
Zhang Y, Wang H, Yin C, Shu T, Yu J, Jian J, Jian C, Duan M, Kadier K, Xu Q, Wang X, Xiang T, Liu X. Development of a prediction model for the risk of 30-day unplanned readmission in older patients with heart failure: A multicenter retrospective study. Nutr Metab Cardiovasc Dis 2023; 33:1878-1887. [PMID: 37500347 DOI: 10.1016/j.numecd.2023.05.034] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/21/2023] [Accepted: 05/31/2023] [Indexed: 07/29/2023]
Abstract
BACKGROUND AND AIM Heart failure (HF) imposes significant global health costs due to its high incidence, readmission, and mortality rate. Accurate assessment of readmission risk and precise interventions have become important measures to improve health for patients with HF. Therefore, this study aimed to develop a machine learning (ML) model to predict 30-day unplanned readmissions in older patients with HF. METHODS AND RESULTS This study collected data on hospitalized older patients with HF from the medical data platform of Chongqing Medical University from January 1, 2012, to December 31, 2021. A total of 5 candidate algorithms were selected from 15 ML algorithms with excellent performance, which was evaluated by area under the operating characteristic curve (AUC) and accuracy. Then, the 5 candidate algorithms were hyperparameter tuned by 5-fold cross-validation grid search, and performance was evaluated by AUC, accuracy, sensitivity, specificity, and recall. Finally, an optimal ML model was constructed, and the predictive results were explained using the SHapley Additive exPlanations (SHAP) framework. A total of 14,843 older patients with HF were consecutively enrolled. CatBoost model was selected as the best prediction model, and AUC was 0.732, with 0.712 accuracy, 0.619 sensitivity, and 0.722 specificity. NT.proBNP, length of stay (LOS), triglycerides, blood phosphorus, blood potassium, and lactate dehydrogenase had the greatest effect on 30-day unplanned readmission in older patients with HF, according to SHAP results. CONCLUSIONS The study developed a CatBoost model to predict the risk of unplanned 30-day special-cause readmission in older patients with HF, which showed more significant performance compared with the traditional logistic regression model.
Collapse
Affiliation(s)
- Yang Zhang
- College of Medical Informatics, Chongqing Medical University, Chongqing, China; Medical Data Science Academy, Chongqing Medical University, Chongqing, China
| | - Haolin Wang
- College of Medical Informatics, Chongqing Medical University, Chongqing, China
| | - Chengliang Yin
- Faculty of Medicine, Macau University of Science and Technology, 999078, Macau, China
| | - Tingting Shu
- Army Medical University (Third Military Medical University), Chongqing, China
| | - Jie Yu
- Department of Medical Imaging, The Affiliated Taian City Central Hospital of Qingdao University, Taian 271000, China
| | - Jie Jian
- College of Medical Informatics, Chongqing Medical University, Chongqing, China; Medical Data Science Academy, Chongqing Medical University, Chongqing, China
| | - Chang Jian
- College of Medical Informatics, Chongqing Medical University, Chongqing, China; Medical Data Science Academy, Chongqing Medical University, Chongqing, China
| | - Minjie Duan
- College of Medical Informatics, Chongqing Medical University, Chongqing, China; Medical Data Science Academy, Chongqing Medical University, Chongqing, China
| | - Kaisaierjiang Kadier
- Department of Cardiology, First Affiliated Hospital of Xinjiang Medical University, Ürümqi, China
| | - Qian Xu
- Collection Development Department of Library, Chongqing Medical University, Chongqing, China
| | - Xueer Wang
- College of Oncology, Guangxi Medical University, Nanning 530022, China
| | - Tianyu Xiang
- Information Center, The University-Town Hospital of Chongqing Medical University, Chongqing, China.
| | - Xiaozhu Liu
- College of Medical Informatics, Chongqing Medical University, Chongqing, China; Medical Data Science Academy, Chongqing Medical University, Chongqing, China.
| |
Collapse
|
14
|
Bloc S, Alfonsi P, Belbachir A, Beaussier M, Bouvet L, Campard S, Campion S, Cazenave L, Diemunsch P, Di Maria S, Dufour G, Fabri S, Fletcher D, Garnier M, Godier A, Grillo P, Huet O, Joosten A, Lasocki S, Le Guen M, Le Saché F, Macquer I, Marquis C, de Montblanc J, Maurice-Szamburski A, Nguyen YL, Ruscio L, Zieleskiewicz L, Caillard A, Weiss E. Guidelines on perioperative optimization protocol for the adult patient 2023. Anaesth Crit Care Pain Med 2023; 42:101264. [PMID: 37295649 DOI: 10.1016/j.accpm.2023.101264] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
OBJECTIVE The French Society of Anesthesiology and Intensive Care Medicine [Société Française d'Anesthésie et de Réanimation (SFAR)] aimed at providing guidelines for the implementation of perioperative optimization programs. DESIGN A consensus committee of 29 experts from the SFAR was convened. A formal conflict-of-interest policy was developed at the outset of the process and enforced throughout. The entire guidelines process was conducted independently of any industry funding. The authors were advised to follow the principles of the Grading of Recommendations Assessment, Development and Evaluation (GRADE) system to guide assessment of quality of evidence. METHODS Four fields were defined: 1) Generalities on perioperative optimization programs; 2) Preoperative measures; 3) Intraoperative measures and; 4) Postoperative measures. For each field, the objective of the recommendations was to answer a number of questions formulated according to the PICO model (population, intervention, comparison, and outcomes). Based on these questions, an extensive bibliographic search was carried out using predefined keywords according to PRISMA guidelines and analyzed using the GRADE® methodology. The recommendations were formulated according to the GRADE® methodology and then voted on by all the experts according to the GRADE grid method. As the GRADE® methodology could have been fully applied for the vast majority of questions, the recommendations were formulated using a "formalized expert recommendations" format. RESULTS The experts' work on synthesis and application of the GRADE® method resulted in 30 recommendations. Among the formalized recommendations, 19 were found to have a high level of evidence (GRADE 1±) and ten a low level of evidence (GRADE 2±). For one recommendation, the GRADE methodology could not be fully applied, resulting in an expert opinion. Two questions did not find any response in the literature. After two rounds of rating and several amendments, strong agreement was reached for all the recommendations. CONCLUSIONS Strong agreement among the experts was obtained to provide 30 recommendations for the elaboration and/or implementation of perioperative optimization programs in the highest number of surgical fields.
Collapse
Affiliation(s)
- Sébastien Bloc
- Clinical Research Department, Ambroise Pare Hospital Group, Neuilly-sur-Seine, France; Department of Anesthesiology, Clinique Drouot Sport, Paris, France.
| | - Pascal Alfonsi
- Department of Anesthesia, University of Paris Descartes, Groupe Hospitalier Paris Saint-Joseph, 185 rue Raymond Losserand, F-75674 Paris Cedex 14, France
| | - Anissa Belbachir
- Service d'Anesthésie Réanimation, UF Douleur, Assistance Publique Hôpitaux de Paris, APHP.Centre, Site Cochin, Paris, France
| | - Marc Beaussier
- Department of Digestive, Oncologic and Metabolic Surgery, Institut Mutualiste Montsouris, Université de Paris, 42 Boulevard Jourdan, 75014, Paris, France
| | - Lionel Bouvet
- Department of Anaesthesia and Intensive Care, Hospices Civils de Lyon, Hôpital Femme Mère Enfant, Lyon, France
| | | | - Sébastien Campion
- AP-HP, Groupe Hospitalier Universitaire APHP-Sorbonne Université, site Pitié-Salpêtrière, Département d'Anesthésie-Réanimation, F-75013 Paris, France; Sorbonne Université, INSERM, UMRS1158 Neurophysiologie Respiratoire Expérimentale et Clinique, F-75005 Paris, France
| | - Laure Cazenave
- Department of Anaesthesia and Critical Care, Hospices Civils de Lyon, Lyon, France; Groupe Jeunes, French Society of Anaesthesia and Intensive Care Medicine (SFAR), 75016 Paris, France
| | - Pierre Diemunsch
- Unité de Réanimation Chirurgicale, Service d'Anesthésie-réanimation Chirurgicale, Pôle Anesthésie-Réanimations Chirurgicales, Samu-Smur, Hôpital de Hautepierre, Hôpitaux Universitaires de Strasbourg, 1, Avenue Molière, 67098 Strasbourg Cedex, France
| | - Sophie Di Maria
- Department of Anaesthesiology and Critical Care, AP-HP, Hôpital Pitié-Salpêtrière, Paris, France
| | - Guillaume Dufour
- Service d'Anesthésie-Réanimation, CHU de Pitié-Salpêtrière, 47-83, Boulevard de l'Hôpital, 75013 Paris, France
| | - Stéphanie Fabri
- Faculty of Economics, Management & Accountancy, University of Malta, Malta
| | - Dominique Fletcher
- Université de Versailles-Saint-Quentin-en-Yvelines, Assistance Publique-Hôpitaux de Paris, Hôpital Ambroise-Paré, Service d'Anesthésie, 9, Avenue Charles-de-Gaulle, 92100 Boulogne-Billancourt, France
| | - Marc Garnier
- Sorbonne Université, GRC 29, DMU DREAM, Service d'Anesthésie-Réanimation et Médecine Périopératoire Rive Droite, Paris, France
| | - Anne Godier
- Department of Anaesthesiology and Critical Care, European Georges Pompidou Hospital, Assistance Publique-Hôpitaux de Paris, France
| | | | - Olivier Huet
- CHU de Brest, Anesthesia and Intensive Care Unit, Brest, France
| | - Alexandre Joosten
- Department of Anesthesiology, Erasme Hospital, Université Libre de Bruxelles, Brussels, Belgium; Department of Anesthesiology and Intensive Care, Hôpitaux Universitaires Paris-Sud, Université Paris-Sud, Université Paris-Saclay, Paul Brousse Hospital, Assistance Publique Hôpitaux de Paris (APHP), Villejuif, France
| | | | - Morgan Le Guen
- Paris Saclay University, Department of Anaesthesia and Pain Medicine, Foch Hospital, 92150 Suresnes, France
| | - Frédéric Le Saché
- Department of Anesthesiology, Clinique Drouot Sport, Paris, France; DMU DREAM Department of Anesthesiology and Critical Care, Pitié-Salpêtrière Hospital, Paris, France
| | - Isabelle Macquer
- Bordeaux University Hospitals, Bordeaux, Anaesthesia and Intensive Care Medicine Department, Bordeaux, France
| | - Constance Marquis
- Clinique du Sport, Département d'Anesthésie et Réanimation, Médipole Garonne, 45 rue de Gironis - CS 13 624, 31036 Toulouse Cedex 1, France
| | - Jacques de Montblanc
- Departments of Anesthesiology and Intensive Care Paris-Saclay University, Bicêtre Hospital, Assistance Publique Hôpitaux de Paris, Le Kremlin-Bicêtre, France
| | | | - Yên-Lan Nguyen
- Anesthesiology and Critical Care Medicine Department, Cochin Academic Hospital, APHP, Université de Paris, 75014 Paris, France
| | - Laura Ruscio
- Departments of Anesthesiology and Intensive Care Paris-Saclay University, Bicêtre Hospital, Assistance Publique Hôpitaux de Paris, Le Kremlin-Bicêtre, France; INSERM U 1195, Université Paris-Saclay, Saint-Aubin, Île-de-France, France
| | - Laurent Zieleskiewicz
- Service d'Anesthésie Réanimation, Hôpital Nord, AP-HM, Marseille, Aix Marseille Université, C2VN, France
| | - Anaîs Caillard
- Centre Hospitalier Universitaire La Cavale Blanche Université de Bretagne Ouest, Anaesthesiology, Critical Care and Perioperative Medicine Department, Brest, France
| | - Emmanuel Weiss
- Department of Anaesthesiology and Critical Care, Beaujon Hospital, DMU Parabol, AP-HP, Nord, Clichy, France; University of Paris, Paris, France; Inserm UMR_S1149, Centre for Research on Inflammation, Paris, France
| |
Collapse
|
15
|
Luo AL, Ravi A, Arvisais-Anhalt S, Muniyappa AN, Liu X, Wang S. Development and Internal Validation of an Interpretable Machine Learning Model to Predict Readmissions in a United States Healthcare System. INFORMATICS 2023. [DOI: 10.3390/informatics10020033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023] Open
Abstract
(1) One in four hospital readmissions is potentially preventable. Machine learning (ML) models have been developed to predict hospital readmissions and risk-stratify patients, but thus far they have been limited in clinical applicability, timeliness, and generalizability. (2) Methods: Using deidentified clinical data from the University of California, San Francisco (UCSF) between January 2016 and November 2021, we developed and compared four supervised ML models (logistic regression, random forest, gradient boosting, and XGBoost) to predict 30-day readmissions for adults admitted to a UCSF hospital. (3) Results: Of 147,358 inpatient encounters, 20,747 (13.9%) patients were readmitted within 30 days of discharge. The final model selected was XGBoost, which had an area under the receiver operating characteristic curve of 0.783 and an area under the precision-recall curve of 0.434. The most important features by Shapley Additive Explanations were days since last admission, discharge department, and inpatient length of stay. (4) Conclusions: We developed and internally validated a supervised ML model to predict 30-day readmissions in a US-based healthcare system. This model has several advantages including state-of-the-art performance metrics, the use of clinical data, the use of features available within 24 h of discharge, and generalizability to multiple disease states.
Collapse
|
16
|
Loh HW, Ooi CP, Seoni S, Barua PD, Molinari F, Acharya UR. Application of explainable artificial intelligence for healthcare: A systematic review of the last decade (2011-2022). COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107161. [PMID: 36228495 DOI: 10.1016/j.cmpb.2022.107161] [Citation(s) in RCA: 155] [Impact Index Per Article: 51.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 09/16/2022] [Accepted: 09/25/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVES Artificial intelligence (AI) has branched out to various applications in healthcare, such as health services management, predictive medicine, clinical decision-making, and patient data and diagnostics. Although AI models have achieved human-like performance, their use is still limited because they are seen as a black box. This lack of trust remains the main reason for their low use in practice, especially in healthcare. Hence, explainable artificial intelligence (XAI) has been introduced as a technique that can provide confidence in the model's prediction by explaining how the prediction is derived, thereby encouraging the use of AI systems in healthcare. The primary goal of this review is to provide areas of healthcare that require more attention from the XAI research community. METHODS Multiple journal databases were thoroughly searched using PRISMA guidelines 2020. Studies that do not appear in Q1 journals, which are highly credible, were excluded. RESULTS In this review, we surveyed 99 Q1 articles covering the following XAI techniques: SHAP, LIME, GradCAM, LRP, Fuzzy classifier, EBM, CBR, rule-based systems, and others. CONCLUSION We discovered that detecting abnormalities in 1D biosignals and identifying key text in clinical notes are areas that require more attention from the XAI research community. We hope this is review will encourage the development of a holistic cloud system for a smart city.
Collapse
Affiliation(s)
- Hui Wen Loh
- School of Science and Technology, Singapore University of Social Sciences, Singapore
| | - Chui Ping Ooi
- School of Science and Technology, Singapore University of Social Sciences, Singapore
| | - Silvia Seoni
- Department of Electronics and Telecommunications, Biolab, Politecnico di Torino, Torino 10129, Italy
| | - Prabal Datta Barua
- Faculty of Engineering and Information Technology, University of Technology Sydney, Australia; School of Business (Information Systems), Faculty of Business, Education, Law & Arts, University of Southern Queensland, Australia
| | - Filippo Molinari
- Department of Electronics and Telecommunications, Biolab, Politecnico di Torino, Torino 10129, Italy
| | - U Rajendra Acharya
- School of Science and Technology, Singapore University of Social Sciences, Singapore; School of Business (Information Systems), Faculty of Business, Education, Law & Arts, University of Southern Queensland, Australia; School of Engineering, Ngee Ann Polytechnic, Singapore; Department of Bioinformatics and Medical Engineering, Asia University, Taiwan; Research Organization for Advanced Science and Technology (IROAST), Kumamoto University, Kumamoto, Japan.
| |
Collapse
|
17
|
Duan M, Shu T, Zhao B, Xiang T, Wang J, Huang H, Zhang Y, Xiao P, Zhou B, Xie Z, Liu X. Explainable machine learning models for predicting 30-day readmission in pediatric pulmonary hypertension: A multicenter, retrospective study. Front Cardiovasc Med 2022; 9:919224. [PMID: 35958416 PMCID: PMC9360407 DOI: 10.3389/fcvm.2022.919224] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 06/23/2022] [Indexed: 11/13/2022] Open
Abstract
BackgroundShort-term readmission for pediatric pulmonary hypertension (PH) is associated with a substantial social and personal burden. However, tools to predict individualized readmission risk are lacking. This study aimed to develop machine learning models to predict 30-day unplanned readmission in children with PH.MethodsThis study collected data on pediatric inpatients with PH from the Chongqing Medical University Medical Data Platform from January 2012 to January 2019. Key clinical variables were selected by the least absolute shrinkage and the selection operator. Prediction models were selected from 15 machine learning algorithms with excellent performance, which was evaluated by area under the operating characteristic curve (AUC). The outcome of the predictive model was interpreted by SHapley Additive exPlanations (SHAP).ResultsA total of 5,913 pediatric patients with PH were included in the final cohort. The CatBoost model was selected as the predictive model with the greatest AUC for 0.81 (95% CI: 0.77–0.86), high accuracy for 0.74 (95% CI: 0.72–0.76), sensitivity 0.78 (95% CI: 0.69–0.87), and specificity 0.74 (95% CI: 0.72–0.76). Age, length of stay (LOS), congenital heart surgery, and nonmedical order discharge showed the greatest impact on 30-day readmission in pediatric PH, according to SHAP results.ConclusionsThis study developed a CatBoost model to predict the risk of unplanned 30-day readmission in pediatric patients with PH, which showed more significant performance compared with traditional logistic regression. We found that age, LOS, congenital heart surgery, and nonmedical order discharge were important factors for 30-day readmission in pediatric PH.
Collapse
Affiliation(s)
- Minjie Duan
- College of Medical Informatics, Chongqing Medical University, Chongqing, China
- Medical Data Science Academy, Chongqing Medical University, Chongqing, China
| | - Tingting Shu
- Department of Cardiology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Binyi Zhao
- Department of Cardiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Tianyu Xiang
- Information Center, The University-Town Hospital of Chongqing Medical University, Chongqing, China
| | - Jinkui Wang
- Department of Urology, Children's Hospital of Chongqing Medical University, Chongqing, China
| | - Haodong Huang
- Medical Data Science Academy, Chongqing Medical University, Chongqing, China
- Personnel Department, Chongqing Health Center for Women and Children, Chongqing, China
| | - Yang Zhang
- College of Medical Informatics, Chongqing Medical University, Chongqing, China
- Medical Data Science Academy, Chongqing Medical University, Chongqing, China
| | - Peilin Xiao
- Department of Cardiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Bei Zhou
- Department of Cardiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Zulong Xie
- Department of Cardiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
- *Correspondence: Zulong Xie ;
| | - Xiaozhu Liu
- Department of Cardiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
- Xiaozhu Liu ;
| |
Collapse
|
18
|
Wu X, Guan Q, Cheng ASK, Guan C, Su Y, Jiang J, Wang B, Zeng L, Zeng Y. Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women. Asia Pac J Oncol Nurs 2022; 9:100101. [PMID: 36276882 PMCID: PMC9579303 DOI: 10.1016/j.apjon.2022.100101] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 05/30/2022] [Indexed: 11/21/2022] Open
Abstract
Objective Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. Methods This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. Results Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD = 7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n = 206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD = 11.71). Most of the tumors were either stage I (n = 49, 31.2%) or stage II (n = 252, 68.1%). More than half of the sample had had postoperative chemotherapy (n = 227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. Conclusions This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably.
Collapse
|
19
|
Lo YT, Liao JC, Chen MH, Chang CM, Li CT. Correction to: Predictive modeling for 14-day unplanned hospital readmission risk by using machine learning algorithms. BMC Med Inform Decis Mak 2022; 22:73. [PMID: 35337321 PMCID: PMC8953066 DOI: 10.1186/s12911-022-01804-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Affiliation(s)
- Yu-Tai Lo
- Department of Geriatrics and Gerontology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan (R.O.C.)
| | - Jay Chiehen Liao
- Institute of Data Science, National Cheng Kung University, No. 1, University Road, Tainan City, 701, Taiwan (R.O.C.)
| | - Mei-Hua Chen
- Department of Geriatrics and Gerontology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan (R.O.C.)
| | - Chia-Ming Chang
- Department of Geriatrics and Gerontology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan (R.O.C.).,Department of Medicine and Institute of Gerontology, College of Medicine, National Cheng Kung University, Tainan, Taiwan (R.O.C.)
| | - Cheng-Te Li
- Institute of Data Science, National Cheng Kung University, No. 1, University Road, Tainan City, 701, Taiwan (R.O.C.).
| |
Collapse
|