Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li C, Zhang Z, Ren Y, Nie H, Lei Y, Qiu H, Xu Z, Pu X. Machine learning based early mortality prediction in the emergency department. Int J Med Inform 2021;155:104570. [PMID: 34547624 DOI: 10.1016/j.ijmedinf.2021.104570] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 06/01/2021] [Accepted: 09/06/2021] [Indexed: 02/08/2023]

For:	Li C, Zhang Z, Ren Y, Nie H, Lei Y, Qiu H, Xu Z, Pu X. Machine learning based early mortality prediction in the emergency department. Int J Med Inform 2021;155:104570. [PMID: 34547624 DOI: 10.1016/j.ijmedinf.2021.104570] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 06/01/2021] [Accepted: 09/06/2021] [Indexed: 02/08/2023]

Number

Cited by Other Article(s)

Kuo KM, Chang CS. A meta-analysis of the diagnostic test accuracy of artificial intelligence predicting emergency department dispositions. BMC Med Inform Decis Mak 2025;25:187. [PMID: 40375078 PMCID: PMC12082892 DOI: 10.1186/s12911-025-03010-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 04/23/2025] [Indexed: 05/18/2025] Open

Abstract

BACKGROUND

The rapid advancement of Artificial Intelligence (AI) has led to its widespread application across various domains, showing encouraging outcomes. Many studies have utilized AI to forecast emergency department (ED) disposition, aiming to forecast patient outcomes earlier and to allocate resources better; however, a dearth of comprehensive review literature exists to assess the objective performance standards of these predictive models using quantitative evaluations. This study aims to conduct a meta-analysis to assess the diagnostic accuracy of AI in predicting ED disposition, encompassing admission, critical care, and mortality.

METHODS

Multiple databases, including Scopus, Springer, ScienceDirect, PubMed, Wiley, Sage, and Google Scholar, were searched until December 31, 2023, to gather relevant literature. Risk of bias was assessed using the Prediction Model Risk of Bias Assessment Tool. Pooled estimates of sensitivity, specificity, and area under the receiver operating characteristic curve (AUROC) were calculated to evaluate AI's predictive performance. Sub-group analyses were performed to explore covariates affecting AI predictive model performance.

RESULTS

The study included 88 articles possessed with 117 AI models, among which 39, 45, and 33 models predicted admission, critical care, and mortality, respectively. The reported statistics for sensitivity, specificity, and AUROC represent pooled summary measures derived from the component studies included in this meta-analysis. AI's summary sensitivity, specificity, and AUROC for predicting admission were 0.81 (95% Confidence Interval [CI] 0.74-0.86), 0.87 (95% CI 0.81-0.91), and 0.87 (95% CI 0.84-0.93), respectively. For critical care, the values were 0.86 (95% CI 0.79-0.91), 0.89 (95% CI 0.83-0.93), and 0.93 (95% CI 0.89-0.95), respectively, and for mortality, they were 0.85 (95% CI 0.80-0.89), 0.94 (95% CI 0.90-0.96), and 0.93 (95% CI 0.89-0.96), respectively. Emergent sample characteristics and AI techniques showed evidence of significant covariates influencing the heterogeneity of AI predictive models for ED disposition.

CONCLUSIONS

The meta-analysis indicates promising performance of AI in predicting ED disposition, with certain potential for improvement, especially in sensitivity. Future research could explore advanced AI techniques such as ensemble learning and cross-validation with hyper-parameter tuning to enhance predictive model efficacy.

TRIAL REGISTRATION

This systematic review was not registered with PROSPERO or any other similar registry because the review was completed prior to the opportunity for registration, and PROSPERO currently does not accept registrations for reviews that are already completed. We are committed to transparency and have adhered to best practices in systematic review methodology throughout this study.

Collapse

Hong T, Huang J, Deng J, Kuang L, Sun M, Wang Q, Luo C, Zhao J, Liu X, Wang H. The Scoring Model to Predict ICU Stay and Mortality After Emergency Admissions in Atrial Fibrillation: A Retrospective Study of 30 366 Patients. Clin Cardiol 2025;48:e70101. [PMID: 39976638 PMCID: PMC11841604 DOI: 10.1002/clc.70101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/05/2024] [Revised: 01/31/2025] [Accepted: 02/10/2025] [Indexed: 02/23/2025] Open

Martínez‐Licort R, Sahelices B, de la Torre I, Vegas J. Machine Learning Methods for Predicting Syncope Severity in the Emergency Department: A Retrospective Analysis. Health Sci Rep 2025;8:e70477. [PMID: 39995795 PMCID: PMC11847648 DOI: 10.1002/hsr2.70477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 01/10/2025] [Accepted: 01/27/2025] [Indexed: 02/26/2025] Open

Abstract

Background and Aims

Syncope is a frequent reason for hospital emergency admissions, presenting significant challenges in determining its cause and associated risks. Despite its prevalence, research on using artificial intelligence (AI) to improve patient outcomes in this context has been limited. The main objective of current study is to predict the severity of syncope cases using machine learning (ML) algorithms based on data collected during on-site treatment and ambulance transportation.

Methods

This study analyzed 572 records from five Spanish public hospitals (2018-2021), focusing on hospitalization, ICU admission, and mortality. A three-phase strategy was used: data preprocessing, model exploration, and model selection. In the exploration phase, three data transformations techniques were applied and in each of them, models were evaluated using stratified 10-fold cross-validation, optimizing AUC, accuracy, and recall, with emphasis on minimizing false negatives (FN). The top-performing models were fine-tuned and tested. The strategy was implemented using Python libraries and a diverse set of ML classifiers were applied, including linear discriminant analysis (LDA), random forest (RF), dummy classifier (DC), and gradient boosting (GB).

Results

The RF classifier performed best for predicting hospitalization, reducing FN to 37% and achieving a true negative rate (TN) of 78%, with a recall of 0.63 and accuracy of 0.74. For ICU, DC showed FN = 29%, TN = 57%, recall = 0.625, and accuracy = 0.58. The LDA classifier excelled in predicting hospital mortality, with FN = 40%, TN = 89%, recall = 0.6, and accuracy = 0.88. These results indicate that RF was superior for predicting hospitalization, while DC for ICU and LDA performed better for predicting mortality.

Conclusions

This study provides an experimental foundation for the application of ML techniques in managing syncope in ED. The intention is to stimulate AI research in this area, with a view to integrating these models into clinical workflows in the future.

Collapse

Porto BM. Improving triage performance in emergency departments using machine learning and natural language processing: a systematic review. BMC Emerg Med 2024;24:219. [PMID: 39558255 PMCID: PMC11575054 DOI: 10.1186/s12873-024-01135-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Accepted: 11/11/2024] [Indexed: 11/20/2024] Open

Abstract

BACKGROUND

In Emergency Departments (EDs), triage is crucial for determining patient severity and prioritizing care, typically using the Manchester Triage Scale (MTS). Traditional triage systems, reliant on human judgment, are prone to under-triage and over-triage, resulting in variability, bias, and incorrect patient classification. Studies suggest that Machine Learning (ML) and Natural Language Processing (NLP) could enhance triage accuracy and consistency. This review analyzes studies on ML and/or NLP algorithms for ED patient triage.

METHODS

Following Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) guidelines, we conducted a systematic review across five databases: Web of Science, PubMed, Scopus, IEEE Xplore, and ACM Digital Library, from their inception of each database to October 2023. The risk of bias was assessed using the Prediction model Risk of Bias Assessment Tool (PROBAST). Only articles employing at least one ML and/or NLP method for patient triage classification were included.

RESULTS

Sixty studies covering 57 ML algorithms were included. Logistic Regression (LR) was the most used model, while eXtreme Gradient Boosting (XGBoost), decision tree-based algorithms with Gradient Boosting (GB), and Deep Neural Networks (DNNs) showed superior performance. Frequent predictive variables included demographics and vital signs, with oxygen saturation, chief complaints, systolic blood pressure, age, and mode of arrival being the most retained. The ML algorithms showed significant bias risk due to critical bias assessment in classification models.

CONCLUSION

NLP methods improved ML algorithms' classification capability using triage nursing and medical notes and structured clinical data compared to algorithms using only structured data. Feature engineering (FE) and class imbalance correction methods enhanced ML workflows' performance, but FE and eXplainable Artificial Intelligence (XAI) were underexplored in this field. Registration and funding. This systematic review has been registered (registration number: CRD42024604529) in the International Prospective Register of Systematic Reviews (PROSPERO) and can be accessed online at the following URL: https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=604529 . Funding for this work was provided by the National Council for Scientific and Technological Development (CNPq), Brazil.

Collapse

Liu J, Duan X, Duan M, Jiang Y, Mao W, Wang L, Liu G. Development and external validation of an interpretable machine learning model for the prediction of intubation in the intensive care unit. Sci Rep 2024;14:27174. [PMID: 39511328 PMCID: PMC11544239 DOI: 10.1038/s41598-024-77798-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Accepted: 10/25/2024] [Indexed: 11/15/2024] Open

Jawad BN, Shaker SM, Altintas I, Eugen-Olsen J, Nehlin JO, Andersen O, Kallemose T. Development and validation of prognostic machine learning models for short- and long-term mortality among acutely admitted patients based on blood tests. Sci Rep 2024;14:5942. [PMID: 38467752 PMCID: PMC10928126 DOI: 10.1038/s41598-024-56638-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 03/08/2024] [Indexed: 03/13/2024] Open

Rahmatinejad Z, Dehghani T, Hoseini B, Rahmatinejad F, Lotfata A, Reihani H, Eslami S. A comparative study of explainable ensemble learning and logistic regression for predicting in-hospital mortality in the emergency department. Sci Rep 2024;14:3406. [PMID: 38337000 PMCID: PMC10858239 DOI: 10.1038/s41598-024-54038-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 02/07/2024] [Indexed: 02/12/2024] Open

Abstract

This study addresses the challenges associated with emergency department (ED) overcrowding and emphasizes the need for efficient risk stratification tools to identify high-risk patients for early intervention. While several scoring systems, often based on logistic regression (LR) models, have been proposed to indicate patient illness severity, this study aims to compare the predictive performance of ensemble learning (EL) models with LR for in-hospital mortality in the ED. A cross-sectional single-center study was conducted at the ED of Imam Reza Hospital in northeast Iran from March 2016 to March 2017. The study included adult patients with one to three levels of emergency severity index. EL models using Bagging, AdaBoost, random forests (RF), Stacking and extreme gradient boosting (XGB) algorithms, along with an LR model, were constructed. The training and validation visits from the ED were randomly divided into 80% and 20%, respectively. After training the proposed models using tenfold cross-validation, their predictive performance was evaluated. Model performance was compared using the Brier score (BS), The area under the receiver operating characteristics curve (AUROC), The area and precision-recall curve (AUCPR), Hosmer-Lemeshow (H-L) goodness-of-fit test, precision, sensitivity, accuracy, F1-score, and Matthews correlation coefficient (MCC). The study included 2025 unique patients admitted to the hospital's ED, with a total percentage of hospital deaths at approximately 19%. In the training group and the validation group, 274 of 1476 (18.6%) and 152 of 728 (20.8%) patients died during hospitalization, respectively. According to the evaluation of the presented framework, EL models, particularly Bagging, predicted in-hospital mortality with the highest AUROC (0.839, CI (0.802-0.875)) and AUCPR = 0.64 comparable in terms of discrimination power with LR (AUROC (0.826, CI (0.787-0.864)) and AUCPR = 0.61). XGB achieved the highest precision (0.83), sensitivity (0.831), accuracy (0.842), F1-score (0.833), and the highest MCC (0.48). Additionally, the most accurate models in the unbalanced dataset belonged to RF with the lowest BS (0.128). Although all studied models overestimate mortality risk and have insufficient calibration (P > 0.05), stacking demonstrated relatively good agreement between predicted and actual mortality. EL models are not superior to LR in predicting in-hospital mortality in the ED. Both EL and LR models can be considered as screening tools to identify patients at risk of mortality.

Collapse

Stoessel D, Fa R, Artemova S, von Schenck U, Nowparast Rostami H, Madiot PE, Landelle C, Olive F, Foote A, Moreau-Gaudry A, Bosson JL. Early prediction of in-hospital mortality utilizing multivariate predictive modelling of electronic medical records and socio-determinants of health of the first day of hospitalization. BMC Med Inform Decis Mak 2023;23:259. [PMID: 37957690 PMCID: PMC10644472 DOI: 10.1186/s12911-023-02356-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Accepted: 10/27/2023] [Indexed: 11/15/2023] Open

Choi A, Choi SY, Chung K, Chung HS, Song T, Choi B, Kim JH. Development of a machine learning-based clinical decision support system to predict clinical deterioration in patients visiting the emergency department. Sci Rep 2023;13:8561. [PMID: 37237057 DOI: 10.1038/s41598-023-35617-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Accepted: 05/21/2023] [Indexed: 05/28/2023] Open

Li H, Tao X, Liang T, Jiang J, Zhu J, Wu S, Chen L, Zhang Z, Zhou C, Sun X, Huang S, Chen J, Chen T, Ye Z, Chen W, Guo H, Yao Y, Liao S, Yu C, Fan B, Liu Y, Lu C, Hu J, Xie Q, Wei X, Fang C, Liu H, Huang C, Pan S, Zhan X, Liu C. Comprehensive AI-assisted tool for ankylosing spondylitis based on multicenter research outperforms human experts. Front Public Health 2023;11:1063633. [PMID: 36844823 PMCID: PMC9947660 DOI: 10.3389/fpubh.2023.1063633] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 01/18/2023] [Indexed: 02/11/2023] Open

Abstract

Introduction

The diagnosis and treatment of ankylosing spondylitis (AS) is a difficult task, especially in less developed countries without access to experts. To address this issue, a comprehensive artificial intelligence (AI) tool was created to help diagnose and predict the course of AS.

Methods

In this retrospective study, a dataset of 5389 pelvic radiographs (PXRs) from patients treated at a single medical center between March 2014 and April 2022 was used to create an ensemble deep learning (DL) model for diagnosing AS. The model was then tested on an additional 583 images from three other medical centers, and its performance was evaluated using the area under the receiver operating characteristic curve analysis, accuracy, precision, recall, and F1 scores. Furthermore, clinical prediction models for identifying high-risk patients and triaging patients were developed and validated using clinical data from 356 patients.

Results

The ensemble DL model demonstrated impressive performance in a multicenter external test set, with precision, recall, and area under the receiver operating characteristic curve values of 0.90, 0.89, and 0.96, respectively. This performance surpassed that of human experts, and the model also significantly improved the experts' diagnostic accuracy. Furthermore, the model's diagnosis results based on smartphone-captured images were comparable to those of human experts. Additionally, a clinical prediction model was established that accurately categorizes patients with AS into high-and low-risk groups with distinct clinical trajectories. This provides a strong foundation for individualized care.

Discussion

In this study, an exceptionally comprehensive AI tool was developed for the diagnosis and management of AS in complex clinical scenarios, especially in underdeveloped or rural areas that lack access to experts. This tool is highly beneficial in providing an efficient and effective system of diagnosis and management.

Collapse

Affiliation(s)

Hao Li The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Xiang Tao The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Tuo Liang The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Jie Jiang The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Jichong Zhu The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Shaofeng Wu The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Liyi Chen The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Zide Zhang The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Chenxing Zhou The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Xuhua Sun The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Shengsheng Huang The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Jiarui Chen The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Tianyou Chen The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Zhen Ye The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Wuhua Chen The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Hao Guo The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Yuanlin Yao The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Shian Liao The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Chaojie Yu The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Binguang Fan The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Yihong Liu Guangxi Medical University, Nanning, Guangxi, China
Chunai Lu Guangxi Medical University, Nanning, Guangxi, China
Junnan Hu Guangxi Medical University, Nanning, Guangxi, China
Qinghong Xie Guangxi Medical University, Nanning, Guangxi, China
Xiao Wei Guangxi Medical University, Nanning, Guangxi, China
Cairen Fang Guangxi Medical University, Nanning, Guangxi, China
Huijiang Liu Orthopaedics of The First People's Hospital of Nanning, Nanning, Guangxi, China
Chengqian Huang Orthopaedics of People's Hospital of Baise, Baise, Guangxi, China
Shixin Pan Orthopaedics of Wuzhou Red Cross Hospital, Wuzhou, Guangxi, China
Xinli Zhan The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
Chong Liu The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China,*Correspondence: Chong Liu ✉

Collapse

Petersson L, Vincent K, Svedberg P, Nygren JM, Larsson I. Ethical considerations in implementing AI for mortality prediction in the emergency department: Linking theory and practice. Digit Health 2023;9:20552076231206588. [PMID: 37829612 PMCID: PMC10566278 DOI: 10.1177/20552076231206588] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/21/2023] [Indexed: 10/14/2023] Open

Abstract

Background

Artificial intelligence (AI) is predicted to be a solution for improving healthcare, increasing efficiency, and saving time and recourses. A lack of ethical principles for the use of AI in practice has been highlighted by several stakeholders due to the recent attention given to it. Research has shown an urgent need for more knowledge regarding the ethical implications of AI applications in healthcare. However, fundamental ethical principles may not be sufficient to describe ethical concerns associated with implementing AI applications.

Objective

The aim of this study is twofold, (1) to use the implementation of AI applications to predict patient mortality in emergency departments as a setting to explore healthcare professionals' perspectives on ethical issues in relation to ethical principles and (2) to develop a model to guide ethical considerations in AI implementation in healthcare based on ethical theory.

Methods

Semi-structured interviews were conducted with 18 participants. The abductive approach used to analyze the empirical data consisted of four steps alternating between inductive and deductive analyses.

Results

Our findings provide an ethical model demonstrating the need to address six ethical principles (autonomy, beneficence, non-maleficence, justice, explicability, and professional governance) in relation to ethical theories defined as virtue, deontology, and consequentialism when AI applications are to be implemented in clinical practice.

Conclusions

Ethical aspects of AI applications are broader than the prima facie principles of medical ethics and the principle of explicability. Ethical aspects thus need to be viewed from a broader perspective to cover different situations that healthcare professionals, in general, and physicians, in particular, may face when using AI applications in clinical practice.

Collapse

Establishment of ICU Mortality Risk Prediction Models with Machine Learning Algorithm Using MIMIC-IV Database. Diagnostics (Basel) 2022;12:diagnostics12051068. [PMID: 35626224 PMCID: PMC9139972 DOI: 10.3390/diagnostics12051068] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 04/21/2022] [Accepted: 04/22/2022] [Indexed: 12/10/2022] Open

Abstract

Objective: The mortality rate of critically ill patients in ICUs is relatively high. In order to evaluate patients’ mortality risk, different scoring systems are used to help clinicians assess prognosis in ICUs, such as the Acute Physiology and Chronic Health Evaluation III (APACHE III) and the Logistic Organ Dysfunction Score (LODS). In this research, we aimed to establish and compare multiple machine learning models with physiology subscores of APACHE III—namely, the Acute Physiology Score III (APS III)—and LODS scoring systems in order to obtain better performance for ICU mortality prediction. Methods: A total number of 67,748 patients from the Medical Information Database for Intensive Care (MIMIC-IV) were enrolled, including 7055 deceased patients, and the same number of surviving patients were selected by the random downsampling technique, for a total of 14,110 patients included in the study. The enrolled patients were randomly divided into a training dataset (n = 9877) and a validation dataset (n = 4233). Fivefold cross-validation and grid search procedures were used to find and evaluate the best hyperparameters in different machine learning models. Taking the subscores of LODS and the physiology subscores that are part of the APACHE III scoring systems as input variables, four machine learning methods of XGBoost, logistic regression, support vector machine, and decision tree were used to establish ICU mortality prediction models, with AUCs as metrics. AUCs, specificity, sensitivity, positive predictive value, negative predictive value, and calibration curves were used to find the best model. Results: For the prediction of mortality risk in ICU patients, the AUC of the XGBoost model was 0.918 (95%CI, 0.915–0.922), and the AUCs of logistic regression, SVM, and decision tree were 0.872 (95%CI, 0.867–0.877), 0.872 (95%CI, 0.867–0.877), and 0.852 (95%CI, 0.847–0.857), respectively. The calibration curves of logistic regression and support vector machine performed better than the other two models in the ranges 0–40% and 70%–100%, respectively, while XGBoost performed better in the range of 40–70%. Conclusions: The mortality risk of ICU patients can be better predicted by the characteristics of the Acute Physiology Score III and the Logistic Organ Dysfunction Score with XGBoost in terms of ROC curve, sensitivity, and specificity. The XGBoost model could assist clinicians in judging in-hospital outcome of critically ill patients, especially in patients with a more uncertain survival outcome.

Collapse