1
|
Dhakal S, Yin A, Escarra-Senmarti M, Demko ZO, Pisanic N, Johnston TS, Trejo-Zambrano MI, Kruczynski K, Lee JS, Hardick JP, Shea P, Shapiro JR, Park HS, Parish MA, Caputo C, Ganesan A, Mullapudi SK, Gould SJ, Betenbaugh MJ, Pekosz A, Heaney CD, Antar AAR, Manabe YC, Cox AL, Karaba AH, Andrade F, Zeger SL, Klein SL. Application of machine learning algorithms to identify serological predictors of COVID-19 severity and outcomes. COMMUNICATIONS MEDICINE 2024; 4:249. [PMID: 39592832 PMCID: PMC11599591 DOI: 10.1038/s43856-024-00658-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 10/29/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND Critically ill hospitalized patients with COVID-19 have greater antibody titers than those with mild to moderate illness, but their association with recovery or death from COVID-19 has not been characterized. METHODS In a cohort study of 178 COVID-19 patients, 73 non-hospitalized and 105 hospitalized patients, mucosal swabs and plasma samples were collected at hospital enrollment and up to 3 months post-enrollment (MPE) to measure virus RNA, cytokines/chemokines, binding antibodies, ACE2 binding inhibition, and Fc effector antibody responses against SARS-CoV-2. The association of demographic variables and more than 20 serological antibody measures with intubation or death due to COVID-19 was determined using machine learning algorithms. RESULTS Predictive models reveal that IgG binding and ACE2 binding inhibition responses at 1 MPE are positively and anti-Spike antibody-mediated complement activation at enrollment is negatively associated with an increased probability of intubation or death from COVID-19 within 3 MPE. CONCLUSIONS At enrollment, serological antibody measures are more predictive than demographic variables of subsequent intubation or death among hospitalized COVID-19 patients.
Collapse
Affiliation(s)
- Santosh Dhakal
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Diagnostic Medicine/Pathobiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS, USA
| | - Anna Yin
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | | | - Zoe O Demko
- Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Nora Pisanic
- Department of Environmental Health and Engineering, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Trevor S Johnston
- Department of Environmental Health and Engineering, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | | | - Kate Kruczynski
- Department of Environmental Health and Engineering, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - John S Lee
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Justin P Hardick
- Division of Infectious Diseases, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Patrick Shea
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Janna R Shapiro
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Han-Sol Park
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Maclaine A Parish
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Christopher Caputo
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Abhinaya Ganesan
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Sarika K Mullapudi
- Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Stephen J Gould
- Department of Biological Chemistry, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Michael J Betenbaugh
- Department of Chemical and Biomolecular Engineering, Advanced Mammalian Biomanufacturing Innovation Center, Johns Hopkins University, Baltimore, MD, USA
| | - Andrew Pekosz
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Environmental Health and Engineering, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Division of Infectious Diseases, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Christopher D Heaney
- Department of Environmental Health and Engineering, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Annukka A R Antar
- Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Yukari C Manabe
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
- Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Andrea L Cox
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Division of Infectious Diseases, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Andrew H Karaba
- Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Felipe Andrade
- Division of Rheumatology, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Scott L Zeger
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Sabra L Klein
- W. Harry Feinstone Department of Molecular Microbiology and Immunology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
- Division of Infectious Diseases, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.
- Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
2
|
Qin Q, Yu H, Zhao J, Xu X, Li Q, Gu W, Guo X. Machine learning-based derivation and validation of three immune phenotypes for risk stratification and prognosis in community-acquired pneumonia: a retrospective cohort study. Front Immunol 2024; 15:1441838. [PMID: 39114653 PMCID: PMC11303239 DOI: 10.3389/fimmu.2024.1441838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 07/05/2024] [Indexed: 08/10/2024] Open
Abstract
Background The clinical presentation of Community-acquired pneumonia (CAP) in hospitalized patients exhibits heterogeneity. Inflammation and immune responses play significant roles in CAP development. However, research on immunophenotypes in CAP patients is limited, with few machine learning (ML) models analyzing immune indicators. Methods A retrospective cohort study was conducted at Xinhua Hospital, affiliated with Shanghai Jiaotong University. Patients meeting predefined criteria were included and unsupervised clustering was used to identify phenotypes. Patients with distinct phenotypes were also compared in different outcomes. By machine learning methods, we comprehensively assess the disease severity of CAP patients. Results A total of 1156 CAP patients were included in this research. In the training cohort (n=809), we identified three immune phenotypes among patients: Phenotype A (42.0%), Phenotype B (40.2%), and Phenotype C (17.8%), with Phenotype C corresponding to more severe disease. Similar results can be observed in the validation cohort. The optimal prognostic model, SuperPC, achieved the highest average C-index of 0.859. For predicting CAP severity, the random forest model was highly accurate, with C-index of 0.998 and 0.794 in training and validation cohorts, respectively. Conclusion CAP patients can be categorized into three distinct immune phenotypes, each with prognostic relevance. Machine learning exhibits potential in predicting mortality and disease severity in CAP patients by leveraging clinical immunological data. Further external validation studies are crucial to confirm applicability.
Collapse
Affiliation(s)
- Qiangqiang Qin
- Department of Respiratory Medicine, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Haiyang Yu
- Department of Respiratory Medicine, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Jie Zhao
- Department of Hematology, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Xue Xu
- Department of Respiratory Medicine, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Qingxuan Li
- Department of Respiratory and Critical Care Medicine, The Second Hospital of Jilin University, Changchun, Jilin, China
| | - Wen Gu
- Department of Respiratory Medicine, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Xuejun Guo
- Department of Respiratory Medicine, Xinhua Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| |
Collapse
|
3
|
Hamar Á, Mohammed D, Váradi A, Herczeg R, Balázsfalvi N, Fülesdi B, László I, Gömöri L, Gergely PA, Kovacs GL, Jáksó K, Gombos K. COVID-19 mortality prediction in Hungarian ICU settings implementing random forest algorithm. Sci Rep 2024; 14:11941. [PMID: 38789490 PMCID: PMC11126653 DOI: 10.1038/s41598-024-62791-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 05/19/2024] [Indexed: 05/26/2024] Open
Abstract
The emergence of newer SARS-CoV-2 variants of concern (VOCs) profoundly changed the ICU demography; this shift in the virus's genotype and its correlation to lethality in the ICUs is still not fully investigated. We aimed to survey ICU patients' clinical and laboratory parameters in correlation with SARS-CoV-2 variant genotypes to lethality. 503 COVID-19 ICU patients were included in our study beginning in January 2021 through November 2022 in Hungary. Furthermore, we implemented random forest (RF) as a potential predictor regarding SARS-CoV-2 lethality among 649 ICU patients in two ICU centers. Survival analysis and comparison of hypertension (HT), diabetes mellitus (DM), and vaccination effects were conducted. Logistic regression identified DM as a significant mortality risk factor (OR: 1.55, 95% CI 1.06-2.29, p = 0.025), while HT showed marginal significance. Additionally, vaccination demonstrated protection against mortality (p = 0.028). RF detected lethality with 81.42% accuracy (95% CI 73.01-88.11%, [AUC]: 91.6%), key predictors being PaO2/FiO2 ratio, lymphocyte count, and chest Computed Tomography Severity Score (CTSS). Although a smaller number of patients require ICU treatment among Omicron cases, the likelihood of survival has not proportionately increased for those who are admitted to the ICU. In conclusion, our RF model supports more effective clinical decision-making among ICU COVID-19 patients.
Collapse
Affiliation(s)
- Ágoston Hamar
- Department of Laboratory Medicine, Medical School, University of Pécs, Pécs, Hungary
- Molecular Medicine Research Group, Szentágothai Research Centre, University of Pécs, Pécs, Hungary
| | - Daryan Mohammed
- Molecular Medicine Research Group, Szentágothai Research Centre, University of Pécs, Pécs, Hungary
| | - Alex Váradi
- Molecular Medicine Research Group, Szentágothai Research Centre, University of Pécs, Pécs, Hungary
- Institute of Metagenomics, University of Debrecen, Debrecen, Hungary
| | - Róbert Herczeg
- Molecular Medicine Research Group, Szentágothai Research Centre, University of Pécs, Pécs, Hungary
| | - Norbert Balázsfalvi
- Department of Anaesthesiology and Intensive Care, University of Debrecen, Debrecen, Hungary
| | - Béla Fülesdi
- Department of Anaesthesiology and Intensive Care, University of Debrecen, Debrecen, Hungary
| | - István László
- Department of Anaesthesiology and Intensive Care, University of Debrecen, Debrecen, Hungary
| | - Lídia Gömöri
- Doctoral School of Neuroscience, University of Debrecen, Debrecen, Hungary
| | | | - Gabor Laszlo Kovacs
- Department of Laboratory Medicine, Medical School, University of Pécs, Pécs, Hungary
- Molecular Medicine Research Group, Szentágothai Research Centre, University of Pécs, Pécs, Hungary
| | - Krisztián Jáksó
- Department of Anaesthesiology and Intensive Care, Clinical Centre, University of Pécs, Pécs, Hungary
| | - Katalin Gombos
- Department of Laboratory Medicine, Medical School, University of Pécs, Pécs, Hungary.
- Molecular Medicine Research Group, Szentágothai Research Centre, University of Pécs, Pécs, Hungary.
| |
Collapse
|
4
|
Viderman D, Kotov A, Popov M, Abdildin Y. Machine and deep learning methods for clinical outcome prediction based on physiological data of COVID-19 patients: a scoping review. Int J Med Inform 2024; 182:105308. [PMID: 38091862 DOI: 10.1016/j.ijmedinf.2023.105308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/20/2023] [Accepted: 12/03/2023] [Indexed: 01/07/2024]
Abstract
INTRODUCTION Since the beginning of the COVID-19 pandemic, numerous machine and deep learning (MDL) methods have been proposed in the literature to analyze patient physiological data. The objective of this review is to summarize various aspects of these methods and assess their practical utility for predicting various clinical outcomes. METHODS We searched PubMed, Scopus, and Cochrane Library, screened and selected the studies matching the inclusion criteria. The clinical analysis focused on the characteristics of the patient cohorts in the studies included in this review, the specific tasks in the context of the COVID-19 pandemic that machine and deep learning methods were used for, and their practical limitations. The technical analysis focused on the details of specific MDL methods and their performance. RESULTS Analysis of the 48 selected studies revealed that the majority (∼54 %) of them examined the application of MDL methods for the prediction of survival/mortality-related patient outcomes, while a smaller fraction (∼13 %) of studies also examined applications to the prediction of patients' physiological outcomes and hospital resource utilization. 21 % of the studies examined the application of MDL methods to multiple clinical tasks. Machine and deep learning methods have been shown to be effective at predicting several outcomes of COVID-19 patients, such as disease severity, complications, intensive care unit (ICU) transfer, and mortality. MDL methods also achieved high accuracy in predicting the required number of ICU beds and ventilators. CONCLUSION Machine and deep learning methods have been shown to be valuable tools for predicting disease severity, organ dysfunction and failure, patient outcomes, and hospital resource utilization during the COVID-19 pandemic. The discovered knowledge and our conclusions and recommendations can also be useful to healthcare professionals and artificial intelligence researchers in managing future pandemics.
Collapse
Affiliation(s)
- Dmitriy Viderman
- Department of Surgery, School of Medicine, Nazarbayev University, Astana, Kazakhstan; Department of Anesthesiology, Intensive Care, and Pain Medicine, National Research Oncology Center, Astana, Kazakhstan.
| | - Alexander Kotov
- Department of Computer Science, College of Engineering, Wayne State University, Detroit, USA.
| | - Maxim Popov
- Department of Computer Science, School of Engineering and Digital Sciences, Nazarbayev University, Astana, Kazakhstan.
| | - Yerkin Abdildin
- Department of Mechanical and Aerospace Engineering, School of Engineering and Digital Sciences, Nazarbayev University, Astana, Kazakhstan.
| |
Collapse
|
5
|
Klein S, Dhakal S, Yin A, Escarra-Senmarti M, Demko Z, Pisanic N, Johnston T, Trejo-Zambrano M, Kruczynski K, Lee J, Hardick J, Shea P, Shapiro J, Park HS, Parish M, Caputo C, Ganesan A, Mullapudi S, Gould S, Betenbaugh M, Pekosz A, Heaney CD, Antar A, Manabe Y, Cox A, Karaba A, Andrade F, Zeger S. Application of machine learning models to identify serological predictors of COVID-19 severity and outcomes. RESEARCH SQUARE 2023:rs.3.rs-3463155. [PMID: 38014049 PMCID: PMC10680931 DOI: 10.21203/rs.3.rs-3463155/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Critically ill people with COVID-19 have greater antibody titers than those with mild to moderate illness, but their association with recovery or death from COVID-19 has not been characterized. In 178 COVID-19 patients, 73 non-hospitalized and 105 hospitalized patients, mucosal swabs and plasma samples were collected at hospital enrollment and up to 3 months post-enrollment (MPE) to measure virus RNA, cytokines/chemokines, binding antibodies, ACE2 binding inhibition, and Fc effector antibody responses against SARS-CoV-2. The association of demographic variables and >20 serological antibody measures with intubation or death due to COVID-19 was determined using machine learning algorithms. Predictive models revealed that IgG binding and ACE2 binding inhibition responses at 1 MPE were positively and C1q complement activity at enrollment was negatively associated with an increased probability of intubation or death from COVID-19 within 3 MPE. Serological antibody measures were more predictive than demographic variables of intubation or death among COVID-19 patients.
Collapse
Affiliation(s)
- Sabra Klein
- Johns Hopkins Bloomberg School of Public Health
| | | | - Anna Yin
- Johns Hopkins Bloomberg School of Public Health
| | | | | | | | | | | | | | - John Lee
- Johns Hopkins Bloomberg School of Public Health
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Yukari Manabe
- Division of Infectious Diseases, Department of Medicine, The Johns Hopkins School of Medicine
| | | | | | | | | |
Collapse
|
6
|
Chimbunde E, Sigwadhi LN, Tamuzi JL, Okango EL, Daramola O, Ngah VD, Nyasulu PS. Machine learning algorithms for predicting determinants of COVID-19 mortality in South Africa. Front Artif Intell 2023; 6:1171256. [PMID: 37899965 PMCID: PMC10600470 DOI: 10.3389/frai.2023.1171256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 08/15/2023] [Indexed: 10/31/2023] Open
Abstract
Background COVID-19 has strained healthcare resources, necessitating efficient prognostication to triage patients effectively. This study quantified COVID-19 risk factors and predicted COVID-19 intensive care unit (ICU) mortality in South Africa based on machine learning algorithms. Methods Data for this study were obtained from 392 COVID-19 ICU patients enrolled between 26 March 2020 and 10 February 2021. We used an artificial neural network (ANN) and random forest (RF) to predict mortality among ICU patients and a semi-parametric logistic regression with nine covariates, including a grouping variable based on K-means clustering. Further evaluation of the algorithms was performed using sensitivity, accuracy, specificity, and Cohen's K statistics. Results From the semi-parametric logistic regression and ANN variable importance, age, gender, cluster, presence of severe symptoms, being on the ventilator, and comorbidities of asthma significantly contributed to ICU death. In particular, the odds of mortality were six times higher among asthmatic patients than non-asthmatic patients. In univariable and multivariate regression, advanced age, PF1 and 2, FiO2, severe symptoms, asthma, oxygen saturation, and cluster 4 were strongly predictive of mortality. The RF model revealed that intubation status, age, cluster, diabetes, and hypertension were the top five significant predictors of mortality. The ANN performed well with an accuracy of 71%, a precision of 83%, an F1 score of 100%, Matthew's correlation coefficient (MCC) score of 100%, and a recall of 88%. In addition, Cohen's k-value of 0.75 verified the most extreme discriminative power of the ANN. In comparison, the RF model provided a 76% recall, an 87% precision, and a 65% MCC. Conclusion Based on the findings, we can conclude that both ANN and RF can predict COVID-19 mortality in the ICU with accuracy. The proposed models accurately predict the prognosis of COVID-19 patients after diagnosis. The models can be used to prioritize COVID-19 patients with a high mortality risk in resource-constrained ICUs.
Collapse
Affiliation(s)
- Emmanuel Chimbunde
- Division of Epidemiology and Biostatistics, Department of Global Health, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Lovemore N. Sigwadhi
- Division of Epidemiology and Biostatistics, Department of Global Health, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Jacques L. Tamuzi
- Division of Epidemiology and Biostatistics, Department of Global Health, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | | | - Olawande Daramola
- Department of Information Technology, Faculty of Informatics and Design, Cape Peninsula University of Technology, Cape Town, South Africa
| | - Veranyuy D. Ngah
- Division of Epidemiology and Biostatistics, Department of Global Health, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Peter S. Nyasulu
- Division of Epidemiology and Biostatistics, Department of Global Health, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
- Division of Epidemiology and Biostatistics, School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| |
Collapse
|
7
|
Huang J, Chen H, Deng J, Liu X, Shu T, Yin C, Duan M, Fu L, Wang K, Zeng S. Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation. Front Neurol 2023; 14:1185447. [PMID: 37614971 PMCID: PMC10443100 DOI: 10.3389/fneur.2023.1185447] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 07/18/2023] [Indexed: 08/25/2023] Open
Abstract
Background Timely and accurate outcome prediction plays a critical role in guiding clinical decisions for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. However, interpreting and translating the predictive models into clinical applications are as important as the prediction itself. This study aimed to develop an interpretable machine learning (IML) model that accurately predicts 28-day all-cause mortality in hypertensive ischemic or hemorrhagic stroke patients. Methods A total of 4,274 hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU in the USA from multicenter cohorts were included in this study to develop and validate the IML model. Five machine learning (ML) models were developed, including artificial neural network (ANN), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), logistic regression (LR), and support vector machine (SVM), to predict mortality using the MIMIC-IV and eICU-CRD database in the USA. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm. Model performance was evaluated based on the area under the curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV). The ML model with the best predictive performance was selected for interpretability analysis. Finally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. Results The XGBoost model demonstrated the best predictive performance, with the AUC values of 0.822, 0.739, and 0.700 in the training, test, and external cohorts, respectively. The analysis of feature importance revealed that age, ethnicity, white blood cell (WBC), hyperlipidemia, mean corpuscular volume (MCV), glucose, pulse oximeter oxygen saturation (SpO2), serum calcium, red blood cell distribution width (RDW), blood urea nitrogen (BUN), and bicarbonate were the 11 most important features. The SHAP plots were employed to interpret the XGBoost model. Conclusions The XGBoost model accurately predicted 28-day all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. The SHAP method can provide explicit explanations of personalized risk prediction, which can aid physicians in understanding the model.
Collapse
Affiliation(s)
- Jian Huang
- Emergency Department, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
- The Graduate School of Guangxi University of Traditional Chinese Medicine, Nanning, China
| | - Huaqiao Chen
- Department of Cardiology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Jiewen Deng
- Department of Neurosurgery, Xiu Shan People's Hospital, Chongqing, China
| | - Xiaozhu Liu
- Department of Critical Care Medicine, Beijing Shijitan Hospital, Capital Medical University, Beijing, China
| | - Tingting Shu
- Department of Cardiology, Daping Hospital, The Third Military Medical University (Army Medical University), Chongqing, China
| | - Chengliang Yin
- Faculty of Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
| | - Minjie Duan
- College of Medical Informatics, Chongqing Medical University, Chongqing, China
| | - Li Fu
- Key Laboratory of Novel Materials for Sensor of Zhejiang Province, College of Materials and Environmental Engineering, Hangzhou Dianzi University, Hangzhou, China
| | - Kai Wang
- Department of Neurology, The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
| | - Song Zeng
- Emergency Department, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
| |
Collapse
|
8
|
Deng Y, Liu S, Wang Z, Wang Y, Jiang Y, Liu B. Explainable time-series deep learning models for the prediction of mortality, prolonged length of stay and 30-day readmission in intensive care patients. Front Med (Lausanne) 2022; 9:933037. [PMID: 36250092 PMCID: PMC9554013 DOI: 10.3389/fmed.2022.933037] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 09/01/2022] [Indexed: 11/14/2022] Open
Abstract
Background In-hospital mortality, prolonged length of stay (LOS), and 30-day readmission are common outcomes in the intensive care unit (ICU). Traditional scoring systems and machine learning models for predicting these outcomes usually ignore the characteristics of ICU data, which are time-series forms. We aimed to use time-series deep learning models with the selective combination of three widely used scoring systems to predict these outcomes. Materials and methods A retrospective cohort study was conducted on 40,083 patients in ICU from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) database. Three deep learning models, namely, recurrent neural network (RNN), gated recurrent unit (GRU), and long short-term memory (LSTM) with attention mechanisms, were trained for the prediction of in-hospital mortality, prolonged LOS, and 30-day readmission with variables collected during the initial 24 h after ICU admission or the last 24 h before discharge. The inclusion of variables was based on three widely used scoring systems, namely, APACHE II, SOFA, and SAPS II, and the predictors consisted of time-series vital signs, laboratory tests, medication, and procedures. The patients were randomly divided into a training set (80%) and a test set (20%), which were used for model development and model evaluation, respectively. The area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and Brier scores were used to evaluate model performance. Variable significance was identified through attention mechanisms. Results A total of 33 variables for 40,083 patients were enrolled for mortality and prolonged LOS prediction and 36,180 for readmission prediction. The rates of occurrence of the three outcomes were 9.74%, 27.54%, and 11.79%, respectively. In each of the three outcomes, the performance of RNN, GRU, and LSTM did not differ greatly. Mortality prediction models, prolonged LOS prediction models, and readmission prediction models achieved AUCs of 0.870 ± 0.001, 0.765 ± 0.003, and 0.635 ± 0.018, respectively. The top significant variables co-selected by the three deep learning models were Glasgow Coma Scale (GCS), age, blood urea nitrogen, and norepinephrine for mortality; GCS, invasive ventilation, and blood urea nitrogen for prolonged LOS; and blood urea nitrogen, GCS, and ethnicity for readmission. Conclusion The prognostic prediction models established in our study achieved good performance in predicting common outcomes of patients in ICU, especially in mortality prediction. In addition, GCS and blood urea nitrogen were identified as the most important factors strongly associated with adverse ICU events.
Collapse
Affiliation(s)
- Yuhan Deng
- School of Public Health, Peking University, Beijing, China
| | - Shuang Liu
- School of Public Health, Peking University, Beijing, China
| | - Ziyao Wang
- School of Public Health, Peking University, Beijing, China
| | - Yuxin Wang
- School of Public Health, Peking University, Beijing, China
| | - Yong Jiang
- Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
- China National Clinical Research Center for Neurological Diseases, Beijing, China
- Yong Jiang,
| | - Baohua Liu
- School of Public Health, Peking University, Beijing, China
- *Correspondence: Baohua Liu,
| |
Collapse
|
9
|
Shimpi N, Glurich I, Panny A, Hegde H, Scannapieco FA, Acharya A. Identifying oral disease variables associated with pneumonia emergence by application of machine learning to integrated medical and dental big data to inform eHealth approaches. FRONTIERS IN DENTAL MEDICINE 2022; 3:1005140. [PMID: 36643095 PMCID: PMC9835559 DOI: 10.3389/fdmed.2022.1005140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Background The objective of this study was to build models that define variables contributing to pneumonia risk by applying supervised Machine Learning-(ML) to medical and oral disease data to define key risk variables contributing to pneumonia emergence for any pneumonia/pneumonia subtypes. Methods Retrospective medical and dental data were retrieved from Marshfield Clinic Health System's data warehouse and integrated electronic medical-dental health records (iEHR). Retrieved data were pre-processed prior to conducting analyses and included matching of cases to controls by (a) race/ethnicity and (b) 1:1 Case: Control ratio. Variables with >30% missing data were excluded from analysis. Datasets were divided into four subsets: (1) All Pneumonia (all cases and controls); (2) community (CAP)/healthcare associated (HCAP) pneumonias; (3) ventilator-associated (VAP)/hospital-acquired (HAP) pneumonias and (4) aspiration pneumonia (AP). Performance of five algorithms were compared across the four subsets: Naïve Bayes, Logistic Regression, Support Vector Machine (SVM), Multi-Layer Perceptron (MLP) and Random Forests. Feature (input variables) selection and ten-fold cross validation was performed on all the datasets. An evaluation set (10%) was extracted from the subsets for further validation. Model performance was evaluated in terms of total accuracy, sensitivity, specificity, F-measure, Mathews-correlation-coefficient and area under receiver operating characteristic curve (AUC). Results In total, 6,034 records (cases and controls) met eligibility for inclusion in the main dataset. After feature selection, the variables retained in the subsets were: All Pneumonia (n = 29 variables), CAP-HCAP (n = 26 variables); VAP-HAP (n = 40 variables) and AP (n = 37 variables), respectively. Variables retained (n = 22) were common across all four pneumonia subsets. Of these, the number of missing teeth, periodontal status, periodontal pocket depth more than 5 mm and number of restored teeth contributed to all the subsets and were retained in the model. MLP outperformed other predictive models for All Pneumonia, CAP-HCAP and AP subsets, while SVM outperformed other models in VAP-HAP subset. Conclusion This study validates previously described associations between poor oral health and pneumonia. Benefits of an integrated medical-dental record and care delivery environment for modeling pneumonia risk are highlighted. Based on findings, risk score development could inform referrals and follow-up in integrated healthcare delivery environment and coordinated patient management.
Collapse
Affiliation(s)
- Neel Shimpi
- Center for Clinical Epidemiology and Population Health, Marshfield Clinic Research Institute, Marshfield, WI, United States
| | - Ingrid Glurich
- Cancer Care and Research Center, Marshfield Clinic Research Institute, Marshfield, WI, United States
| | - Aloksagar Panny
- Security Health Plan, Marshfield Clinic Health System, Marshfield, WI, United States
| | - Harshad Hegde
- Berkeley Bioinformatics Open-source Projects, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Frank A. Scannapieco
- Department of Oral Biology, School of Dental Medicine, University at Buffalo, Buffalo, NY, United States
| | - Amit Acharya
- Advocate Aurora Research Institute, Advocate Aurora Health, Chicago, IL, United States
| |
Collapse
|