1
|
Frausto-Avila M, León-Montiel RDJ, Quiroz-Juárez MA, U'Ren AB. Prospective study using artificial neural networks for identification of high-risk COVID-19 patients. Sci Rep 2025; 15:18005. [PMID: 40410212 PMCID: PMC12102217 DOI: 10.1038/s41598-025-00925-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Accepted: 05/02/2025] [Indexed: 05/25/2025] Open
Abstract
The COVID-19 pandemic caused a major public health crisis, with severe impacts on global health and the economy. Machine learning (ML) has been crucial in developing new technologies to address challenges posed by the pandemic, particularly in identifying high-risk COVID-19 patients. This identification is vital for efficiently allocating hospital resources and controlling the virus's spread. Comprehensive validation of these intelligent approaches is necessary to confirm their clinical usefulness and help create future strategies for managing viral outbreaks. Here we present a prospective study to evaluate the performance of state-of-the-art ML models designed to identify high-risk COVID-19 patients across four clinical stages. Using artificial neural networks trained with historical patient data from Mexico, we assess the models' accuracy across six epidemiological waves without retraining them. We then compare their performance against neural networks trained with cumulative historical data up to the end of each wave. The findings reveal that models trained on early data can effectively predict high-risk patients in later waves, despite changes in vaccination rates, viral strains, and treatments. These results suggest that artificial intelligence-based patient classification methods could be robust tools for future pandemics, aiding in predicting clinical outcomes under evolving conditions.
Collapse
Affiliation(s)
- Mateo Frausto-Avila
- Centro de Física Aplicada y Tecnología Avanzada, Universidad Nacional Autónoma de México, Boulevard Juriquilla 3001, 76230, Querétaro, Mexico
| | - Roberto de J León-Montiel
- Instituto de Ciencias Nucleares, Universidad Nacional Autónoma de México, Apartado Postal 70-543, 04510, Mexico, CDMX, Mexico
| | - Mario A Quiroz-Juárez
- Centro de Física Aplicada y Tecnología Avanzada, Universidad Nacional Autónoma de México, Boulevard Juriquilla 3001, 76230, Querétaro, Mexico.
| | - Alfred B U'Ren
- Instituto de Ciencias Nucleares, Universidad Nacional Autónoma de México, Apartado Postal 70-543, 04510, Mexico, CDMX, Mexico
| |
Collapse
|
2
|
Shyamala A, Murugeswari S, Mahendran G, Jothi Chitra R. Hybrid grey assisted whale optimization based machine learning for the COVID-19 prediction. Comput Methods Biomech Biomed Engin 2025; 28:388-397. [PMID: 38112339 DOI: 10.1080/10255842.2023.2292008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 11/23/2023] [Accepted: 12/03/2023] [Indexed: 12/21/2023]
Abstract
Recently, COVID-19 (coronavirus) has been a huge influence on the socio and economic field. COVID-19 cases are seriously increasing day-day and also don't identified proper vaccine for COVID-19. Hence, COVID-19 is fast spreading virus and it causes more deaths. In order to address this, the work has proposed a machine learning (ML) scheme for the prediction of COVID-19 positive, negative, and deceased instances. Initially, the data is pre-processed by eliminating redundant and missing values. Then, the features are selected using hybrid grey assisted whale optimization algorithm (H-GAWOA). Finally, the classifier ANFIS (adaptive network-based fuzzy inference systems) is used for investigating the confirmed, survival and death rate of COVID-19. The performance is analysed on John Hopkins University dataset and the performances like MSE, RMSE, MAPE, and R2 are measured. In all the comparisons, the MSE value is very less for the proposed model. Particularly, in the deceased cases prediction, the MSE value is 0.00 for the proposed H-GAWOA-ANFIS. Finally, it is proved that the suggested model is able to generate the better results when contrast to the other approaches.
Collapse
Affiliation(s)
- A Shyamala
- Department of Electronics and Communication Engineering, Mohamed Sathak Engineering College, Kilakarai, Ramanathapuram, Chennai, Tamil Nadu, India
| | - S Murugeswari
- Department of Electronics and Communication Engineering, Syed Ammal Engineering College, Ramanathapuram, Tamil Nadu, India
| | - G Mahendran
- Department of Electronics and Communication Engineering, Syed Ammal Engineering College, Ramanathapuram, Tamil Nadu, India
| | - R Jothi Chitra
- Department of Electronics and Communication Engineering, Velammal Institute of Technology, Thiruvallur, Chennai, Tamil Nadu, India
| |
Collapse
|
3
|
Daramola O, Kavu TD, Kotze MJ, Marnewick JL, Sarumi OA, Kabaso B, Moser T, Stroetmann K, Fwemba I, Daramola F, Nyirenda M, van Rensburg SJ, Nyasulu PS. Predictive modelling and identification of critical variables of mortality risk in COVID-19 patients. Sci Rep 2025; 15:2184. [PMID: 39820088 PMCID: PMC11739597 DOI: 10.1038/s41598-023-46712-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Accepted: 11/03/2023] [Indexed: 01/19/2025] Open
Abstract
South Africa was the most affected country in Africa by the coronavirus disease 2019 (COVID-19) pandemic, where over 4 million confirmed cases of COVID-19 and over 102,000 deaths have been recorded since 2019. Aside from clinical methods, artificial intelligence (AI)-based solutions such as machine learning (ML) models have been employed in treating COVID-19 cases. However, limited application of AI for COVID-19 in Africa has been reported in the literature. This study aimed to investigate the performance and interpretability of several ML algorithms, including deep multilayer perceptron (Deep MLP), support vector machine (SVM) and Extreme gradient boosting trees (XGBoost) for predicting COVID-19 mortality risk with an emphasis on the effect of cross-validation (CV) and principal component analysis (PCA) on the results. For this purpose, a dataset with 154 features from 490 COVID-19 patients admitted into the intensive care unit (ICU) of Tygerberg Hospital in Cape Town, South Africa, during the first wave of COVID-19 in 2020 was retrospectively analysed. Our results show that Deep MLP had the best overall performance (F1 = 0.92; area under the curve (AUC) = 0.94) when CV and the synthetic minority oversampling technique (SMOTE) were applied without PCA. By using the Shapley Additive exPlanations (SHAP) model to interpret the mortality risk predictions, we identified the Length of stay (LOS) in the hospital, LOS in the ICU, Time to ICU from admission, days discharged alive or death, D-dimer (blood clotting factor), and blood pH as the six most critical variables for mortality risk prediction. Also, Age at admission, Pf ratio (PaO2/FiO2 ratio), troponin T (TropT), ferritin, ventilation, C-reactive protein (CRP), and symptoms of acute respiratory distress syndrome (ARDS) were associated with the severity and fatality of COVID-19 cases. The study reveals how ML could assist medical practitioners in making informed decisions on handling critically ill COVID-19 patients with comorbidities. It also offers insight into the combined effect of CV, PCA, and SMOTE on the performance of ML models for COVID-19 mortality risk prediction, which has been little explored.
Collapse
Affiliation(s)
- Olawande Daramola
- Department of Information Technology, Cape Peninsula University of Technology, Cape Town, South Africa.
| | - Tatenda Duncan Kavu
- Department of Information Technology, Cape Peninsula University of Technology, Cape Town, South Africa
| | - Maritha J Kotze
- Division of Chemical Pathology, Department of Pathology, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
- Division of Chemical Pathology, Department of Pathology, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Jeanine L Marnewick
- Applied Microbial and Health Biotechnology Institute, Cape Peninsula University of Technology, Cape Town, South Africa
| | - Oluwafemi A Sarumi
- Department of Mathematics and Computer Science, Philipps University of Marburg, Hans-Meerwein Str. 6 D-35032, Marburg, Germany
| | - Boniface Kabaso
- Department of Information Technology, Cape Peninsula University of Technology, Cape Town, South Africa
| | - Thomas Moser
- St Pölten University of Applied Sciences, St Pölten, Austria
| | - Karl Stroetmann
- School of Health Information Science, University of Victoria, Victoria, BC, Canada
| | - Isaac Fwemba
- Division of Epidemiology and Biostatistics, Faculty of Medicine, and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Fisayo Daramola
- Division of Epidemiology and Biostatistics, Faculty of Medicine, and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Martha Nyirenda
- Division of Epidemiology and Biostatistics, Faculty of Medicine, and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Susan J van Rensburg
- Division of Chemical Pathology, Department of Pathology, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Peter S Nyasulu
- Division of Epidemiology and Biostatistics, Faculty of Medicine, and Health Sciences, Stellenbosch University, Cape Town, South Africa
| |
Collapse
|
4
|
Hussain S, Ahmad S, Wasid M. Artificial intelligence-driven intelligent learning models for identification and prediction of cardioneurological disorders: A comprehensive study. Comput Biol Med 2025; 184:109342. [PMID: 39571276 DOI: 10.1016/j.compbiomed.2024.109342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Revised: 10/19/2024] [Accepted: 10/30/2024] [Indexed: 12/22/2024]
Abstract
The integration of Artificial Intelligence (AI) and Intelligent Learning Models (ILMs) in healthcare has transformed the field, offering precise diagnostics, remote monitoring, personalized treatment, and more. Cardioneurological disorders (CD), affecting the cardiovascular and neurological systems, present significant diagnostic and management challenges. Traditional testing methods often lack sensitivity and specificity, leading to delayed or inaccurate diagnoses. AI-driven ILMs trained on large datasets offer promise for accurate identification and prediction of CD by analyzing complex data patterns. However, there is a lack of comprehensive studies reviewing AI applications for the diagnosis of CD and inter related disorders. This paper comprehensively reviews existing integrated solutions involving AI and ILMs in CD, examining their clinical manifestations, epidemiology, diagnostic challenges, and therapeutic considerations. The study examines recent research on CD, reviews AI-driven models' landscape, evaluates existing models, addresses practical considerations, and outlines future research directions. Through this work, we aim to provide insights into the transformative potential of AI-driven ILMs in improving clinical practice and patient outcomes for CD.
Collapse
Affiliation(s)
- Shahadat Hussain
- School of Computer Science Engineering & Technology, Bennett University, Greater Noida 201310, India
| | - Shahnawaz Ahmad
- School of Computer Science Engineering & Technology, Bennett University, Greater Noida 201310, India
| | - Mohammed Wasid
- School of Computer Science Engineering & Technology, Bennett University, Greater Noida 201310, India.
| |
Collapse
|
5
|
Tsanakas AT, Mueller YM, van de Werken HJG, Pujol Borrell R, Ouzounis CA, Katsikis PD. An explainable machine learning model for COVID-19 severity prognosis at hospital admission. INFORMATICS IN MEDICINE UNLOCKED 2025; 52:101602. [DOI: 10.1016/j.imu.2024.101602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2025] Open
|
6
|
Neuman I, Shvartser L, Teppler S, Friedman Y, Levine JJ, Kagan I, Bishara J, Kushinir S, Singer P. A machine-learning model for prediction of Acinetobacter baumannii hospital acquired infection. PLoS One 2024; 19:e0311576. [PMID: 39636870 PMCID: PMC11620386 DOI: 10.1371/journal.pone.0311576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 09/21/2024] [Indexed: 12/07/2024] Open
Abstract
BACKGROUND Acinetobacter baumanni infection is a leading cause of morbidity and mortality in the Intensive Care Unit (ICU). Early recognition of patients at risk for infection allows early proper treatment and is associated with improved outcomes. This study aimed to construct an innovative Machine Learning (ML) based prediction tool for Acinetobacter baumanni infection, among patients in the ICU, and to examine its robustness and predictive power. METHODS For model development and internal validation, we used The Medical Information Mart for Intensive Care database (MIMIC) III data from 19,690 consecutive adult patients admitted between 2001 and 2012 at a Boston tertiary center ICU. For external validation, we used a different dataset from Rabin Medical Center (RMC, Israeli tertiary center) ICU, of 1,700 patients admitted between 2017 and 2021. After training on MIMIC cohorts, we adapted the algorithm from MIMIC to RMC and evaluated its discriminating power in terms of Area Under the Receiver Operating Curve (AUROC), sensitivity, specificity, Negative Predictive Value and Positive Predictive Value. RESULTS The prediction model achieved AUROC = 0.624 (95% CI 0.604-0.647). The most significant predictors were (i) physiological parameters of cardio-respiratory function, such as carbon dioxide (CO2) levels and respiratory rate, (ii) metabolic disturbances such as lactate and acidosis (pH) and (iii) past administration of antibiotics. CONCLUSIONS Infection with Acinetobacter baumanni is more likely to occur in patients with respiratory failure and higher lactate levels, as well as patients who have used larger amounts of antibiotics. The accuracy of Acinetobacter prediction may be enhanced by future studies.
Collapse
Affiliation(s)
- Ido Neuman
- Department of General Intensive Care and Institute for Nutrition Research, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel
| | | | | | | | | | - Ilya Kagan
- Department of General Intensive Care and Institute for Nutrition Research, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel
| | - Jihad Bishara
- Infectious Diseases Unit, Rabin Medical Center, Beilinson Hospital, Petah-Tikva, Israel
| | - Shiri Kushinir
- Rabin Medical Center Research Authority, Beilinson Hospital, Petah Tikva, Israel
| | - Pierre Singer
- Department of General Intensive Care and Institute for Nutrition Research, Rabin Medical Center, Beilinson Hospital, Petah Tikva, Israel
| |
Collapse
|
7
|
Weaver M, Goodin DA, Miller HA, Karmali D, Agarwal AA, Frieboes HB, Suliman SA. Prediction of prolonged mechanical ventilation in the intensive care unit via machine learning: a COVID-19 perspective. Sci Rep 2024; 14:30173. [PMID: 39627490 PMCID: PMC11615281 DOI: 10.1038/s41598-024-81980-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Accepted: 12/02/2024] [Indexed: 12/06/2024] Open
Abstract
Early recognition of risk factors for prolonged mechanical ventilation (PMV) could allow for early clinical interventions, prevention of secondary complications such as nosocomial infections, and effective triage of hospital resources. This study tested the hypothesis that an ensemble machine learning (ML) analysis of clinical data at time of intubation could identify patients at risk of PMV, using a COVID-19 dataset to classify patients into PMV (> 14 days) and non-PMV (≤ 14 days) groups. While several factors are known to cause PMV, including acid-base, weakness, and delirium, lesser-utilized but routinely measured parameters such as platelet count, glucose levels and fevers may also be relevant. Patient data from a single University Hospital were analyzed via the ML workflow to predict patients at risk of PMV and identify key clinical markers. Model performance was evaluated on a chronologically distinct cohort. The ML workflow identified patients at risk of PMV with AUROCTRAIN=0.960 (F1TRAIN = 0.935) and AUROCTEST=0.804 (F1TEST = 0.800). Top key features for classification included glucose, platelet count, temperature, LVEF, bicarbonate (arterial blood gas), and BMI. Data analysis at intubation time via the proposed workflow offers the potential to accurately predict patients at risk of PMV, with the goal to improve patient management and triage of hospital resources.
Collapse
Affiliation(s)
- Marianna Weaver
- Division of Pulmonary Medicine, University of Louisville, Louisville, KY, 40292, USA
| | - Dylan A Goodin
- Department of Bioengineering, University of Louisville, Lutz Hall 419, Louisville, KY, 40292, USA
| | - Hunter A Miller
- Department of Bioengineering, University of Louisville, Lutz Hall 419, Louisville, KY, 40292, USA
| | - Dipan Karmali
- Division of Pulmonary Medicine, University of Louisville, Louisville, KY, 40292, USA
| | - Apurv A Agarwal
- Division of Pulmonary Medicine, University of Louisville, Louisville, KY, 40292, USA
| | - Hermann B Frieboes
- Department of Bioengineering, University of Louisville, Lutz Hall 419, Louisville, KY, 40292, USA.
- Department of Pharmacology/Toxicology, University of Louisville, Louisville, KY, 40292, USA.
- Center for Predictive Medicine, University of Louisville, Louisville, KY, 40292, USA.
- James Graham Brown Cancer Center, University of Louisville, Louisville, KY, 40292, USA.
| | - Sally A Suliman
- University of Arizona Medical Center Phoenix, Phoenix, AZ, 85004, USA
- Formerly at: Division of Pulmonary Medicine, University of Louisville, Louisville, KY, 40292, USA
| |
Collapse
|
8
|
Mohd Nor NH, Mansor NI, Hasim NA. Artificial Neural Networks: A New Frontier in Dental Tissue Regeneration. TISSUE ENGINEERING. PART B, REVIEWS 2024. [PMID: 39556233 DOI: 10.1089/ten.teb.2024.0216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]
Abstract
In the realm of dental tissue regeneration research, various constraints exist such as the potential variance in cell quality, potency arising from differences in donor tissue and tissue microenvironment, the difficulties associated with sustaining long-term and large-scale cell expansion while preserving stemness and therapeutic attributes, as well as the need for extensive investigation into the enduring safety and effectiveness in clinical settings. The adoption of artificial intelligence (AI) technologies has been suggested as a means to tackle these challenges. This is because, tissue regeneration research could be advanced through the use of diagnostic systems that incorporate mining methods such as neural networks (NN), fuzzy, predictive modeling, genetic algorithms, machine learning (ML), cluster analysis, and decision trees. This article seeks to offer foundational insights into a subset of AI referred to as artificial neural networks (ANNs) and assess their potential applications as essential decision-making support tools in the field of dentistry, with a particular focus on tissue engineering research. Although ANNs may initially appear complex and resource intensive, they have proven to be effective in laboratory and therapeutic settings. This expert system can be trained using clinical data alone, enabling their deployment in situations where rule-based decision-making is impractical. As ANNs progress further, it is likely to play a significant role in revolutionizing dental tissue regeneration research, providing promising results in streamlining dental procedures and improving patient outcomes in the clinical setting.
Collapse
Affiliation(s)
| | - Nur Izzati Mansor
- Department of Nursing, Faculty of Medicine, Universiti Kebangsaan Malaysia, Cheras, Malaysia
| | - Nur Asmadayana Hasim
- Pusat Pengajian Citra Universiti, Universiti Kebangsaan Malaysia, Bangi, Malaysia
| |
Collapse
|
9
|
Kalra A, Bachina P, Shou BL, Hwang J, Barshay M, Kulkarni S, Sears I, Eickhoff C, Bermudez CA, Brodie D, Ventetuolo CE, Whitman GJ, Abbasi A, Cho SM. Using machine learning to predict neurologic injury in venovenous extracorporeal membrane oxygenation recipients: An ELSO Registry analysis. JTCVS OPEN 2024; 21:140-167. [PMID: 39534333 PMCID: PMC11551311 DOI: 10.1016/j.xjon.2024.06.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 05/10/2024] [Accepted: 06/10/2024] [Indexed: 11/16/2024]
Abstract
Background Venovenous extracorporeal membrane oxygenation (VV-ECMO) is associated with acute brain injury (ABI), including central nervous system (CNS) ischemia (defined as ischemic stroke or hypoxic-ischemic brain injury [HIBI]) and intracranial hemorrhage (ICH). Data on prediction models for neurologic outcomes in VV-ECMO are limited. Methods We analyzed adult (age ≥18 years) VV-ECMO patients in the Extracorporeal Life Support Organization (ELSO) Registry (2009-2021) from 676 centers. ABI was defined as CNS ischemia, ICH, brain death, and seizures. Data on 67 variables were extracted, including clinical characteristics and pre-ECMO/on-ECMO variables. Random forest, CatBoost, LightGBM, and XGBoost machine learning (ML) algorithms (10-fold leave-one-out cross-validation) were used to predict ABI. Feature importance scores were used to pinpoint the most important variables for predicting ABI. Results Of 37,473 VV-ECMO patients (median age, 48.1 years; 63% male), 2644 (7.1%) experienced ABI, including 610 (2%) with CNS ischemia and 1591 (4%) with ICH. The areas under the receiver operating characteristic curve for predicting ABI, CNS ischemia, and ICH were 0.70, 0.68, and 0.70, respectively. The accuracy, positive predictive value, and negative predictive value for ABI were 85%, 19%, and 95%, respectively. ML identified higher center volume, pre-ECMO cardiac arrest, higher ECMO pump flow, and elevated on-ECMO serum lactate level as the most important risk factors for ABI and its subtypes. Conclusions This is the largest study of VV-ECMO patients to use ML to predict ABI reported to date. Performance was suboptimal, likely due to lack of standardization of neuromonitoring/imaging protocols and data granularity in the ELSO Registry. Standardized neurologic monitoring and imaging are needed across ELSO centers to detect the true prevalence of ABI.
Collapse
Affiliation(s)
- Andrew Kalra
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
- Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, Pa
| | - Preetham Bachina
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Benjamin L. Shou
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Jaeho Hwang
- Division of Epilepsy, Department of Neurology, Johns Hopkins Hospital, Baltimore, Md
| | - Meylakh Barshay
- Warren Alpert Medical School of Brown University, Providence, RI
| | - Shreyas Kulkarni
- Warren Alpert Medical School of Brown University, Providence, RI
| | - Isaac Sears
- Warren Alpert Medical School of Brown University, Providence, RI
| | - Carsten Eickhoff
- Department of Computer Science, Brown University, Providence, RI
- Faculty of Medicine, University of Tübingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Christian A. Bermudez
- Division of Cardiovascular Surgery, Department of Surgery, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pa
| | - Daniel Brodie
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Md
| | - Corey E. Ventetuolo
- Department of Health Services, Policy and Practice, Brown School of Public Health, Providence, RI
- Division of Pulmonary, Critical Care and Sleep Medicine, Warren Alpert Medical School of Brown University, Providence, RI
| | - Glenn J.R. Whitman
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Adeel Abbasi
- Division of Pulmonary, Critical Care and Sleep Medicine, Warren Alpert Medical School of Brown University, Providence, RI
| | - Sung-Min Cho
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
- Division of Neurosciences Critical Care, Department of Neurology, Neurosurgery, Anesthesiology and Critical Care Medicine, Johns Hopkins Hospital, Baltimore, Md
| |
Collapse
|
10
|
Alie MS, Negesse Y, Kindie K, Merawi DS. Machine learning algorithms for predicting COVID-19 mortality in Ethiopia. BMC Public Health 2024; 24:1728. [PMID: 38943093 PMCID: PMC11212371 DOI: 10.1186/s12889-024-19196-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 06/19/2024] [Indexed: 07/01/2024] Open
Abstract
BACKGROUND Coronavirus disease 2019 (COVID-19), a global public health crisis, continues to pose challenges despite preventive measures. The daily rise in COVID-19 cases is concerning, and the testing process is both time-consuming and costly. While several models have been created to predict mortality in COVID-19 patients, only a few have shown sufficient accuracy. Machine learning algorithms offer a promising approach to data-driven prediction of clinical outcomes, surpassing traditional statistical modeling. Leveraging machine learning (ML) algorithms could potentially provide a solution for predicting mortality in hospitalized COVID-19 patients in Ethiopia. Therefore, the aim of this study is to develop and validate machine-learning models for accurately predicting mortality in COVID-19 hospitalized patients in Ethiopia. METHODS Our study involved analyzing electronic medical records of COVID-19 patients who were admitted to public hospitals in Ethiopia. Specifically, we developed seven different machine learning models to predict COVID-19 patient mortality. These models included J48 decision tree, random forest (RF), k-nearest neighborhood (k-NN), multi-layer perceptron (MLP), Naïve Bayes (NB), eXtreme gradient boosting (XGBoost), and logistic regression (LR). We then compared the performance of these models using data from a cohort of 696 patients through statistical analysis. To evaluate the effectiveness of the models, we utilized metrics derived from the confusion matrix such as sensitivity, specificity, precision, and receiver operating characteristic (ROC). RESULTS The study included a total of 696 patients, with a higher number of females (440 patients, accounting for 63.2%) compared to males. The median age of the participants was 35.0 years old, with an interquartile range of 18-79. After conducting different feature selection procedures, 23 features were examined, and identified as predictors of mortality, and it was determined that gender, Intensive care unit (ICU) admission, and alcohol drinking/addiction were the top three predictors of COVID-19 mortality. On the other hand, loss of smell, loss of taste, and hypertension were identified as the three lowest predictors of COVID-19 mortality. The experimental results revealed that the k-nearest neighbor (k-NN) algorithm outperformed than other machine learning algorithms, achieving an accuracy of 95.25%, sensitivity of 95.30%, precision of 92.7%, specificity of 93.30%, F1 score 93.98% and a receiver operating characteristic (ROC) score of 96.90%. These findings highlight the effectiveness of the k-NN algorithm in predicting COVID-19 outcomes based on the selected features. CONCLUSION Our study has developed an innovative model that utilizes hospital data to accurately predict the mortality risk of COVID-19 patients. The main objective of this model is to prioritize early treatment for high-risk patients and optimize strained healthcare systems during the ongoing pandemic. By integrating machine learning with comprehensive hospital databases, our model effectively classifies patients' mortality risk, enabling targeted medical interventions and improved resource management. Among the various methods tested, the K-nearest neighbors (KNN) algorithm demonstrated the highest accuracy, allowing for early identification of high-risk patients. Through KNN feature identification, we identified 23 predictors that significantly contribute to predicting COVID-19 mortality. The top five predictors are gender (female), intensive care unit (ICU) admission, alcohol drinking, smoking, and symptoms of headache and chills. This advancement holds great promise in enhancing healthcare outcomes and decision-making during the pandemic. By providing services and prioritizing patients based on the identified predictors, healthcare facilities and providers can improve the chances of survival for individuals. This model provides valuable insights that can guide healthcare professionals in allocating resources and delivering appropriate care to those at highest risk.
Collapse
Affiliation(s)
- Melsew Setegn Alie
- Department Public Health, School of Public Health, College of Medicine and Health Science, Mizan-Tepi University, Mizan-Aman, Ethiopia.
| | - Yilkal Negesse
- Department of Public Health, College of Medicine and Health Science, Debre Markos University, Gojjam, Ethiopia
| | - Kassa Kindie
- Department Nursing, College of Medicine and Health Science, Mizan-Tepi University, Mizan-Aman, Ethiopia
| | - Dereje Senay Merawi
- Department of Information Technology, Faculty of Technology, Debre Tabor University, Gonder, Ethiopia
| |
Collapse
|
11
|
Das S, Nayak SP, Sahoo B, Nayak SC. Machine Learning in Healthcare Analytics: A State-of-the-Art Review. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING 2024. [DOI: 10.1007/s11831-024-10098-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 02/23/2024] [Indexed: 01/06/2025]
|
12
|
Gabaldi CQ, Cypriano AS, Pedrotti CHS, Malheiro DT, Laselva CR, Cendoroglo M, Teich VD. Is it possible to estimate the number of patients with COVID-19 admitted to intensive care units and general wards using clinical and telemedicine data? EINSTEIN-SAO PAULO 2024; 22:eAO0328. [PMID: 38477720 PMCID: PMC10948090 DOI: 10.31744/einstein_journal/2024ao0328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 11/14/2023] [Indexed: 03/14/2024] Open
Abstract
BACKGROUND Gabaldi et al. utilized telemedicine data, web search trends, hospitalized patient characteristics, and resource usage data to estimate bed occupancy during the COVID-19 pandemic. The results showcase the potential of data-driven strategies to enhance resource allocation decisions for an effective pandemic response. OBJECTIVE To develop and validate predictive models to estimate the number of COVID-19 patients hospitalized in the intensive care units and general wards of a private not-for-profit hospital in São Paulo, Brazil. METHODS Two main models were developed. The first model calculated hospital occupation as the difference between predicted COVID-19 patient admissions, transfers between departments, and discharges, estimating admissions based on their weekly moving averages, segmented by general wards and intensive care units. Patient discharge predictions were based on a length of stay predictive model, assessing the clinical characteristics of patients hospitalized with COVID-19, including age group and usage of mechanical ventilation devices. The second model estimated hospital occupation based on the correlation with the number of telemedicine visits by patients diagnosed with COVID-19, utilizing correlational analysis to define the lag that maximized the correlation between the studied series. Both models were monitored for 365 days, from May 20th, 2021, to May 20th, 2022. RESULTS The first model predicted the number of hospitalized patients by department within an interval of up to 14 days. The second model estimated the total number of hospitalized patients for the following 8 days, considering calls attended by Hospital Israelita Albert Einstein's telemedicine department. Considering the average daily predicted values for the intensive care unit and general ward across a forecast horizon of 8 days, as limited by the second model, the first and second models obtained R² values of 0.900 and 0.996, respectively and mean absolute errors of 8.885 and 2.524 beds, respectively. The performances of both models were monitored using the mean error, mean absolute error, and root mean squared error as a function of the forecast horizon in days. CONCLUSION The model based on telemedicine use was the most accurate in the current analysis and was used to estimate COVID-19 hospital occupancy 8 days in advance, validating predictions of this nature in similar clinical contexts. The results encourage the expansion of this method to other pathologies, aiming to guarantee the standards of hospital care and conscious consumption of resources. BACKGROUND Developed models to forecast bed occupancy for up to 14 days and monitored errors for 365 days. BACKGROUND Telemedicine calls from COVID-19 patients correlated with the number of patients hospitalized in the next 8 days.
Collapse
Affiliation(s)
- Caio Querino Gabaldi
- Hospital Israelita Albert EinsteinSão PauloSPBrazilHospital Israelita Albert Einstein, São Paulo, SP, Brazil.
| | - Adriana Serra Cypriano
- Hospital Israelita Albert EinsteinSão PauloSPBrazilHospital Israelita Albert Einstein, São Paulo, SP, Brazil.
| | | | - Daniel Tavares Malheiro
- Hospital Israelita Albert EinsteinSão PauloSPBrazilHospital Israelita Albert Einstein, São Paulo, SP, Brazil.
| | - Claudia Regina Laselva
- Hospital Israelita Albert EinsteinSão PauloSPBrazilHospital Israelita Albert Einstein, São Paulo, SP, Brazil.
| | - Miguel Cendoroglo
- Hospital Israelita Albert EinsteinSão PauloSPBrazilHospital Israelita Albert Einstein, São Paulo, SP, Brazil.
| | - Vanessa Damazio Teich
- Hospital Israelita Albert EinsteinSão PauloSPBrazilHospital Israelita Albert Einstein, São Paulo, SP, Brazil.
| |
Collapse
|
13
|
Wubineh BZ, Deriba FG, Woldeyohannis MM. Exploring the opportunities and challenges of implementing artificial intelligence in healthcare: A systematic literature review. Urol Oncol 2024; 42:48-56. [PMID: 38101991 DOI: 10.1016/j.urolonc.2023.11.019] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 11/25/2023] [Indexed: 12/17/2023]
Abstract
Recent progress in the realm of artificial intelligence has shown effectiveness in various industries, particularly within the healthcare sector. However, there are limited insights on existing studies regarding ethical, social, privacy, and technological aspects of AI in the health sector, which is the gap our study aims to fill. This study aimed to synthesize empirical studies on the challenges and opportunities of using AI by conducting a systematic review. We reviewed 33 articles published between 2015 and 2022 in the PubMed, IEEE Xplore, and Science Direct databases. The results show that artificial intelligence has the promise of improving health care and faces obstacles when implemented. Most of the reviewed studies indicated that the use of AI provides several opportunities, including teamwork and decision-making, technological advancement, diagnosis and patient monitoring, drug development, and virtual health assistance. However, the findings show that the use of AI in the health sector hinders multifaceted challenges, including ethical and privacy-related issues, lack of awareness, unreliability of technology, and professional liability. The findings highlight that artificial intelligence has the potential to transform healthcare and that addressing these challenges is crucial to fully utilize its potential.
Collapse
Affiliation(s)
- Betelhem Zewdu Wubineh
- Faculty of Information and Communication Technology, Wroclaw University of Science and Technology, Wroclaw, Poland; School of Computing and Informatics, Wachemo University, Hosaena, Ethiopia.
| | - Fitsum Gizachew Deriba
- School of Computing, University of Eastern Finland, Joensuu, Finland; School of Computing and Informatics, Wachemo University, Hosaena, Ethiopia
| | | |
Collapse
|
14
|
Tiemi Enokida Mori M, Name Colado Simão A, Danelli T, Rangel Oliveira S, Luis Candido de Souza Cassela P, Lerner Trigo G, Morais Cardoso K, Mestre Tejo A, Naomi Tano Z, Regina Delicato de Almeida E, Maria Vissoci Reiche E, Maes M, Alysson Batisti Lozovoy M. Protective effects of IL18-105G > A and IL18-137C > Ggenetic variants on severity of COVID-19. Cytokine 2024; 174:156476. [PMID: 38128426 DOI: 10.1016/j.cyto.2023.156476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 11/29/2023] [Accepted: 12/08/2023] [Indexed: 12/23/2023]
Abstract
OBJECTIVE AND DESIGN A cross-sectional study evaluated the IL18-105G > A (rs360717) and IL18-137C > G (rs187238) variants on Coronavírus Disease 2019 (COVID-19) severity. SUBJECTS AND METHODS 528 patients with COVID-19 classifed with mild (n = 157), moderate (n = 63) and critical (n = 308) disease were genotpyed for the IL18-105G > A and IL18-137C > G variants. RESULTS We observed associations between severe + critical COVID-19 groups (reference group was mild COVID-19) and the IL18-105G > A (p = 0.008) and IL18-137C > G (p = 0.01) variants, which remained significant after adjusting for sex, ethnicity and age. Consequently, we have examined the associations between moderate + critical COVID-19 and the genotypes of both variants using different genetic models. The IL18-105G > A was associated with severe disease (moderate + critical), with effects of the GA genotype in the codominant [Odds ratio (OR), (95 % confidence interval) 0.55, 0.34-0.89, p = 0.015], overdominant (0.56, 0.35-0.89, p = 0.014) and dominant (0.60, 0.38-0.96, p = 0.031) models. IL18-105 GA coupled with age, chest computed tomograhy scan anormalities, body mass index, heart diseases, type 2 diabetes mellitus, hypertension, and inflammation may be used to predict the patients who develop severe disease with an accuracy of 84.3 % (sensitivity: 83.3 % and specificity: 86.5 %). Therefore, the presence of the IL18-105 A allele in homozygosis or heterozygosis conferred about 44.0 % of protection in the development of moderate and severe COVID-19. The IL18-137C > G variant was also associated with protective effects in the codominant (0.55, 0.34-0.89, p = 0.015), overdominant (0.57, 0.36-0.91, p = 0.018), and dominant models (0.59, 0.37-0.93, p = 0.025). Therefore, the IL18-137 G allele showed a protective effect against COVID-19 severity. CONCLUSION The IL18-105G > A and IL18-137C > Gvariants may contribute with protective effects for COVID-19 severity and the effects of IL18-137C > G may be modulating IL-18 production and Th1-mediated immune responses.
Collapse
Affiliation(s)
| | - Andréa Name Colado Simão
- Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil; Department of Pathology, Clinical Analysis and Toxicology, Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil.
| | - Tiago Danelli
- Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil
| | - Sayonara Rangel Oliveira
- Department of Pathology, Clinical Analysis and Toxicology, Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil
| | | | - Guilherme Lerner Trigo
- Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil
| | - Kauê Morais Cardoso
- Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil.
| | | | - Zuleica Naomi Tano
- Depertment of Medical Clinic, University of Londrina, Londrina, PR, Brazil.
| | - Elaine Regina Delicato de Almeida
- Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil; Department of Pathology, Clinical Analysis and Toxicology, Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil
| | - Edna Maria Vissoci Reiche
- Postgraduate Program of Clinical and Laboratory Pathophysiology, Health Sciences Center, Londrina State University, Lodrina, Paraná, Brazil; Pontifical Catholic University of Paraná, School of Medicine, Campus Londrina, Lonidrna, Paraná, Brazil.
| | - Michael Maes
- Department of Psychiatry, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand; Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu 610072, China; Key Laboratory of Psychosomatic Medicine, Chinese Academy of Medical Sciences, Chengdu 610072, China.
| | - Marcell Alysson Batisti Lozovoy
- Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil; Department of Pathology, Clinical Analysis and Toxicology, Laboratory of Research in Applied Immunology, University of Londrina, Londrina, PR, Brazil
| |
Collapse
|
15
|
Viderman D, Kotov A, Popov M, Abdildin Y. Machine and deep learning methods for clinical outcome prediction based on physiological data of COVID-19 patients: a scoping review. Int J Med Inform 2024; 182:105308. [PMID: 38091862 DOI: 10.1016/j.ijmedinf.2023.105308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/20/2023] [Accepted: 12/03/2023] [Indexed: 01/07/2024]
Abstract
INTRODUCTION Since the beginning of the COVID-19 pandemic, numerous machine and deep learning (MDL) methods have been proposed in the literature to analyze patient physiological data. The objective of this review is to summarize various aspects of these methods and assess their practical utility for predicting various clinical outcomes. METHODS We searched PubMed, Scopus, and Cochrane Library, screened and selected the studies matching the inclusion criteria. The clinical analysis focused on the characteristics of the patient cohorts in the studies included in this review, the specific tasks in the context of the COVID-19 pandemic that machine and deep learning methods were used for, and their practical limitations. The technical analysis focused on the details of specific MDL methods and their performance. RESULTS Analysis of the 48 selected studies revealed that the majority (∼54 %) of them examined the application of MDL methods for the prediction of survival/mortality-related patient outcomes, while a smaller fraction (∼13 %) of studies also examined applications to the prediction of patients' physiological outcomes and hospital resource utilization. 21 % of the studies examined the application of MDL methods to multiple clinical tasks. Machine and deep learning methods have been shown to be effective at predicting several outcomes of COVID-19 patients, such as disease severity, complications, intensive care unit (ICU) transfer, and mortality. MDL methods also achieved high accuracy in predicting the required number of ICU beds and ventilators. CONCLUSION Machine and deep learning methods have been shown to be valuable tools for predicting disease severity, organ dysfunction and failure, patient outcomes, and hospital resource utilization during the COVID-19 pandemic. The discovered knowledge and our conclusions and recommendations can also be useful to healthcare professionals and artificial intelligence researchers in managing future pandemics.
Collapse
Affiliation(s)
- Dmitriy Viderman
- Department of Surgery, School of Medicine, Nazarbayev University, Astana, Kazakhstan; Department of Anesthesiology, Intensive Care, and Pain Medicine, National Research Oncology Center, Astana, Kazakhstan.
| | - Alexander Kotov
- Department of Computer Science, College of Engineering, Wayne State University, Detroit, USA.
| | - Maxim Popov
- Department of Computer Science, School of Engineering and Digital Sciences, Nazarbayev University, Astana, Kazakhstan.
| | - Yerkin Abdildin
- Department of Mechanical and Aerospace Engineering, School of Engineering and Digital Sciences, Nazarbayev University, Astana, Kazakhstan.
| |
Collapse
|
16
|
Reina-Reina A, Barrera J, Maté A, Trujillo J, Valdivieso B, Gas ME. Developing an interpretable machine learning model for predicting COVID-19 patients deteriorating prior to intensive care unit admission using laboratory markers. Heliyon 2023; 9:e22878. [PMID: 38125502 PMCID: PMC10731083 DOI: 10.1016/j.heliyon.2023.e22878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 11/15/2023] [Accepted: 11/22/2023] [Indexed: 12/23/2023] Open
Abstract
Coronavirus disease (COVID-19) remains a significant global health challenge, prompting a transition from emergency response to comprehensive management strategies. Furthermore, the emergence of new variants of concern, such as BA.2.286, underscores the need for early detection and response to new variants, which continues to be a crucial strategy for mitigating the impact of COVID-19, especially among the vulnerable population. This study aims to anticipate patients requiring intensive care or facing elevated mortality risk throughout their COVID-19 infection while also identifying laboratory predictive markers for early diagnosis of patients. Therefore, haematological, biochemical, and demographic variables were retrospectively evaluated in 8,844 blood samples obtained from 2,935 patients before intensive care unit admission using an interpretable machine learning model. Feature selection techniques were applied using precision-recall measures to address data imbalance and evaluate the suitability of the different variables. The model was trained using stratified cross-validation with k=5 and internally validated, achieving an accuracy of 77.27%, sensitivity of 78.55%, and area under the receiver operating characteristic (AUC) of 0.85; successfully identifying patients at increased risk of severe progression. From a medical perspective, the most important features of the progression or severity of patients with COVID-19 were lactate dehydrogenase, age, red blood cell distribution standard deviation, neutrophils, and platelets, which align with findings from several prior investigations. In light of these insights, diagnostic processes can be significantly expedited through the use of laboratory tests, with a greater focus on key indicators. This strategic approach not only improves diagnostic efficiency but also extends its reach to a broader spectrum of patients. In addition, it allows healthcare professionals to take early preventive measures for those most at risk of adverse outcomes, thereby optimising patient care and prognosis.
Collapse
Affiliation(s)
- A. Reina-Reina
- Lucentia Research. Department of Software and Computing System, University of Alicante, Carretera San Vicente del Raspeig s/n, 03690, Alicante, Spain
- Lucentia Lab, Av. Pintor Pérez Gil, 16, 03540, Alicante, Spain
| | - J.M. Barrera
- Lucentia Research. Department of Software and Computing System, University of Alicante, Carretera San Vicente del Raspeig s/n, 03690, Alicante, Spain
- Lucentia Lab, Av. Pintor Pérez Gil, 16, 03540, Alicante, Spain
| | - A. Maté
- Lucentia Research. Department of Software and Computing System, University of Alicante, Carretera San Vicente del Raspeig s/n, 03690, Alicante, Spain
- Lucentia Lab, Av. Pintor Pérez Gil, 16, 03540, Alicante, Spain
| | - J.C. Trujillo
- Lucentia Research. Department of Software and Computing System, University of Alicante, Carretera San Vicente del Raspeig s/n, 03690, Alicante, Spain
- Lucentia Lab, Av. Pintor Pérez Gil, 16, 03540, Alicante, Spain
| | - B. Valdivieso
- The University and Polytechnic La Fe Hospital of Valencia, Avenida Fernando Abril Martorell, 106 Torre H 1st floor, 46026, Valencia, Spain
- The Medical Research Institute of Hospital La Fe, Avenida Fernando Abril Martorell, 106 Torre F 7th floor, 46026, Valencia, Spain
| | - María-Eugenia Gas
- The Medical Research Institute of Hospital La Fe, Avenida Fernando Abril Martorell, 106 Torre F 7th floor, 46026, Valencia, Spain
| |
Collapse
|
17
|
Chen Y, Wang Y, Chen J, Ma X, Su L, Wei Y, Li L, Ma D, Zhang F, Zhu W, Meng X, Sun G, Ma L, Jiang H, Yin C, Li T, Zhou X, China National Critical Care Quality Control Center Group. Multidimensional dynamic prediction model for hospitalized patients with the omicron variant in China. Infect Dis Model 2023; 8:1097-1107. [PMID: 37854788 PMCID: PMC10579104 DOI: 10.1016/j.idm.2023.09.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 09/09/2023] [Accepted: 09/18/2023] [Indexed: 10/20/2023] Open
Abstract
Purpose To establish dynamic prediction models by machine learning using daily multidimensional data for coronavirus disease 2019 (COVID-19) patients. Methods Hospitalized COVID-19 patients at Peking Union Medical College Hospital from Nov 2nd, 2022, to Jan 13th, 2023, were enrolled in this study. The outcome was defined as deterioration or recovery of the patient's condition. Demographics, comorbidities, laboratory test results, vital signs, and treatments were used to train the model. To predict the following days, a separate XGBoost model was trained and validated. The Shapley additive explanations method was used to analyze feature importance. Results A total of 995 patients were enrolled, generating 7228 and 3170 observations for each prediction model. In the deterioration prediction model, the minimum area under the receiver operating characteristic curve (AUROC) for the following 7 days was 0.786 (95% CI 0.721-0.851), while the AUROC on the next day was 0.872 (0.831-0.913). In the recovery prediction model, the minimum AUROC for the following 3 days was 0.675 (0.583-0.767), while the AUROC on the next day was 0.823 (0.770-0.876). The top 5 features for deterioration prediction on the 7th day were disease course, length of hospital stay, hypertension, and diastolic blood pressure. Those for recovery prediction on the 3rd day were age, D-dimer levels, disease course, creatinine levels and corticosteroid therapy. Conclusion The models could accurately predict the dynamics of Omicron patients' conditions using daily multidimensional variables, revealing important features including comorbidities (e.g., hyperlipidemia), age, disease course, vital signs, D-dimer levels, corticosteroid therapy and oxygen therapy.
Collapse
Affiliation(s)
- Yujie Chen
- Department of Critical Care Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Yao Wang
- Yidu Cloud Technology Inc., Beijing, China
| | - Jieqing Chen
- Information Center Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | - Xudong Ma
- Department of Medical Administration, National Health Commission of the People's Republic of China, Beijing, 100044, China
| | - Longxiang Su
- Department of Critical Care Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Yuna Wei
- Yidu Cloud Technology Inc., Beijing, China
| | - Linfeng Li
- Yidu Cloud Technology Inc., Beijing, China
| | - Dandan Ma
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Feng Zhang
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Wen Zhu
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Xiaoyang Meng
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Guoqiang Sun
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Lian Ma
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Huizhen Jiang
- Information Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Chang Yin
- National Institute of Hospital Administration, Beijing, China
| | - Taisheng Li
- Department of Infectious Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China
| | - Xiang Zhou
- Department of Critical Care Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
- Information Center Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing, 100730, China
| | | |
Collapse
|
18
|
Benjamin R. Reproduction number projection for the COVID-19 pandemic. ADVANCES IN CONTINUOUS AND DISCRETE MODELS 2023; 2023:46. [DOI: 10.1186/s13662-023-03792-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 11/10/2023] [Indexed: 01/02/2025]
Abstract
AbstractThe recently derived Hybrid-Incidence Susceptible-Transmissible-Removed (HI-STR) prototype is a deterministic compartment model for epidemics and an alternative to the Susceptible-Infected-Removed (SIR) model. The HI-STR predicts that pathogen transmission depends on host population characteristics including population size, population density and social behaviour common within that population.The HI-STR prototype is applied to the ancestral Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV2) to show that the original estimates of the Coronavirus Disease 2019 (COVID-19) basic reproduction number $\mathcal{R}_{0}$
R
0
for the United Kingdom (UK) could have been projected onto the individual states of the United States of America (USA) prior to being detected in the USA.The Imperial College London (ICL) group’s estimate of $\mathcal{R}_{0}$
R
0
for the UK is projected onto each USA state. The difference between these projections and the ICL’s estimates for USA states is either not statistically significant on the paired Student t-test or not epidemiologically significant.The SARS-CoV2 Delta variant’s $\mathcal{R}_{0}$
R
0
is also projected from the UK to the USA to prove that projection can be applied to a Variant of Concern (VOC). Projection provides both a localised baseline for evaluating the implementation of an intervention policy and a mechanism for anticipating the impact of a VOC before local manifestation.
Collapse
|
19
|
Giuste FO, He L, Lais P, Shi W, Zhu Y, Hornback A, Tsai C, Isgut M, Anderson B, Wang MD. Early and fair COVID-19 outcome risk assessment using robust feature selection. Sci Rep 2023; 13:18981. [PMID: 37923795 PMCID: PMC10624921 DOI: 10.1038/s41598-023-36175-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 05/29/2023] [Indexed: 11/06/2023] Open
Abstract
Personalized medicine plays an important role in treatment optimization for COVID-19 patient management. Early treatment in patients at high risk of severe complications is vital to prevent death and ventilator use. Predicting COVID-19 clinical outcomes using machine learning may provide a fast and data-driven solution for optimizing patient care by estimating the need for early treatment. In addition, it is essential to accurately predict risk across demographic groups, particularly those underrepresented in existing models. Unfortunately, there is a lack of studies demonstrating the equitable performance of machine learning models across patient demographics. To overcome this existing limitation, we generate a robust machine learning model to predict patient-specific risk of death or ventilator use in COVID-19 positive patients using features available at the time of diagnosis. We establish the value of our solution across patient demographics, including gender and race. In addition, we improve clinical trust in our automated predictions by generating interpretable patient clustering, patient-level clinical feature importance, and global clinical feature importance within our large real-world COVID-19 positive patient dataset. We achieved 89.38% area under receiver operating curve (AUROC) performance for severe outcomes prediction and our robust feature ranking approach identified the presence of dementia as a key indicator for worse patient outcomes. We also demonstrated that our deep-learning clustering approach outperforms traditional clustering in separating patients by severity of outcome based on mutual information performance. Finally, we developed an application for automated and fair patient risk assessment with minimal manual data entry using existing data exchange standards.
Collapse
Affiliation(s)
- Felipe O Giuste
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, 30322, USA
| | - Lawrence He
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, 30322, USA
| | - Peter Lais
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, 30322, USA
| | - Wenqi Shi
- School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30322, USA
| | - Yuanda Zhu
- School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30322, USA
| | - Andrew Hornback
- School of Computer Science and Engineering, Georgia Institute of Technology, Atlanta, GA, 30322, USA
| | - Chiche Tsai
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, 30322, USA
| | - Monica Isgut
- School of Biology, Georgia Institute of Technology, Atlanta, GA, 30322, USA
| | - Blake Anderson
- Department of Medicine, Emory University, Atlanta, GA, 30322, USA
| | - May D Wang
- The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, 30322, USA.
| |
Collapse
|
20
|
Grovu R, Huo Y, Nguyen A, Mourad O, Pan Z, El-Gharib K, Wei C, Mustafa A, Quan T, Slobodnick A. Machine learning: Predicting hospital length of stay in patients admitted for lupus flares. Lupus 2023; 32:1418-1429. [PMID: 37831499 DOI: 10.1177/09612033231206830] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
BACKGROUND Although rare, severe systemic lupus erythematosus (SLE) flares requiring hospitalization account for most of the direct costs of SLE care. New machine learning (ML) methods may optimize lupus care by predicting which patients will have a prolonged hospital length of stay (LOS). Our study uses a machine learning approach to predict the LOS in patients admitted for lupus flares and assesses which features prolong LOS. METHODS Our study sampled 5831 patients admitted for lupus flares from the National Inpatient Sample Database 2016-2018 and collected 90 demographics and comorbidity features. Four machine learning (ML) models were built (XGBoost, Linear Support Vector Machines, K Nearest Neighbors, and Logistic Regression) to predict LOS, and their performance was evaluated using multiple metrics, including accuracy, receiver operator area under the curve (ROC-AUC), precision-recall area under the curve (PR- AUC), and F1-score. Using the highest-performing model (XGBoost), we assessed the feature importance of our input features using Shapley value explanations (SHAP) to rank their impact on LOS. RESULTS Our XGB model performed the best with a ROC-AUC of 0.87, PR-AUC of 0.61, an F1 score of 0.56, and an accuracy of 95%. The features with the most significant impact on the model were "the need for a central line," "acute dialysis," and "acute renal failure." Other top features include those related to renal and infectious comorbidities. CONCLUSION Our results were consistent with the established literature and showed promise in ML over traditional methods of predictive analyses, even with rare rheumatic events such as lupus flare hospitalizations.
Collapse
Affiliation(s)
- Radu Grovu
- Internal Medicine Department, Staten Island University Hospital, Staten Island, NY, USA
| | - Yanran Huo
- Department of Engineering, University of Massachusetts, Dartmouth, MA, USA
| | - Andrew Nguyen
- Medicine Department, Harvard Medical School, Boston, MA, USA
| | - Omar Mourad
- Internal Medicine Department, Staten Island University Hospital, Staten Island, NY, USA
| | - Zihang Pan
- Medicine Department, Duke-NUS Medical School, Singapore
| | - Khalil El-Gharib
- Internal Medicine Department, Staten Island University Hospital, Staten Island, NY, USA
| | - Chapman Wei
- Internal Medicine Department, Staten Island University Hospital, Staten Island, NY, USA
| | - Ahmad Mustafa
- Internal Medicine Department, Staten Island University Hospital, Staten Island, NY, USA
| | - Theodore Quan
- Medicine Department, George Washington University School of Medicine, Washington, DC, USA
| | - Anastasia Slobodnick
- Rheumatology Department, Staten Island University Hospital, Staten Island, NY, USA
| |
Collapse
|
21
|
Le JP, Shashikumar SP, Malhotra A, Nemati S, Wardi G. Making the Improbable Possible: Generalizing Models Designed for a Syndrome-Based, Heterogeneous Patient Landscape. Crit Care Clin 2023; 39:751-768. [PMID: 37704338 PMCID: PMC10758922 DOI: 10.1016/j.ccc.2023.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2023]
Abstract
Syndromic conditions, such as sepsis, are commonly encountered in the intensive care unit. Although these conditions are easy for clinicians to grasp, these conditions may limit the performance of machine-learning algorithms. Individual hospital practice patterns may limit external generalizability. Data missingness is another barrier to optimal algorithm performance and various strategies exist to mitigate this. Recent advances in data science, such as transfer learning, conformal prediction, and continual learning, may improve generalizability of machine-learning algorithms in critically ill patients. Randomized trials with these approaches are indicated to demonstrate improvements in patient-centered outcomes at this point.
Collapse
Affiliation(s)
- Joshua Pei Le
- School of Medicine, University of Limerick, Castletroy, Co, Limerick V94 T9PX, Ireland
| | | | - Atul Malhotra
- Division of Pulmonary, Critical Care and Sleep Medicine, University of California San Diego, San Diego, CA, USA
| | - Shamim Nemati
- Division of Biomedical Informatics, University of California San Diego, San Diego, CA, USA
| | - Gabriel Wardi
- Division of Pulmonary, Critical Care and Sleep Medicine, University of California San Diego, San Diego, CA, USA; Department of Emergency Medicine, University of California San Diego, 200 W Arbor Drive, San Diego, CA 92103, USA.
| |
Collapse
|
22
|
Casas-Rojo JM, Ventura PS, Antón Santos JM, de Latierro AO, Arévalo-Lorido JC, Mauri M, Rubio-Rivas M, González-Vega R, Giner-Galvañ V, Otero Perpiñá B, Fonseca-Aizpuru E, Muiño A, Del Corral-Beamonte E, Gómez-Huelgas R, Arnalich-Fernández F, Llorente Barrio M, Sancha-Lloret A, Rábago Lorite I, Loureiro-Amigo J, Pintos-Martínez S, García-Sardón E, Montaño-Martínez A, Rojano-Rivero MG, Ramos-Rincón JM, López-Escobar A. Improving prediction of COVID-19 mortality using machine learning in the Spanish SEMI-COVID-19 registry. Intern Emerg Med 2023; 18:1711-1722. [PMID: 37349618 DOI: 10.1007/s11739-023-03338-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 06/01/2023] [Indexed: 06/24/2023]
Abstract
COVID-19 is responsible for high mortality, but robust machine learning-based predictors of mortality are lacking. To generate a model for predicting mortality in patients hospitalized with COVID-19 using Gradient Boosting Decision Trees (GBDT). The Spanish SEMI-COVID-19 registry includes 24,514 pseudo-anonymized cases of patients hospitalized with COVID-19 from 1 February 2020 to 5 December 2021. This registry was used as a GBDT machine learning model, employing the CatBoost and BorutaShap classifier to select the most relevant indicators and generate a mortality prediction model by risk level, ranging from 0 to 1. The model was validated by separating patients according to admission date, using the period 1 February to 31 December 2020 (first and second waves, pre-vaccination period) for training, and 1 January to 30 November 2021 (vaccination period) for the test group. An ensemble of ten models with different random seeds was constructed, separating 80% of the patients for training and 20% from the end of the training period for cross-validation. The area under the receiver operating characteristics curve (AUC) was used as a performance metric. Clinical and laboratory data from 23,983 patients were analyzed. CatBoost mortality prediction models achieved an AUC performance of 84.76 (standard deviation 0.45) for patients in the test group (potentially vaccinated patients not included in model training) using 16 features. The performance of the 16-parameter GBDT model for predicting COVID-19 hospital mortality, although requiring a relatively large number of predictors, shows a high predictive capacity.
Collapse
Affiliation(s)
- José-Manuel Casas-Rojo
- Internal Medicine Department, Infanta Cristina University Hospital, Parla, 28981, Madrid, Spain
| | - Paula Sol Ventura
- Department of Pediatric Endocrinology, Hospital HM Nens, HM Hospitales, 08009, Barcelona, Spain
| | | | | | | | - Marc Mauri
- Data Scientist, Kaizen AI, Barcelona, Spain
| | - Manuel Rubio-Rivas
- Internal Medicine Department, Bellvitge University Hospital, Hospitalet de Llobregat, Barcelona, Spain
| | - Rocío González-Vega
- Internal Medicine Department, Hospital Costa del Sol, Marbella, Málaga, Spain
| | - Vicente Giner-Galvañ
- Internal Medicine Department, Hospital Universitario San Juan. San Juan de Alicante, Alicante, Spain
| | | | - Eva Fonseca-Aizpuru
- Internal Medicine Department, Hospital Universitario de Cabueñes, Gijón, Asturias, Spain
| | - Antonio Muiño
- Internal Medicine Department, Hospital Universitario Gregorio Marañón, Madrid, Spain
| | | | - Ricardo Gómez-Huelgas
- Internal Medicine Department, Regional University Hospital of Málaga, Biomedical Research Institute of Málaga (IBIMA), University of Málaga (UMA), Málaga, Spain
| | | | | | | | - Isabel Rábago Lorite
- Internal Medicine Department, Hospital Universitario Infanta Sofía. San Sebastián de los Reyes, Madrid, Spain
| | - José Loureiro-Amigo
- Internal Medicine Department, Hospital Moisès Broggi, Sant Joan Despí, Barcelona, Spain
| | - Santiago Pintos-Martínez
- Internal Medicine Department, Hospital Universitario de Sagunto, Puerto de Sagunto, Valencia, Spain
| | - Eva García-Sardón
- Internal Medicine Department, Hospital Universitario de Cáceres, Cáceres, Spain
| | | | | | | | - Alejandro López-Escobar
- Pediatrics Department, Clinical Research Unit, Hospital Universitario Vithas Madrid La Milagrosa, Fundación Vithas, Madrid, Spain.
| |
Collapse
|
23
|
Zakariaee SS, Naderi N, Ebrahimi M, Kazemi-Arpanahi H. Comparing machine learning algorithms to predict COVID‑19 mortality using a dataset including chest computed tomography severity score data. Sci Rep 2023; 13:11343. [PMID: 37443373 PMCID: PMC10345104 DOI: 10.1038/s41598-023-38133-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 07/04/2023] [Indexed: 07/15/2023] Open
Abstract
Since the beginning of the COVID-19 pandemic, new and non-invasive digital technologies such as artificial intelligence (AI) had been introduced for mortality prediction of COVID-19 patients. The prognostic performances of the machine learning (ML)-based models for predicting clinical outcomes of COVID-19 patients had been mainly evaluated using demographics, risk factors, clinical manifestations, and laboratory results. There is a lack of information about the prognostic role of imaging manifestations in combination with demographics, clinical manifestations, and laboratory predictors. The purpose of the present study is to develop an efficient ML prognostic model based on a more comprehensive dataset including chest CT severity score (CT-SS). Fifty-five primary features in six main classes were retrospectively reviewed for 6854 suspected cases. The independence test of Chi-square was used to determine the most important features in the mortality prediction of COVID-19 patients. The most relevant predictors were used to train and test ML algorithms. The predictive models were developed using eight ML algorithms including the J48 decision tree (J48), support vector machine (SVM), multi-layer perceptron (MLP), k-nearest neighbourhood (k-NN), Naïve Bayes (NB), logistic regression (LR), random forest (RF), and eXtreme gradient boosting (XGBoost). The performances of the predictive models were evaluated using accuracy, precision, sensitivity, specificity, and area under the ROC curve (AUC) metrics. After applying the exclusion criteria, a total of 815 positive RT-PCR patients were the final sample size, where 54.85% of the patients were male and the mean age of the study population was 57.22 ± 16.76 years. The RF algorithm with an accuracy of 97.2%, the sensitivity of 100%, a precision of 94.8%, specificity of 94.5%, F1-score of 97.3%, and AUC of 99.9% had the best performance. Other ML algorithms with AUC ranging from 81.2 to 93.9% had also good prediction performances in predicting COVID-19 mortality. Results showed that timely and accurate risk stratification of COVID-19 patients could be performed using ML-based predictive models fed by routine data. The proposed algorithm with the more comprehensive dataset including CT-SS could efficiently predict the mortality of COVID-19 patients. This could lead to promptly targeting high-risk patients on admission, the optimal use of hospital resources, and an increased probability of survival of patients.
Collapse
Affiliation(s)
| | - Negar Naderi
- Department of Midwifery, Ilam University of Medical Sciences, Ilam, Iran
| | - Mahdi Ebrahimi
- Department of Emergency Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Hadi Kazemi-Arpanahi
- Department of Health Information Technology, Abadan University of Medical Sciences, Abadan, Iran.
| |
Collapse
|
24
|
Dobrijević D, Vilotijević-Dautović G, Katanić J, Horvat M, Horvat Z, Pastor K. Rapid Triage of Children with Suspected COVID-19 Using Laboratory-Based Machine-Learning Algorithms. Viruses 2023; 15:1522. [PMID: 37515208 PMCID: PMC10383367 DOI: 10.3390/v15071522] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/05/2023] [Accepted: 07/06/2023] [Indexed: 07/30/2023] Open
Abstract
In order to limit the spread of the novel betacoronavirus (SARS-CoV-2), it is necessary to detect positive cases as soon as possible and isolate them. For this purpose, machine-learning algorithms, as a field of artificial intelligence, have been recognized as a promising tool. The aim of this study was to assess the utility of the most common machine-learning algorithms in the rapid triage of children with suspected COVID-19 using easily accessible and inexpensive laboratory parameters. A cross-sectional study was conducted on 566 children treated for respiratory diseases: 280 children with PCR-confirmed SARS-CoV-2 infection and 286 children with respiratory symptoms who were SARS-CoV-2 PCR-negative (control group). Six machine-learning algorithms, based on the blood laboratory data, were tested: random forest, support vector machine, linear discriminant analysis, artificial neural network, k-nearest neighbors, and decision tree. The training set was validated through stratified cross-validation, while the performance of each algorithm was confirmed by an independent test set. Random forest and support vector machine models demonstrated the highest accuracy of 85% and 82.1%, respectively. The models demonstrated better sensitivity than specificity and better negative predictive value than positive predictive value. The F1 score was higher for the random forest than for the support vector machine model, 85.2% and 82.3%, respectively. This study might have significant clinical applications, helping healthcare providers identify children with COVID-19 in the early stage, prior to PCR and/or antigen testing. Additionally, machine-learning algorithms could improve overall testing efficiency with no extra costs for the healthcare facility.
Collapse
Affiliation(s)
- Dejan Dobrijević
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia
- Institute for Child and Youth Health Care of Vojvodina, 21000 Novi Sad, Serbia
| | - Gordana Vilotijević-Dautović
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia
- Institute for Child and Youth Health Care of Vojvodina, 21000 Novi Sad, Serbia
| | - Jasmina Katanić
- Faculty of Medicine, University of Novi Sad, 21000 Novi Sad, Serbia
- Institute for Child and Youth Health Care of Vojvodina, 21000 Novi Sad, Serbia
| | - Mirjana Horvat
- Faculty of Civil Engineering Subotica, University of Novi Sad, 24000 Subotica, Serbia
| | - Zoltan Horvat
- Faculty of Civil Engineering Subotica, University of Novi Sad, 24000 Subotica, Serbia
| | - Kristian Pastor
- Faculty of Technology, University of Novi Sad, 21000 Novi Sad, Serbia
| |
Collapse
|
25
|
Li H, Drukker K, Hu Q, Whitney HM, Fuhrman JD, Giger ML. Predicting intensive care need for COVID-19 patients using deep learning on chest radiography. J Med Imaging (Bellingham) 2023; 10:044504. [PMID: 37608852 PMCID: PMC10440543 DOI: 10.1117/1.jmi.10.4.044504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2023] [Revised: 07/12/2023] [Accepted: 08/01/2023] [Indexed: 08/24/2023] Open
Abstract
Purpose Image-based prediction of coronavirus disease 2019 (COVID-19) severity and resource needs can be an important means to address the COVID-19 pandemic. In this study, we propose an artificial intelligence/machine learning (AI/ML) COVID-19 prognosis method to predict patients' needs for intensive care by analyzing chest X-ray radiography (CXR) images using deep learning. Approach The dataset consisted of 8357 CXR exams from 5046 COVID-19-positive patients as confirmed by reverse transcription polymerase chain reaction (RT-PCR) tests for the SARS-CoV-2 virus with a training/validation/test split of 64%/16%/20% on a by patient level. Our model involved a DenseNet121 network with a sequential transfer learning technique employed to train on a sequence of gradually more specific and complex tasks: (1) fine-tuning a model pretrained on ImageNet using a previously established CXR dataset with a broad spectrum of pathologies; (2) refining on another established dataset to detect pneumonia; and (3) fine-tuning using our in-house training/validation datasets to predict patients' needs for intensive care within 24, 48, 72, and 96 h following the CXR exams. The classification performances were evaluated on our independent test set (CXR exams of 1048 patients) using the area under the receiver operating characteristic curve (AUC) as the figure of merit in the task of distinguishing between those COVID-19-positive patients who required intensive care following the imaging exam and those who did not. Results Our proposed AI/ML model achieved an AUC (95% confidence interval) of 0.78 (0.74, 0.81) when predicting the need for intensive care 24 h in advance, and at least 0.76 (0.73, 0.80) for 48 h or more in advance using predictions based on the AI prognostic marker derived from CXR images. Conclusions This AI/ML prediction model for patients' needs for intensive care has the potential to support both clinical decision-making and resource management.
Collapse
Affiliation(s)
- Hui Li
- The University of Chicago, Department of Radiology, Chicago, Illinois, United States
| | - Karen Drukker
- The University of Chicago, Department of Radiology, Chicago, Illinois, United States
| | - Qiyuan Hu
- The University of Chicago, Department of Radiology, Chicago, Illinois, United States
| | - Heather M. Whitney
- The University of Chicago, Department of Radiology, Chicago, Illinois, United States
| | - Jordan D. Fuhrman
- The University of Chicago, Department of Radiology, Chicago, Illinois, United States
| | - Maryellen L. Giger
- The University of Chicago, Department of Radiology, Chicago, Illinois, United States
| |
Collapse
|
26
|
Mehrdad S, Shamout FE, Wang Y, Atashzar SF. Deep learning for deterioration prediction of COVID-19 patients based on time-series of three vital signs. Sci Rep 2023; 13:9968. [PMID: 37339986 PMCID: PMC10282033 DOI: 10.1038/s41598-023-37013-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Accepted: 06/14/2023] [Indexed: 06/22/2023] Open
Abstract
Unrecognized deterioration of COVID-19 patients can lead to high morbidity and mortality. Most existing deterioration prediction models require a large number of clinical information, typically collected in hospital settings, such as medical images or comprehensive laboratory tests. This is infeasible for telehealth solutions and highlights a gap in deterioration prediction models based on minimal data, which can be recorded at a large scale in any clinic, nursing home, or even at the patient's home. In this study, we develop and compare two prognostic models that predict if a patient will experience deterioration in the forthcoming 3 to 24 h. The models sequentially process routine triadic vital signs: (a) oxygen saturation, (b) heart rate, and (c) temperature. These models are also provided with basic patient information, including sex, age, vaccination status, vaccination date, and status of obesity, hypertension, or diabetes. The difference between the two models is the way that the temporal dynamics of the vital signs are processed. Model #1 utilizes a temporally-dilated version of the Long-Short Term Memory model (LSTM) for temporal processes, and Model #2 utilizes a residual temporal convolutional network (TCN) for this purpose. We train and evaluate the models using data collected from 37,006 COVID-19 patients at NYU Langone Health in New York, USA. The convolution-based model outperforms the LSTM based model, achieving a high AUROC of 0.8844-0.9336 for 3 to 24 h deterioration prediction on a held-out test set. We also conduct occlusion experiments to evaluate the importance of each input feature, which reveals the significance of continuously monitoring the variation of the vital signs. Our results show the prospect for accurate deterioration forecast using a minimum feature set that can be relatively easily obtained using wearable devices and self-reported patient information.
Collapse
Affiliation(s)
- Sarmad Mehrdad
- Department of Electrical and Computer Engineering, New York University (NYU), New York, USA
| | - Farah E Shamout
- Department of Biomedical Engineering, New York University (NYU), New York, USA
- Division of Engineering, New York University Abu Dhabi (NYUAD), Abu Dhabi, UAE
- Computer Science and Engineering, New York University (NYU), New York, USA
| | - Yao Wang
- Department of Electrical and Computer Engineering, New York University (NYU), New York, USA
- Department of Biomedical Engineering, New York University (NYU), New York, USA
| | - S Farokh Atashzar
- Department of Electrical and Computer Engineering, New York University (NYU), New York, USA.
- Department of Biomedical Engineering, New York University (NYU), New York, USA.
- Department of Mechanical and Aerospace Engineering, New York University (NYU), New York, USA.
| |
Collapse
|
27
|
A machine learning and explainable artificial intelligence triage-prediction system for COVID-19. DECISION ANALYTICS JOURNAL 2023; 7:100246. [PMCID: PMC10163946 DOI: 10.1016/j.dajour.2023.100246] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 04/21/2023] [Accepted: 05/02/2023] [Indexed: 06/02/2024]
Abstract
COVID-19 is a respiratory disease caused by the SARS-CoV-2 contagion, severely disrupted the healthcare infrastructure. Various countries have developed COVID-19 vaccines that have effectively prevented the severe symptoms caused by the virus to a certain extent. However, a small section of people continues to perish. Artificial intelligence advances have revolutionized healthcare diagnosis and prognosis infrastructure. In this study, we predict the severity of COVID-19 using heterogenous Machine Learning and Deep Learning algorithms by considering clinical markers, vital signs, and other critical factors. This study extensively reviews various classifier architectures to predict the COVID-19 severity. We built and evaluated multiple pipelines entailing combinations of five state-of-the-art data-balancing techniques (Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic, Borderline SMOTE, SMOTE with Tomek links, and SMOTE with Edited Nearest Neighbor (ENN)) and twelve heterogeneous classifiers such as Logistic Regression, Decision Tree, Random Forest, Support Vector Machine, K-Nearest Neighbors, Naïve Bayes, Xgboost, Extratrees, Adaboost, Light GBM, Catboost, and 1-D Convolution Neural Network. The best-performing pipeline consists of Random Forest trained on Borderline SMOTE balanced data that produced the highest recall of 83%. We deployed Explainable Artificial Intelligence tools such as Shapley Additive Explanations and Local Interpretable Model-agnostic Explanations, ELI5, Qlattice, Anchor, and Feature Importance to demystify complex tree-based ensemble models. These tools provide valuable insights into the significance of critical features in the severity prediction of a COVID-19 patient. It was observed that changes in respiratory rate, blood pressure, lactate, and calcium values were the primary contributors to the increase in severity of a COVID-19 patient. This architecture aims to be an explainable decision-support triaging system for medical professionals in countries lacking advanced medical technology and infrastructure to reduce fatalities.
Collapse
|
28
|
Utilizing CNN-LSTM techniques for the enhancement of medical systems. ALEXANDRIA ENGINEERING JOURNAL 2023; 72:323-338. [PMCID: PMC10105249 DOI: 10.1016/j.aej.2023.04.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/30/2023] [Accepted: 04/05/2023] [Indexed: 04/04/2024]
Abstract
COVID-19 is one of the most chronic and serious infections of recent years due to its worldwide spread. Determining who was genuinely affected when the disease spreads more widely is challenging. More than 60% of affected individuals report having a dry cough. In many recent studies, diagnostic models were developed using coughing and other breathing sounds. With the development of technology, body sounds are now collected using digital techniques for respiratory and cardiovascular tests. Early research on identifying COVID-19 utilizing speech and diagnosing signs yielded encouraging findings. The gathering of extensive, multi-group, airborne acoustical sound data is used in the developed framework to conduct an efficient assessment to test for COVID-19. An effective classification model is created to assess COVID-19 utilizing deep learning methods. The MIT-Covid-19 dataset is used as the input, and the Weiner filter is used for pre-processing. Following feature extraction done by Mel-frequency cepstral coefficients, the classification is performed using the CNN-LSTM approach. The study compared the performance of the developed framework with other techniques such as CNN, GRU, and LSTM. Study results revealed that CNN-LSTM outperformed other existing approaches by 97.7%.
Collapse
|
29
|
Karimian H, Fan Q, Li Q, Chen Y, Shi J. Spatiotemporal transmission of infectious nanochemical particles in water environment: A case study of Covid-19. CHEMOSPHERE 2023:139065. [PMID: 37247670 DOI: 10.1016/j.chemosphere.2023.139065] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 05/09/2023] [Accepted: 05/26/2023] [Indexed: 05/31/2023]
Abstract
This study explores the dynamic transmission of nanochemical infectious particles due to COVID-19 in the water environment using a spatiotemporal epidemiological approach. We proposed a novel multi-agent model to simulate the spread of COVID-19 by considering several influencing factors. The model divides the population into susceptible and infected and analyzes the impact of different prevention and control measures, such as limiting the number of people and wearing masks on the spread of COVID-19. The findings suggest that reducing population density and wearing masks can significantly reduce the likelihood of virus transmission. Specifically, the research shows that if the population moves within a fixed range, almost everyone will eventually be infected within 1 h. When the population density is 50%, the infection rate is as high as 96%. If everyone does not wear a mask, nearly 72.33% of the people will be infected after 1 h. However, when people wear masks, the infection rate is consistently lower than when they do not wear masks. Even if only 25% of people wear masks, the infection rate with masks is 27.67% lower than without masks, which is strong evidence of the importance of wearing a mask. As people's daily activities are mostly carried out indoors, and many super-spreading events of the new crown epidemic also originated from indoor gatherings, the research on indoor epidemic prevention and control is essential. This study provides decision-making support for epidemic preventions and controls and the proposed methodology can be used in other regions and future epidemics.
Collapse
Affiliation(s)
- Hamed Karimian
- School of Marine Technology and Geomatics, Jiangsu Ocean University, Lianyungang, 222005, China.
| | - Qin Fan
- School of Civil and Surveying & Mapping Engineering, Jiangxi University of Science and Technology, Ganzhou, 341000, China
| | - Qun Li
- Ganzhou Land Space Survey and Planning Research Center, Ganzhou, 341000, China
| | - Youliang Chen
- School of Civil and Surveying & Mapping Engineering, Jiangxi University of Science and Technology, Ganzhou, 341000, China.
| | - Juan Shi
- School of Marine Technology and Geomatics, Jiangsu Ocean University, Lianyungang, 222005, China.
| |
Collapse
|
30
|
Mulenga C, Kaonga P, Hamoonga R, Mazaba ML, Chabala F, Musonda P. Predicting Mortality in Hospitalized COVID-19 Patients in Zambia: An Application of Machine Learning. Glob Health Epidemiol Genom 2023; 2023:8921220. [PMID: 37260675 PMCID: PMC10228226 DOI: 10.1155/2023/8921220] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/23/2023] [Accepted: 04/27/2023] [Indexed: 06/02/2023] Open
Abstract
The coronavirus disease 2019 (COVID-19) has wreaked havoc globally, resulting in millions of cases and deaths. The objective of this study was to predict mortality in hospitalized COVID-19 patients in Zambia using machine learning (ML) methods based on factors that have been shown to be predictive of mortality and thereby improve pandemic preparedness. This research employed seven powerful ML models that included decision tree (DT), random forest (RF), support vector machines (SVM), logistic regression (LR), Naïve Bayes (NB), gradient boosting (GB), and XGBoost (XGB). These classifiers were trained on 1,433 hospitalized COVID-19 patients from various health facilities in Zambia. The performances achieved by these models were checked using accuracy, recall, F1-Score, area under the receiver operating characteristic curve (ROC_AUC), area under the precision-recall curve (PRC_AUC), and other metrics. The best-performing model was the XGB which had an accuracy of 92.3%, recall of 94.2%, F1-Score of 92.4%, and ROC_AUC of 97.5%. The pairwise Mann-Whitney U-test analysis showed that the second-best model (GB) and the third-best model (RF) did not perform significantly worse than the best model (XGB) and had the following: GB had an accuracy of 91.7%, recall of 94.2%, F1-Score of 91.9%, and ROC_AUC of 97.1%. RF had an accuracy of 90.8%, recall of 93.6%, F1-Score of 91.0%, and ROC_AUC of 96.8%. Other models showed similar results for the same metrics checked. The study successfully derived and validated the selected ML models and predicted mortality effectively with reasonably high performance in the stated metrics. The feature importance analysis found that knowledge of underlying health conditions about patients' hospital length of stay (LOS), white blood cell count, age, and other factors can help healthcare providers offer lifesaving services on time, improve pandemic preparedness, and decongest health facilities in Zambia and other countries with similar settings.
Collapse
Affiliation(s)
- Clyde Mulenga
- Department of Epidemiology and Biostatistics, University of Zambia, Lusaka, Zambia
- Institute of Basic and Biomedical Sciences, Levy Mwanawasa Medical University, Lusaka, Zambia
| | - Patrick Kaonga
- Department of Epidemiology and Biostatistics, University of Zambia, Lusaka, Zambia
| | - Raymond Hamoonga
- The Health Press, Zambia National Public Health Institute, Lusaka, Zambia
| | - Mazyanga Lucy Mazaba
- Communication Information and Research, Zambia National Public Health Institute, Lusaka, Zambia
| | - Freeman Chabala
- Institute of Basic and Biomedical Sciences, Levy Mwanawasa Medical University, Lusaka, Zambia
| | - Patrick Musonda
- Department of Epidemiology and Biostatistics, University of Zambia, Lusaka, Zambia
| |
Collapse
|
31
|
Ortiz-Barrios M, Arias-Fonseca S, Ishizaka A, Barbati M, Avendaño-Collante B, Navarro-Jiménez E. Artificial intelligence and discrete-event simulation for capacity management of intensive care units during the Covid-19 pandemic: A case study. JOURNAL OF BUSINESS RESEARCH 2023; 160:113806. [PMID: 36895308 PMCID: PMC9981538 DOI: 10.1016/j.jbusres.2023.113806] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 01/18/2023] [Accepted: 02/23/2023] [Indexed: 06/18/2023]
Abstract
The Covid-19 pandemic has pushed the Intensive Care Units (ICUs) into significant operational disruptions. The rapid evolution of this disease, the bed capacity constraints, the wide variety of patient profiles, and the imbalances within health supply chains still represent a challenge for policymakers. This paper aims to use Artificial Intelligence (AI) and Discrete-Event Simulation (DES) to support ICU bed capacity management during Covid-19. The proposed approach was validated in a Spanish hospital chain where we initially identified the predictors of ICU admission in Covid-19 patients. Second, we applied Random Forest (RF) to predict ICU admission likelihood using patient data collected in the Emergency Department (ED). Finally, we included the RF outcomes in a DES model to assist decision-makers in evaluating new ICU bed configurations responding to the patient transfer expected from downstream services. The results evidenced that the median bed waiting time declined between 32.42 and 48.03 min after intervention.
Collapse
Affiliation(s)
- Miguel Ortiz-Barrios
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 080002, Colombia
| | - Sebastián Arias-Fonseca
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 080002, Colombia
| | - Alessio Ishizaka
- NEOMA Business School, 1 rue du Maréchal Juin, Mont-Saint-Aignan 76130, France
| | - Maria Barbati
- Department of Economics, University Ca' Foscari, Cannaregio 873, Fondamenta San Giobbe, 30121 Venice, Italy
| | | | | |
Collapse
|
32
|
Ye J, Shao X, Yang Y, Zhu F. Predicting the negative conversion time of nonsevere COVID-19 patients using machine learning methods. J Med Virol 2023; 95:e28747. [PMID: 37185847 DOI: 10.1002/jmv.28747] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 03/10/2023] [Accepted: 04/10/2023] [Indexed: 05/17/2023]
Abstract
Based on the patient's clinical characteristics and laboratory indicators, different machine-learning methods were used to develop models for predicting the negative conversion time of nonsevere coronavirus disease 2019 (COVID-19) patients. A retrospective analysis was performed on 376 nonsevere COVID-19 patients admitted to Wuxi Fifth People's Hospital from May 2, 2022, to May 14, 2022. The patients were divided into training set (n = 309) and test set (n = 67). The clinical features and laboratory parameters of the patients were collected. In the training set, the least absolute shrinkage and selection operator (LASSO) was used to select predictive features and train six machine learning models: multiple linear regression (MLR), K-Nearest Neighbors Regression (KNNR), random forest regression (RFR), support vector machine regression (SVR), XGBoost regression (XGBR), and multilayer perceptron regression (MLPR). Seven best predictive features selected by LASSO included: age, gender, vaccination status, IgG, lymphocyte ratio, monocyte ratio, and lymphocyte count. The predictive performance of the models in the test set was MLPR > SVR > MLR > KNNR > XGBR > RFR, and MLPR had the strongest generalization performance, which is significantly better than SVR and MLR. In the MLPR model, vaccination status, IgG, lymphocyte count, and lymphocyte ratio were protective factors for negative conversion time; male gender, age, and monocyte ratio were risk factors. The top three features with the highest weights were vaccination status, gender, and IgG. Machine learning methods (especially MLPR) can effectively predict the negative conversion time of non-severe COVID-19 patients. It can help to rationally allocate limited medical resources and prevent disease transmission, especially during the Omicron pandemic.
Collapse
Affiliation(s)
- Jiru Ye
- Department of Respiratory and Critical Care Medicine, The Third Affiliated Hospital of Soochow University, Changzhou, China
| | - Xiaonan Shao
- Department of Nuclear Medicine, The Third Affiliated Hospital of Soochow University, Institute of Clinical Translation of Nuclear Medicine and Molecular Imaging of Soochow University, Changzhou Clinical Medical Center, Changzhou, China
| | - Yong Yang
- Department of Pediatrics, The Third Affiliated Hospital of Soochow University, Changzhou, China
| | - Feng Zhu
- Department of Respiratory and Critical Care Medicine, Affiliated Wuxi Fifth Hospital of Jiangnan University, Wuxi Fifth People's Hospital, Wuxi, China
| |
Collapse
|
33
|
Artificial Intelligence Functionalities During the COVID-19 Pandemic. Disaster Med Public Health Prep 2023; 17:e336. [PMID: 36847255 DOI: 10.1017/dmp.2023.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]
Abstract
BACKGROUND The coronavirus disease 2019 (COVID-19) pandemic has led us to use virtual solutions and emerging technologies such as artificial intelligence (AI). Recent studies have clearly demonstrated the role of AI in health care and medical practice; however, a comprehensive review can identify potential yet not fulfilled functionalities of such technologies in pandemics. Therefore, this scoping review study aims at assessing AI functionalities in the COVID-19 pandemic in 2022. METHODS A systematic search was carried out in PubMed, Cochran Library, Scopus, Science Direct, ProQuest, and Web of Science from 2019 to May 9, 2022. Researchers selected the articles according to the search keywords. Finally, the articles mentioning the functionalities of AI in the COVID-19 pandemic were evaluated. Two investigators performed this process. RESULTS Initial search resulted in 9123 articles. After reviewing the title, abstract, and full text of these articles, and applying the inclusion and exclusion criteria, 4 articles were selectd for the final analysis. All 4 were cross-sectional studies. Two studies (50%) were performed in the United States, 1 (25%) in Israel, and 1 (25%) in Saudi Arabia. They covered the functionalities of AI in the prediction, detection, and diagnosis of COVID-19. CONCLUSIONS To the extent of the researchers' knowledge, this study is the first scoping review that assesses the AI functionalities in the COVID-19 pandemic. Health-care organizations need decision support technologies and evidence-based apparatuses that can perceive, think, and reason not dissimilar to human beings. Potential functionalities of such technologies can be used to predict mortality, detect, screen, and trace current and former patients, analyze health data, prioritize high-risk patients, and better allocate hospital resources in pandemics, and generally in health-care settings.
Collapse
|
34
|
Tiwari S, Chanak P, Singh SK. A Review of the Machine Learning Algorithms for Covid-19 Case Analysis. IEEE TRANSACTIONS ON ARTIFICIAL INTELLIGENCE 2023; 4:44-59. [PMID: 36908643 PMCID: PMC9983698 DOI: 10.1109/tai.2022.3142241] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 12/25/2021] [Indexed: 11/09/2022]
Abstract
The purpose of this article is to see how machine learning (ML) algorithms and applications are used in the COVID-19 inquiry and for other purposes. The available traditional methods for COVID-19 international epidemic prediction, researchers and authorities have given more attention to simple statistical and epidemiological methodologies. The inadequacy and absence of medical testing for diagnosing and identifying a solution is one of the key challenges in preventing the spread of COVID-19. A few statistical-based improvements are being strengthened to answer this challenge, resulting in a partial resolution up to a certain level. ML have advocated a wide range of intelligence-based approaches, frameworks, and equipment to cope with the issues of the medical industry. The application of inventive structure, such as ML and other in handling COVID-19 relevant outbreak difficulties, has been investigated in this article. The major goal of this article is to 1) Examining the impact of the data type and data nature, as well as obstacles in data processing for COVID-19. 2) Better grasp the importance of intelligent approaches like ML for the COVID-19 pandemic. 3) The development of improved ML algorithms and types of ML for COVID-19 prognosis. 4) Examining the effectiveness and influence of various strategies in COVID-19 pandemic. 5) To target on certain potential issues in COVID-19 diagnosis in order to motivate academics to innovate and expand their knowledge and research into additional COVID-19-affected industries.
Collapse
Affiliation(s)
- Shrikant Tiwari
- Department of Computer Science and EngineeringIndian Institute of Technology (BHU)Varanasi221005India
| | - Prasenjit Chanak
- Department of Computer Science and EngineeringIndian Institute of Technology (BHU)Varanasi221005India
| | - Sanjay Kumar Singh
- Department of Computer Science and EngineeringIndian Institute of Technology (BHU)Varanasi221005India
| |
Collapse
|
35
|
Nopour R, Shanbezadeh M, Kazemi-Arpanahi H. Predicting intubation risk among COVID-19 hospitalized patients using artificial neural networks. JOURNAL OF EDUCATION AND HEALTH PROMOTION 2023; 12:16. [PMID: 37034879 PMCID: PMC10079178 DOI: 10.4103/jehp.jehp_20_22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 04/26/2022] [Indexed: 06/19/2023]
Abstract
BACKGROUND Accurately predicting the intubation risk in COVID-19 patients at the admission time is critical to optimal use of limited hospital resources, providing customized and evidence-based treatments, and improving the quality of delivered medical care services. This study aimed to design a statistical algorithm to select the best features influencing intubation prediction in coronavirus disease 2019 (COVID-19) hospitalized patients. Then, using selected features, multiple artificial neural network (ANN) configurations were developed to predict intubation risk. MATERIAL AND METHODS In this retrospective single-center study, a dataset containing 482 COVID-19 patients who were hospitalized between February 9, 2020 and July 20, 2021 was used. First, the Phi correlation coefficient method was performed for selecting the most important features affecting COVID-19 patients' intubation. Then, the different configurations of ANN were developed. Finally, the performance of ANN configurations was assessed using several evaluation metrics, and the best structure was determined for predicting intubation requirements among hospitalized COVID-19 patients. RESULTS The ANN models were developed based on 18 validated features. The results indicated that the best performance belongs to the 18-20-1 ANN configuration with positive predictive value (PPV) = 0.907, negative predictive value (NPV) = 0.941, sensitivity = 0.898, specificity = 0.951, and area under curve (AUC) = 0.906. CONCLUSIONS The results demonstrate the effectiveness of the ANN models for timely and reliable prediction of intubation risk in COVID-19 hospitalized patients. Our models can inform clinicians and those involved in policymaking and decision making for prioritizing restricted mechanical ventilation and other related resources for critically COVID-19 patients.
Collapse
Affiliation(s)
- Raoof Nopour
- Department of Health Information Management, Student Research Committee, School of Health Management and Information Sciences Branch, Iran University of Medical Sciences, Tehran, Iran
| | - Mostafa Shanbezadeh
- Department of Health Information Technology, School of Paramedical, Ilam University of Medical Sciences, Ilam, Iran
| | - Hadi Kazemi-Arpanahi
- Department of Health Information Technology, Abadan University of Medical Sciences, Abadan, Iran
| |
Collapse
|
36
|
Jeyananthan P. SARS-CoV-2 Diagnosis Using Transcriptome Data: A Machine Learning Approach. SN COMPUTER SCIENCE 2023; 4:218. [PMID: 36844504 PMCID: PMC9936926 DOI: 10.1007/s42979-023-01703-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 01/24/2023] [Indexed: 05/02/2023]
Abstract
SARS-CoV-2 pandemic is the big issue of the whole world right now. The health community is struggling to rescue the public and countries from this spread, which revives time to time with different waves. Even the vaccination seems to be not prevents this spread. Accurate identification of infected people on time is essential these days to control the spread. So far, Polymerase chain reaction (PCR) and rapid antigen tests are widely used in this identification, accepting their own drawbacks. False negative cases are the menaces in this scenario. To avoid these problems, this study uses machine learning techniques to build a classification model with higher accuracy to filter the COVID-19 cases from the non-COVID individuals. Transcriptome data of the SARS-CoV-2 patients along with the control are used in this stratification using three different feature selection algorithms and seven classification models. Differently expressed genes also studied between these two groups of people and used in this classification. Results shows that mutual information (or DEGs) along with naïve Bayes (or SVM) gives the best accuracy (0.98 ± 0.04) among these methods. Supplementary Information The online version contains supplementary material available at 10.1007/s42979-023-01703-6.
Collapse
|
37
|
Amiri P, Montazeri M, Ghasemian F, Asadi F, Niksaz S, Sarafzadeh F, Khajouei R. Prediction of mortality risk and duration of hospitalization of COVID-19 patients with chronic comorbidities based on machine learning algorithms. Digit Health 2023; 9:20552076231170493. [PMID: 37312960 PMCID: PMC10259141 DOI: 10.1177/20552076231170493] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 03/31/2023] [Indexed: 06/15/2023] Open
Abstract
Background The severity of coronavirus (COVID-19) in patients with chronic comorbidities is much higher than in other patients, which can lead to their death. Machine learning (ML) algorithms as a potential solution for rapid and early clinical evaluation of the severity of the disease can help in allocating and prioritizing resources to reduce mortality. Objective The objective of this study was to predict the mortality risk and length of stay (LoS) of patients with COVID-19 and history of chronic comorbidities using ML algorithms. Methods This retrospective study was conducted by reviewing the medical records of COVID-19 patients with a history of chronic comorbidities from March 2020 to January 2021 in Afzalipour Hospital in Kerman, Iran. The outcome of patients, hospitalization was recorded as discharge or death. The filtering technique used to score the features and well-known ML algorithms were applied to predict the risk of mortality and LoS of patients. Ensemble Learning methods is also used. To evaluate the performance of the models, different measures including F1, precision, recall, and accuracy were calculated. The TRIPOD guideline assessed transparent reporting. Results This study was performed on 1291 patients, including 900 alive and 391 dead patients. Shortness of breath (53.6%), fever (30.1%), and cough (25.3%) were the three most common symptoms in patients. Diabetes mellitus(DM) (31.3%), hypertension (HTN) (27.3%), and ischemic heart disease (IHD) (14.2%) were the three most common chronic comorbidities of patients. Twenty-six important factors were extracted from each patient's record. Gradient boosting model with 84.15% accuracy was the best model for predicting mortality risk and multilayer perceptron (MLP) with rectified linear unit function (MSE = 38.96) was the best model for predicting the LoS. The most common chronic comorbidities among these patients were DM (31.3%), HTN (27.3%), and IHD (14.2%). The most important factors in predicting the risk of mortality were hyperlipidemia, diabetes, asthma, and cancer, and in predicting LoS was shortness of breath. Conclusion The results of this study showed that the use of ML algorithms can be a good tool to predict the risk of mortality and LoS of patients with COVID-19 and chronic comorbidities based on physiological conditions, symptoms, and demographic information of patients. The Gradient boosting and MLP algorithms can quickly identify patients at risk of death or long-term hospitalization and notify physicians to do appropriate interventions.
Collapse
Affiliation(s)
- Parastoo Amiri
- Student Research Committee, Kerman University of Medical Sciences, Kerman, Iran
| | - Mahdieh Montazeri
- Department of Health Information Sciences, Faculty of Management and Medical Information Sciences, Kerman University of Medical Sciences, Kerman, Iran
| | - Fahimeh Ghasemian
- Computer Engineering Department, Faculty of Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Fatemeh Asadi
- Student Research Committee, School of Management and Medical Information, Kerman University of Medical Sciences, Kerman, Iran
| | - Saeed Niksaz
- Computer Engineering Department, Faculty of Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Farhad Sarafzadeh
- Infectious and Internal Medicine Department, Afzalipour Hospital, Kerman University of Medical Sciences, Kerman, Iran
| | - Reza Khajouei
- Department of Health Information Sciences, Faculty of Management and Medical Information Sciences, Kerman University of Medical Sciences, Kerman, Iran
| |
Collapse
|
38
|
Kwekha-Rashid AS, Abduljabbar HN, Alhayani B. Coronavirus disease (COVID-19) cases analysis using machine-learning applications. APPLIED NANOSCIENCE 2023; 13:2013-2025. [PMID: 34036034 PMCID: PMC8138510 DOI: 10.1007/s13204-021-01868-7] [Citation(s) in RCA: 74] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 05/04/2021] [Indexed: 12/23/2022]
Abstract
Today world thinks about coronavirus disease that which means all even this pandemic disease is not unique. The purpose of this study is to detect the role of machine-learning applications and algorithms in investigating and various purposes that deals with COVID-19. Review of the studies that had been published during 2020 and were related to this topic by seeking in Science Direct, Springer, Hindawi, and MDPI using COVID-19, machine learning, supervised learning, and unsupervised learning as keywords. The total articles obtained were 16,306 overall but after limitation; only 14 researches of these articles were included in this study. Our findings show that machine learning can produce an important role in COVID-19 investigations, prediction, and discrimination. In conclusion, machine learning can be involved in the health provider programs and plans to assess and triage the COVID-19 cases. Supervised learning showed better results than other Unsupervised learning algorithms by having 92.9% testing accuracy. In the future recurrent supervised learning can be utilized for superior accuracy.
Collapse
Affiliation(s)
- Ameer Sardar Kwekha-Rashid
- Business Information Technology, College of Administration and Economics, University of Sulaimani, Sulaimaniya, Iraq
| | - Heamn N. Abduljabbar
- College of Education, Physics Department, Salahaddin University, Shaqlawa, Iraq ,Department of radiology and imagingFaculty of Medicine and Health Sciences, Universiti Putra Malaysia UPM, Seri Kembangan, Malaysia
| | - Bilal Alhayani
- Electronics and Communication Department, Yildiz Technical University, Istanbul, Turkey
| |
Collapse
|
39
|
Ustebay S, Sarmis A, Kaya GK, Sujan M. A comparison of machine learning algorithms in predicting COVID-19 prognostics. Intern Emerg Med 2023; 18:229-239. [PMID: 36116079 PMCID: PMC9483274 DOI: 10.1007/s11739-022-03101-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 09/05/2022] [Indexed: 02/01/2023]
Abstract
ML algorithms are used to develop prognostic and diagnostic models and so to support clinical decision-making. This study uses eight supervised ML algorithms to predict the need for intensive care, intubation, and mortality risk for COVID-19 patients. The study uses two datasets: (1) patient demographics and clinical data (n = 11,712), and (2) patient demographics, clinical data, and blood test results (n = 602) for developing the prediction models, understanding the most significant features, and comparing the performances of eight different ML algorithms. Experimental findings showed that all prognostic prediction models reported an AUROC value of over 0.92, in which extra tree and CatBoost classifiers were often outperformed (AUROC over 0.94). The findings revealed that the features of C-reactive protein, the ratio of lymphocytes, lactic acid, and serum calcium have a substantial impact on COVID-19 prognostic predictions. This study provides evidence of the value of tree-based supervised ML algorithms for predicting prognosis in health care.
Collapse
Affiliation(s)
- Serpil Ustebay
- Department of Computer Engineering, Istanbul Medeniyet University, Istanbul, Turkey
| | - Abdurrahman Sarmis
- Department of Microbiology Laboratory, Goztepe Prof. Dr. Suleyman Yalcin City Hospital, Istanbul, Turkey
| | - Gulsum Kubra Kaya
- Department of Industrial Engineering, Istanbul Medeniyet University, Istanbul, Turkey.
- School of Aerospace, Transport and Manufacturing, Cranfield University, Bedford, MK430AL, UK.
| | | |
Collapse
|
40
|
Zakariaee SS, Abdi AI, Naderi N, Babashahi M. Prognostic significance of chest CT severity score in mortality prediction of COVID-19 patients, a machine learning study. THE EGYPTIAN JOURNAL OF RADIOLOGY AND NUCLEAR MEDICINE 2023; 54:73. [PMCID: PMC10116092 DOI: 10.1186/s43055-023-01022-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 04/13/2023] [Indexed: 04/05/2024] Open
Abstract
Background The high mortality rate of COVID-19 makes it necessary to seek early identification of high-risk patients with poor prognoses. Although the association between CT-SS and mortality of COVID-19 patients was reported, its prognosis significance in combination with other prognostic parameters was not evaluated yet. Methods This retrospective single-center study reviewed a total of 6854 suspected patients referred to Imam Khomeini hospital, Ilam city, west of Iran, from February 9, 2020 to December 20, 2020. The prognostic performances of k-Nearest Neighbors (kNN), Multilayer Perceptron (MLP), Support Vector Machine (SVM), and J48 decision tree algorithms were evaluated based on the most important and relevant predictors. The metrics derived from the confusion matrix were used to determine the performance of the ML models. Results After applying exclusion criteria, 815 hospitalized cases were entered into the study. Of these, 447(54.85%) were male and the mean (± SD) age of participants was 57.22(± 16.76) years. The results showed that the performances of the ML algorithms were improved when they are fed by the dataset with CT-SS data. The kNN model with an accuracy of 94.1%, sensitivity of 100. 0%, precision of 89.5%, specificity of 88.3%, and AUC around 97.2% had the best performance among the other three ML techniques. Conclusions The integration of CT-SS data with demographics, risk factors, clinical manifestations, and laboratory parameters improved the prognostic performances of the ML algorithms. An ML model with a comprehensive collection of predictors could identify high-risk patients more efficiently and lead to the optimal use of hospital resources.
Collapse
Affiliation(s)
- Seyed Salman Zakariaee
- Department of Medical Physics, Faculty of Paramedical Sciences, Ilam University of Medical Sciences, Ilam, Iran
| | - Aza Ismail Abdi
- Department of Radiology, Erbil Medical Technical Institute, Erbil Polytechnic University, Erbil, Iraq
| | - Negar Naderi
- Department of Midwifery, Faculty of Nursing and Midwifery, Ilam University of Medical Sciences, Ilam, Iran
| | - Mashallah Babashahi
- Department of Pathology, Faculty of Paramedical Sciences, Ilam University of Medical Sciences, Ilam, Iran
| |
Collapse
|
41
|
Walston SL, Matsumoto T, Miki Y, Ueda D. Artificial intelligence-based model for COVID-19 prognosis incorporating chest radiographs and clinical data; a retrospective model development and validation study. Br J Radiol 2022; 95:20220058. [PMID: 36193755 PMCID: PMC9733620 DOI: 10.1259/bjr.20220058] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 08/19/2022] [Accepted: 08/23/2022] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVES The purpose of this study was to develop an artificial intelligence-based model to prognosticate COVID-19 patients at admission by combining clinical data and chest radiographs. METHODS This retrospective study used the Stony Brook University COVID-19 dataset of 1384 inpatients. After exclusions, 1356 patients were randomly divided into training (1083) and test datasets (273). We implemented three artificial intelligence models, which classified mortality, ICU admission, or ventilation risk. Each model had three submodels with different inputs: clinical data, chest radiographs, and both. We showed the importance of the variables using SHapley Additive exPlanations (SHAP) values. RESULTS The mortality prediction model was best overall with area under the curve, sensitivity, specificity, and accuracy of 0.79 (0.72-0.86), 0.74 (0.68-0.79), 0.77 (0.61-0.88), and 0.74 (0.69-0.79) for the clinical data-based model; 0.77 (0.69-0.85), 0.67 (0.61-0.73), 0.81 (0.67-0.92), 0.70 (0.64-0.75) for the image-based model, and 0.86 (0.81-0.91), 0.76 (0.70-0.81), 0.77 (0.61-0.88), 0.76 (0.70-0.81) for the mixed model. The mixed model had the best performance (p value < 0.05). The radiographs ranked fourth for prognostication overall, and first of the inpatient tests assessed. CONCLUSIONS These results suggest that prognosis models become more accurate if AI-derived chest radiograph features and clinical data are used together. ADVANCES IN KNOWLEDGE This AI model evaluates chest radiographs together with clinical data in order to classify patients as having high or low mortality risk. This work shows that chest radiographs taken at admission have significant COVID-19 prognostic information compared to clinical data other than age and sex.
Collapse
Affiliation(s)
| | | | - Yukio Miki
- Department of Diagnostic and Interventional Radiology, Graduate School of Medicine, Osaka Metropolitan University,1-4-3 Asahi-machi, Abeno-ku, Osaka, Japan
| | | |
Collapse
|
42
|
Fadja AN, Fraccaroli M, Bizzarri A, Mazzuchelli G, Lamma E. Neural-Symbolic Ensemble Learning for early-stage prediction of critical state of Covid-19 patients. Med Biol Eng Comput 2022; 60:3461-3474. [PMID: 36201136 PMCID: PMC9540054 DOI: 10.1007/s11517-022-02674-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 09/17/2022] [Indexed: 11/11/2022]
Abstract
Recently, Artificial Intelligence (AI) and Machine Learning (ML) have been successfully applied to many domains of interest including medical diagnosis. Due to the availability of a large quantity of data, it is possible to build reliable AI systems that assist humans in making decisions. The recent Covid-19 pandemic quickly spread over the world causing serious health problems and severe economic and social damage. Computer scientists are actively working together with doctors on different ML models to diagnose Covid-19 patients using Computed Tomography (CT) scans and clinical data. In this work, we propose a neural-symbolic system that predicts if a Covid-19 patient arriving at the hospital will end in a critical condition. The proposed system relies on Deep 3D Convolutional Neural Networks (3D-CNNs) for analyzing lung CT scans of Covid-19 patients, Decision Trees (DTs) for predicting if a Covid-19 patient will eventually pass away by analyzing its clinical data, and a neural system that integrates the previous ones using Hierarchical Probabilistic Logic Programs (HPLPs). Predicting if a Covid-19 patient will end in a critical condition is useful for managing the limited number of intensive care at the hospital. Moreover, knowing early that a Covid-19 patient could end in serious conditions allows doctors to gain early knowledge on patients and provide special treatment to those predicted to finish in critical conditions. The proposed system, entitled Neural HPLP, obtains good performance in terms of area under the receiver operating characteristic and precision curves with values of about 0.96 for both metrics. Therefore, with Neural HPLP, it is possible not only to efficiently predict if Covid-19 patients will end in severe conditions but also possible to provide an explanation of the prediction. This makes Neural HPLP explainable, interpretable, and reliable. Graphical abstract Representation of Neural HPLP. From top to bottom, the two different types of data collected from the same patient and used in this project are represented. This data feeds the two different machine learning systems and the integration of the two systems using Hierarchical Probabilistic Logic Program.
Collapse
Affiliation(s)
- Arnaud Nguembang Fadja
- Department of Mathematics and Computer Science, University of Ferrara, Via Nicolò Machiavelli 30, Ferrara, 44121 Italy
| | - Michele Fraccaroli
- DE - Department of Engineering, University of Ferrara, Via Saragat 1, Ferrara, 44122 Italy
| | - Alice Bizzarri
- DE - Department of Engineering, University of Ferrara, Via Saragat 1, Ferrara, 44122 Italy
| | - Giulia Mazzuchelli
- DE - Department of Engineering, University of Ferrara, Via Saragat 1, Ferrara, 44122 Italy
| | - Evelina Lamma
- DE - Department of Engineering, University of Ferrara, Via Saragat 1, Ferrara, 44122 Italy
| |
Collapse
|
43
|
Lodato I, Iyer AV, To IZ, Lai ZY, Chan HSY, Leung WSW, Tang THC, Cheung VKL, Wu TC, Ng GWY. Prognostic Model of COVID-19 Severity and Survival among Hospitalized Patients Using Machine Learning Techniques. Diagnostics (Basel) 2022; 12:2728. [PMID: 36359571 PMCID: PMC9689804 DOI: 10.3390/diagnostics12112728] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 10/25/2022] [Accepted: 11/03/2022] [Indexed: 08/22/2023] Open
Abstract
We conducted a statistical study and developed a machine learning model to triage COVID-19 patients affected during the height of the COVID-19 pandemic in Hong Kong based on their medical records and test results (features) collected during their hospitalization. The correlation between the values of these features is studied against discharge status and disease severity as a preliminary step to identify those features with a more pronounced effect on the patient outcome. Once identified, they constitute the inputs of four machine learning models, Decision Tree, Random Forest, Gradient and RUSBoosting, which predict both the Mortality and Severity associated with the disease. We test the accuracy of the models when the number of input features is varied, demonstrating their stability; i.e., the models are already highly predictive when run over a core set of (6) features. We show that Random Forest and Gradient Boosting classifiers are highly accurate in predicting patients' Mortality (average accuracy ∼99%) as well as categorize patients (average accuracy ∼91%) into four distinct risk classes (Severity of COVID-19 infection). Our methodical and broad approach combines statistical insights with various machine learning models, which paves the way forward in the AI-assisted triage and prognosis of COVID-19 cases, which is potentially generalizable to other seasonal flus.
Collapse
Affiliation(s)
- Ivano Lodato
- Allos Limited, 1 Hok Cheung Street, Kowloon, Hong Kong, China
| | - Aditya Varna Iyer
- Allos Limited, 1 Hok Cheung Street, Kowloon, Hong Kong, China
- Department of Physics, University of Oxford, Oxford OX1 3PJ, UK
| | | | - Zhong-Yuan Lai
- Allos Limited, 1 Hok Cheung Street, Kowloon, Hong Kong, China
- Department of Physics, Fudan University, Shanghai 200433, China
| | - Helen Shuk-Ying Chan
- Division of Infectious Diseases, Department of Medicine, Queen Elizabeth Hospital, Hong Kong, China
| | - Winnie Suk-Wai Leung
- Division of Integrative Systems and Design, Hong Kong University of Science and Technology, Hong Kong, China
| | - Tommy Hing-Cheung Tang
- Division of Infectious Diseases, Department of Medicine, Queen Elizabeth Hospital, Hong Kong, China
| | - Victor Kai-Lam Cheung
- Multi-Disciplinary Simulation and Skills Centre, Queen Elizabeth Hospital, Hong Kong, China
| | - Tak-Chiu Wu
- Division of Infectious Diseases, Department of Medicine, Queen Elizabeth Hospital, Hong Kong, China
| | - George Wing-Yiu Ng
- Intensive Care Unit, Department of Medicine, Queen Elizabeth Hospital, Hong Kong, China
| |
Collapse
|
44
|
Mavrogiorgou A, Kiourtis A, Kleftakis S, Mavrogiorgos K, Zafeiropoulos N, Kyriazis D. A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions. SENSORS (BASEL, SWITZERLAND) 2022; 22:8615. [PMID: 36433212 PMCID: PMC9695983 DOI: 10.3390/s22228615] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 11/04/2022] [Accepted: 11/04/2022] [Indexed: 05/27/2023]
Abstract
Extracting useful knowledge from proper data analysis is a very challenging task for efficient and timely decision-making. To achieve this, there exist a plethora of machine learning (ML) algorithms, while, especially in healthcare, this complexity increases due to the domain's requirements for analytics-based risk predictions. This manuscript proposes a data analysis mechanism experimented in diverse healthcare scenarios, towards constructing a catalogue of the most efficient ML algorithms to be used depending on the healthcare scenario's requirements and datasets, for efficiently predicting the onset of a disease. To this context, seven (7) different ML algorithms (Naïve Bayes, K-Nearest Neighbors, Decision Tree, Logistic Regression, Random Forest, Neural Networks, Stochastic Gradient Descent) have been executed on top of diverse healthcare scenarios (stroke, COVID-19, diabetes, breast cancer, kidney disease, heart failure). Based on a variety of performance metrics (accuracy, recall, precision, F1-score, specificity, confusion matrix), it has been identified that a sub-set of ML algorithms are more efficient for timely predictions under specific healthcare scenarios, and that is why the envisioned ML catalogue prioritizes the ML algorithms to be used, depending on the scenarios' nature and needed metrics. Further evaluation must be performed considering additional scenarios, involving state-of-the-art techniques (e.g., cloud deployment, federated ML) for improving the mechanism's efficiency.
Collapse
Affiliation(s)
- Argyro Mavrogiorgou
- Department of Digital Systems, University of Piraeus, 185 34 Piraeus, Greece
| | | | | | | | | | | |
Collapse
|
45
|
COVID-19 machine learning model predicts outcomes in older patients from various European countries, between pandemic waves, and in a cohort of Asian, African, and American patients. PLOS DIGITAL HEALTH 2022; 1:e0000136. [PMID: 36812571 PMCID: PMC9931233 DOI: 10.1371/journal.pdig.0000136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 09/26/2022] [Indexed: 11/11/2022]
Abstract
BACKGROUND COVID-19 remains a complex disease in terms of its trajectory and the diversity of outcomes rendering disease management and clinical resource allocation challenging. Varying symptomatology in older patients as well as limitation of clinical scoring systems have created the need for more objective and consistent methods to aid clinical decision making. In this regard, machine learning methods have been shown to enhance prognostication, while improving consistency. However, current machine learning approaches have been limited by lack of generalisation to diverse patient populations, between patients admitted at different waves and small sample sizes. OBJECTIVES We sought to investigate whether machine learning models, derived on routinely collected clinical data, can generalise well i) between European countries, ii) between European patients admitted at different COVID-19 waves, and iii) between geographically diverse patients, namely whether a model derived on the European patient cohort can be used to predict outcomes of patients admitted to Asian, African and American ICUs. METHODS We compare Logistic Regression, Feed Forward Neural Network and XGBoost algorithms to analyse data from 3,933 older patients with a confirmed COVID-19 diagnosis in predicting three outcomes, namely: ICU mortality, 30-day mortality and patients at low risk of deterioration. The patients were admitted to ICUs located in 37 countries, between January 11, 2020, and April 27, 2021. RESULTS The XGBoost model derived on the European cohort and externally validated in cohorts of Asian, African, and American patients, achieved AUC of 0.89 (95% CI 0.89-0.89) in predicting ICU mortality, AUC of 0.86 (95% CI 0.86-0.86) for 30-day mortality prediction and AUC of 0.86 (95% CI 0.86-0.86) in predicting low-risk patients. Similar AUC performance was achieved also when predicting outcomes between European countries and between pandemic waves, while the models showed high calibration quality. Furthermore, saliency analysis showed that FiO2 values of up to 40% do not appear to increase the predicted risk of ICU and 30-day mortality, while PaO2 values of 75 mmHg or lower are associated with a sharp increase in the predicted risk of ICU and 30-day mortality. Lastly, increase in SOFA scores also increase the predicted risk, but only up to a value of 8. Beyond these scores the predicted risk remains consistently high. CONCLUSION The models captured both the dynamic course of the disease as well as similarities and differences between the diverse patient cohorts, enabling prediction of disease severity, identification of low-risk patients and potentially supporting effective planning of essential clinical resources. TRIAL REGISTRATION NUMBER NCT04321265.
Collapse
|
46
|
Iles RK, Iles JK, Lacey J, Gardiner A, Zmuidinaite R. Direct Detection of Glycated Human Serum Albumin and Hyperglycosylated IgG3 in Serum, by MALDI-ToF Mass Spectrometry, as a Predictor of COVID-19 Severity. Diagnostics (Basel) 2022; 12:diagnostics12102521. [PMID: 36292212 PMCID: PMC9601263 DOI: 10.3390/diagnostics12102521] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 10/10/2022] [Accepted: 10/11/2022] [Indexed: 11/16/2022] Open
Abstract
The prefusion spike protein of SARS-CoV-2 binds advanced glycation end product (AGE)-glycated human serum albumin (HSA) and a higher mass (hyperglycosylated/glycated) immunoglobulin (Ig) G3, as determined by matrix assisted laser desorption mass spectrometry (MALDI-ToF). We set out to investigate if the total blood plasma of patients who had recovered from acute respiratory distress syndrome (ARDS) as a result of COVID-19, contained more glycated HSA and higher mass (glycosylated/glycated) IgG3 than those with only clinically mild or asymptomatic infections. A direct serum dilution, and disulphide bond reduction, method was developed and applied to plasma samples from SARS-CoV-2 seronegative (n = 30) and seropositive (n = 31) healthcare workers (HCWs) and 38 convalescent plasma samples from patients who had been admitted with acute respiratory distress (ARDS) associated with COVID-19. Patients recovering from COVID-19 ARDS had significantly higher mass AGE-glycated HSA and higher mass IgG3 levels. This would indicate that increased levels and/or ratios of hyper-glycosylation (probably terminal sialic acid) IgG3 and AGE glycated HSA may be predisposition markers for the development of COVID-19 ARDS as a result of SARS-CoV2 infection. Furthermore, rapid direct analysis of serum/plasma samples by MALDI-ToF for such humoral immune correlates of COVID-19 presents a feasible screening technology for the most at risk; regardless of age or known health conditions.
Collapse
Affiliation(s)
- Ray K. Iles
- MAP Sciences, The iLab, Stannard Way, Bedford MK44 3RZ, UK
- Laboratory of Viral Zoonotics, Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
- NISAD, Sundstorget 2, 252-21 Helsingborg, Sweden
- Correspondence:
| | - Jason K. Iles
- MAP Sciences, The iLab, Stannard Way, Bedford MK44 3RZ, UK
- Laboratory of Viral Zoonotics, Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Jonathan Lacey
- MAP Sciences, The iLab, Stannard Way, Bedford MK44 3RZ, UK
| | - Anna Gardiner
- MAP Sciences, The iLab, Stannard Way, Bedford MK44 3RZ, UK
| | - Raminta Zmuidinaite
- MAP Sciences, The iLab, Stannard Way, Bedford MK44 3RZ, UK
- Laboratory of Viral Zoonotics, Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| |
Collapse
|
47
|
Kistenev YV, Vrazhnov DA, Shnaider EE, Zuhayri H. Predictive models for COVID-19 detection using routine blood tests and machine learning. Heliyon 2022; 8:e11185. [PMID: 36311357 PMCID: PMC9595489 DOI: 10.1016/j.heliyon.2022.e11185] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 03/25/2022] [Accepted: 10/16/2022] [Indexed: 11/06/2022] Open
Abstract
The problem of accurate, fast, and inexpensive COVID-19 tests has been urgent till now. Standard COVID-19 tests need high-cost reagents and specialized laboratories with high safety requirements, are time-consuming. Data of routine blood tests as a base of SARS-CoV-2 invasion detection allows using the most practical medicine facilities. But blood tests give general information about a patient's state, which is not directly associated with COVID-19. COVID-19-specific features should be selected from the list of standard blood characteristics, and decision-making software based on appropriate clinical data should be created. This review describes the abilities to develop predictive models for COVID-19 detection using routine blood tests and machine learning.
Collapse
Affiliation(s)
- Yury V. Kistenev
- Laboratory of Laser Molecular Imaging and Machine Learning, Tomsk State University, 36 Lenin Av., 634050 Tomsk, Russia
| | - Denis A. Vrazhnov
- Laboratory of Laser Molecular Imaging and Machine Learning, Tomsk State University, 36 Lenin Av., 634050 Tomsk, Russia
| | - Ekaterina E. Shnaider
- Laboratory of Laser Molecular Imaging and Machine Learning, Tomsk State University, 36 Lenin Av., 634050 Tomsk, Russia
| | - Hala Zuhayri
- Laboratory of Laser Molecular Imaging and Machine Learning, Tomsk State University, 36 Lenin Av., 634050 Tomsk, Russia
| |
Collapse
|
48
|
Factor S, Neuman Y, Vidra M, Shalom M, Lichtenstein A, Amar E, Rath E. Violation of expectations is correlated with satisfaction following hip arthroscopy. Knee Surg Sports Traumatol Arthrosc 2022; 31:2023-2029. [PMID: 36181523 DOI: 10.1007/s00167-022-07182-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 09/26/2022] [Indexed: 11/07/2022]
Abstract
PURPOSE The mechanism by which preoperative expectations may be associated with patient satisfaction and procedural outcomes following hip preservation surgery (HPS) is far from simple or linear. The purpose of this study is to better understand patient expectations regarding HPS and their relationship with patient-reported outcomes (PROs) and satisfaction using machine learning (ML) algorithms. METHODS Patients scheduled for hip arthroscopy completed the Hip Preservation Surgery Expectations Survey (HPSES) and the pre- and a minimum 2 year postoperative International Hip Outcome Tool (iHOT-33). Patient demographics, including age, gender, occupation, and body mass index (BMI), were also collected. At the latest follow-up, patients were evaluated for subjective satisfaction and postoperative complications. ML algorithms and standard statistics were used. RESULTS A total of 69 patients were included in this study (mean age 33.7 ± 13.1 years, 62.3% males). The mean follow-up period was 27 months. The mean HPSES score, patient satisfaction, preoperative, and postoperative iHOT-33 were 83.8 ± 16.5, 75.9 ± 26.9, 31.6 ± 15.8, and 73 ± 25.9, respectively. Fifty-nine patients (86%) reported that they would undergo the surgery again, with no significant difference with regards to expectations. A significant difference was found with regards to expectation violation (p < 0.001). Expectation violation scores were also found to be significantly correlated with satisfaction. CONCLUSION ML algorithms utilized in this study demonstrate that violation of expectations plays an important predictive role in postoperative outcomes and patient satisfaction and is associated with patients' willingness to undergo surgery again. LEVEL OF EVIDENCE IV.
Collapse
Affiliation(s)
- Shai Factor
- Orthopedic Division, Department of Orthopedic Surgery, Affiliated with the Sackler Faculty of Medicine, Tel Aviv Medical Center, 6 Weitzman St., 6423906, Tel Aviv, Israel.
| | - Yair Neuman
- Department of Cognitive and Brain Sciences and the Zlotowski Center for Neuroscience, Ben-Gurion University of the Negev, 84105, Beer-Sheva, Israel
| | - Matias Vidra
- Orthopedic Division, Department of Orthopedic Surgery, Affiliated with the Sackler Faculty of Medicine, Tel Aviv Medical Center, 6 Weitzman St., 6423906, Tel Aviv, Israel
| | - Moshe Shalom
- Orthopedic Division, Department of Orthopedic Surgery, Affiliated with the Sackler Faculty of Medicine, Tel Aviv Medical Center, 6 Weitzman St., 6423906, Tel Aviv, Israel
| | - Adi Lichtenstein
- Orthopedic Division, Department of Orthopedic Surgery, Affiliated with the Sackler Faculty of Medicine, Tel Aviv Medical Center, 6 Weitzman St., 6423906, Tel Aviv, Israel
| | - Eyal Amar
- Orthopedic Division, Department of Orthopedic Surgery, Affiliated with the Sackler Faculty of Medicine, Tel Aviv Medical Center, 6 Weitzman St., 6423906, Tel Aviv, Israel
| | - Ehud Rath
- Orthopedic Division, Department of Orthopedic Surgery, Affiliated with the Sackler Faculty of Medicine, Tel Aviv Medical Center, 6 Weitzman St., 6423906, Tel Aviv, Israel
| |
Collapse
|
49
|
Winston L, McCann M, Onofrei G. Exploring Socioeconomic Status as a Global Determinant of COVID-19 Prevalence, Using Exploratory Data Analytic and Supervised Machine Learning Techniques: Algorithm Development and Validation Study. JMIR Form Res 2022; 6:e35114. [PMID: 36001798 PMCID: PMC9518652 DOI: 10.2196/35114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 04/12/2022] [Accepted: 04/27/2022] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND The COVID-19 pandemic represents the most unprecedented global challenge in recent times. As the global community attempts to manage the pandemic in the long term, it is pivotal to understand what factors drive prevalence rates and to predict the future trajectory of the virus. OBJECTIVE This study had 2 objectives. First, it tested the statistical relationship between socioeconomic status and COVID-19 prevalence. Second, it used machine learning techniques to predict cumulative COVID-19 cases in a multicountry sample of 182 countries. Taken together, these objectives will shed light on socioeconomic status as a global risk factor of the COVID-19 pandemic. METHODS This research used exploratory data analysis and supervised machine learning methods. Exploratory analysis included variable distribution, variable correlations, and outlier detection. Following this, the following 3 supervised regression techniques were applied: linear regression, random forest, and adaptive boosting (AdaBoost). Results were evaluated using k-fold cross-validation and subsequently compared to analyze algorithmic suitability. The analysis involved 2 models. First, the algorithms were trained to predict 2021 COVID-19 prevalence using only 2020 reported case data. Following this, socioeconomic indicators were added as features and the algorithms were trained again. The Human Development Index (HDI) metrics of life expectancy, mean years of schooling, expected years of schooling, and gross national income were used to approximate socioeconomic status. RESULTS All variables correlated positively with the 2021 COVID-19 prevalence, with R2 values ranging from 0.55 to 0.85. Using socioeconomic indicators, COVID-19 prevalence was predicted with a reasonable degree of accuracy. Using 2020 reported case rates as a lone predictor to predict 2021 prevalence rates, the average predictive accuracy of the algorithms was low (R2=0.543). When socioeconomic indicators were added alongside 2020 prevalence rates as features, the average predictive performance improved considerably (R2=0.721) and all error statistics decreased. Thus, adding socioeconomic indicators alongside 2020 reported case data optimized the prediction of COVID-19 prevalence to a considerable degree. Linear regression was the strongest learner with R2=0.693 on the first model and R2=0.763 on the second model, followed by random forest (0.481 and 0.722) and AdaBoost (0.454 and 0.679). Following this, the second model was retrained using a selection of additional COVID-19 risk factors (population density, median age, and vaccination uptake) instead of the HDI metrics. However, average accuracy dropped to 0.649, which highlights the value of socioeconomic status as a predictor of COVID-19 cases in the chosen sample. CONCLUSIONS The results show that socioeconomic status is an important variable to consider in future epidemiological modeling, and highlights the reality of the COVID-19 pandemic as a social phenomenon and a health care phenomenon. This paper also puts forward new considerations about the application of statistical and machine learning techniques to understand and combat the COVID-19 pandemic.
Collapse
Affiliation(s)
- Luke Winston
- Department of Computing, Atlantic Technological University, Letterkenny, Ireland
| | - Michael McCann
- Department of Computing, Atlantic Technological University, Letterkenny, Ireland
| | - George Onofrei
- Department of Business, Atlantic Technological University, Letterkenny, Ireland
| |
Collapse
|
50
|
Chen J, Li Y, Guo L, Zhou X, Zhu Y, He Q, Han H, Feng Q. Machine learning techniques for CT imaging diagnosis of novel coronavirus pneumonia: a review. Neural Comput Appl 2022; 36:1-19. [PMID: 36159188 PMCID: PMC9483435 DOI: 10.1007/s00521-022-07709-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 08/04/2022] [Indexed: 11/20/2022]
Abstract
Since 2020, novel coronavirus pneumonia has been spreading rapidly around the world, bringing tremendous pressure on medical diagnosis and treatment for hospitals. Medical imaging methods, such as computed tomography (CT), play a crucial role in diagnosing and treating COVID-19. A large number of CT images (with large volume) are produced during the CT-based medical diagnosis. In such a situation, the diagnostic judgement by human eyes on the thousands of CT images is inefficient and time-consuming. Recently, in order to improve diagnostic efficiency, the machine learning technology is being widely used in computer-aided diagnosis and treatment systems (i.e., CT Imaging) to help doctors perform accurate analysis and provide them with effective diagnostic decision support. In this paper, we comprehensively review these frequently used machine learning methods applied in the CT Imaging Diagnosis for the COVID-19, discuss the machine learning-based applications from the various kinds of aspects including the image acquisition and pre-processing, image segmentation, quantitative analysis and diagnosis, and disease follow-up and prognosis. Moreover, we also discuss the limitations of the up-to-date machine learning technology in the context of CT imaging computer-aided diagnosis.
Collapse
Affiliation(s)
- Jingjing Chen
- Zhejiang University City College, Hangzhou, China
- Zhijiang College of Zhejiang University of Technology, Shaoxing, China
| | - Yixiao Li
- Faculty of Science, Zhejiang University of Technology, Hangzhou, China
| | - Lingling Guo
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou, China
| | - Xiaokang Zhou
- Faculty of Data Science, Shiga University, Hikone, Japan
- RIKEN Center for Advanced Intelligence Project, Tokyo, Japan
| | - Yihan Zhu
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou, China
| | - Qingfeng He
- School of Pharmacy, Fudan University, Shanghai, China
| | - Haijun Han
- School of Medicine, Zhejiang University City College, Hangzhou, China
| | - Qilong Feng
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou, China
| |
Collapse
|