1
|
Campagnini S, Sodero A, Baccini M, Hakiki B, Grippo A, Macchi C, Mannini A, Cecchi F. Prediction of the functional outcome of intensive inpatient rehabilitation after stroke using machine learning methods. Sci Rep 2025; 15:16083. [PMID: 40341247 PMCID: PMC12062331 DOI: 10.1038/s41598-025-00781-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Accepted: 04/30/2025] [Indexed: 05/10/2025] Open
Abstract
An accurate and reliable functional prognosis is vital to stroke patients addressing rehabilitation, to their families, and healthcare providers. This study aimed at developing and validating externally patient-wise prognostic models of the global functional outcome at discharge from intensive inpatient post-acute rehabilitation after stroke, based on a standardized comprehensive multidimensional assessment performed at admission to rehabilitation. Patients addressing intensive inpatient rehabilitation pathways within 30 days from stroke were prospectively enrolled in two consecutive multisite studies. Demographics, description of the event, clinical/functional, and psycho-social data were collected. The outcome of interest was disability in basic daily living activities at discharge, measured by the modified Barthel Index (mBI). Machine learning-based prognostic models were developed, internally cross-validated, and externally validated. Interpretability techniques were applied for the analysis of predictors. 385 patients were considered, 220 (165) for training (external test) sets. A 50.9% (55.8%) of women, 79.5% (80.0%) of ischemic, and a median [interquartile range- IQR] age of 80.0[15.0] (79.0[17.0]) were registered. The Support Vector Machine obtained the best validation performances and a median absolute error [IQR] on discharge mBI estimation of 11.5[15.0] and 9.2[13.0] points on the internal and external testing, respectively. The baseline variables providing the main contributions to the predictions were mBI, motor upper-limb score, age, and cognitive screening score. We achieved a solution to support the formulation of a functional prognosis at intensive rehabilitation admission. The interpretability analysis confirms the relevance of easily collected motor and cognitive dataat admission and of the patient's age.Trial registration: Prospectively registered on ClinicalTrials.gov (registration numbers RIPS NCT03866057, STRATEGY NCT05389878).
Collapse
Affiliation(s)
- Silvia Campagnini
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
| | - Alessandro Sodero
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
- Department of Neurofarba, Università degli Studi di Firenze, Firenze, Italy
| | - Marco Baccini
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
| | - Bahia Hakiki
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
- Department of Experimental and Clinical Medicine, Università degli Studi di Firenze, Firenze, Italy
| | - Antonello Grippo
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
- Azienda Ospedaliera Universitaria Careggi (AOUC), Firenze, Italy
| | - Claudio Macchi
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
| | - Andrea Mannini
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy.
| | - Francesca Cecchi
- IRCCS Fondazione Don Carlo Gnocchi onlus, Via di Scandicci 269, Firenze, 50143, Italy
- Department of Experimental and Clinical Medicine, Università degli Studi di Firenze, Firenze, Italy
| |
Collapse
|
2
|
Aude CA, Vattipally VN, Das O, Ran KR, Giwa GA, Rincon-Torroella J, Xu R, Byrne JP, Muehlschlegel S, Suarez JI, Mukherjee D, Huang J, Azad TD, Bettegowda C. Machine Learning Reveals Demographic Disparities in Palliative Care Timing Among Patients With Traumatic Brain Injury Receiving Neurosurgical Consultation. Neurocrit Care 2024:10.1007/s12028-024-02172-2. [PMID: 39653977 DOI: 10.1007/s12028-024-02172-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 10/31/2024] [Indexed: 12/25/2024]
Abstract
BACKGROUND Timely palliative care (PC) consultations offer demonstrable benefits for patients with traumatic brain injury (TBI), yet their implementation remains inconsistent. This study employs machine learning methods to identify distinct patient phenotypes and elucidate the primary drivers of PC consultation timing variability in TBI management, aiming to uncover disparities and inform more equitable care strategies. METHODS Data on admission, hospital course, and outcomes were collected for a cohort of 232 patients with TBI who received both PC consultations and neurosurgical consultations during the same hospitalization. Patient phenotypes were uncovered using principal component analysis and K-means clustering; time-to-PC consultation for each phenotype was subsequently compared by Kaplan-Meier analysis. An extreme gradient boosting model with Shapley Additive Explanations identified key factors influencing PC consultation timing. RESULTS Three distinct patient clusters emerged: cluster A (n = 86), comprising older adult White women (median 87 years) with mild TBI, received the earliest PC consultations (median 2.5 days); cluster B (n = 108), older adult White men (median 81 years) with mild TBI, experienced delayed PC consultations (median 5.0 days); and cluster C (n = 38), middle-aged (median: 46.5 years), severely injured, non-White patients, had the latest PC consultations (median 9.0 days). The clusters did not differ by discharge disposition (p = 0.4) or inpatient mortality (p > 0.9); however, Kaplan-Meier analysis revealed a significant difference in time-to-PC consultation (p < 0.001), despite no differences in time-to-mortality (p = 0.18). Shapley Additive Explanations analysis of the extreme gradient boosting model identified age, sex, and race as the most influential drivers of PC consultation timing. CONCLUSIONS This study unveils crucial disparities in PC consultation timing for patients with TBI, primarily driven by demographic factors rather than clinical presentation or injury characteristics. The identification of distinct patient phenotypes and quantification of factors influencing PC consultation timing provide a foundation for developing for standardized protocols and decision support tools to ensure timely and equitable palliative care access for patients with TBI.
Collapse
Affiliation(s)
- Carlos A Aude
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Vikas N Vattipally
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Oishika Das
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Kathleen R Ran
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Ganiat A Giwa
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Jordina Rincon-Torroella
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Risheng Xu
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - James P Byrne
- Division of Acute Care Surgery, Department of Surgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Susanne Muehlschlegel
- Division of Neurosciences Critical Care, Departments of Anesthesiology and Critical Care Medicine, Neurology, and Neurosurgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Jose I Suarez
- Division of Neurosciences Critical Care, Departments of Anesthesiology and Critical Care Medicine, Neurology, and Neurosurgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Debraj Mukherjee
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Judy Huang
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| | - Tej D Azad
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA.
| | - Chetan Bettegowda
- Department of Neurosurgery, Johns Hopkins University School of Medicine, 1800 Orleans Street, Baltimore, 21287, MD, USA
| |
Collapse
|
3
|
Finocchi A, Campagnini S, Mannini A, Doronzio S, Baccini M, Hakiki B, Bardi D, Grippo A, Macchi C, Navarro Solano J, Baccini M, Cecchi F. Multiple imputation integrated to machine learning: predicting post-stroke recovery of ambulation after intensive inpatient rehabilitation. Sci Rep 2024; 14:25188. [PMID: 39448629 PMCID: PMC11502899 DOI: 10.1038/s41598-024-74537-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Accepted: 09/26/2024] [Indexed: 10/26/2024] Open
Abstract
Good data quality is vital for personalising plans in rehabilitation. Machine learning (ML) improves prognostics but integrating it with Multiple Imputation (MImp) for dealing missingness is an unexplored field. This work aims to provide post-stroke ambulation prognosis, integrating MImp with ML, and identify the prognostic influential factors. Stroke survivors in intensive rehabilitation were enrolled. Data on demographics, events, clinical, physiotherapy, and psycho-social assessment were collected. An independent ambulation at discharge, using the Functional Ambulation Category scale, was the outcome. After handling missingness using MImp, ML models were optimised, cross-validated, and tested. Interpretability techniques analysed predictor contributions. Pre-MImp, the dataset included 54.1% women, 79.2% ischaemic patients, median age 80.0 (interquartile range: 15.0). Post-MImp, 368 non-ambulatory patients on 10 imputed datasets were used for training, 80 for testing. The random forest (the validation best-performing algorithm) obtained 75.5% aggregated balanced accuracy on the test set. The main predictors included modified Barthel index, Fugl-Meyer assessment/motricity index, short physical performance battery, age, Charlson comorbidity index/cumulative illness rating scale, and trunk control test. This is among the first studies applying ML, together with MImp, to predict ambulation recovery in post-stroke rehabilitation. This pipeline reliably exploits the potential of incomplete datasets for healthcare prognosis, identifying relevant predictors.
Collapse
Affiliation(s)
- Alice Finocchi
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
| | | | - Andrea Mannini
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
| | - Stefano Doronzio
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
- Department of Experimental and Clinical Medicine, University of Florence, Firenze, Italy
| | - Marco Baccini
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
| | - Bahia Hakiki
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
- Department of Experimental and Clinical Medicine, University of Florence, Firenze, Italy
| | - Donata Bardi
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
| | - Antonello Grippo
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
- Azienda Ospedaliera Universitaria Careggi (AOUC), Firenze, Italy
| | - Claudio Macchi
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
- Department of Experimental and Clinical Medicine, University of Florence, Firenze, Italy
| | | | - Michela Baccini
- Department of Statistics, Computer Science, Applications, University of Florence, Firenze, Italy
| | - Francesca Cecchi
- IRCCS Fondazione Don Carlo Gnocchi onlus, Firenze, Italy
- Department of Experimental and Clinical Medicine, University of Florence, Firenze, Italy
| |
Collapse
|
4
|
Kommer M, Hawthorne C, Moss L, Piper I, O'Kane R, Czosnyka M, Enblad P, Hemphill JC, Spiegelberg A, Riddell JS, Shaw M. International e-Delphi survey to define best practice in the reporting of intracranial pressure monitoring recording data. BRAIN & SPINE 2024; 4:102860. [PMID: 39149423 PMCID: PMC11325271 DOI: 10.1016/j.bas.2024.102860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 04/25/2024] [Accepted: 07/03/2024] [Indexed: 08/17/2024]
Abstract
Introduction Intracranial pressure (ICP) monitoring is a very commonly performed neurosurgical procedure but there is a wide variation in how it is reported, hindering analysis of it. The current study sought to generate consensus on the reporting of ICP monitoring recording data. Research question "What should be included in an ICP monitoring report?" Material and methods The exercise was completed via a modified eDelphi survey. An expert panel discussion was held from which themes were identified and used to produce a code to annotate the transcript of the discussion. Statements were generated for a further two rounds of electronic questionnaires distributed via the REDcap platform. A Likert scale was used to grade agreement with each statement in the survey. A statement was accepted if more than 70% agreement was achieved between respondents. Data was collated using Microsoft Excel and analysed using R. Results 149 relevant statements were identified from the transcript and categorised into recording parameters, waveform characteristics or reporting. A total of 22 statements were generated for the first round of the survey which was answered by 39 respondents. Following the electronic round of surveys consensus was achieved for all but one statement regarding the acceptability of automating ICP reporting. This was put forward to a second round after which 79% agreement was reached. Discussion and conclusion The themes and statements from this eDelphi can be used as a framework to allow the standardisation of the reporting of intracranial pressure monitoring data.
Collapse
Affiliation(s)
- Maya Kommer
- NHS Greater Glasgow Clyde, UK
- University of Glasgow, UK
| | | | - Laura Moss
- NHS Greater Glasgow Clyde, UK
- University of Glasgow, UK
| | | | - Roddy O'Kane
- NHS Greater Glasgow Clyde, UK
- Royal Hospital for Children Glasgow, UK
| | - Marek Czosnyka
- Division of Neurosurgery, Cambridge University Hospitals, UK
| | - Per Enblad
- Department of Medical Sciences, Section of Neurosurgery, Uppsala University, Uppsala, Sweden
| | - J Claude Hemphill
- Department of Neurology, University of California San Francisco, San Francisco, CA, USA
| | - Andreas Spiegelberg
- The Interface Group, Institute of Physiology, University of Zurich, Switzerland
| | - John S Riddell
- University of Glasgow, UK
- Centre for Neuroscience, Psychology & Neuroscience Education Hub, School of Psychology and Neuroscience, University of Glasgow, UK
| | - Martin Shaw
- NHS Greater Glasgow Clyde, UK
- University of Glasgow, UK
| |
Collapse
|
5
|
Moss L, Shaw M, Piper I, Hawthorne C. From bed to bench and back again: Challenges facing deployment of intracranial pressure data analysis in clinical environments. BRAIN & SPINE 2024; 4:102858. [PMID: 39105104 PMCID: PMC11298855 DOI: 10.1016/j.bas.2024.102858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 05/29/2024] [Accepted: 07/03/2024] [Indexed: 08/07/2024]
Abstract
Introduction Numerous complex physiological models derived from intracranial pressure (ICP) monitoring data have been developed. More recently, techniques such as machine learning are being used to develop increasingly sophisticated models to aid in clinical decision-making tasks such as diagnosis and prediction. Whilst their potential clinical impact may be significant, few models based on ICP data are routinely available at a patient's bedside. Further, the ability to refine models using ongoing patient data collection is rare. In this paper we identify and discuss the challenges faced when converting insight from ICP data analysis into deployable tools at the patient bedside. Research question To provide an overview of challenges facing implementation of sophisticated ICP models and analyses at the patient bedside. Material and methods A narrative review of the barriers facing implementation of sophisticated ICP models and analyses at the patient bedside in a neurocritical care unit combined with a descriptive case study (the CHART-ADAPT project) on the topic. Results Key barriers found were technical, analytical, and integrity related. Examples included: lack of interoperability of medical devices for data collection and/or model deployment; inadequate infrastructure, hindering analysis of large volumes of high frequency patient data; a lack of clinical confidence in a model; and ethical, trust, security and patient confidentiality considerations governing the secondary use of patient data. Discussion and conclusion To realise the benefits of ICP data analysis, the results need to be promptly delivered and meaningfully communicated. Multiple barriers to implementation remain and solutions which address real-world challenges are required.
Collapse
Affiliation(s)
- Laura Moss
- Dept. of Clinical Physics, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom
- College of Medicine, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Martin Shaw
- Dept. of Clinical Physics, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom
- College of Medicine, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Ian Piper
- College of Medicine, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Christopher Hawthorne
- Dept. of Neuroanaesthesia, Institute of Neurological Sciences, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom
| |
Collapse
|
6
|
Aude CA, Vattipally VN, Das O, Ran KR, Giwa GA, Rincon-Torroella J, Xu R, Byrne JP, Muehlschlegel S, Suarez JI, Mukherjee D, Huang J, Azad TD, Bettegowda C. Machine Learning Identifies Variation in Timing of Palliative Care Consultations Among Traumatic Brain Injury Patients. RESEARCH SQUARE 2024:rs.3.rs-4290808. [PMID: 38746163 PMCID: PMC11092864 DOI: 10.21203/rs.3.rs-4290808/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Background and Objective Timely palliative care involvement offers demonstrable benefits for traumatic brain injury (TBI) patients; however, palliative care consultations (PCCs) are used inconsistently during TBI management. This study aimed to employ advanced machine learning techniques to elucidate the primary drivers of PCC timing variability for TBI patients. Methods Data on admission, hospital course, and outcomes were collected for a cohort of 232 TBI patients who received both PCCs and neurosurgical consultations during the same hospitalization. Principal Component Analysis (PCA) and K-means clustering were used to identify patient phenotypes, which were then compared using Kaplan-Meier analysis. An extreme gradient boosting model (XGBoost) was employed to determine drivers of PCC timing, with model interpretation performed using SHapley Additive exPlanations (SHAP). Results Cluster A (n = 86) consisted mainly of older (median [IQR] = 87 [78, 94] years), White females with mild TBIs and demonstrated the shortest time-to-PCC (2.5 [1.0, 7.0] days). Cluster B (n = 108) also sustained mild TBIs but comprised moderately younger (81 [75, 86] years) married White males with later PCC (5.0 [3.0, 10.8] days). Cluster C (n = 38) represented much younger (46.5 [29.5, 59.8] years), more severely injured, non-White patients with the latest PCC initiation (9.0 [4.2, 17.0] days). The clusters did not differ by discharge disposition (p = 0.4) or frequency inpatient mortality (p > 0.9); however, Kaplan-Meier analysis revealed a significant difference in the time from admission to PCC (p < 0.001), despite no differences in time from admission to mortality (p = 0.18). SHAP analysis of the XGBoost model identified age, sex, and race as the most influential drivers of PCC timing. Conclusions This study highlights crucial disparities in PCC timing for TBI patients and underscores the need for targeted strategies to ensure timely and equitable palliative care integration for this vulnerable population.
Collapse
Affiliation(s)
| | | | - Oishika Das
- The Johns Hopkins University School of Medicine
| | | | | | | | - Risheng Xu
- The Johns Hopkins University School of Medicine
| | | | | | | | | | - Judy Huang
- The Johns Hopkins University School of Medicine
| | - Tej D Azad
- The Johns Hopkins University School of Medicine
| | | |
Collapse
|
7
|
Jaeger BC, Welden S, Lenoir K, Speiser JL, Segar MW, Pandey A, Pajewski NM. Accelerated and Interpretable Oblique Random Survival Forests. J Comput Graph Stat 2024; 33:192-207. [PMID: 39184344 PMCID: PMC11343578 DOI: 10.1080/10618600.2023.2231048] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 06/12/2023] [Indexed: 08/27/2024]
Abstract
The oblique random survival forest (RSF) is an ensemble supervised learning method for right-censored outcomes. Trees in the oblique RSF are grown using linear combinations of predictors, whereas in the standard RSF, a single predictor is used. Oblique RSF ensembles have high prediction accuracy, but assessing many linear combinations of predictors induces high computational overhead. In addition, few methods have been developed for estimation of variable importance (VI) with oblique RSFs. We introduce a method to increase computational efficiency of the oblique RSF and a method to estimate VI with the oblique RSF. Our computational approach uses Newton-Raphson scoring in each non-leaf node, We estimate VI by negating each coefficient used for a given predictor in linear combinations, and then computing the reduction in out-of-bag accuracy. In benchmarking experiments, we find our implementation of the oblique RSF is hundreds of times faster, with equivalent prediction accuracy, compared to existing software for oblique RSFs. We find in simulation studies that "negation VI" discriminates between relevant and irrelevant numeric predictors more accurately than permutation VI, Shapley VI, and a technique to measure VI using analysis of variance. All oblique RSF methods in the current study are available in the aorsf R package, and additional supplemental materials are available online.
Collapse
Affiliation(s)
- Byron C Jaeger
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC
| | - Sawyer Welden
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC
| | - Kristin Lenoir
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC
| | - Jaime L Speiser
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC
| | - Matthew W Segar
- Department of Cardiology, Texas Heart Institute, Houston, TX
| | - Ambarish Pandey
- Division of Cardiology, Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX
| | - Nicholas M Pajewski
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC
| |
Collapse
|
8
|
Sikora A, Zhang T, Murphy DJ, Smith SE, Murray B, Kamaleswaran R, Chen X, Buckley MS, Rowe S, Devlin JW. Machine learning vs. traditional regression analysis for fluid overload prediction in the ICU. Sci Rep 2023; 13:19654. [PMID: 37949982 PMCID: PMC10638304 DOI: 10.1038/s41598-023-46735-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 11/04/2023] [Indexed: 11/12/2023] Open
Abstract
Fluid overload, while common in the ICU and associated with serious sequelae, is hard to predict and may be influenced by ICU medication use. Machine learning (ML) approaches may offer advantages over traditional regression techniques to predict it. We compared the ability of traditional regression techniques and different ML-based modeling approaches to identify clinically meaningful fluid overload predictors. This was a retrospective, observational cohort study of adult patients admitted to an ICU ≥ 72 h between 10/1/2015 and 10/31/2020 with available fluid balance data. Models to predict fluid overload (a positive fluid balance ≥ 10% of the admission body weight) in the 48-72 h after ICU admission were created. Potential patient and medication fluid overload predictor variables (n = 28) were collected at either baseline or 24 h after ICU admission. The optimal traditional logistic regression model was created using backward selection. Supervised, classification-based ML models were trained and optimized, including a meta-modeling approach. Area under the receiver operating characteristic (AUROC), positive predictive value (PPV), and negative predictive value (NPV) were compared between the traditional and ML fluid prediction models. A total of 49 of the 391 (12.5%) patients developed fluid overload. Among the ML models, the XGBoost model had the highest performance (AUROC 0.78, PPV 0.27, NPV 0.94) for fluid overload prediction. The XGBoost model performed similarly to the final traditional logistic regression model (AUROC 0.70; PPV 0.20, NPV 0.94). Feature importance analysis revealed severity of illness scores and medication-related data were the most important predictors of fluid overload. In the context of our study, ML and traditional models appear to perform similarly to predict fluid overload in the ICU. Baseline severity of illness and ICU medication regimen complexity are important predictors of fluid overload.
Collapse
Affiliation(s)
- Andrea Sikora
- Department of Clinical and Administrative Pharmacy, University of Georgia College of Pharmacy, 1120 15th Street, HM-118, Augusta, GA, 30912, USA
| | - Tianyi Zhang
- Department of Statistics, University of Georgia Franklin College of Arts and Sciences, Athens, GA, USA
| | - David J Murphy
- Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Emory University, Atlanta, GA, USA
| | - Susan E Smith
- Department of Clinical and Administrative Pharmacy, University of Georgia College of Pharmacy, 1120 15th Street, HM-118, Augusta, GA, 30912, USA
| | - Brian Murray
- Department of Pharmacy, University of North Carolina Medical Center, Chapel Hill, NC, USA
| | - Rishikesan Kamaleswaran
- Department of Biomedical Informatics, Emory University School of Medicine, Atlanta, GA, USA
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | - Xianyan Chen
- Department of Statistics, University of Georgia Franklin College of Arts and Sciences, Athens, GA, USA
| | | | - Sandra Rowe
- Department of Pharmacy, Oregon Health and Science University, Portland, OR, USA
| | - John W Devlin
- Northeastern University School of Pharmacy, Boston, MA, USA.
- Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA.
| |
Collapse
|
9
|
Hosseinzadeh Kasani P, Lee JE, Park C, Yun CH, Jang JW, Lee SA. Evaluation of nutritional status and clinical depression classification using an explainable machine learning method. Front Nutr 2023; 10:1165854. [PMID: 37229464 PMCID: PMC10203418 DOI: 10.3389/fnut.2023.1165854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 03/27/2023] [Indexed: 05/27/2023] Open
Abstract
Introduction Depression is a prevalent disorder worldwide, with potentially severe implications. It contributes significantly to an increased risk of diseases associated with multiple risk factors. Early accurate diagnosis of depressive symptoms is a critical first step toward management, intervention, and prevention. Various nutritional and dietary compounds have been suggested to be involved in the onset, maintenance, and severity of depressive disorders. Despite the challenges to better understanding the association between nutritional risk factors and the occurrence of depression, assessing the interplay of these markers through supervised machine learning remains to be fully explored. Methods This study aimed to determine the ability of machine learning-based decision support methods to identify the presence of depression using publicly available health data from the Korean National Health and Nutrition Examination Survey. Two exploration techniques, namely, uniform manifold approximation and projection and Pearson correlation, were performed for explanatory analysis among datasets. A grid search optimization with cross-validation was performed to fine-tune the models for classifying depression with the highest accuracy. Several performance measures, including accuracy, precision, recall, F1 score, confusion matrix, areas under the precision-recall and receiver operating characteristic curves, and calibration plot, were used to compare classifier performances. We further investigated the importance of the features provided: visualized interpretation using ELI5, partial dependence plots, and local interpretable using model-agnostic explanations and Shapley additive explanation for the prediction at both the population and individual levels. Results The best model achieved an accuracy of 86.18% for XGBoost and an area under the curve of 84.96% for the random forest model in original dataset and the XGBoost algorithm with an accuracy of 86.02% and an area under the curve of 85.34% in the quantile-based dataset. The explainable results revealed a complementary observation of the relative changes in feature values, and, thus, the importance of emergent depression risks could be identified. Discussion The strength of our approach is the large sample size used for training with a fine-tuned model. The machine learning-based analysis showed that the hyper-tuned model has empirically higher accuracy in classifying patients with depressive disorder, as evidenced by the set of interpretable experiments, and can be an effective solution for disease control.
Collapse
Affiliation(s)
- Payam Hosseinzadeh Kasani
- Department of Neurology, Kangwon National University Hospital, Chuncheon, Republic of Korea
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of Korea
| | - Jung Eun Lee
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of Korea
| | - Chihyun Park
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of Korea
- Department of Computer Science and Engineering, Kangwon National University, Chuncheon, Republic of Korea
| | - Cheol-Heui Yun
- Department of Agricultural Biotechnology, Seoul National University, Seoul, Republic of Korea
- Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul, Republic of Korea
| | - Jae-Won Jang
- Department of Neurology, Kangwon National University Hospital, Chuncheon, Republic of Korea
- Department of Neurology, Kangwon National University School of Medicine, Chuncheon, Republic of Korea
| | - Sang-Ah Lee
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon, Republic of Korea
- Department of Preventive Medicine, College of Medicine, Kangwon National University, Chuncheon, Republic of Korea
| |
Collapse
|
10
|
Qin K, Huang W, Zhang T, Tang S. Machine learning and deep learning for blood pressure prediction: a methodological review from multiple perspectives. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10353-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
11
|
Navigating the Ocean of Big Data in Neurocritical Care. Neurocrit Care 2022; 37:157-159. [PMID: 35799093 DOI: 10.1007/s12028-022-01558-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|