1
|
Gao J, Zhu Y, Wang W, Wang Z, Dong G, Tang W, Wang H, Wang Y, Harrison EM, Ma L. A comprehensive benchmark for COVID-19 predictive modeling using electronic health records in intensive care. PATTERNS (NEW YORK, N.Y.) 2024; 5:100951. [PMID: 38645764 PMCID: PMC11026964 DOI: 10.1016/j.patter.2024.100951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 02/12/2024] [Accepted: 02/13/2024] [Indexed: 04/23/2024]
Abstract
The COVID-19 pandemic highlighted the need for predictive deep-learning models in health care. However, practical prediction task design, fair comparison, and model selection for clinical applications remain a challenge. To address this, we introduce and evaluate two new prediction tasks-outcome-specific length-of-stay and early-mortality prediction for COVID-19 patients in intensive care-which better reflect clinical realities. We developed evaluation metrics, model adaptation designs, and open-source data preprocessing pipelines for these tasks while also evaluating 18 predictive models, including clinical scoring methods and traditional machine-learning, basic deep-learning, and advanced deep-learning models, tailored for electronic health record (EHR) data. Benchmarking results from two real-world COVID-19 EHR datasets are provided, and all results and trained models have been released on an online platform for use by clinicians and researchers. Our efforts contribute to the advancement of deep-learning and machine-learning research in pandemic predictive modeling.
Collapse
Affiliation(s)
- Junyi Gao
- Centre for Medical Informatics, University of Edinburgh, EH16 4UX Edinburgh, UK
- Health Data Research UK, NW1 2BE London, UK
| | | | | | | | - Guiying Dong
- Peking University People’s Hospital, Beijing 100044, China
| | - Wen Tang
- Peking University Third Hospital, Beijing 100191, China
| | - Hao Wang
- Peking University, Beijing 100871, China
| | - Yasha Wang
- Peking University, Beijing 100871, China
| | - Ewen M. Harrison
- Centre for Medical Informatics, University of Edinburgh, EH16 4UX Edinburgh, UK
| | - Liantao Ma
- Peking University, Beijing 100871, China
| |
Collapse
|
2
|
Jiang S, Gai X, Treggiari MM, Stead WW, Zhao Y, Page CD, Zhang AR. Soft phenotyping for sepsis via EHR time-aware soft clustering. J Biomed Inform 2024; 152:104615. [PMID: 38423266 PMCID: PMC11073833 DOI: 10.1016/j.jbi.2024.104615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 01/25/2024] [Accepted: 02/20/2024] [Indexed: 03/02/2024]
Abstract
OBJECTIVE Sepsis is one of the most serious hospital conditions associated with high mortality. Sepsis is the result of a dysregulated immune response to infection that can lead to multiple organ dysfunction and death. Due to the wide variability in the causes of sepsis, clinical presentation, and the recovery trajectories, identifying sepsis sub-phenotypes is crucial to advance our understanding of sepsis characterization, to choose targeted treatments and optimal timing of interventions, and to improve prognostication. Prior studies have described different sub-phenotypes of sepsis using organ-specific characteristics. These studies applied clustering algorithms to electronic health records (EHRs) to identify disease sub-phenotypes. However, prior approaches did not capture temporal information and made uncertain assumptions about the relationships among the sub-phenotypes for clustering procedures. METHODS We developed a time-aware soft clustering algorithm guided by clinical variables to identify sepsis sub-phenotypes using data available in the EHR. RESULTS We identified six novel sepsis hybrid sub-phenotypes and evaluated them for medical plausibility. In addition, we built an early-warning sepsis prediction model using logistic regression. CONCLUSION Our results suggest that these novel sepsis hybrid sub-phenotypes are promising to provide more accurate information on sepsis-related organ dysfunction and sepsis recovery trajectories which can be important to inform management decisions and sepsis prognosis.
Collapse
Affiliation(s)
- Shiyi Jiang
- Department of Electrical & Computer Engineering, Duke University, Durham, 27708, NC, USA
| | - Xin Gai
- Department of Statistical Science, Duke University, Durham, 27708, NC, USA
| | | | - William W Stead
- Department of Biomedical Informatics, Vanderbilt University, Nashville, 37235, TN, USA
| | - Yuankang Zhao
- Department of Biostatistics & Bioinformatics, Duke University, Durham, 27708, NC, USA
| | - C David Page
- Department of Biostatistics & Bioinformatics, Duke University, Durham, 27708, NC, USA
| | - Anru R Zhang
- Department of Biostatistics & Bioinformatics, Duke University, Durham, 27708, NC, USA; Department of Computer Science, Duke University, Durham, 27708, NC, USA.
| |
Collapse
|
3
|
Marwaha JS, Beaulieu-Jones BR, Berrigan M, Yuan W, Odom SR, Cook CH, Scott BB, Gupta A, Parsons CS, Seshadri AJ, Brat GA. Quantifying the Prognostic Value of Preoperative Surgeon Intuition: Comparing Surgeon Intuition and Clinical Risk Prediction as Derived from the American College of Surgeons NSQIP Risk Calculator. J Am Coll Surg 2023; 236:1093-1103. [PMID: 36815715 PMCID: PMC10192014 DOI: 10.1097/xcs.0000000000000658] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Abstract
BACKGROUND Surgical risk prediction models traditionally use patient attributes and measures of physiology to generate predictions about postoperative outcomes. However, the surgeon's assessment of the patient may be a valuable predictor, given the surgeon's ability to detect and incorporate factors that existing models cannot capture. We compare the predictive utility of surgeon intuition and a risk calculator derived from the American College of Surgeons (ACS) NSQIP. STUDY DESIGN From January 10, 2021 to January 9, 2022, surgeons were surveyed immediately before performing surgery to assess their perception of a patient's risk of developing any postoperative complication. Clinical data were abstracted from ACS NSQIP. Both sources of data were independently used to build models to predict the likelihood of a patient experiencing any 30-day postoperative complication as defined by ACS NSQIP. RESULTS Preoperative surgeon assessment was obtained for 216 patients. NSQIP data were available for 9,182 patients who underwent general surgery (January 1, 2017 to January 9, 2022). A binomial regression model trained on clinical data alone had an area under the receiver operating characteristic curve (AUC) of 0.83 (95% CI 0.80 to 0.85) in predicting any complication. A model trained on only preoperative surgeon intuition had an AUC of 0.70 (95% CI 0.63 to 0.78). A model trained on surgeon intuition and a subset of clinical predictors had an AUC of 0.83 (95% CI 0.77 to 0.89). CONCLUSIONS Preoperative surgeon intuition alone is an independent predictor of patient outcomes; however, a risk calculator derived from ACS NSQIP is a more robust predictor of postoperative complication. Combining intuition and clinical data did not strengthen prediction.
Collapse
Affiliation(s)
- Jayson S Marwaha
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
- the Department of Biomedical Informatics, Harvard Medical School, Boston, MA (Marwaha, Beaulieu-Jones, Yuan, Brat)
| | - Brendin R Beaulieu-Jones
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
- the Department of Biomedical Informatics, Harvard Medical School, Boston, MA (Marwaha, Beaulieu-Jones, Yuan, Brat)
| | - Margaret Berrigan
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - William Yuan
- the Department of Biomedical Informatics, Harvard Medical School, Boston, MA (Marwaha, Beaulieu-Jones, Yuan, Brat)
| | - Stephen R Odom
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - Charles H Cook
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - Benjamin B Scott
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - Alok Gupta
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - Charles S Parsons
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - Anupamaa J Seshadri
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
| | - Gabriel A Brat
- From the Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA (Marwaha, Beaulieu-Jones, Berrigan, Odom, Cook, Scott, Gupta, Parsons, Seshadri, Brat)
- the Department of Biomedical Informatics, Harvard Medical School, Boston, MA (Marwaha, Beaulieu-Jones, Yuan, Brat)
| |
Collapse
|
4
|
Yang S, Varghese P, Stephenson E, Tu K, Gronsbell J. Machine learning approaches for electronic health records phenotyping: a methodical review. J Am Med Inform Assoc 2023; 30:367-381. [PMID: 36413056 PMCID: PMC9846699 DOI: 10.1093/jamia/ocac216] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 09/27/2022] [Accepted: 10/27/2022] [Indexed: 11/23/2022] Open
Abstract
OBJECTIVE Accurate and rapid phenotyping is a prerequisite to leveraging electronic health records for biomedical research. While early phenotyping relied on rule-based algorithms curated by experts, machine learning (ML) approaches have emerged as an alternative to improve scalability across phenotypes and healthcare settings. This study evaluates ML-based phenotyping with respect to (1) the data sources used, (2) the phenotypes considered, (3) the methods applied, and (4) the reporting and evaluation methods used. MATERIALS AND METHODS We searched PubMed and Web of Science for articles published between 2018 and 2022. After screening 850 articles, we recorded 37 variables on 100 studies. RESULTS Most studies utilized data from a single institution and included information in clinical notes. Although chronic conditions were most commonly considered, ML also enabled the characterization of nuanced phenotypes such as social determinants of health. Supervised deep learning was the most popular ML paradigm, while semi-supervised and weakly supervised learning were applied to expedite algorithm development and unsupervised learning to facilitate phenotype discovery. ML approaches did not uniformly outperform rule-based algorithms, but deep learning offered a marginal improvement over traditional ML for many conditions. DISCUSSION Despite the progress in ML-based phenotyping, most articles focused on binary phenotypes and few articles evaluated external validity or used multi-institution data. Study settings were infrequently reported and analytic code was rarely released. CONCLUSION Continued research in ML-based phenotyping is warranted, with emphasis on characterizing nuanced phenotypes, establishing reporting and evaluation standards, and developing methods to accommodate misclassified phenotypes due to algorithm errors in downstream applications.
Collapse
Affiliation(s)
- Siyue Yang
- Department of Statistical Sciences, University of Toronto, Toronto, Ontario, Canada
| | | | - Ellen Stephenson
- Department of Family & Community Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Karen Tu
- Department of Family & Community Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Jessica Gronsbell
- Department of Statistical Sciences, University of Toronto, Toronto, Ontario, Canada
- Department of Family & Community Medicine, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
5
|
Tan Q, Ye M, Ma AJ, Yip TCF, Wong GLH, Yuen PC. Importance-aware personalized learning for early risk prediction using static and dynamic health data. J Am Med Inform Assoc 2021; 28:713-726. [PMID: 33496786 DOI: 10.1093/jamia/ocaa306] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 11/17/2020] [Accepted: 11/21/2020] [Indexed: 01/23/2023] Open
Abstract
OBJECTIVE Accurate risk prediction is important for evaluating early medical treatment effects and improving health care quality. Existing methods are usually designed for dynamic medical data, which require long-term observations. Meanwhile, important personalized static information is ignored due to the underlying uncertainty and unquantifiable ambiguity. It is urgent to develop an early risk prediction method that can adaptively integrate both static and dynamic health data. MATERIALS AND METHODS Data were from 6367 patients with Peptic Ulcer Bleeding between 2007 and 2016. This article develops a novel End-to-end Importance-Aware Personalized Deep Learning Approach (eiPDLA) to achieve accurate early clinical risk prediction. Specifically, eiPDLA introduces a long short-term memory with temporal attention to learn sequential dependencies from time-stamped records and simultaneously incorporating a residual network with correlation attention to capture their influencing relationship with static medical data. Furthermore, a new multi-residual multi-scale network with the importance-aware mechanism is designed to adaptively fuse the learned multisource features, automatically assigning larger weights to important features while weakening the influence of less important features. RESULTS Extensive experimental results on a real-world dataset illustrate that our method significantly outperforms the state-of-the-arts for early risk prediction under various settings (eg, achieving an AUC score of 0.944 at 1 year ahead of risk prediction). Case studies indicate that the achieved prediction results are highly interpretable. CONCLUSION These results reflect the importance of combining static and dynamic health data, mining their influencing relationship, and incorporating the importance-aware mechanism to automatically identify important features. The achieved accurate early risk prediction results save precious time for doctors to timely design effective treatments and improve clinical outcomes.
Collapse
Affiliation(s)
- Qingxiong Tan
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, Hong Kong
| | - Mang Ye
- School of Computer Science, Wuhan University, Wuhan, China
| | - Andy Jinhua Ma
- School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China
| | - Terry Cheuk-Fung Yip
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong, Hong Kong
| | - Grace Lai-Hung Wong
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Hong Kong, Hong Kong
| | - Pong C Yuen
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, Hong Kong
| |
Collapse
|
6
|
Hong S, Hou X, Jing J, Ge W, Zhang L. Predicting Risk of Mortality in Pediatric ICU Based on Ensemble Step-Wise Feature Selection. HEALTH DATA SCIENCE 2021; 2021:9365125. [PMID: 38487508 PMCID: PMC10880178 DOI: 10.34133/2021/9365125] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 01/21/2021] [Indexed: 03/17/2024]
Abstract
Background. Prediction of mortality risk in intensive care units (ICU) is an important task. Data-driven methods such as scoring systems, machine learning methods, and deep learning methods have been investigated for a long time. However, few data-driven methods are specially developed for pediatric ICU. In this paper, we aim to amend this gap-build a simple yet effective linear machine learning model from a number of hand-crafted features for mortality prediction in pediatric ICU.Methods. We use a recently released publicly available pediatric ICU dataset named pediatric intensive care (PIC) from Children's Hospital of Zhejiang University School of Medicine in China. Unlike previous sophisticated machine learning methods, we want our method to keep simple that can be easily understood by clinical staffs. Thus, an ensemble step-wise feature ranking and selection method is proposed to select a small subset of effective features from the entire feature set. A logistic regression classifier is built upon selected features for mortality prediction.Results. The final predictive linear model with 11 features achieves a 0.7531 ROC-AUC score on the hold-out test set, which is comparable with a logistic regression classifier using all 397 features (0.7610 ROC-AUC score) and is higher than the existing well known pediatric mortality risk scorer PRISM III (0.6895 ROC-AUC score).Conclusions. Our method improves feature ranking and selection by utilizing an ensemble method while keeping a simple linear form of the predictive model and therefore achieves better generalizability and performance on mortality prediction in pediatric ICU.
Collapse
Affiliation(s)
- Shenda Hong
- National Institute of Health Data Science at Peking University, Beijing, China
- Institute of Medical Technology, Health Science Center of Peking University, Beijing, China
| | - Xinlin Hou
- Neonatology Department of Peking University First Hospital, BeijingChina
| | - Jin Jing
- Harvard Medical School, Boston, MA, USA
- Clinical Data Animation Center (CDAC), Massachusetts General Hospital, Boston, MA, USA
| | - Wendong Ge
- Harvard Medical School, Boston, MA, USA
- Clinical Data Animation Center (CDAC), Massachusetts General Hospital, Boston, MA, USA
| | - Luxia Zhang
- National Institute of Health Data Science at Peking University, Beijing, China
- Institute of Medical Technology, Health Science Center of Peking University, Beijing, China
| |
Collapse
|